Skip to content
View Freja71122's full-sized avatar

Block or report Freja71122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Expert Specialized Fine-Tuning

Python 718 260 Updated May 22, 2025

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 328 21 Updated Apr 24, 2025

DeepSeek LLM: Let there be answers

Makefile 6,676 1,043 Updated Feb 4, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,408 105 Updated Mar 3, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

4,246 250 Updated Dec 9, 2025

A curated list of open-source projects related to DeepSeek Coder

742 203 Updated Nov 11, 2025

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 2,992 359 Updated Apr 22, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,537 2,689 Updated Nov 11, 2025

一种任务级GPU算力分时调度的高性能深度学习训练平台

Python 726 94 Updated Oct 24, 2023

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,906 1,209 Updated Jul 25, 2024

AI magics meet Infinite draw board.

Jupyter Notebook 1,938 177 Updated May 9, 2024

The test of different distributed-training methods on High-Flyer AIHPC

Python 26 3 Updated Oct 18, 2022

FireFlyer Record file format, writer and reader for DL training samples.

Python 238 25 Updated Dec 1, 2022
Python 42 6 Updated Jun 10, 2022

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

Python 145 16 Updated Jun 10, 2022