dhcode-cpp

Follow

🐒

Making AI Safer

dhcode95 dhcode-cpp

🐒

Making AI Safer

Follow

Making AI Safer. Focus on LLM、RL、Infra

196 followers · 327 following

Achievements

Achievements

Stars

118 stars written in Python

build-with-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,223 375 Updated Sep 11, 2025

openai / simple-evals

Python 4,158 444 Updated Jul 31, 2025

lllyasviel / Paints-UNDO

Understand Human Behavior to Align True Needs

Python 4,018 389 Updated Aug 13, 2025

meta-llama / PurpleLlama

Set of tools to assess and improve LLM security.

Python 3,857 661 Updated Nov 4, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,780 296 Updated Nov 6, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 3,774 489 Updated Nov 6, 2025

mseitzer / pytorch-fid

Compute FID scores with PyTorch.

Python 3,774 524 Updated Jul 3, 2024

nunchaku-tech / nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,307 194 Updated Oct 25, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,874 305 Updated Mar 10, 2025

huggingface / knockknock

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Python 2,816 233 Updated Jun 23, 2023

vectara / hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,802 90 Updated Nov 7, 2025

openai / weak-to-strong

Python 2,547 306 Updated May 19, 2024

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,390 186 Updated Nov 7, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,300 202 Updated Jun 10, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,299 253 Updated Sep 3, 2025

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,265 293 Updated May 11, 2025

666DZY666 / micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-…

Python 2,263 478 Updated May 6, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,187 365 Updated Aug 14, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 1,981 94 Updated Jul 12, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,891 144 Updated Aug 26, 2025

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,634 127 Updated Apr 17, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,554 126 Updated Sep 8, 2025

stanfordnlp / pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,526 129 Updated Feb 6, 2025

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,236 108 Updated Sep 6, 2025

HJYao00 / Mulberry

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,145 97 Updated Oct 20, 2025

unitaryai / detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…

Python 1,132 132 Updated Oct 6, 2025

huggingface / Math-Verify

Python 994 46 Updated Jul 2, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 969 67 Updated Oct 2, 2025

zhentingqi / rStar

Python 963 110 Updated Jan 23, 2025