dhcode-cpp

Follow

🐒

Making AI Safer

dhcode95 dhcode-cpp

🐒

Making AI Safer

Follow

Making AI Safer. Focus on LLM、RL、Infra

196 followers · 327 following

Achievements

Achievements

Stars

118 stars written in Python

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 955 134 Updated Jun 21, 2025

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 930 77 Updated Sep 4, 2024

Visual-Agent / DeepEyes

Python 923 55 Updated Oct 20, 2025

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 919 47 Updated Mar 19, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 906 88 Updated Sep 10, 2025

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 893 49 Updated Sep 30, 2025

abarankab / DDPM

PyTorch DDPM implementation

Python 829 122 Updated May 23, 2022

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 781 88 Updated Aug 21, 2024

thuml / depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 756 26 Updated Oct 13, 2025

ezelikman / quiet-star

Code for Quiet-STaR

Python 741 90 Updated Aug 21, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 676 50 Updated Jan 20, 2025

BlackSamorez / tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 658 45 Updated Jan 2, 2024

OpenDriveLab / OccNet

[ICCV 2023] OccNet: Scene as Occupancy

Python 644 55 Updated Jul 2, 2025

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

Python 627 48 Updated Sep 29, 2024

papercopilot / paperlists

Processed / Cleaned Data for Paper Copilot

Python 617 34 Updated Nov 8, 2025

qiwihui / reinforcement-learning-an-introduction-chinese

《Reinforcement Learning: An Introduction》（第二版）中文翻译

Python 616 109 Updated Apr 9, 2022

w86763777 / pytorch-ddpm

Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models

Python 615 77 Updated Jun 11, 2024

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 614 22 Updated Mar 18, 2025

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 591 68 Updated Oct 14, 2025

apple / ml-cross-entropy

Python 545 53 Updated Sep 23, 2025

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 536 49 Updated Dec 28, 2024

mit-han-lab / duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 499 34 Updated Feb 10, 2025

SmallDoges / flash-dmattn

Trainable fast and memory-efficient sparse attention

Python 434 39 Updated Nov 7, 2025

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 405 17 Updated Nov 16, 2024

rosinality / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Models in PyTorch

Python 389 44 Updated Jun 14, 2022

microsoft / dion

Dion optimizer algorithm

Python 381 32 Updated Nov 1, 2025

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 360 21 Updated Sep 15, 2025

mihirp1998 / AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

Python 302 11 Updated Nov 1, 2024

Eric-Wallace / universal-triggers

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Python 299 56 Updated Jul 25, 2024

tokenbender / avataRL

rl from zero pretrain, can it be done? yes.

Python 279 21 Updated Sep 28, 2025