Skip to content
View dhcode-cpp's full-sized avatar
🐒
Making AI Safer
🐒
Making AI Safer

Block or report dhcode-cpp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
118 stars written in Python
Clear filter

Arena-Hard-Auto: An automatic LLM benchmark.

Python 955 134 Updated Jun 21, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 930 77 Updated Sep 4, 2024
Python 923 55 Updated Oct 20, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 919 47 Updated Mar 19, 2025

Ring attention implementation with flash attention

Python 906 88 Updated Sep 10, 2025

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 893 49 Updated Sep 30, 2025

PyTorch DDPM implementation

Python 829 122 Updated May 23, 2022

Pipeline Parallelism for PyTorch

Python 781 88 Updated Aug 21, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 756 26 Updated Oct 13, 2025

Code for Quiet-STaR

Python 741 90 Updated Aug 21, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 676 50 Updated Jan 20, 2025

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 658 45 Updated Jan 2, 2024

[ICCV 2023] OccNet: Scene as Occupancy

Python 644 55 Updated Jul 2, 2025

Microsoft Automatic Mixed Precision Library

Python 627 48 Updated Sep 29, 2024

Processed / Cleaned Data for Paper Copilot

Python 617 34 Updated Nov 8, 2025

《Reinforcement Learning: An Introduction》(第二版)中文翻译

Python 616 109 Updated Apr 9, 2022

Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models

Python 615 77 Updated Jun 11, 2024

Explore the Multimodal “Aha Moment” on 2B Model

Python 614 22 Updated Mar 18, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 591 68 Updated Oct 14, 2025
Python 545 53 Updated Sep 23, 2025

A recipe for online RLHF and online iterative DPO.

Python 536 49 Updated Dec 28, 2024

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 499 34 Updated Feb 10, 2025

Trainable fast and memory-efficient sparse attention

Python 434 39 Updated Nov 7, 2025

[ICML 2024] CLLMs: Consistency Large Language Models

Python 405 17 Updated Nov 16, 2024

Implementation of Denoising Diffusion Probabilistic Models in PyTorch

Python 389 44 Updated Jun 14, 2022

Dion optimizer algorithm

Python 381 32 Updated Nov 1, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 360 21 Updated Sep 15, 2025

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

Python 302 11 Updated Nov 1, 2024

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Python 299 56 Updated Jul 25, 2024

rl from zero pretrain, can it be done? yes.

Python 279 21 Updated Sep 28, 2025