Skip to content
View dhcode-cpp's full-sized avatar
🐒
Making AI Safer
🐒
Making AI Safer

Block or report dhcode-cpp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
118 stars written in Python
Clear filter

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,223 375 Updated Sep 11, 2025
Python 4,158 444 Updated Jul 31, 2025

Understand Human Behavior to Align True Needs

Python 4,018 389 Updated Aug 13, 2025

Set of tools to assess and improve LLM security.

Python 3,857 661 Updated Nov 4, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,780 296 Updated Nov 6, 2025

NanoGPT (124M) in 3 minutes

Python 3,774 489 Updated Nov 6, 2025

Compute FID scores with PyTorch.

Python 3,774 524 Updated Jul 3, 2024

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,307 194 Updated Oct 25, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,874 305 Updated Mar 10, 2025

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Python 2,816 233 Updated Jun 23, 2023

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,802 90 Updated Nov 7, 2025
Python 2,547 306 Updated May 19, 2024

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,390 186 Updated Nov 7, 2025

A library for advanced large language model reasoning

Python 2,300 202 Updated Jun 10, 2025

Minimalistic large language model 3D-parallelism training

Python 2,299 253 Updated Sep 3, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,265 293 Updated May 11, 2025

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-…

Python 2,263 478 Updated May 6, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,187 365 Updated Aug 14, 2025

Muon is an optimizer for hidden layers in neural networks

Python 1,981 94 Updated Jul 12, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,891 144 Updated Aug 26, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,634 127 Updated Apr 17, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,554 126 Updated Sep 8, 2025

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,526 129 Updated Feb 6, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,236 108 Updated Sep 6, 2025

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,145 97 Updated Oct 20, 2025

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…

Python 1,132 132 Updated Oct 6, 2025
Python 994 46 Updated Jul 2, 2025
Python 963 110 Updated Jan 23, 2025