dhcode-cpp

Follow

🐒

Making AI Safer

dhcode95 dhcode-cpp

🐒

Making AI Safer

Follow

Making AI Safer. Focus on LLM、RL、Infra

196 followers · 327 following

Achievements

Achievements

Stars

118 stars written in Python

deepseek-ai / DeepSeek-V3

Python 100,187 16,321 Updated Aug 28, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,456 11,114 Updated Nov 7, 2025

meta-llama / llama

Inference code for Llama models

Python 58,906 9,812 Updated Jan 26, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,027 3,931 Updated Nov 7, 2025

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,749 8,713 Updated Oct 11, 2024

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 32,316 2,190 Updated Nov 3, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,516 6,481 Updated Nov 7, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,073 3,477 Updated Jan 26, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,564 2,536 Updated Nov 7, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,618 2,400 Updated Sep 8, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,345 5,815 Updated Aug 14, 2024

langchain-ai / langgraph

Build resilient language agents as graphs.

Python 20,739 3,661 Updated Nov 7, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,586 2,209 Updated Mar 11, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,971 3,301 Updated Nov 7, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,128 1,909 Updated Nov 1, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,216 2,279 Updated Nov 7, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,460 1,117 Updated Nov 7, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,129 3,252 Updated Nov 7, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,903 856 Updated Dec 17, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,370 1,522 Updated Apr 24, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,339 809 Updated Oct 31, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,791 599 Updated Nov 6, 2025

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,419 1,175 Updated Mar 21, 2025

XuehaiPan / nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,265 194 Updated Oct 27, 2025

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 524 Updated Jul 1, 2025

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,579 678 Updated Nov 7, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,415 463 Updated Sep 8, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,668 319 Updated Aug 19, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,660 596 Updated Nov 7, 2025

OpenDriveLab / UniAD

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 4,298 482 Updated Oct 29, 2025