Skip to content
View dhcode-cpp's full-sized avatar
🐒
Making AI Safer
🐒
Making AI Safer

Block or report dhcode-cpp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
118 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,456 11,114 Updated Nov 7, 2025

Inference code for Llama models

Python 58,906 9,812 Updated Jan 26, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,027 3,931 Updated Nov 7, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,749 8,713 Updated Oct 11, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 32,316 2,190 Updated Nov 3, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,516 6,481 Updated Nov 7, 2025

The official Meta Llama 3 GitHub site

Python 29,073 3,477 Updated Jan 26, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,564 2,536 Updated Nov 7, 2025

Fully open reproduction of DeepSeek-R1

Python 25,618 2,400 Updated Sep 8, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,345 5,815 Updated Aug 14, 2024

Build resilient language agents as graphs.

Python 20,739 3,661 Updated Nov 7, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,586 2,209 Updated Mar 11, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,971 3,301 Updated Nov 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,128 1,909 Updated Nov 1, 2025

Train transformer language models with reinforcement learning.

Python 16,216 2,279 Updated Nov 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,460 1,117 Updated Nov 7, 2025

Ongoing research training transformer models at scale

Python 14,129 3,252 Updated Nov 7, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,903 856 Updated Dec 17, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 12,370 1,522 Updated Apr 24, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,339 809 Updated Oct 31, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,791 599 Updated Nov 6, 2025

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,419 1,175 Updated Mar 21, 2025

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,265 194 Updated Oct 27, 2025

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 524 Updated Jul 1, 2025

PyTorch native post-training library

Python 5,579 678 Updated Nov 7, 2025

Robust recipes to align language models with human and AI preferences

Python 5,415 463 Updated Sep 8, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,668 319 Updated Aug 19, 2025

A PyTorch native platform for training generative AI models

Python 4,660 596 Updated Nov 7, 2025

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 4,298 482 Updated Oct 29, 2025
Next