Skip to content
View dhcode-cpp's full-sized avatar
🐒
Making AI Safer
🐒
Making AI Safer

Block or report dhcode-cpp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,404 11,099 Updated Nov 7, 2025

Inference code for Llama models

Python 58,905 9,812 Updated Jan 26, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,008 3,930 Updated Nov 7, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,747 8,713 Updated Oct 11, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 32,304 2,189 Updated Nov 3, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,511 6,478 Updated Nov 7, 2025

The official Meta Llama 3 GitHub site

Python 29,073 3,477 Updated Jan 26, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,561 2,535 Updated Nov 7, 2025

Fully open reproduction of DeepSeek-R1

Python 25,617 2,400 Updated Sep 8, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,346 5,817 Updated Aug 14, 2024

Distribute and run LLMs with a single file.

C 23,342 1,234 Updated Nov 5, 2025

Build resilient language agents as graphs.

Python 20,718 3,659 Updated Nov 6, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,585 2,210 Updated Mar 11, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,909 3,296 Updated Nov 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,123 1,907 Updated Nov 1, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,635 1,072 Updated Nov 6, 2025

Train transformer language models with reinforcement learning.

Python 16,206 2,279 Updated Nov 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,451 1,115 Updated Nov 6, 2025

Ongoing research training transformer models at scale

Python 14,122 3,251 Updated Nov 7, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,647 2,009 Updated Aug 8, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,901 856 Updated Dec 17, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 12,365 1,522 Updated Apr 24, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,850 896 Updated Sep 30, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,855 697 Updated Aug 18, 2024

DeepEP: an efficient expert-parallel communication library

Cuda 8,697 976 Updated Nov 6, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,471 542 Updated May 18, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,334 807 Updated Oct 31, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,790 599 Updated Nov 6, 2025
Next