dhcode-cpp

Follow

🐒

Making AI Safer

dhcode95 dhcode-cpp

🐒

Making AI Safer

Follow

Making AI Safer. Focus on LLM、RL、Infra

196 followers · 327 following

Achievements

Achievements

Stars

deepseek-ai / DeepSeek-V3

Python 100,183 16,322 Updated Aug 28, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,404 11,099 Updated Nov 7, 2025

meta-llama / llama

Inference code for Llama models

Python 58,905 9,812 Updated Jan 26, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,008 3,930 Updated Nov 7, 2025

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,747 8,713 Updated Oct 11, 2024

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 32,304 2,189 Updated Nov 3, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,511 6,478 Updated Nov 7, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,073 3,477 Updated Jan 26, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,561 2,535 Updated Nov 7, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,617 2,400 Updated Sep 8, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,346 5,817 Updated Aug 14, 2024

mozilla-ai / llamafile

Distribute and run LLMs with a single file.

C 23,342 1,234 Updated Nov 5, 2025

langchain-ai / langgraph

Build resilient language agents as graphs.

Python 20,718 3,659 Updated Nov 6, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,585 2,210 Updated Mar 11, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,909 3,296 Updated Nov 7, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,123 1,907 Updated Nov 1, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,635 1,072 Updated Nov 6, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,206 2,279 Updated Nov 7, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,451 1,115 Updated Nov 6, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,122 3,251 Updated Nov 7, 2025

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,647 2,009 Updated Aug 8, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,901 856 Updated Dec 17, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,365 1,522 Updated Apr 24, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,850 896 Updated Sep 30, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,855 697 Updated Aug 18, 2024

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,697 976 Updated Nov 6, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,471 542 Updated May 18, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,334 807 Updated Oct 31, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,790 599 Updated Nov 6, 2025