dhcode95 dhcode-cpp

🐒

Making AI Safer

Making AI Safer. Focus on LLM、RL、Infra

Achievements

7 results for sponsorable starred repositories written in Python

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,473 11,121 Updated Nov 8, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,031 3,932 Updated Nov 7, 2025

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,265 194 Updated Oct 27, 2025

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 4,298 482 Updated Oct 29, 2025

[ICCV 2023] OccNet: Scene as Occupancy

Python 644 55 Updated Jul 2, 2025

《Reinforcement Learning: An Introduction》（第二版）中文翻译

Python 616 109 Updated Apr 9, 2022

Forked from PKU-Alignment/safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 4 Updated May 16, 2024