🐒
Making AI Safer
Making AI Safer. Focus on LLM、RL、Infra
Stars
7
results
for sponsorable starred repositories
written in Python
Clear filter
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
《Reinforcement Learning: An Introduction》(第二版)中文翻译
XuehaiPan / safe-rlhf
Forked from PKU-Alignment/safe-rlhfSafe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback