🐒
Making AI Safer
Making AI Safer. Focus on LLM、RL、Infra
Stars
10
results
for sponsorable starred repositories
Clear filter
Copy-paste Liquid Glass shader with SVG
《Reinforcement Learning: An Introduction》(第二版)中文翻译
A high-throughput and memory-efficient inference and serving engine for LLMs
XuehaiPan / safe-rlhf
Forked from PKU-Alignment/safe-rlhfSafe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Automation scripts for setting up a basic development environment.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving