🐒
Making AI Safer
Making AI Safer. Focus on LLM、RL、Infra
Stars
5
results
for forked starred repositories
Clear filter
hijkzzz / awesome-RLHF
Forked from opendilab/awesome-RLHFA curated list of reinforcement learning with human feedback resources (continually updated)
4
Updated Jan 10, 2025
philschmid / MixEval
Forked from JinjieNi/MixEvalThe official evaluation suite and dynamic data release for MixEval.
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
yang-song / pytorch-fid
Forked from mseitzer/pytorch-fidA Port of Fréchet Inception Distance (FID score) to PyTorch
XuehaiPan / safe-rlhf
Forked from PKU-Alignment/safe-rlhfSafe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback