dhcode95 dhcode-cpp

🐒

Making AI Safer

Making AI Safer. Focus on LLM、RL、Infra

Achievements

5 results for forked starred repositories

Forked from opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4 Updated Jan 10, 2025

Forked from JinjieNi/MixEval

The official evaluation suite and dynamic data release for MixEval.

Python 11 2 Updated Sep 23, 2024

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,186 365 Updated Aug 14, 2025

Forked from mseitzer/pytorch-fid

A Port of Fréchet Inception Distance (FID score) to PyTorch

Python 1 1 Updated Jan 31, 2019

Forked from PKU-Alignment/safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 4 Updated May 16, 2024