Pinned Loading
-
cde-offline-rl
cde-offline-rl PublicLearning from Sparse Offline Datasets via Conservative Density Estimation (ICLR 2024)
Python 3
-
Bridge-LLM-reasoning
Bridge-LLM-reasoning PublicBehavior Injection: Preparing Language Models for Reinforcement Learning (NeurIPS 2025)
Python 14
-
SalesforceAIResearch/PretrainRL-pipeline
SalesforceAIResearch/PretrainRL-pipeline PublicAn automated data pipeline scaling RL to pretraining levels
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.