Stars
Train auto_car in CARLA simulator with RL algorithms(SAC).
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Collection of reinforcement learning algorithms
Website for Practical Deep Learning for Coders 2022
An autoregressive character-level language model for making more things
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
SGLang is a high-performance serving framework for large language models and multimodal models.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
magnusja / ppo
Forked from pat-coady/trpoProximal Policy Optimization with TensorFlow and OpenAI Gym
Simple framework for image and video deblurring, implemented by PyTorch
Python Implementations of Monte Carlo Tree Search
A replica of the AlphaZero methodology for deep reinforcement learning in Python
An educational resource to help anyone learn deep reinforcement learning.
Python Implementation of Reinforcement Learning: An Introduction