Stars
SGLang is a high-performance serving framework for large language models and multimodal models.
Machine Learning Engineering Open Book
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
🚀 Efficient implementations for emerging model architectures
A generative and self-guided robotic agent that endlessly propose and master new skills.
A project to improve skills of large language models
A lightweight inference engine supporting speculative speculative decoding (SSD).
Plugin for CTFd that integrates a web based shell
Babbleshack / RLGPUSchedule
Forked from matthewygf/GPUScheduleA GPU Cluster Simulator for Distributed Deep Learning Training using Deep Reinforcement Learning