Lists (12)
Sort Name ascending (A-Z)
Stars
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
FlashMLA: Efficient Multi-head Latent Attention Kernels
NOFX: Defining the Next-Generation AI Trading Operating System. A multi-exchange Al trading platform(Binance/Hyperliquid/Aster) with multi-Ai competition(deepseek/qwen/claude)self-evolution, and re…
The most intuitive desktop API client. Organize and execute REST, GraphQL, WebSockets, Server Sent Events, and gRPC 🦬
🚀 Efficient implementations of state-of-the-art linear attention models
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…
The absolute trainer to light up AI agents.
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Scalable toolkit for efficient model reinforcement
A debugging and profiling tool that can trace and visualize python code execution
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
PDF references add-on for Zotero.