Starred repositories
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
Implementation of "FlashPreill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling"
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
A lightweight inference engine supporting speculative speculative decoding (SSD).
An AI-powered Texas Hold'em Poker framework driven by Large Language Models
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Welcome to GR00T Whole-Body Control (WBC)! This is a unified platform for developing and deploying advanced humanoid controllers. This includes: Decoupled WBC models used in NVIDIA Isaac-Gr00t, Gr0…
[Development suspended] Advanced open-source Texas Hold'em GTO solver with optimized performance (web browser version)
🚀 A very efficient Texas Holdem GTO solver
A Java implemented Texas holdem and short deck Solver
Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities bef…
RynnBrain: Open Embodied Foundation Models
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Fast and memory-efficient exact attention
[CVPR2026] Detect Anything via Next Point Prediction
Real-time behaviour synthesis with MuJoCo, using Predictive Control
[ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
Toolbox for our GraspNet-1Billion dataset.