LLM
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
An Open Large Reasoning Model for Real-World Solutions
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
prime is a framework for efficient, globally distributed training of AI models over the internet.
SGLang is a fast serving framework for large language models and vision language models.
Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
noise_step: Training in 1.58b With No Gradient Memory
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
LLM4AD: A Platform for Algorithm Design with Large Language Model
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
ModelScope: bring the notion of Model-as-a-Service to life.
A curated, but incomplete, list of data-centric AI resources.
ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
AnimationGPT:An AIGC tool for generating game combat motion assets