Stars
Provide with pre-build flash-attention 2 and 3 package wheels on Linux and Windows using GitHub Actions
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
Unified high-performance Python client for object and file stores.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Post-training with Tinker
A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗
Development repository for the Triton language and compiler
AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识。
A framework for few-shot evaluation of language models.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…
The absolute trainer to light up AI agents.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
NexRL is an ultra-loosely-coupled LLM post-training framework.
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
Papers for database systems powered by artificial intelligence (machine learning for database)