Stars
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
The definitive Web UI for local AI, with powerful features and easy setup.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A library for efficient similarity search and clustering of dense vectors.
Google Research
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Open-Sora: Democratizing Efficient Video Production for All
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Awesome-LLM: a curated list of Large Language Model
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Graph Neural Network Library for PyTorch
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Build resilient language agents as graphs.
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
SGLang is a fast serving framework for large language models and vision language models.