Stars
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Supercharge Your LLM Application Evaluations 🚀
Retrieval and Retrieval-augmented LLMs
Large Language Model Text Generation Inference
Hackable and optimized Transformers building blocks, supporting a composable construction.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。
Example models using DeepSpeed
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
The official PyTorch implementation of Google's Gemma models
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
Aligning pretrained language models with instruction data generated by themselves.
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
General technology for enabling AI capabilities w/ LLMs and MLLMs
An open-source framework for training large multimodal models.
Gemma open-weight LLM library, from Google DeepMind
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
A parser for Google Scholar, written in Python