Starred repositories
Python tool for converting files and office documents to Markdown.
A high-throughput and memory-efficient inference and serving engine for LLMs
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
If you live in the terminal, kitty is made for you! Cross-platform, fast, feature-rich, GPU based.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
SGLang is a fast serving framework for large language models and vision language models.
リアルタイムボイスチェンジャー Realtime Voice Changer
A TTS model capable of generating ultra-realistic dialogue in one pass.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open source code for AlphaFold 2.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Hierarchical Reasoning Model Official Release
A framework for few-shot evaluation of language models.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Transformer: PyTorch Implementation of "Attention Is All You Need"
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Official repository for the Boltz biomolecular interaction models
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
PyTorch code and models for V-JEPA self-supervised learning from video.
Official PyTorch implementation for "Large Language Diffusion Models"
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
Efficient vision foundation models for high-resolution generation and perception.
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling