🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,265 1,219 Updated Nov 4, 2025

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,677 578 Updated Sep 22, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,377 450 Updated Aug 2, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 7,294 464 Updated Nov 5, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,110 391 Updated Jul 11, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,346 468 Updated Aug 7, 2024

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,912 691 Updated Nov 6, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,344 479 Updated Nov 6, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,227 420 Updated Nov 6, 2025

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,079 318 Updated Oct 17, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,703 281 Updated Nov 6, 2025

namhyung / uftrace

Function graph tracer for C/C++/Rust/Python

C 3,344 533 Updated Oct 10, 2025

huggingface / huggingface_hub

The official Python client for the Hugging Face Hub.

Python 3,037 840 Updated Nov 6, 2025

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,990 223 Updated Oct 14, 2025

huihongxiao / MIT6.S081

2,624 402 Updated Mar 6, 2024

RedisGraph / RedisGraph

A graph database as a Redis module

C 2,037 231 Updated Jul 21, 2025

alibaba / yalantinglibs

A collection of modern C++ libraries, include coro_http, coro_rpc, compile-time reflection, struct_pack, struct_json, struct_xml, struct_pb, easylog, async_simple etc.

C++ 2,013 300 Updated Nov 5, 2025

linux-rdma / rdma-core

RDMA core userspace libraries and daemons

C 2,012 792 Updated Nov 2, 2025

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 1,981 221 Updated Nov 6, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,975 220 Updated Nov 5, 2025

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

zchoi / Awesome-Embodied-Robotics-and-Agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,582 89 Updated Oct 30, 2025

rogersce / cnpy

library to read/write .npy and .npz files in C/C++

C++ 1,434 326 Updated Jan 18, 2023

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,410 236 Updated Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zszdsze

Block or report zszdsze

Stars

vllm-project / vllm

deepspeedai / DeepSpeed

openai / CLIP

Genesis-Embodied-AI / Genesis

haotian-liu / LLaVA

brendangregg / FlameGraph

huggingface / accelerate