zszdsze

Follow

zszdsze

Follow

0 followers · 5 following

Stars

31 stars written in Python

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,340 11,080 Updated Nov 6, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,624 4,613 Updated Nov 6, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,558 2,535 Updated Nov 6, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,374 583 Updated Oct 28, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,266 1,219 Updated Nov 4, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 7,297 464 Updated Nov 5, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,110 391 Updated Jul 11, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 469 Updated Aug 7, 2024

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,917 691 Updated Nov 6, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,336 518 Updated Mar 23, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,704 281 Updated Nov 6, 2025

huggingface / huggingface_hub

The official Python client for the Hugging Face Hub.

Python 3,037 841 Updated Nov 6, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,977 220 Updated Nov 5, 2025

google-research / robotics_transformer

Python 1,622 189 Updated Jan 31, 2024

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,410 236 Updated Jul 31, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 857 90 Updated Feb 20, 2025

vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 831 96 Updated Apr 18, 2024

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 736 91 Updated Sep 8, 2025

SpatialVLA / SpatialVLA

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 564 33 Updated Jun 23, 2025

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 506 41 Updated Sep 3, 2025

RoboFlamingo / RoboFlamingo

Code for RoboFlamingo

Python 407 37 Updated May 8, 2024

Infini-AI-Lab / Sequoia

scalable and robust tree-based speculative decoding algorithm

Python 361 37 Updated Jan 28, 2025

FMInference / DejaVu

Python 345 44 Updated Apr 2, 2024

ByteDance-Seed / ShadowKV

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python 269 18 Updated May 1, 2025

Infini-AI-Lab / MagicPIG

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 238 16 Updated Dec 16, 2024

snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 157 29 Updated Jul 10, 2024

Infini-AI-Lab / UMbreLLa

LLM Inference on consumer devices

Python 125 15 Updated Mar 17, 2025

RedisGraph / redisgraph-bulk-loader

A Python utility for building RedisGraph databases from CSV inputs

Python 68 35 Updated May 14, 2023

hyy02 / Corki

Python 27 2 Updated Aug 27, 2025