Skip to content
View zszdsze's full-sized avatar

Block or report zszdsze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
31 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,207 11,052 Updated Nov 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,614 4,613 Updated Nov 6, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,548 2,531 Updated Nov 5, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,902 2,659 Updated Aug 12, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,372 584 Updated Oct 28, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,264 1,219 Updated Nov 4, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,294 464 Updated Nov 5, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,110 391 Updated Jul 11, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 468 Updated Aug 7, 2024

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,912 691 Updated Nov 6, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,326 518 Updated Mar 23, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,703 281 Updated Nov 6, 2025

The official Python client for the Hugging Face Hub.

Python 3,037 840 Updated Nov 6, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,972 220 Updated Nov 5, 2025

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,409 236 Updated Jul 31, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 857 90 Updated Feb 20, 2025

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 831 96 Updated Apr 18, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 736 91 Updated Sep 8, 2025

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 562 32 Updated Jun 23, 2025

Efficient and easy multi-instance LLM serving

Python 505 41 Updated Sep 3, 2025

Code for RoboFlamingo

Python 407 37 Updated May 8, 2024

scalable and robust tree-based speculative decoding algorithm

Python 361 37 Updated Jan 28, 2025
Python 345 44 Updated Apr 2, 2024

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python 267 18 Updated May 1, 2025

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 238 16 Updated Dec 16, 2024

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 157 29 Updated Jul 10, 2024

LLM Inference on consumer devices

Python 125 15 Updated Mar 17, 2025

A Python utility for building RedisGraph databases from CSV inputs

Python 68 35 Updated May 14, 2023
Python 27 2 Updated Aug 27, 2025
Next