Skip to content
View zszdsze's full-sized avatar

Block or report zszdsze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A generative world for general-purpose robotics & embodied AI learning.

Python 27,557 2,534 Updated Nov 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,624 4,613 Updated Nov 6, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 1,983 223 Updated Nov 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,334 11,080 Updated Nov 6, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,704 281 Updated Nov 6, 2025

The official Python client for the Hugging Face Hub.

Python 3,037 841 Updated Nov 6, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,345 479 Updated Nov 6, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,916 691 Updated Nov 6, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,231 420 Updated Nov 6, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,297 464 Updated Nov 5, 2025

A collection of modern C++ libraries, include coro_http, coro_rpc, compile-time reflection, struct_pack, struct_json, struct_xml, struct_pb, easylog, async_simple etc.

C++ 2,013 300 Updated Nov 5, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,977 220 Updated Nov 5, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

πŸš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,266 1,219 Updated Nov 4, 2025

RDMA core userspace libraries and daemons

C 2,012 792 Updated Nov 2, 2025

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! πŸ”₯

1,583 89 Updated Oct 30, 2025

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 912 44 Updated Oct 29, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,079 318 Updated Oct 17, 2025

Collection of AWESOME vision-language models for vision tasks

2,990 223 Updated Oct 14, 2025

Function graph tracer for C/C++/Rust/Python

C 3,344 533 Updated Oct 10, 2025

πŸ“° Must-read papers on KV Cache Compression (constantly updating πŸ€—).

594 15 Updated Sep 30, 2025

[Lumina Embodied AI] ε…·θΊ«ζ™Ίθƒ½ζŠ€ζœ―ζŒ‡ε— Embodied-AI-Guide

8,681 578 Updated Sep 22, 2025

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 63 5 Updated Sep 15, 2025

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 736 91 Updated Sep 8, 2025

This repository collects papers on VLLM applications. We will update new papers irregularly.

174 14 Updated Sep 7, 2025

Efficient and easy multi-instance LLM serving

Python 505 41 Updated Sep 3, 2025
Python 27 2 Updated Aug 27, 2025

TAPA compiles task-parallel HLS program into high-performance FPGA accelerators.

C++ 175 35 Updated Aug 16, 2025

paper and its code for AI System

334 23 Updated Aug 15, 2025
Next