Skip to content
View zszdsze's full-sized avatar

Block or report zszdsze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
68 results for source starred repositories
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,273 11,066 Updated Nov 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,618 4,613 Updated Nov 6, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,443 3,820 Updated Jul 23, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 27,551 2,533 Updated Nov 5, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,905 2,659 Updated Aug 12, 2024

Stack trace visualizer

Perl 18,901 2,058 Updated Oct 20, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,265 1,219 Updated Nov 4, 2025

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,677 578 Updated Sep 22, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,377 450 Updated Aug 2, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,294 464 Updated Nov 5, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,110 391 Updated Jul 11, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,346 468 Updated Aug 7, 2024

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,912 691 Updated Nov 6, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,344 479 Updated Nov 6, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,227 420 Updated Nov 6, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,079 318 Updated Oct 17, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,703 281 Updated Nov 6, 2025

Function graph tracer for C/C++/Rust/Python

C 3,344 533 Updated Oct 10, 2025

The official Python client for the Hugging Face Hub.

Python 3,037 840 Updated Nov 6, 2025

Collection of AWESOME vision-language models for vision tasks

2,990 223 Updated Oct 14, 2025

A graph database as a Redis module

C 2,037 231 Updated Jul 21, 2025

A collection of modern C++ libraries, include coro_http, coro_rpc, compile-time reflection, struct_pack, struct_json, struct_xml, struct_pb, easylog, async_simple etc.

C++ 2,013 300 Updated Nov 5, 2025

RDMA core userspace libraries and daemons

C 2,012 792 Updated Nov 2, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 1,981 221 Updated Nov 6, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,975 220 Updated Nov 5, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,582 89 Updated Oct 30, 2025

library to read/write .npy and .npz files in C/C++

C++ 1,434 326 Updated Jan 18, 2023

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,410 236 Updated Jul 31, 2024
Next