Skip to content
View angerybob's full-sized avatar

Block or report angerybob

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast Multimodal LLM on Mobile Devices

C++ 1,297 156 Updated Dec 23, 2025

Running VLA at 30Hz frame rate and 480Hz trajectory frequency

Python 327 23 Updated Dec 14, 2025

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 577 125 Updated Dec 25, 2025

HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing

Python 9 4 Updated Sep 12, 2025

Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

C++ 70 9 Updated Apr 25, 2025

LLM inference in C/C++

C++ 91,989 14,244 Updated Dec 25, 2025

Code release for AdapMoE accepted by ICCAD 2024

Jupyter Notebook 35 4 Updated Apr 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,181 12,186 Updated Dec 25, 2025
Verilog 1 Updated Jun 11, 2025

小猿口算

Python 1,342 167 Updated Nov 5, 2024

学在浙大/智云课堂 辅助脚本

JavaScript 47 2 Updated Dec 5, 2024