Skip to content
View cubele's full-sized avatar
😈
😈

Highlights

  • Pro

Block or report cubele

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Python 302 35 Updated Jun 10, 2025

A lightweight design for computation-communication overlap.

Cuda 200 9 Updated Oct 10, 2025

A framework for generating realistic LLM serving workloads

Python 92 6 Updated Oct 9, 2025
C++ 15 5 Updated Sep 10, 2025
C++ 773 125 Updated Oct 29, 2025
HTML 227 47 Updated Dec 5, 2025

pop'n music 難易度表 自動記入ツール

JavaScript 3 3 Updated Oct 20, 2025

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 174 26 Updated Mar 27, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,827 1,036 Updated Dec 23, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,947 288 Updated May 15, 2025

The ASPLOS 2025 / EuroSys 2025 Contest Track

37 5 Updated Aug 7, 2025

Summary of some awesome work for optimizing LLM inference

151 5 Updated Nov 30, 2025

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

C++ 493 173 Updated Dec 17, 2025

[SIGCOMM'23] DONS: Fast and Affordable Discrete Event Network Simulation with Automatic Parallelization.

C# 52 11 Updated Apr 11, 2024

my mac/linux config file。快速配置 mac 终端环境

Shell 218 102 Updated Dec 6, 2025

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

4,130 358 Updated Jan 25, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 96,619 10,733 Updated Dec 9, 2025

A Solidity starter template for developing smart contracts.

TypeScript 117 16 Updated Jul 4, 2023

SCIONLab user interface and administration

Python 10 17 Updated Nov 29, 2025

SCION Internet Architecture

Go 474 176 Updated Dec 22, 2025

Extensions for Wireshark

Lua 349 19 Updated Oct 30, 2018

High-speed packet processing framework

C 2,898 363 Updated Dec 22, 2025

Web-based Traffic and Security Network Traffic Monitoring

Lua 7,386 720 Updated Dec 23, 2025

Open Source Deep Packet Inspection Software Toolkit

C 4,289 963 Updated Dec 20, 2025

Open source components and extensions for n2disk

C 536 13 Updated Dec 5, 2025

Realtime Robust Malicious Traffic Detection via Frequency Domain Analysis

C++ 132 28 Updated Oct 30, 2023

Parallel sparse direct solver for circuit simulation

C 48 18 Updated Jun 13, 2022

A C++ project template for quick start.

CMake 3 2 Updated Mar 27, 2022

why not Rewrite It In Rust

732 8 Updated Sep 17, 2023
Next