FirwoodLin

💭

🎣

Fir Wood FirwoodLin

💭

🎣

18 followers · 35 following

Shanghai, China
00:05 (UTC +08:00)

Achievements

Highlights

Lists (6)

Sort

Stars

Yeachan-Heo / oh-my-codex

OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.

TypeScript 16,387 1,555 Updated Apr 5, 2026

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 896 128 Updated Mar 15, 2026

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,995 317 Updated Apr 3, 2026

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 135,921 11,405 Updated Apr 2, 2026

233stone / vocotype-cli

VocoType 是一款运行在本地端侧的隐私安全语音输入工具，通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI 优化文本、自定义替换词典、录音视频转文字等功能，让语音输入更高效、更安全。

Python 514 52 Updated Mar 23, 2026

pengsida / learning_research

本人的科研经验

11,136 575 Updated Mar 7, 2026

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

532 36 Updated Feb 7, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,125 693 Updated Apr 5, 2026

High-Logic / Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

Python 1,474 101 Updated Apr 1, 2026

Yanlewen / TradeTrap

🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?

Python 74 13 Updated Nov 27, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 1,001 97 Updated Sep 10, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 1,167 168 Updated Apr 5, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,896 383 Updated Apr 3, 2026

KuangjuX / NVSHMEM-Tutorial

NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer

Cuda 173 14 Updated Feb 11, 2026

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,276 131 Updated Apr 5, 2026