Skip to content
View lingengyuan's full-sized avatar
🎯
Focusing
🎯
Focusing
  • china
  • 20:38 (UTC -12:00)

Block or report lingengyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lingengyuan/README.md

Hugh Lin · 林耿塬

Pushing LLMs to the edge of what a single MacBook can do.

Independent researcher / engineer. I work on local-first AI on Apple Silicon — paper reproduction, optimizer studies, KV cache quantization, and the tooling that actually makes it all run.

If it can be done on a MacBook, I want to know exactly how well.


🔬 Selected work

gpt2-from-scratch — Rebuilt GPT-2 Small (124M) end-to-end on a MacBook Air M5. 1B FineWeb tokens, AdamW vs Muon, MLX port at 98% wall-clock efficiency, 9.14× KV cache compression with Δ PPL < 1.05. No cloud.

qjl-mlx — First MLX / Apple Silicon native port of QJL & TurboQuant (Google Research). Parity vs PyTorch reference to 1.79e-7 across 4 mode×codebook configs.

minimind-autoresearch — 17 autonomous ablations on a 26M Chinese LLM in 40 minutes. Muon beats best AdamW config by 23.8% val_loss on M5 MPS.

docflow — Fully local multi-format RAG. 100% offline, zero telemetry, retrieval Recall@5 = 1.0 on internal eval.

git-dungeon — A roguelike where your git commits are the monsters. CLI, Python, just for fun.


🛠️ Currently

  • 🇨🇳→🌐 Translating my Chinese technical writing into English releases
  • 📊 Compiling a single "Apple M5 LLM Performance Report" from the experiments above
  • 👀 Looking at: speculative decoding on MLX, 1-bit attention, MoE on unified memory

📫 Find me

GitHub issues on any of my repos.

"Take ideas that look good on paper and push them until they either work on a Mac or fail for a clear reason."

Popular repositories Loading

  1. my-skills my-skills Public

    Python 8 1

  2. minimind-autoresearch minimind-autoresearch Public

    Autoresearch: 17 hyperparameter experiments on MiniMind 26M Chinese LLM. Muon optimizer achieves 27.6% faster convergence on Apple Silicon MPS.

    Python 5

  3. x-algorithm-simulator x-algorithm-simulator Public

    TypeScript 4 1

  4. qjl-mlx qjl-mlx Public

    First MLX / Apple Silicon native implementation of QJL and TurboQuant

    Python 3 2

  5. git-dungeon git-dungeon Public

    Git Dungeon 是一个将 Git 提交历史游戏化的命令行 Roguelike:它把每个 commit 映射为可战斗的敌人,按章节推进,支持经验/金币成长与 AI 动态文 案(含缓存与限流回退),并配套完整的单元、功能与黄金回归测试。

    Python 1

  6. CodeSnippets CodeSnippets Public

    Python 1