wpybtw

Pengyu Wang wpybtw

AI research engineer. PhD in CS from SJTU.

16 followers · 21 following

MSRA

Achievements

Highlights

Starred repositories

Da1sypetals / SnapViewer

PyTorch memory allocation visualizer

Rust 67 6 Updated Jul 14, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 799 80 Updated Feb 17, 2026

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 465 80 Updated Feb 4, 2026

yifanzhang-pro / deep-delta-learning

Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)

Python 334 25 Updated Feb 5, 2026

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,353 984 Updated Feb 12, 2026

farion1231 / cc-switch

A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode & Gemini CLI.

TypeScript 18,602 1,160 Updated Feb 17, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,698 255 Updated Jan 14, 2026

axtonliu / axton-obsidian-visual-skills

Visual Skills Pack for Obsidian: generate Canvas, Excalidraw, and Mermaid diagrams from text with Claude Code

1,426 130 Updated Feb 12, 2026

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 53,203 4,044 Updated Feb 12, 2026

toyaix / triton-runner

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python 84 3 Updated Jan 27, 2026

AndreSlavescu / mHC.cu

mHC kernels implemented in CUDA

Cuda 252 19 Updated Jan 14, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 53,340 9,029 Updated Nov 12, 2025

pixelspark / sushitrain

Securely synchronize files with your devices on iOS using Syncthing

Swift 1,358 35 Updated Feb 15, 2026

databricks / megablocks

Python 1,533 221 Updated Jun 26, 2025

NVIDIA / cuda-tile

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 833 61 Updated Feb 13, 2026

HKUDS / Paper2Slides

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,079 416 Updated Dec 31, 2025

Anionex / banana-slides

一个基于nano banana pro🍌的原生AI PPT生成应用，迈向真正的＂Vibe PPT＂; 支持上传任意模板图片；上传任意素材&智能解析；一句话/大纲/页面描述自动生成PPT；口头修改指定区域、一键导出可编辑ppt - An AI-native PPT generator based on nano banana pro🍌

Python 12,025 1,401 Updated Feb 17, 2026