Skip to content
View wpybtw's full-sized avatar
  • MSRA

Highlights

  • Pro

Block or report wpybtw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PyTorch memory allocation visualizer

Rust 67 6 Updated Jul 14, 2025

A Quirky Assortment of CuTe Kernels

Python 799 80 Updated Feb 17, 2026

A library to analyze PyTorch traces.

Python 465 80 Updated Feb 4, 2026

Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)

Python 334 25 Updated Feb 5, 2026

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,353 984 Updated Feb 12, 2026

A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode & Gemini CLI.

TypeScript 18,602 1,160 Updated Feb 17, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,698 255 Updated Jan 14, 2026

Visual Skills Pack for Obsidian: generate Canvas, Excalidraw, and Mermaid diagrams from text with Claude Code

1,426 130 Updated Feb 12, 2026

An agentic skills framework & software development methodology that works.

Shell 53,203 4,044 Updated Feb 12, 2026

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python 84 3 Updated Jan 27, 2026

mHC kernels implemented in CUDA

Cuda 252 19 Updated Jan 14, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 53,340 9,029 Updated Nov 12, 2025

Securely synchronize files with your devices on iOS using Syncthing

Swift 1,358 35 Updated Feb 15, 2026
Python 1,533 221 Updated Jun 26, 2025

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 833 61 Updated Feb 13, 2026

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,079 416 Updated Dec 31, 2025

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native PPT generator based on nano banana pro🍌

Python 12,025 1,401 Updated Feb 17, 2026

Build RL environments for LLM training

Python 655 63 Updated Feb 17, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,901 327 Updated Feb 14, 2026

Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.

Python 70 9 Updated Mar 20, 2025
Python 856 67 Updated Dec 4, 2025

Automating analysis from trace files

Python 58 9 Updated Feb 16, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,926 116 Updated Feb 17, 2026

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 846 53 Updated Dec 20, 2025

Ring attention implementation with flash attention

Python 981 94 Updated Sep 10, 2025

Helpful tools and examples for working with flex-attention

Python 1,132 70 Updated Feb 8, 2026

My learning notes for ML SYS.

Python 5,355 348 Updated Jan 30, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 882 109 Updated Feb 17, 2026

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,018 85 Updated Sep 4, 2024

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 856 124 Updated Feb 17, 2026
Next