-
Peking University
- Shenzhen
-
19:12
(UTC +08:00)
Lists (9)
Sort Name ascending (A-Z)
Stars
Paper Debugger is the best overleaf companion
Helpful kernel tutorials and examples for tile-based GPU programming
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Curated collection of papers in MoE model inference
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
StreamingVLM: Real-Time Understanding for Infinite Video Streams
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
🌙 LunarVim is an IDE layer for Neovim. Completely free and community driven.
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
A theme-driven, out-of-the-box modern configuration of neovim (HardHackerNvim)💎
A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar
🤙 Easy replacement for LaTeX Beamer! 🥂 custom Marp templates with a selection of over a dozen themes
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
HuggingFace conversion and training library for Megatron-based models
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
A Collection of Foundation Driving Models by OpenDriveLab
verl: Volcano Engine Reinforcement Learning for LLMs
A collection of AWESOME things about mixture-of-experts
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Simple & Scalable Pretraining for Neural Architecture Research
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)
WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".