-
Peking University
- Shenzhen
-
08:03
(UTC +08:00)
Lists (9)
Sort Name ascending (A-Z)
Stars
Paper Debugger is the best overleaf companion
Helpful kernel tutorials and examples for tile-based GPU programming
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Curated collection of papers in MoE model inference
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
StreamingVLM: Real-Time Understanding for Infinite Video Streams
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
🌙 LunarVim is an IDE layer for Neovim. Completely free and community driven.
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
A theme-driven, out-of-the-box modern configuration of neovim (HardHackerNvim)💎
A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar
🤙 Easy replacement for LaTeX Beamer! 🥂 custom Marp templates with a selection of over a dozen themes
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
HuggingFace conversion and training library for Megatron-based models
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
A Collection of Foundation Driving Models by OpenDriveLab
verl: Volcano Engine Reinforcement Learning for LLMs
A collection of AWESOME things about mixture-of-experts
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Simple & Scalable Pretraining for Neural Architecture Research
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)
WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient