Highlights
- Pro
Lists (15)
Sort Name ascending (A-Z)
Starred repositories
CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark, CVPR 2019, Oral
[ECCV 2024] WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
DecodeX: Exploring and Benchmarking of LDPC Decoding across CPU, GPU, and ASIC Platforms
High-speed Large Language Model Serving for Local Deployment
Scalable Edge-Assisted Serving Framework for Interactive LLMs [NeurIPS 2025 Spotlight]
Real-Time Inference of 5G NR Multi-user MIMO Neural Receivers
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
📰 Must-read papers and blogs on Speculative Decoding ⚡️
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
A compilation of notes for E2E RAN and Core Slicing, SRv6, L4S from my work at NYU Wireless. Potentially helpful for 5G research, OpenAirInterface users and develelopers.
A list of awesome papers and cool resources on WiFi CSI sensing.
Python 5G toolbox provide 3GPP 5G NR physical layer high-phy and low-phy libraries. It has passed 60K+ testcases which were generated from Matlab 5G toolbox.
Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)
☀️ [ArXiv 2025] Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting
Hackable and optimized Transformers building blocks, supporting a composable construction.
[OSDI 2025] Bayesian Code Diffusion for Efficient Automatic Deep Learning Program Optimization
An open-source AI agent that brings the power of Gemini directly into your terminal.
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer