tuidan

🎯

Focusing

Xin Wang tuidan

🎯

Focusing

CS PhD @ Ohio State University

22 followers · 5 following

Ohio State University
Columbus, OH, USA
20:37 (UTC +08:00)
https://scholar.google.com/citations?user=Q7yOQTMAAAAJ&hl=zh-CN

Achievements

x2 x2 x2

Achievements

x2 x2 x2

Organizations

Stars

deepseek-ai / TileKernels

A kernel library written in tilelang

Python 1,587 138 Updated Apr 23, 2026

dexmal / realtime-vla

Running VLA at 30Hz frame rate and 480Hz trajectory frequency

Python 571 41 Updated Feb 10, 2026

killthefullmoon / MMSpec

MMSpec: Benchmarking Speculative Decoding for Vision-Language Models

Python 33 2 Updated Mar 17, 2026

attention-survey / Efficient_Attention_Survey

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

298 5 Updated Dec 1, 2025

HallerPatrick / Awesome-LA-Papers

A curated list of resources related to linear attention mechanisms.

17 3 Updated Mar 16, 2025

hku-netexplo-lab / QSpec

[EMNLP 2025 Main Conference] QSpec: Speculative Decoding with Complementary Quantisation Schemes

Python 7 1 Updated Mar 9, 2026

AIoT-MLSys-Lab / MMDeepResearch-Bench

MMDeepResearch-Bench (MMDR)

Python 29 2 Updated Apr 1, 2026

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 888 253 Updated Jun 14, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,135 1,108 Updated Jun 14, 2026

Relaxed-System-Lab / Flash-Sparse-Attention

🚀🚀 Efficient implementations of Native Sparse Attention

Python 619 15 Updated Sep 29, 2025

sspec-project / SparseSpec

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding

Python 109 8 Updated Dec 2, 2025

vipshop / cache-dit

A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.

Python 1,199 75 Updated Jun 12, 2026

vllm-project / speculators

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 515 104 Updated Jun 13, 2026

Infini-AI-Lab / vortex_torch

Vortex: Programmable Sparse Attention for Agents as Algorithm Designers

Python 60 7 Updated Jun 8, 2026

Denghaoyuan123 / Awesome-RL-VLA

A Survey on Reinforcement Learning of Vision-Language-Action Models for Robotic Manipulation

740 22 Updated May 18, 2026

PKU-Alignment / VLA-Arena

VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.

Python 178 15 Updated Mar 14, 2026

microsoft / MInference

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,221 78 Updated Apr 8, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,706 1,058 Updated Apr 30, 2026

OpenHelix-Team / VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 2,200 199 Updated Mar 19, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 5,217 556 Updated Jun 11, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,116 895 Updated Jun 13, 2026

TencentARC / TokLIP

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 236 6 Updated Aug 18, 2025

meituan-longcat / LongCat-Flash-Chat

1,338 67 Updated Jun 11, 2026

sgl-project / sgl-learning-materials

Materials for learning SGLang

843 64 Updated Jan 5, 2026

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 397 49 Updated Apr 22, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,020 2,211 Updated Apr 26, 2026

bilibili / Index-anisora

Python 2,461 145 Updated Jun 3, 2026

killthefullmoon / PhyX

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Python 52 1 Updated Mar 16, 2026

Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,315 181 Updated Jul 29, 2023

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,254 80 Updated Jun 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xin Wang tuidan

Achievements

Achievements

Organizations

Block or report tuidan

Stars

deepseek-ai / TileKernels

dexmal / realtime-vla

killthefullmoon / MMSpec

attention-survey / Efficient_Attention_Survey

HallerPatrick / Awesome-LA-Papers

hku-netexplo-lab / QSpec

AIoT-MLSys-Lab / MMDeepResearch-Bench

sgl-project / SpecForge

vllm-project / vllm-omni

Relaxed-System-Lab / Flash-Sparse-Attention

sspec-project / SparseSpec

vipshop / cache-dit

vllm-project / speculators

Infini-AI-Lab / vortex_torch

Denghaoyuan123 / Awesome-RL-VLA

PKU-Alignment / VLA-Arena

microsoft / MInference

deepseek-ai / FlashMLA

OpenHelix-Team / VLA-Adapter

fla-org / flash-linear-attention

THUDM / slime

TencentARC / TokLIP

meituan-longcat / LongCat-Flash-Chat

sgl-project / sgl-learning-materials

hemingkx / Spec-Bench

GeeeekExplorer / nano-vllm

bilibili / Index-anisora

killthefullmoon / PhyX

Liu-xiandong / How_to_optimize_in_GPU

hemingkx / SpeculativeDecodingPapers