alexzms

Follow

🎯

Focusing

alexzms alexzms

🎯

Focusing

Follow

MLsys / Long-context modeling

17 followers · 33 following

UC San Diego
La Jolla
01:28 (UTC -07:00)
alexzms.github.io
in/minshen-zhang-416a0b291

Achievements

Achievements

Highlights

Pro

Organizations

Lists (1)

Sort

🚀 My stack

Stars

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 883 148 Updated Mar 24, 2026

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,954 3,983 Updated Mar 24, 2026

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 3,015 220 Updated Mar 23, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,390 949 Updated Mar 24, 2026

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,925 605 Updated May 3, 2024

lmgame-org / GamingAgent

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 896 96 Updated Nov 16, 2025

hao-ai-lab / cse291-s26

HTML 1 Updated Mar 22, 2026

hao-ai-lab / DistCA

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 97 7 Updated Mar 5, 2026

GindaChen / nsys-ai

Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy

Python 45 8 Updated Mar 23, 2026

RightNow-AI / autokernel

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 789 66 Updated Mar 19, 2026

flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

Python 206 31 Updated Mar 23, 2026

Jacky-hate / HiAR

Python 107 6 Updated Mar 12, 2026

CASIA-IVA-Lab / VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 55 Updated Mar 9, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,764 508 Updated Oct 27, 2025

chengtao-lv / LightForcing

Official repository for the paper "Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention"

28 1 Updated Feb 4, 2026

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,242 252 Updated Sep 12, 2025

TencentARC / RollingForcing

[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 353 16 Updated Oct 31, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,664 235 Updated Jun 17, 2025

moeru-ai / airi

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

TypeScript 35,372 3,517 Updated Mar 24, 2026

PKU-YuanGroup / Helios

Helios: Real Real-Time Long Video Generation Model

Python 1,460 109 Updated Mar 24, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,398 1,774 Updated Nov 3, 2025

solaris-wm / solaris-engine

Scalable Minecraft multiplayer data collection engine

JavaScript 122 5 Updated Mar 14, 2026

H1yori233 / TriBench

Python 4 1 Updated Mar 6, 2026

elie222 / inbox-zero

The world's best AI personal assistant for email. Open source app to help you reach inbox zero fast.

TypeScript 10,322 1,241 Updated Mar 24, 2026

vipshop / cache-dit

A PyTorch-native inference engine with hybrid cache acceleration and massive parallelism for DiTs.

Python 1,107 66 Updated Mar 24, 2026

tangshuang / webcut

A powerful Web video editing UI framework, designed to help Web applications quickly integrate professional-grade video editing features.

TypeScript 208 56 Updated Mar 5, 2026

video-db / Director

AI video agents framework for next-gen video interactions and workflows.

Python 1,335 216 Updated Jan 23, 2026

ChenLiu-1996 / figures4papers

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 699 53 Updated Mar 18, 2026

memovai / memov

Give git-like & traceable memory to OpenClaw and any coding agents. By https://memov.ai/ aka Entire CLI for every coding agents by MCP.

Python 170 18 Updated Feb 5, 2026

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 650 61 Updated Mar 21, 2026