Skip to content
View alexzms's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@FoundationResearch

Block or report alexzms

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 883 148 Updated Mar 24, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,954 3,983 Updated Mar 24, 2026

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 3,015 220 Updated Mar 23, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,390 949 Updated Mar 24, 2026

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,925 605 Updated May 3, 2024

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 896 96 Updated Nov 16, 2025
HTML 1 Updated Mar 22, 2026

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 97 7 Updated Mar 5, 2026

Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy

Python 45 8 Updated Mar 23, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 789 66 Updated Mar 19, 2026

Building the Virtuous Cycle for AI-driven LLM Systems

Python 206 31 Updated Mar 23, 2026
Python 107 6 Updated Mar 12, 2026

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 55 Updated Mar 9, 2025

Open-source unified multimodal model

Python 5,764 508 Updated Oct 27, 2025

Official repository for the paper "Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention"

28 1 Updated Feb 4, 2026

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,242 252 Updated Sep 12, 2025

[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 353 16 Updated Oct 31, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,664 235 Updated Jun 17, 2025

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

TypeScript 35,372 3,517 Updated Mar 24, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,460 109 Updated Mar 24, 2026

Nano vLLM

Python 12,398 1,774 Updated Nov 3, 2025

Scalable Minecraft multiplayer data collection engine

JavaScript 122 5 Updated Mar 14, 2026
Python 4 1 Updated Mar 6, 2026

The world's best AI personal assistant for email. Open source app to help you reach inbox zero fast.

TypeScript 10,322 1,241 Updated Mar 24, 2026

A PyTorch-native inference engine with hybrid cache acceleration and massive parallelism for DiTs.

Python 1,107 66 Updated Mar 24, 2026

A powerful Web video editing UI framework, designed to help Web applications quickly integrate professional-grade video editing features.

TypeScript 208 56 Updated Mar 5, 2026

AI video agents framework for next-gen video interactions and workflows.

Python 1,335 216 Updated Jan 23, 2026

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 699 53 Updated Mar 18, 2026

Give git-like & traceable memory to OpenClaw and any coding agents. By https://memov.ai/ aka Entire CLI for every coding agents by MCP.

Python 170 18 Updated Feb 5, 2026

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 650 61 Updated Mar 21, 2026
Next