jiamings

🚤

win8

Jiaming Song jiamings

🚤

win8

@lumalabs

1.2k followers · 214 following

Luma AI
Palo Alto, CA
http://tsong.me

Achievements

x4 x3 x3

Achievements

x4 x3 x3

Organizations

Stars

meta-pytorch / torchft

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

Python 501 64 Updated Apr 3, 2026

PrimeIntellect-ai / prime-rl

Agentic RL Training at Scale

Python 1,379 290 Updated May 17, 2026

hxixixh / gumbel-distill

Official implementation of Gumbel Distillation for Parallel Text Generation

Python 13 1 Updated Mar 24, 2026

david3684 / flm

Official Codebase For paper "One-step Language Modeling via Continuous Denoising"

Python 133 8 Updated Apr 27, 2026

Luo-Yihong / TDM-R1

[ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward

Python 94 Updated May 17, 2026

PKU-YuanGroup / Helios

Helios: Real Real-Time Long Video Generation Model

Python 1,814 142 Updated Apr 16, 2026

lumalabs / tvm

Terminal Velocity Matching

Python 86 1 Updated Feb 14, 2026

EGalahad / vla-scratch

Python 346 14 Updated Feb 10, 2026

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,787 835 Updated May 10, 2026

TEN-framework / ten-framework

Open-source framework for conversational voice AI agents

Python 10,582 1,279 Updated May 14, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 11,402 1,485 Updated Mar 17, 2026

OthmanAdi / planning-with-files

Claude Code skill implementing Manus-style persistent markdown planning — the workflow pattern behind the $2B acquisition.

Python 21,483 1,902 Updated May 16, 2026

slopus / happy

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 20,846 1,725 Updated May 15, 2026

presenton / presenton

Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)

JavaScript 5,021 977 Updated May 17, 2026

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 483 14 Updated Apr 15, 2026

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 1,042 103 Updated Mar 3, 2026

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,500 256 Updated Apr 15, 2026

NVIDIA-NeMo / Automodel

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 505 154 Updated May 17, 2026

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,128 4,686 Updated Aug 19, 2024

poetiq-ai / poetiq-arc-agi-solver

This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.

Python 1,274 214 Updated Dec 16, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,707 797 Updated May 14, 2026

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 8,059 621 Updated Jan 18, 2026

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 777 41 Updated Mar 19, 2026

krea-ai / realtime-video

Krea Realtime 14B. An open-source realtime AI video model.

Python 542 37 Updated Nov 13, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 30,377 2,576 Updated May 12, 2026

OpenImagingLab / FlashVSR

[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co…

Python 1,600 131 Updated Dec 23, 2025

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,369 453 Updated May 17, 2026

xiquan-li / MeanAudio

[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows

Python 139 17 Updated Sep 2, 2025

qiuzh20 / gated_attention

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 947 57 Updated Dec 20, 2025

NVlabs / rcm

rCM & Causal-rCM: Best Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale

Python 636 27 Updated May 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiaming Song jiamings

Achievements

Achievements

Organizations

Block or report jiamings

Stars

meta-pytorch / torchft

PrimeIntellect-ai / prime-rl

hxixixh / gumbel-distill

david3684 / flm

Luo-Yihong / TDM-R1

PKU-YuanGroup / Helios

lumalabs / tvm

EGalahad / vla-scratch

KoljaB / RealtimeSTT

TEN-framework / ten-framework

QwenLM / Qwen3-TTS

OthmanAdi / planning-with-files

slopus / happy

presenton / presenton

MiniMax-AI / VTP

XiaomiMiMo / MiMo-Audio

thu-ml / TurboDiffusion

NVIDIA-NeMo / Automodel

suno-ai / bark

poetiq-ai / poetiq-arc-agi-solver

THUDM / slime

boson-ai / higgs-audio

CodeGoat24 / UnifiedReward

krea-ai / realtime-video

fishaudio / fish-speech

OpenImagingLab / FlashVSR

RLinf / RLinf

xiquan-li / MeanAudio

qiuzh20 / gated_attention

NVlabs / rcm