SeuTao

🎯

Focusing

Tao Shen SeuTao

🎯

Focusing

AI Developer / Kaggle Grandmaster / Engineer / Data Scientist / Researcher

989 followers · 27 following

ByteDance
Shanghai/Shenzhen
https://www.kaggle.com/shentao
@SeuTao1
https://scholar.google.com/citations?user=8cprenoAAAAJ&hl=zh-CN

Achievements

Stars

ddlBoJack / Omni-Captioner

[ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.

Python 136 Updated Apr 7, 2026

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 87,137 12,458 Updated Jun 13, 2026

ernie-research / NAVA

Official Code of NAVA: Native Audio-Visual Alignment for Generation.

Python 185 21 Updated Jun 8, 2026

character-ai / Ovi

Python 1,723 200 Updated Nov 15, 2025

MeiGen-AI / InfiniteTalk

Unlimited-length talking video generation that supports image-to-video and video-to-video generation

Python 6,884 1,212 Updated May 22, 2026

nv-tlabs / PiD

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Python 726 36 Updated Jun 3, 2026

zghhui / OmniNFT

Code for "OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation"

Python 94 5 Updated Jun 1, 2026

microsoft / Lens

Lens is a 3.8B-parameter text-to-image diffusion model that achieves quality competitive with and in several cases surpassing models like FLUX and SD3, while requiring significantly less training c…

Python 239 17 Updated May 25, 2026

Visko-Platform / VEFX-Bench

VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects

Python 216 17 Updated May 16, 2026

inspatio / inspatio-world

Python 915 68 Updated Apr 13, 2026

OpenDCAI / OpenWorldLib

Unified Codebase for Advanced World Models.

Python 817 43 Updated Jun 11, 2026

etched-ai / open-oasis

Inference script for Oasis 500M

Python 2,100 181 Updated Nov 8, 2024

SkyworkAI / Matrix-Game

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,235 237 Updated Mar 30, 2026

facebookresearch / tuna-2

Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

Python 711 28 Updated Jun 9, 2026

TencentAI4S / tfold

open source code for Tencent tFold

Python 158 27 Updated Mar 14, 2025

HVision-NKU / ASID-Caption

ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Understanding.

Python 65 2 Updated Mar 3, 2026

lucidrains / d4rt

Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind

Python 70 Updated Jun 8, 2026

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 848 58 Updated Jun 13, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,130 1,106 Updated Jun 13, 2026

nv-tlabs / lyra

Project Lyra: Open Generative 3D World Models

Python 2,086 222 Updated Jun 11, 2026

Netflix / void-model

Python 1,895 178 Updated May 4, 2026

tulerfeng / Gen-Searcher

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Python 365 33 Updated Apr 7, 2026

black-forest-labs / Self-Flow

[ICML'26] Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 508 19 Updated May 23, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 86,491 12,529 Updated Mar 26, 2026

H-EmbodVis / HyDRA

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Python 252 14 Updated Apr 29, 2026

Robbyant / lingbot-map

A feed-forward 3D foundation model for reconstructing scenes from streaming data

Python 7,197 712 Updated Jun 2, 2026

Martinser / REG

[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think

Python 267 18 Updated Oct 4, 2025

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,647 94 Updated Mar 16, 2025

jd-opensource / JoyAI-Image

JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.

Python 2,169 157 Updated Jun 12, 2026

GAIR-NLP / daVinci-MagiHuman

Python 2,050 211 Updated Apr 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tao Shen SeuTao

Achievements

Achievements

Block or report SeuTao

Stars

ddlBoJack / Omni-Captioner

harry0703 / MoneyPrinterTurbo

ernie-research / NAVA

character-ai / Ovi

MeiGen-AI / InfiniteTalk

nv-tlabs / PiD

zghhui / OmniNFT

microsoft / Lens

Visko-Platform / VEFX-Bench

inspatio / inspatio-world

OpenDCAI / OpenWorldLib

etched-ai / open-oasis

SkyworkAI / Matrix-Game

facebookresearch / tuna-2

TencentAI4S / tfold

HVision-NKU / ASID-Caption

lucidrains / d4rt

SandAI-org / MagiAttention

vllm-project / vllm-omni

nv-tlabs / lyra

Netflix / void-model

tulerfeng / Gen-Searcher

black-forest-labs / Self-Flow

karpathy / autoresearch

H-EmbodVis / HyDRA

Robbyant / lingbot-map

Martinser / REG

sihyun-yu / REPA

jd-opensource / JoyAI-Image

GAIR-NLP / daVinci-MagiHuman