IFICL

Ziyang Chen IFICL

Research Scientist at Luma AI. Ph.D from the University of Michigan.

101 followers · 127 following

Achievements

x3 x2 x2

Achievements

x3 x2 x2

Lists (1)

Sort

Awesome-Paper

4 repositories

Stars

SonyResearch / Woosh

Public release of the Sound Effect Foundation model by Sony AI.

Python 319 22 Updated May 21, 2026

facebookresearch / dacvae

DACVAE

Python 224 17 Updated Dec 22, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,528 319 Updated May 26, 2026

Kai-46 / minFM

HTML 175 9 Updated Oct 27, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,162 2,095 Updated Jun 9, 2026

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,213 2,010 Updated Mar 17, 2026

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 8,194 629 Updated Jun 5, 2026

MeiGen-AI / MultiTalk

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,945 491 Updated May 22, 2026

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,397 274 Updated Sep 12, 2025

gen-omnimatte / gen-omnimatte-public

Generative Omnimatte (CVPR 2025)

Python 182 17 Updated Jun 3, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 30,811 2,630 Updated Jun 9, 2026

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 651 22 Updated Oct 29, 2025

Yaofang-Liu / Pusa-VidGen

Pusa: Thousands Timesteps Video Diffusion Model

Python 683 46 Updated Feb 13, 2026

harlanhong / ACTalker

ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

Python 458 54 Updated Aug 20, 2025

Lakonik / GMFlow

[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)

Python 191 7 Updated Nov 7, 2025

JavisVerse / JavisDiT

[ICLR 2026] Official implementation of JavisDiT and JavisDiT++ series.

Python 370 31 Updated Mar 29, 2026

ErgastiAlex / TECMO-FLAV

Python 11 1 Updated Sep 22, 2025

WikiChao / FreSca

[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model

Python 55 2 Updated May 31, 2025

NVlabs / QLIP

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 97 3 Updated Mar 1, 2025

yoyomimi / AV-Cloud

[NeurIPS 2024] AV-Cloud: Spatial Audio Rendering Through Audio-Visual Cloud Splatting

Python 14 3 Updated Nov 22, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 4,022 324 Updated Jun 12, 2025

kuleshov-group / bd3lms

[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 1,012 77 Updated Jul 10, 2025

haoyu-bu / CAFe

Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"

Python 33 1 Updated Mar 26, 2025

WeichenFan / CFG-Zero-star

Official repo for CFG-Zero*

Python 704 26 Updated May 2, 2025

MiZhenxing / ThinkDiff

ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Python 192 8 Updated Sep 7, 2025

tulip-berkeley / open_clip

Forked from mlfoundations/open_clip

An open source implementation of CLIP (With TULIP Support)

Python 165 3 Updated May 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziyang Chen IFICL

Achievements

Achievements

Block or report IFICL

Lists (1)

Awesome-Paper

Stars

SonyResearch / Woosh

facebookresearch / dacvae

facebookresearch / sam-audio

Kai-46 / minFM

openai / gpt-oss

Wan-Video / Wan2.2

boson-ai / higgs-audio

MeiGen-AI / MultiTalk

guandeh17 / Self-Forcing

gen-omnimatte / gen-omnimatte-public

fishaudio / fish-speech

baaivision / NOVA

Yaofang-Liu / Pusa-VidGen

harlanhong / ACTalker

Lakonik / GMFlow

JavisVerse / JavisDiT

ErgastiAlex / TECMO-FLAV

WikiChao / FreSca

NVlabs / QLIP

yoyomimi / AV-Cloud

QwenLM / Qwen2.5-Omni

kuleshov-group / bd3lms

haoyu-bu / CAFe

WeichenFan / CFG-Zero-star

MiZhenxing / ThinkDiff

tulip-berkeley / open_clip

tianweiy / CausVid

ZeyueT / AudioX

lumalabs / imm

sakshamsingh1 / vintage_aud_gen