-
UR & PKU & UESTC
-
01:17
(UTC -04:00) - https://infaaa.github.io/
- @vhjf36495872
- in/jinfa-huang-262a2929b
Highlights
- Pro
Lists (10)
Sort Name ascending (A-Z)
Starred repositories
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
Official Implementation of LongLive-RAG: A general retrieval-augmented framework for long video generation.
Visualize Your Ideas With Code
Code release for "i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models"
Official repo for paper "Echo-Infinity: Learnable Evolving Memory for Real-Time Infinite Video Generation"
MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis (CVPR 2026)
UniRL is a Framework for Unified Multimodal Model Reinforcement Learning
Multimodal RL training framework for diffusion & omni models
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
Programmatic video for coding agents — HTML to video on your laptop. Turn HTML, CSS & data into real MP4s with pluggable render engines, 21 templates, AI soundtrack. Apache-2.0, no per-render fees.…
DSPy: The framework for programming—not prompting—language models
Ideogram 4: Open image model at the forefront of design
JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
[ICLR-26, NeurIPS-25] Lumos-Custom Project: research for customized video generation in the Lumos Project.
Official Code for GPIC: A Giant Permissive Image Corpus for Visual Generation
A Comprehensive Survey of Interactive Video World Models
We propose Bidirectional Evolutionary Search (BES), a search framework that couples forward candidate evolution with backward goal decomposition.
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
Official code for "Stitched Value Model for Diffusion Alignment"
Official repository for the paper "Advancing Narrative Long Video Generation via Training-Free Identity-Aware Memory"
Local AI filmmaking studio — skills, canvas, timeline — driven from your coding agent.