Lists (6)
Sort Name ascending (A-Z)
Stars
[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
ACE-Step: A Step Towards Music Generation Foundation Model
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
A geometry-shader-based, global CUDA sorted high-performance 3D Gaussian Splatting rasterizer. Can achieve a 5-10x speedup in rendering compared to the vanialla diff-gaussian-rasterization.
MiMo-Audio: Audio Language Models are Few-Shot Learners
A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…
Code and data for AAAI'24 paper "Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries".
This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models" and "A Training-free LLM-base…
Open-source framework for conversational voice AI agents
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
ModelTC / Wan2.2-Lightning
Forked from Wan-Video/Wan2.2Wan2.2-Lightning: Speed up wan2.2 model with distillation
4-steps distilled version of Wan2.2-TI2V-5B
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
Unlimited-length talking video generation that supports image-to-video and video-to-video generation