Highlights
- Pro
Stars
[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
🔥[CVPR 2025 Highlight, Official Code] for paper "Rethinking Personalized Aesthetics Assessment: Employing Physique Aesthetics Assessment as An Exemplification". Official Weights and Demos provided.…
Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.
This repository contains demos I made with the Transformers library by HuggingFace.
Continuous Conditional Generative Adversarial Networks (CcGAN)
Official repository for the paper "Instance-Conditioned GAN" by Arantxa Casanova, Marlene Careil, Jakob Verbeek, Michał Drożdżal, Adriana Romero-Soriano.
The official code of CVPR 2024: VRetouchEr: Semi-supervised High-Quality Face Retouching Transformer with Prior-Based Selective Self-Attention.
The official code of AAAI 2024: RetouchFormer: Semi-supervised High-Quality Face Retouching Transformer with Prior-Based Selective Self-Attention.
Spatial Transcriptomic Analysis using Reference-Free auxiliarY deep generative modeling and Shared Histology
华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology
Official inference repo for FLUX.1 models
Open-Sora: Democratizing Efficient Video Production for All
Nodes for image juxtaposition for Flux in ComfyUI
Solve Visual Understanding with Reinforced VLMs
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
HunyuanVideo Keyframe Control Lora is an adapter for HunyuanVideo T2V model for keyframe-based video generation
HunyuanVideo: A Systematic Framework For Large Video Generation Model
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming