Starred repositories
🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization
A framework for efficient model inference with omni-modality models
High quality training free inpaint for every stable diffusion model. Supports ComfyUI
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
Official PyTorch Code for "OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild".
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
Fish-like autosuggestions for zsh
A cd command that learns - easily navigate directories from the command line
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…
Elegant reading of real-time and hottest news
mair-lab / EARL
Forked from saba96/EARLEARL: Editing with Autoregression and RL
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
A curated list of Diffusion Model in RL resources (continually updated)
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
A high-throughput and memory-efficient inference and serving engine for LLMs
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.