Lists (9)
Sort Name ascending (A-Z)
Stars
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Official impl. of "MagicMirror: A Large-Scale Dataset and Benchmark for Fine-Grained Artifacts Assessment in Text-to-Image Generation"
The offical repository of "So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection"
Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Real-Time Deepfake Detection in the Real-World
[ICLR 2026 Oral] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning.
Code for paper: Reinforced Vision Perception with Tools
Control Claude Code remotely via email、discord、telegram. Start tasks locally, receive notifications when Claude completes them, and send new commands by simply replying to emails.
Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
A simple yet powerful agent framework that delivers with open-source models
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
2026年最新Claude充值订阅攻略,包括Claude注册、Claude账号购买、Claude拼车合租、Claude Pro代充、Claude Code国内使用教程!
MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025
[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
[NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"
A flexible & scalable MLLM-based AIGC detection pipeline
Image forgery recognition algorithm