[ICLR 2026] AudioMCQ: A 571k audio multiple-choice question dataset for post-training Large Audio Language Models with dual CoT annotations and audio-contribution filtering. 🏆 1st place in DCASE 20…

Python 51 4 Updated Apr 21, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,813 79 Updated Jan 20, 2026

yayafengzi / LMM-HiMTok

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

Python 95 4 Updated Jul 17, 2025

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 959 56 Updated Aug 5, 2025

SCLBD / DeepfakeBench

A comprehensive benchmark of deepfake detection

Python 1,055 186 Updated Aug 20, 2025

Purdue-M2 / AI-Face-FairnessBench

We introduce AI-Face, the first million-scale AI-generated face dataset with demographic annotations, and conduct a comprehensive fairness benchmark. Our work has been accepted at CVPR 2025.

Python 94 7 Updated Mar 2, 2026

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 93,873 13,689 Updated Jun 14, 2026

PKU-YuanGroup / UAE

Official repository for the UAE paper, unified-GRPO, and unified-Bench

Python 165 7 Updated Sep 12, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 29,000 6,040 Updated May 23, 2026

langchain-ai / open_deep_research

Python 11,709 1,674 Updated Jun 7, 2026

AIGText / Glyph-ByT5

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 622 31 Updated Sep 5, 2025

VectorSpaceLab / OmniGen2

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,087 29 Updated Mar 20, 2026

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 25,627 1,892 Updated Jul 31, 2025

WeChatCV / opencv_3rdparty

Forked from opencv/opencv_3rdparty

OpenCV - 3rdparty

451 118 Updated Jul 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dandanJing

Achievements

Achievements

Block or report dandanJing

Stars

SwinTransformer / Video-Swin-Transformer

venus-guangjian / Venus-DeFakerOne

NousResearch / hermes-agent

QwenLM / Qwen-Image

facebookresearch / videoseal

Stability-AI / invisible-watermark-gpu

guofei9987 / blind_watermark

karpathy / autoresearch

Gavinic / ForensicsAI

openclaw / openclaw

handsome-rich / MIRROR

modelscope / AgentEvolver

inclusionAI / AudioMCQ