Stars
AirPods liberated from Apple's ecosystem.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
About [ICCV'25 FAS Workshop] Applying Semantic Anchor in Face Anti-Spoofing Detection for Unified Physical-Digital Attacks
[CVPR2023] Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
Latest Advances on System-2 Reasoning
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
real time face swap and one-click video deepfake with only a single image
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
Official inference framework for 1-bit LLMs
A comprehensive benchmark of deepfake detection
Python tool for converting files and office documents to Markdown.
[CVPR 2024] Code release for TransNeXt model
[NeurIPS 2024] SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model" (SIGGRAPH 2025)
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Blind Face Restoration via Deep Multi-scale Component Dictionaries (ECCV 2020)
[CVPR 2023] Collaborative Diffusion
“让爷康康”是一款手机 AI 应用程序,可以监测不良坐姿并进行语音提示
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Baidu Rope3d detector based on yolov7
Image forgery recognition algorithm
Implementation of the "Learn No to Say Yes Better" paper.