- Chendu, Sichuan
-
02:31
(UTC +08:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
For the paper of https://arxiv.org/abs/2507.10638
[DEIMv2] Real Time Object Detection Meets DINOv3
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Flash-Muon: An Efficient Implementation of Muon Optimizer
Awesome Monocular 3D detection
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Multi-Scale Convolutional Transformer Network for Motor Imagery Brain-Computer Interface
Solve Visual Understanding with Reinforced VLMs
Official repository for the Boltz biomolecular interaction models
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
Text-audio foundation model from Boson AI
[CVPR 2025] The official implementation of EMRDM, which is a novel diffusion model for cloud removal of remote sensing images.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
[ICML 2025] Official repository of the TQNet paper: "Temporal Query Network for Efficient Multivariate Time Series Forecasting". This work is developed by the Lab of Professor Weiwei Lin (linww@scu…
An agent benchmark with tasks in a simulated software company.