Skip to content
View Decem-Y's full-sized avatar
:octocat:
Doing cool stuff
:octocat:
Doing cool stuff
  • Chendu, Sichuan
  • 02:31 (UTC +08:00)

Block or report Decem-Y

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 282 11 Updated Oct 27, 2025

For the paper of https://arxiv.org/abs/2507.10638

Jupyter Notebook 1 Updated Aug 11, 2025

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 933 89 Updated Nov 3, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,915 2,202 Updated Oct 17, 2025

Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"

Python 81 1 Updated Oct 21, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 2,021 214 Updated Oct 9, 2025

Official Repository for QuantAgent

HTML 238 55 Updated Nov 4, 2025

造福广大需要动物实验的朋友

Vue 13 1 Updated Nov 5, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,925 1,291 Updated Nov 3, 2025

Flash-Muon: An Efficient Implementation of Muon Optimizer

Python 205 13 Updated Jun 15, 2025

Awesome Monocular 3D detection

430 47 Updated Dec 6, 2024

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 1,555 129 Updated Oct 15, 2025

使用vllm加速cosyvoice2的推理

Jupyter Notebook 443 53 Updated Apr 26, 2025
Python 14 1 Updated Jun 22, 2025
Python 40 6 Updated Jul 15, 2025

数据合成工具,简单高效的合成不同业务场景的大模型训练数据

Python 31 7 Updated Jan 2, 2025

Multi-Scale Convolutional Transformer Network for Motor Imagery Brain-Computer Interface

Python 109 2 Updated Jul 19, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,673 366 Updated Oct 21, 2025

Official repository for the Boltz biomolecular interaction models

Python 3,429 673 Updated Oct 3, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 18,332 2,547 Updated Nov 5, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 730 39 Updated Sep 19, 2025

Text-audio foundation model from Boson AI

Python 7,570 559 Updated Sep 15, 2025

[CVPR 2025] The official implementation of EMRDM, which is a novel diffusion model for cloud removal of remote sensing images.

Python 38 Updated Jul 3, 2025

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 473 59 Updated Jul 20, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 828 54 Updated May 14, 2025
Python 33 3 Updated Jun 4, 2025

🕵️‍♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)

Python 10 Updated Nov 5, 2025

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Python 422 33 Updated Sep 28, 2025

[ICML 2025] Official repository of the TQNet paper: "Temporal Query Network for Efficient Multivariate Time Series Forecasting". This work is developed by the Lab of Professor Weiwei Lin (linww@scu…

Python 69 10 Updated Jun 11, 2025

An agent benchmark with tasks in a simulated software company.

Python 579 94 Updated Oct 12, 2025
Next