Skip to content
View soeaver's full-sized avatar
  • BUPT
  • Beijing

Block or report soeaver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Depth Anything 3

Python 3,685 319 Updated Dec 12, 2025
Python 173 21 Updated May 22, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,860 111 Updated Dec 8, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,444 750 Updated Dec 21, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,162 195 Updated Oct 9, 2025

Universal memory layer for AI Agents

Python 44,636 4,853 Updated Dec 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,002 664 Updated Nov 20, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 33,507 4,772 Updated Nov 24, 2024

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 131 7 Updated Jun 30, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,055 1,275 Updated Oct 11, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,789 2,351 Updated Dec 23, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,441 8,976 Updated Nov 17, 2025

Yet Another Document Translator

Python 6,250 459 Updated Dec 11, 2025

Train your AI self, amplify you, bridge the world

Python 14,806 1,135 Updated Sep 30, 2025

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python 175 10 Updated Oct 15, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 492 10 Updated Nov 14, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,210 66 Updated Feb 25, 2025

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,368 176 Updated Sep 26, 2025

Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision.

Python 304 30 Updated Mar 18, 2025

Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496

Jupyter Notebook 91 5 Updated Apr 13, 2025

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 108 6 Updated Oct 25, 2024

EVE Series: Encoder-Free Vision-Language Models from BAAI

Python 362 12 Updated Jul 24, 2025
Python 26 Updated Feb 27, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 209 8 Updated Oct 15, 2025

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,638 139 Updated Oct 7, 2025
Python 22 4 Updated Aug 9, 2024

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Python 44 6 Updated Mar 25, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,873 371 Updated Dec 17, 2025

Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)

Python 212 10 Updated Nov 12, 2025
Next