Skip to content
View soeaver's full-sized avatar
  • BUPT
  • Beijing

Block or report soeaver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Depth Anything 3

Python 3,638 311 Updated Dec 12, 2025
Python 169 20 Updated May 22, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,832 108 Updated Dec 8, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,304 729 Updated Dec 21, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,145 193 Updated Oct 9, 2025

Universal memory layer for AI Agents

Python 44,537 4,838 Updated Dec 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,884 655 Updated Nov 20, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 33,464 4,766 Updated Nov 24, 2024

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 131 7 Updated Jun 30, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,030 1,272 Updated Oct 11, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,754 2,354 Updated Dec 21, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,392 8,969 Updated Nov 17, 2025

Yet Another Document Translator

Python 6,230 456 Updated Dec 11, 2025

Train your AI self, amplify you, bridge the world

Python 14,795 1,135 Updated Sep 30, 2025

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python 174 10 Updated Oct 15, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 490 10 Updated Nov 14, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,209 66 Updated Feb 25, 2025

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,360 176 Updated Sep 26, 2025

Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision.

Python 301 30 Updated Mar 18, 2025

Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496

Jupyter Notebook 91 5 Updated Apr 13, 2025

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 108 6 Updated Oct 25, 2024

EVE Series: Encoder-Free Vision-Language Models from BAAI

Python 361 12 Updated Jul 24, 2025
Python 25 Updated Feb 27, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 209 8 Updated Oct 15, 2025

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,628 139 Updated Oct 7, 2025
Python 22 4 Updated Aug 9, 2024

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Python 44 6 Updated Mar 25, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,870 371 Updated Dec 17, 2025

Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)

Python 212 10 Updated Nov 12, 2025
Next