Skip to content
View Azx030512's full-sized avatar

Highlights

  • Pro

Organizations

@PKU-OV3-LAB

Block or report Azx030512

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026 Oral] VGGT Omega

Python 3,157 149 Updated May 18, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,406 8,859 Updated Jun 23, 2026

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Python 3,674 931 Updated Aug 26, 2022

程序员鱼皮的 AI 资源大全 + Vibe Coding 零基础教程,分享 OpenClaw 保姆级教程、大模型玩法(DeepSeek / GPT / Gemini / Claude / GLM)、最新 AI 资讯、Prompt 提示词大全、AI 知识百科(Agent Skills / RAG / MCP / A2A)、AI 编程教程(Harness Engineering)、AI 工具用法…

JavaScript 16,215 1,817 Updated Jun 20, 2026

Code for the ShapeR research paper

Python 793 60 Updated Apr 30, 2026

MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion (CVPR 2025)

Python 522 38 Updated Jul 9, 2025

A synthetic satellite imagery dataset for semantic segmentation and domain adaptation.

44 7 Updated Feb 17, 2020

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

Python 3,470 448 Updated May 4, 2026

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 8,893 855 Updated Jun 23, 2026

Code and Data for Tau-Bench

Python 1,292 207 Updated Mar 18, 2026
Python 89 6 Updated May 20, 2026

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 556 101 Updated Sep 6, 2024

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 777 90 Updated Feb 8, 2026

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 3,294 393 Updated Feb 19, 2026

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

Python 707 97 Updated Nov 10, 2022

[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

Python 143 10 Updated Aug 30, 2024

Code for PhysDreamer

Python 629 30 Updated Feb 10, 2025

A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models

Python 629 13 Updated Jun 15, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,826 92 Updated Nov 28, 2025

[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Python 449 19 Updated Oct 2, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 856 42 Updated Dec 17, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 2,143 164 Updated Jun 22, 2026

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Python 400 31 Updated Feb 26, 2026

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 555 29 Updated Sep 21, 2025

PartFlow: two-stage image-conditioned 3D editing (inference code)

Python 65 Updated May 27, 2026

The implementation of Extreme Viewpoint 4D Video Generation

Python 261 19 Updated Sep 6, 2025

[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 821 40 Updated Jun 9, 2025

HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation

Python 45 3 Updated Jun 16, 2026

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, LTX-2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 6,252 961 Updated Jun 21, 2026

[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Python 1,567 56 Updated Dec 13, 2025
Next