Skip to content
View fangchengji's full-sized avatar

Block or report fangchengji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of "Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation".

Jupyter Notebook 329 16 Updated Jun 11, 2026

🔎 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 3,276 246 Updated May 31, 2026

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,313 79 Updated Jan 5, 2026

CLIP+MLP Aesthetic Score Predictor

Python 1,313 113 Updated Jul 1, 2024

AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.

Python 1,317 195 Updated May 2, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,102 892 Updated Jun 12, 2026

[BMVC2021] The first image composition assessment dataset. Used in the paper "Image Composition Assessment with Saliency-augmented Multi-pattern Pooling". Useful for image composition assessment, i…

Python 168 15 Updated Feb 17, 2026

The official repository of Qwen-VLA

570 21 Updated May 29, 2026

This is the official code repo for DiT4DiT, a Vision-Action-Model (VAM) framework that combines video generation model with flow-matching-based action prediction for generalizable robotic manipulat…

Python 343 16 Updated Jun 9, 2026

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Python 398 31 Updated Feb 26, 2026

[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 570 20 Updated Apr 22, 2026

UAV-GESTURE: A Dataset for UAV Control and Gesture Recognition

C++ 32 5 Updated Jul 16, 2019

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

Python 252 16 Updated Jul 11, 2022

Next generation frontend tooling. It's fast!

TypeScript 81,427 8,296 Updated Jun 12, 2026

PX4 Autopilot Software

C++ 11,935 15,557 Updated Jun 12, 2026

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,414 373 Updated Jun 10, 2026
JavaScript 4,377 266 Updated Jun 12, 2026

A framework for efficient model inference with omni-modality models

Python 5,120 1,106 Updated Jun 12, 2026

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,988 331 Updated Jun 12, 2024

CUDA Accelerated Robot Library

Python 1,623 290 Updated Jun 11, 2026

AerialClaw: Towards General Intelligence for Autonomous Aerial Agents

Python 89 17 Updated May 27, 2026

The agent that grows with you

Python 191,849 33,398 Updated Jun 12, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,485 1,474 Updated Jun 12, 2026

Post-training with Tinker

Python 3,463 446 Updated Jun 12, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,921 442 Updated Nov 13, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,218 721 Updated Jun 12, 2026
Python 181 21 Updated May 9, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

50,151 4,885 Updated Jun 8, 2026

⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)

452 75 Updated Feb 14, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,194 2,004 Updated Mar 17, 2026
Next