Skip to content
View yanweifu's full-sized avatar

Block or report yanweifu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,815 350 Updated Jun 12, 2026

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,925 4,797 Updated Jun 13, 2026

This package contains the original 2012 AlexNet code.

Cuda 2,886 377 Updated Mar 12, 2025

[ICRA 2024]: Train your parkour robot in less than 20 hours.

Python 1,085 171 Updated Nov 28, 2023

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,431 328 Updated Jul 7, 2025

Fast and memory-efficient exact attention

Python 24,128 2,826 Updated Jun 10, 2026

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 10,059 782 Updated Sep 22, 2025

[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Python 39 1 Updated Aug 29, 2023

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 817 101 Updated Mar 6, 2025

Refine high-quality datasets and visual AI models

Python 10,775 768 Updated Jun 13, 2026

Oracle Character Recognition Dataset - Oracle-50K

13 1 Updated May 5, 2022

[ICCV2023] GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction

Python 442 38 Updated Feb 8, 2024

[CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields

Python 140 11 Updated Dec 28, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 34,548 7,950 Updated Jun 7, 2026

Repository to train and evaluate RoboAgent

Python 372 30 Updated Apr 2, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,825 3,421 Updated May 18, 2024

Code for "PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring" ICCV2023

Python 29 5 Updated Dec 6, 2023

✨✨Latest Advances on Multimodal Large Language Models

17,877 1,128 Updated May 1, 2026
Python 108 6 Updated Feb 20, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,787 584 Updated May 5, 2026

CLIP+MLP Aesthetic Score Predictor

Python 1,313 113 Updated Jul 1, 2024

Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling serv…

Python 16,053 3,706 Updated Jun 12, 2026

Code for "SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation" CVPR2022

Python 83 7 Updated Feb 6, 2024

Split-screen video comparison tool using FFmpeg and SDL2

C++ 1,656 66 Updated May 31, 2026

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors (TPAMI2023)

Python 100 11 Updated Dec 17, 2023

An English-language shell for any OS, powered by LLMs

Python 2,154 181 Updated Feb 7, 2026

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 54,342 6,355 Updated Sep 18, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,989 331 Updated Jun 12, 2024
Next