yushengjiexy

yushengjiexy

Stars

77 stars written in Python

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,831 33,284 Updated May 21, 2026

jingyaogong / minimind

🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!

Python 50,326 6,415 Updated May 19, 2026

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,714 2,900 Updated Sep 2, 2024

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,195 4,594 Updated May 21, 2026

Physical-Intelligence / openpi

Python 11,950 1,937 Updated May 5, 2026

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 10,038 779 Updated Sep 22, 2025

jingyaogong / minimind-v

👀「大模型」2小时从0训练65M参数的视觉多模态VLM！Train a 65M-parameter VLM from scratch in just 2h!

Python 7,965 866 Updated May 19, 2026

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,437 372 Updated May 21, 2026

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,221 738 Updated Mar 23, 2025

yenchenlin / nerf-pytorch

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 6,027 1,129 Updated Jul 25, 2024

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,872 494 Updated Oct 27, 2025

fundamentalvision / BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 4,468 728 Updated Aug 15, 2024

PINTO0309 / PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,161 638 Updated Apr 30, 2026

google-research / multinerf

A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF

Python 3,804 357 Updated Dec 8, 2023

median-research-group / LibMTL

A PyTorch Library for Multi-Task Learning

Python 2,550 232 Updated May 14, 2025

HuangJunJie2017 / BEVDet

Code base of the BEVDet series .

Python 1,786 305 Updated Jul 4, 2024

PRIME-RL / SimpleVLA-RL

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,668 107 Updated Jan 6, 2026

autonomousvision / occupancy_networks

This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"

Python 1,659 305 Updated Jun 27, 2023

nv-tlabs / lift-splat-shoot

Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)

Python 1,341 253 Updated Oct 15, 2024

sjtuytc / UnboundedNeRFPytorch

State-of-the-art, simple, fast unbounded / large-scale NeRFs.

Python 1,324 112 Updated Jun 11, 2024

hustvl / VAD

[ICCV 2023 & ICLR 2026] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python 1,310 160 Updated Jan 31, 2026

kakaobrain / nerf-factory

An awesome PyTorch NeRF library

Python 1,273 104 Updated Jul 23, 2024

ziyc / drivestudio

A 3DGS framework for omni urban scene reconstruction and simulation.

Python 1,183 149 Updated Aug 27, 2025

yinyunie / 3D-Shape-Analysis-Paper-List

A list of recent papers, libraries and datasets about 3D shape/scene analysis (by topics, updating).

Python 959 113 Updated Dec 5, 2023

zju3dv / neuralbody

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Python 952 131 Updated Jan 21, 2024

Kai-46 / nerfplusplus

improves over nerf in 360 capture of unbounded scenes

Python 942 101 Updated Mar 24, 2022

taco-group / OpenEMMA

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 935 126 Updated May 13, 2025

WangYueFt / detr3d

Python 910 164 Updated Dec 22, 2022

OpenDriveLab / Vista

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 881 61 Updated Jul 2, 2025

dyfcalid / CameraCalibration

Fisheye or Normal Camera Intrinsic and Extrinsic Calibration. Surround Camera Bird Eye View Generator.

Python 863 222 Updated May 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly