xiaojieli0903

🎯

Focusing

xiaojieli0903 xiaojieli0903

🎯

Focusing

Ph.D. candidate at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen).

50 followers · 54 following

HIT (Shenzhen)
Shenzhen
18:56 (UTC -12:00)
https://xiaojieli0903.github.io

Achievements

Stars

robotics-survey / Awesome-Robotics-Foundation-Models

1,275 116 Updated Oct 7, 2024

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

527 12 Updated Oct 9, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,685 2,640 Updated Aug 12, 2024

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,703 657 Updated Jan 22, 2025

DepthAnything / Video-Depth-Anything

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,452 118 Updated Oct 7, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,040 1,648 Updated Sep 24, 2025

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Python 43 Updated Sep 18, 2025

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Fully Open Framework for Democratized Multimodal Training

Python 457 28 Updated Sep 30, 2025

InternRobotics / InternVLA-M1

InternVLA-M1: A Spatially Grounded Foundation Model for Generalist Robot Policy

Python 125 3 Updated Oct 3, 2025

InternRobotics / InternNav

InternRobotics' open platform for building generalized navigation foundation models.

Python 317 25 Updated Oct 9, 2025

HKUST-Aerial-Robotics / FC-Planner

[ICRA'24 Best UAV Paper Award Finalist] An Efficient Global Planner for Aerial Coverage

C++ 299 24 Updated Jul 13, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,293 408 Updated Sep 14, 2025

InternRobotics / StreamVLN

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 245 13 Updated Sep 28, 2025

unitreerobotics / unifolm-world-model-action

Python 591 47 Updated Oct 1, 2025

AIGeeksGroup / Nav-R1

Nav-R1: Reasoning and Navigation in Embodied Scenes

Python 55 Updated Sep 30, 2025

Yangzhangcst / Mamba-in-CV

A paper list of some recent Mamba-based CV works.

409 20 Updated Oct 6, 2025

Sautenich / Awesome-Aerial-Vision-Language-Navigation

The new spin-off of Visual Language Navigation.

29 Updated Jul 7, 2025

EmbodiedCity / CityNavAgent.code

Python 16 1 Updated Aug 10, 2025

blazejosinski / lm_nav

Jupyter Notebook 249 25 Updated Jan 14, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,563 476 Updated Oct 3, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,006 36 Updated Oct 4, 2025

water-cookie / citynav

Python 88 7 Updated Feb 11, 2025

ki-lw / Awesome-MLLMs-for-Video-Temporal-Grounding

Latest Papers, Codes and Datasets on VTG-LLMs.

37 Updated Sep 25, 2025

EmbodiedMind / DiffusionPolicy-Robotics

Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.

714 33 Updated Aug 31, 2025

fscdc / Awesome-Efficient-Reasoning-Models

[TMLR 2025] Efficient Reasoning Models: A Survey

Python 265 16 Updated Sep 30, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 706 38 Updated Sep 19, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

705 36 Updated Sep 27, 2025

FoundationAgents / awesome-foundation-agents

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,756 171 Updated Jul 28, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,202 56 Updated Oct 1, 2025

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,712 72 Updated Oct 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xiaojieli0903 xiaojieli0903

Achievements

Achievements

Block or report xiaojieli0903

Stars

robotics-survey / Awesome-Robotics-Foundation-Models

leofan90 / Awesome-World-Models

haotian-liu / LLaVA

DepthAnything / Depth-Anything-V2

DepthAnything / Video-Depth-Anything

OpenBMB / MiniCPM-V

InternRobotics / InternVLA-A1

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

InternRobotics / InternVLA-M1

InternRobotics / InternNav

HKUST-Aerial-Robotics / FC-Planner

LLaVA-VL / LLaVA-NeXT

InternRobotics / StreamVLN

unitreerobotics / unifolm-world-model-action

AIGeeksGroup / Nav-R1

Yangzhangcst / Mamba-in-CV

Sautenich / Awesome-Aerial-Vision-Language-Navigation

EmbodiedCity / CityNavAgent.code

blazejosinski / lm_nav

facebookresearch / dinov3

zhaochen0110 / Awesome_Think_With_Images

water-cookie / citynav

ki-lw / Awesome-MLLMs-for-Video-Temporal-Grounding

EmbodiedMind / DiffusionPolicy-Robotics

fscdc / Awesome-Efficient-Reasoning-Models

tulerfeng / Video-R1

showlab / Awesome-Unified-Multimodal-Models

FoundationAgents / awesome-foundation-agents

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

jonyzhang2023 / awesome-embodied-vla-va-vln