Skip to content
View zhulf0804's full-sized avatar
🎯
Busy
🎯
Busy
  • Beijing · China

Block or report zhulf0804

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR2026] Detect Anything via Next Point Prediction

Jupyter Notebook 1,351 91 Updated Feb 22, 2026

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 24,958 6,574 Updated Jun 7, 2024

BoxMOT: Pluggable python and c++ SOTA multi-object tracking modules with support for axis-aligned and oriented bounding boxes

Python 8,158 1,895 Updated May 17, 2026

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Jupyter Notebook 1,429 488 Updated Aug 8, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 3,308 245 Updated Sep 5, 2025

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,801 480 Updated May 8, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,193 1,764 Updated Jan 30, 2026

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,647 490 Updated Aug 7, 2024

A Diagnostic Guardrail Framework for AI Agent Safety and Security

Python 471 18 Updated May 14, 2026

Notes from How Diffusion Models Work by DeepLearning.ai

Jupyter Notebook 28 20 Updated Dec 4, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,086 4,555 Updated May 17, 2026

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,841 409 Updated Mar 15, 2025

Fast and memory-efficient exact attention

Python 23,813 2,731 Updated May 16, 2026

Official Code for LightVLA (ICRA 2026)

Python 97 6 Updated Jan 31, 2026

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,200 169 Updated Sep 9, 2025

ImageBind One Embedding Space to Bind Them All

Python 9,027 846 Updated Nov 21, 2025

[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide

13,686 886 Updated Mar 12, 2026

[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All

Python 852 68 Updated Jun 1, 2023

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,020 1,957 Updated May 17, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,180 732 Updated Mar 23, 2025

LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]

Python 187 5 Updated Mar 12, 2026

Fully Open Framework for Democratized Multimodal Training

Python 839 67 Updated May 18, 2026

Super Rays and Culling Region for Real-Time Updates on Grid-based Occupancy Maps

C++ 75 12 Updated Jan 9, 2025

Example models using DeepSpeed

Python 6,819 1,121 Updated Mar 30, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,801 2,766 Updated Aug 12, 2024

The official GitHub mirror of the Chromium source

C++ 23,702 8,870 Updated May 18, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,410 839 Updated Mar 30, 2026

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 6,530 606 Updated May 9, 2026
Next