YBZh

Follow

🐢

Yabin Zhang YBZh

🐢

Follow

136 followers · 78 following

The Hong Kong Polytechnic University
Hong Kong
https://ybzh.github.io/

Achievements

Achievements

Lists (1)

Sort

medical

Stars

zchoi / Awesome-Embodied-Robotics-and-Agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,751 91 Updated Feb 12, 2026

isaac-sim / IsaacSim

NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual environments.

Python 2,827 368 Updated Mar 16, 2026

aiming-lab / AutoResearchClaw

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 8,376 889 Updated Mar 23, 2026

CryptoAILab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,905 129 Updated Mar 16, 2026

AI45Lab / Awesome-Trustworthy-Embodied-AI

JavaScript 98 3 Updated Mar 24, 2026

DravenALG / awesome-vla-wam

A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond

147 3 Updated Mar 24, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 334,176 65,177 Updated Mar 24, 2026

Wang-pengfei / One2Scene

[ICLR 2026] - One2Scene

Python 31 2 Updated Feb 26, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 82,143 6,861 Updated Mar 20, 2026

ComposioHQ / awesome-claude-skills

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 47,513 4,885 Updated Feb 19, 2026

tongjingqi / Thinking-with-Video

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

Python 287 5 Updated Mar 21, 2026

Jonathan-LeRoux / IguanaTex

A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac

VBA 1,341 76 Updated Jan 30, 2025

changyeyu / LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

Python 3,873 364 Updated Feb 23, 2026

rajpurkarlab / cheXpert-test-set-labels

84 12 Updated Oct 21, 2022

rajpurkarlab / ReXrank

HTML 25 3 Updated Mar 24, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,751 1,690 Updated Jan 30, 2026

jbdel / RadEval

Python 54 10 Updated Mar 19, 2026

JiuTian-VL / Large-VLM-based-VLA-for-Robotic-Manipulation

A curated list of large VLM-based VLA models for robotic manipulation.

372 12 Updated Dec 21, 2025

xtudbxk / GPSToken

The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"

Python 49 Updated Sep 28, 2025

ZhuWenjie98 / ANTS

CVPR2026 ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection

Python 32 Updated Mar 20, 2026

google / langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 34,897 2,349 Updated Mar 22, 2026

MIT-LCP / mimic-cxr

Code, documentation, and discussion around the MIMIC-CXR database

Jupyter Notebook 300 59 Updated Jul 13, 2020

csxmli2016 / MARCONetPlusPlus

Enhanced Generative Structure Prior for Text Image Super-Resolution [TPAMI]

Python 69 6 Updated Aug 20, 2025

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,232 156 Updated Mar 18, 2026

uzh-dqbm-cmi / RadVLM

A Multitask Conversational Vision-Language Model for Radiology

Python 16 2 Updated Jul 3, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,936 2,064 Updated Jan 13, 2026

MinghanLi / FiVE-Bench

[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models

Python 33 Updated Dec 27, 2025

karminski / one-small-step

这是一个简单的技术科普教程项目，主要聚焦于解释一些有趣的，前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

Python 6,836 611 Updated Mar 8, 2026

Joyies / TVT

[ICCV2025] Official code for Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training

Python 122 3 Updated Jan 6, 2026

NVlabs / VideoITG

[CVPR2026] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

Python 96 2 Updated Mar 17, 2026