Skip to content
View YBZh's full-sized avatar
🐢
🐢

Block or report YBZh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,751 91 Updated Feb 12, 2026

NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual environments.

Python 2,827 368 Updated Mar 16, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 8,376 889 Updated Mar 23, 2026

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,905 129 Updated Mar 16, 2026

A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond

147 3 Updated Mar 24, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 334,176 65,177 Updated Mar 24, 2026

[ICLR 2026] - One2Scene

Python 31 2 Updated Feb 26, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 82,143 6,861 Updated Mar 20, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 47,513 4,885 Updated Feb 19, 2026

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

Python 287 5 Updated Mar 21, 2026

A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac

VBA 1,341 76 Updated Jan 30, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 3,873 364 Updated Feb 23, 2026
HTML 25 3 Updated Mar 24, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,751 1,690 Updated Jan 30, 2026
Python 54 10 Updated Mar 19, 2026

A curated list of large VLM-based VLA models for robotic manipulation.

372 12 Updated Dec 21, 2025

The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"

Python 49 Updated Sep 28, 2025

CVPR2026 ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection

Python 32 Updated Mar 20, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 34,897 2,349 Updated Mar 22, 2026

Code, documentation, and discussion around the MIMIC-CXR database

Jupyter Notebook 300 59 Updated Jul 13, 2020

Enhanced Generative Structure Prior for Text Image Super-Resolution [TPAMI]

Python 69 6 Updated Aug 20, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,232 156 Updated Mar 18, 2026

A Multitask Conversational Vision-Language Model for Radiology

Python 16 2 Updated Jul 3, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,936 2,064 Updated Jan 13, 2026

[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models

Python 33 Updated Dec 27, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

Python 6,836 611 Updated Mar 8, 2026

[ICCV2025] Official code for Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training

Python 122 3 Updated Jan 6, 2026

[CVPR2026] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

Python 96 2 Updated Mar 17, 2026
Next