-
Tsinghua University
Stars
This repo provide a python script to create a composite image from a video.
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses | 500+ Papers | Perception, Cognition, Planning, Interaction, Agentic System
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
Control of the fully actuated PX4 Omnicopter
A real-world multimodal VLM reasoning benchmark for industrial real-time safety assessment
Low-level control of PX4 Multi-rotor vehicles in Offboard mode
Example of PX4 offboard control over microdds using python ROS 2
A benchmark fault diagnosis dataset featuring multi-modal signals collected from a three-phase asynchronous motor under variable speed/load conditions with deliberately induced faults, covering mul…
Automated, hardware-independent Hand-Eye Calibration for ROS2
A modernized vision-language successor to GroundingDINO
A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …
[NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.
A vision-language-safety action architecture, named AEGIS, which contains a plug-and-play safety constraint layer formulated via control barrier functions.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
RynnVLA-002: A Unified Vision-Language-Action and World Model
The source code and pre-trained models for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing (WACV 2024, Oral).
A General Toolkit for Advanced Online Learning, Online Active Learning, Online Semi-supervised Learning Approaches
A General Toolkit for Online Learning Approaches
Python package for estimation of vital signs such as heart rate, HRV, and respiratory rate from face video.
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Multi-mode Fault Diagnosis Datasets with TE process (MMFDD-TEP) can be used for the purpose of comparison studies or validation of algorithms