ABot-Claw is a next-generation embodied AI framework built on OpenClaw that unifies VLN, VLA, and WAM models through the VLAC (Vision-Language-Action-Critic) loop, enabling real-time task monitoring and self-adaptation. By sharing a multimodal memory—centered on vision, semantics, and geometry—across devices and leveraging centralized control, it breaks down the silos of single-task robots, delivering a robust, collaborative, and elastically scalable multi-agent system.
Introduces the VLAC (Vision-Language-Action-Critic) closed-loop feedback mechanism for dynamic task control.
- Real-time Assessment: Agents assess task completion and execution progress.
- Adaptive Adjustment: VLAC provides feedback and triggers strategy updates when deviations occur, enhancing success rates and robustness in long-horizon tasks.
Supports VLA (Vision-Language-Action) and WAM (World Action Model) for full-cycle autonomy.
- Seamless Integration: Tightly aligns perception with action to accurately follow natural language instructions.
- High Autonomy: Executes complex multi-step tasks end-to-end without human intervention.
Enables decentralized collaboration via a shared "Brain."
- Unified Decision Making: All robots share one Agent Runtime for synchronized state and joint reasoning.
- Hot-Swappable Support: Robots can join or replace others anytime without disrupting task flow.
Features a unified Memory System for persistent knowledge storage.
- Integrated Structure: Combines geometric maps (for localization) and semantic maps / image-feature-GPS (for understanding).
- Long-Context Understanding: Retains and retrieves key visual history to overcome occlusion and delayed feedback.
ABot-Claw employs a layered microservices architecture to ensure high cohesion and low coupling:
- Infrastructure Layer:
- GPU Server: Hosts high-performance computing models including Yolo, Depth, VLA, Grasp Anything, VLN, and WAM.
- Robots & Cameras: Physical terminals including Dog, G1, PiPer, and various camera devices.
- Runtime Core (Agent Runtime):
- Gateway: Handles message routing for CLI/Web UI and Channels (Telegram, DingTalk, Feishu).
- Agent Loop: The core intelligence cycle containing Context, Tools, and Skills.
- Scheduler & Device: Manages task scheduling (Heartbeat/Cron) and local device interaction (File System, Shell, Browser).
- Memory & Knowledge:
- Vision-centric Memory: A central repository at the top layer storing geometric maps, semantic maps, and image feature indices for global access.
Please refer to the installation instructions in each service.
- Configure the OpenClaw workspace before running multi-robot tasks:
# Merge AbotClaw workspace files into an existing OpenClaw setup
./setup.sh
# Or rebuild the OpenClaw workspace, then apply AbotClaw files
./setup.sh --fresh- Update robot fleet endpoints in
~/.openclaw/workspace/skills/ROBOT.md:
PIPER_BASE_URL=http://<PIPER_HOST>:<PIPER_PORT>G1_BASE_URL=http://<G1_HOST>:<G1_PORT>GO2_BASE_URL=http://<GO2_HOST>:<GO2_PORT>
Example:
PIPER_BASE_URL=http://192.168.1.10:8880
G1_BASE_URL=http://192.168.1.20:8880
GO2_BASE_URL=http://192.168.1.30:8880
- Update shared service host in
~/.openclaw/workspace/skills/SERVICE.md:
SERVICE_HOST=<SERVICE_HOST>
Example:
SERVICE_HOST=192.168.1.100
Service URLs are derived as:
http://<SERVICE_HOST>:8012(SpatialMemory)http://<SERVICE_HOST>:8013(YOLO)http://<SERVICE_HOST>:8014(VLAC)http://<SERVICE_HOST>:8015(GraspAnything)
- Verify robot-type routing rules in
~/.openclaw/workspace/skills/MISSION.md:
- Piper: fixed-base manipulation tasks
- Unitree G1: humanoid interaction / whole-body tasks
- Unitree Go2: mobility, scouting, and inspection tasks
- Restart the OpenClaw gateway:
openclaw gateway restart- Install Python dependencies:
cd robot_layer/arm_piper/agent_server
pip3 install -r requirements.txt- Build ROS driver in
robot_driver_ros(download the corresponding ROS driver for your robot):
cd robot_driver_ros/src
git clone https://github.com/agilexrobotics/piper_ros.git
git clone https://github.com/realsenseai/realsense-ros.git -b ros1-legacy
# Install dependencies required by the corresponding driver.
cd ../..
catkin_make- Update robot topics and source path in
robot_layer/arm_piper/agent_server/robot_sdk/config.yaml:
ros:
image_topic: "/your_camera/color/image_raw"
depth_topic: "/your_camera/aligned_depth_to_color/image_raw"
camera_info_topic: "/your_camera/color/camera_info"
joint_state_topic: "/your_joint_states_topic"
end_pose_topic: "/your_end_pose_topic"
piper:
setup_bash: "/absolute_path_to/robot_layer/arm_piper/agent_server/robot_driver_ros/devel/setup.bash"Please modify the ROS topics according to your robot setup, and make sure
setup_bashpoints to your localdevel/setup.bash.
- Launch components in order:
- Arm driver
cd robot_layer/arm_piper/agent_server/robot_driver_ros/src/piper_ros ./can_activate.sh cd ../.. source devel/setup.bash roslaunch piper start_single_piper.launch can_port:=can0 auto_enable:=true
- Camera driver
cd robot_layer/arm_piper/agent_server/robot_driver_ros source devel/setup.bash roslaunch realsense2_camera rs_rgbd.launch
- Arm MoveIt
cd robot_layer/arm_piper/agent_server/robot_driver_ros source devel/setup.bash roslaunch piper_with_gripper_moveit demo.launch use_rviz:=false
- After all three components are running, start the robot agent server:
cd robot_layer/arm_piper/agent_server
python3 server.py --port 8888This project builds upon the following open-source projects. We thank these teams for their contributions:
We also gratefully acknowledge Deepblue College for their support in the humanoid robot deployment, providing access to the Unitree G1-Romp Edu robot and the LinkerBot-O6 hand system.