EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation

Official PyTorch code release for the ICLR 2025 paper:

EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation
Carl Qi, Dan Haramati, Tal Daniel, Aviv Tamar, and Amy Zhang
Project Website • arXiv • OpenReview • BibTeX

Overview

This repository contains the official PyTorch implementation of EC-Diffuser, a novel behavioral cloning (BC) approach that leverages object-centric representations and an entity-centric Transformer with diffusion-based optimization, enabling efficient learning from offline image data. For more details, please refer to the project website and the associated paper.

Installation

Follow these steps to set up the environment:

Create and activate a Conda environment (we tested on Python 3.8):
```
conda create -n dlp python=3.8
conda activate dlp
```
Install main dependencies:

The key libraries are:

Library Version

torch 2.1.2

stable-baselines3 1.5.0

isaacgym (as required)

For a complete list, please see requirements.txt.
Install Diffuser-related packages:
```
cd diffuser
pip install -e .
cd ../
```

Setup for the FrankaKitchen environment:

Install D4RL by cloning the repository:

git clone https://github.com/Farama-Foundation/d4rl.git
cd d4rl
pip install -e .
cd ../

Finalize environment setup:

Run the provided setup script:
```
bash setup_env.sh
```
(If the script requires sourcing, you can also run: source setup_env.sh)

Downloading Datasets

Download the required datasets, pretrained agents, and DLP representations from our HuggingFace dataset:

git lfs install
git clone https://huggingface.co/datasets/carlq/ecdiffuser-data

Evaluating a Pretrained Agent

You can evaluate the pretrained agents with the following commands. Replace CUDA_VISIBLE_DEVICES=0,1 with the GPU devices you wish to use (Note IsaacGym env has to be on GPU 0).

PushCube Agent:

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/eval_agent.py --config config.plan_pandapush_pint --num_entity 3 --planning_only

PushT Agent:

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/eval_agent.py --config config.plan_pandapush_pint --push_t --num_entity 3 --push_t_num_color 1 --planning_only

FrankaKitchen Agent:

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/eval_agent.py --config config.plan_pandapush_pint --kitchen --planning_only

Training an Agent

Train your own agents using the commands below. Replace CUDA_VISIBLE_DEVICES=0,1 with the GPU devices you wish to use (Note IsaacGym env has to be on GPU 0).

Train a PushCube Agent (3 cubes):

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/train.py --config config.pandapush_pint --num_entity 3

Train a PushT Agent (1 T-shaped object):

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/train.py --config config.pandapush_pint --push_t --num_entity 1

Train a FrankaKitchen Agent:

CUDA_VISIBLE_DEVICES=0,1 python diffuser/scripts/train_kitchen.py --config config.pandapush_pint --kitchen

Citation

If you find our work useful, please consider citing:

Carl Qi, Dan Haramati, Tal Daniel, Aviv Tamar, and Amy Zhang. "EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation." Proceedings of the Twelfth International Conference on Learning Representations (ICLR). 2025.

@inproceedings{
    qi2025ecdiffuser,
    title={{EC}-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation},
    author={Carl Qi and Dan Haramati and Tal Daniel and Aviv Tamar and Amy Zhang},
    booktitle={The Thirteenth International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=o3pJU5QCtv}
    }

Common Issues

For error installing stable-baselines3 / gym:

https://stackoverflow.com/questions/77124879/pip-extras-require-must-be-a-dictionary-whose-values-are-strings-or-lists-of
For AttributeError: module 'numpy' has no attribute 'float', run pip install numpy==1.21.0
For error Compiling Cython file in mujoco-py, run pip install "cython<3": openai/mujoco-py#773

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.idea		.idea
assets		assets
diffuser		diffuser
dlp2		dlp2
env_config		env_config
franka_kitchen		franka_kitchen
media		media
panda_controller		panda_controller
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset_paths.py		dataset_paths.py
dlp_utils.py		dlp_utils.py
isaac_env_wrappers.py		isaac_env_wrappers.py
isaac_panda_push_env.py		isaac_panda_push_env.py
isaac_vec_env.py		isaac_vec_env.py
latent_classifier.py		latent_classifier.py
requirements.txt		requirements.txt
setup_env.sh		setup_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation

Table of Contents

Overview

Installation

Downloading Datasets

Evaluating a Pretrained Agent

Training an Agent

Citation

Common Issues

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Library	Version
`torch`	`2.1.2`
`stable-baselines3`	`1.5.0`
`isaacgym`	(as required)

License

carl-qi/EC-Diffuser

Folders and files

Latest commit

History

Repository files navigation

EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation

Table of Contents

Overview

Installation

Downloading Datasets

Evaluating a Pretrained Agent

Training an Agent

Citation

Common Issues

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages