[CVPR 2026] DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration

DreamSAC is a model-based reinforcement learning agent that integrates structured world models with physics-inspired dynamics for robust visual control and zero-shot domain transfer.

Abstract

Learned world models excel at interpolative generalization but fail at extrapolative generalization to novel physical properties. This limitation arises because they learn statistical correlations rather than the environment's underlying generative rules, such as physical invariances and conservation laws. We argue that learning these invariances is key to robust extrapolation. To achieve this, we first introduce Symmetry Exploration, an unsupervised exploration strategy where an agent is intrinsically motivated by a Hamiltonian-based curiosity bonus to actively probe and challenge its understanding of conservation laws, thereby collecting physically informative data. Second, we design a Hamiltonian-based world model that learns from the collected data, using a novel self-supervised contrastive objective to identify the invariant physical state from raw, view-dependent pixel observations. Our framework, DreamSAC, trained on this actively curated data, significantly outperforms state-of-the-art baselines in 3D physics simulations on tasks requiring extrapolation.

Installation

pip install -r requirements.txt
# Optional: for SE(3)-equivariant networks
pip install e3nn-jax

Requires JAX with CUDA 12. See JAX installation guide.

Usage

Two-Stage Training (Pretrain → Finetune)

bash run.sh

Custom Run

# Stage 1: Unsupervised pre-training
python dreamsac/main.py \
    --configs dreamsac_e3nn pretrain dmc_vision \
    --logdir logdir/pretrain \
    --run.steps 1000000

# Stage 2: Fine-tuning
python dreamsac/main.py \
    --configs dreamsac_e3nn finetune dmc_vision \
    --logdir logdir/finetune \
    --run.from_checkpoint logdir/pretrain/ckpt/<step> \
    --run.steps 500000

Domain Randomization

bash run_transfer.sh

Environments

Evaluated on DeepMind Control Suite (visual observations):

walker_walk, cheetah_run, hopper_hop, reacher_hard, humanoid_walk

Acknowledgement

Some parts of this project are adapted from DreamerV3 and Slot Attention. We would like to thank the authors for their work.

Citation

Coming soon ...

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
dreamsac		dreamsac
embodied		embodied
.gitignore		.gitignore
README.md		README.md
experiment_config.py		experiment_config.py
plot.py		plot.py
plot_loss.py		plot_loss.py
plot_transfer_results.py		plot_transfer_results.py
requirements.txt		requirements.txt
run.sh		run.sh
run_transfer.sh		run_transfer.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2026] DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration

Abstract

Installation

Usage

Two-Stage Training (Pretrain → Finetune)

Custom Run

Domain Randomization

Environments

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2026] DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration

Abstract

Installation

Usage

Two-Stage Training (Pretrain → Finetune)

Custom Run

Domain Randomization

Environments

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages