BlockFusion for Volumetric Scalar field data

Environment Setup (if running on Sophia):

Use this module:

module use /soft/modulefiles; module load conda; conda activate base

And install missing packages:

pip install --user easydict

Setting Environment Variables for using shadow volume sampler:

export PYTHONPATH=/home/kctung/Projects/instant-vnr-pytorch/bindings/build:$PYTHONPATH
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/kctung/Projects/instant-vnr-pytorch/build_deps/install/lib

Stage 1 Training - Triplane Overfitting:

Please go into ./fit_triplane and follow the instructions.

Stage 2 Training - Variational Autoencoder Training on Triplanes

To train Variational Autoencoder, run the following sample command (Please specify --pretrained_triplane_file_path):

python triplane_VAE_training.py --expname="various_loss_combination_exp" \
 --description="L1 + L2 + Geo loss combination (No MS-SSIM) (lr scheduling and KL-annealing roughly following the record from logs/triplane_model_g/20250703-184718)" \
 --batch_size=6 --epochs=2100 --ckpt_freq=500 --model_config=model_g \
 --mae_loss_weight 1.0 --mse_loss_weight 0.5 --ms_ssim_loss_weight 0.0 --lpips_loss_weight 0.0 \
 --kl_loss_weight_values 0.000001 0.00001 0.00005 --kl_loss_weight_epochs 0 500 1000 \
 --geometry_loss_weight 0.8 --scheduler_type MultiStepLR \
 --init_lr 0.0001 --lr_gamma 0.5 --milestones 500 1000 1213 1500 1650 1750 1850 1925 2000 2050 \
 --pretrained_triplane_file_path path_to_pretrained_triplanes_file

Notes: VAE ckpt models can be large in size (about 9 GB), make sure to have enough space to save all ckpt files.

To inference and reconstruct triplanes with pre-trained Variational Autoencoder, run the following sample command (Please specify --expdir and --model_file_name):

python triplane_VAE_inference.py --expdir path_to_the_experiment_directory_created_when_training_VAE --model_file_name filename_of_the_model_used_to_inference --model_config model_g

For distributed training and jobs submission example, you can refer to ./sophia_exp/exp_mae_mse_geo_example.sh (Please specify project name at the first line of the bash script)

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

ACM Transactions on Graphics (SIGGRAPH'24)

Project page | Paper | Video

This repo contains the official implementation for the paper "BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation".

Installation

Pytorch3d is required for postprocess. Come to offical page for installation instructions.

We modified diffusers to adapt triplane structure. Users should build based on the ./src

# clone this repo 
python setup.py install

Pretrained model

Pretrained weights of MLP, VAE, and Diffuser(Condition/Unconditioned). Download the model here [10GB] and extract to ./checkpoints.

Inference

To do uncondtional inference, run

python unconditioned_prediction.py --batch 4 --save output/uncond

To do single block conditional inference, run

python conditioned_prediction.py --layout samplelayouts/exp1_32-56-24/0_0.npy --save output/cond

We provide some sample layouts for demo. If you would like to draw your own layout maps, you can use drawtkinter.py to create your own layout. Here is a tutorial:

The unit of the graduations are in meters, furnitures need to be in reasonable scale. Walls should surround the floor and furniture should be placed on the floor.

The output layouts directory named after expname_xscale-zscale-stride, xscale-zscale-stride are given in decimeter.

Usage:

python drawtkinter.py --h 56 --w 56 --expname draw

Full pipeline(conditional prediction + extrapolation + resample) is contained in fullpipeline.py. Before running, specifying the layout directory.

python fullpipeline.py --layout samplelayouts/exp2_32-56-24 --resample 15 --save output

Citation

If you find our code or paper helps, please consider citing:

@article{blockfusion,
  title={BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation},
  author={Wu, Zhennan and Li, Yang and Yan, Han and Shang, Taizhang and Sun, Weixuan and Wang, Senbo and Cui, Ruikai and Liu, Weizhe and Sato, Hiroyuki and Li, Hongdong and Ji, Pan},
  journal={ACM Transactions on Graphics},
  volume={43},
  number={4},
  year={2024},
  doi={10.1145/3658188}
 }

Name		Name	Last commit message	Last commit date
Latest commit History 279 Commits
autoencoder_config		autoencoder_config
build/lib/diffusers		build/lib/diffusers
dist		dist
fit_triplane		fit_triplane
img		img
output		output
samplelayouts		samplelayouts
sophia_exp		sophia_exp
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
autoencoder.py		autoencoder.py
autoencoder_2D_origin.py		autoencoder_2D_origin.py
calculate_model_size.py		calculate_model_size.py
conda_env_packages.yml		conda_env_packages.yml
conditioned_prediction.py		conditioned_prediction.py
decode.py		decode.py
decoder.py		decoder.py
diffusion_inference.py		diffusion_inference.py
diffusion_training.py		diffusion_training.py
draw.py		draw.py
drawtkinter.py		drawtkinter.py
fullpipeline.ipynb		fullpipeline.ipynb
fullpipeline.py		fullpipeline.py
hashencoding_autoencoder.py		hashencoding_autoencoder.py
hashencoding_autoencoder_extract_latent.py		hashencoding_autoencoder_extract_latent.py
hashencoding_autoencoder_inference.py		hashencoding_autoencoder_inference.py
mesh.py		mesh.py
postprocess.py		postprocess.py
repaint.py		repaint.py
requirements.txt		requirements.txt
setup.py		setup.py
simple_autoencoder.py		simple_autoencoder.py
timevarying_autoencoder.py		timevarying_autoencoder.py
timevarying_data_helper.py		timevarying_data_helper.py
triplane_VAE_extract_latent.py		triplane_VAE_extract_latent.py
triplane_VAE_inference.py		triplane_VAE_inference.py
triplane_VAE_training.py		triplane_VAE_training.py
triplane_VAE_training_shadow.py		triplane_VAE_training_shadow.py
triplane_VAE_training_single_triplane.py		triplane_VAE_training_single_triplane.py
unconditioned_prediction.py		unconditioned_prediction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BlockFusion for Volumetric Scalar field data

Environment Setup (if running on Sophia):

Setting Environment Variables for using shadow volume sampler:

Stage 1 Training - Triplane Overfitting:

Stage 2 Training - Variational Autoencoder Training on Triplanes

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

ACM Transactions on Graphics (SIGGRAPH'24)

Project page | Paper | Video

Installation

Pretrained model

Inference

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BlockFusion for Volumetric Scalar field data

Environment Setup (if running on Sophia):

Setting Environment Variables for using shadow volume sampler:

Stage 1 Training - Triplane Overfitting:

Stage 2 Training - Variational Autoencoder Training on Triplanes

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

ACM Transactions on Graphics (SIGGRAPH'24)

Project page | Paper | Video

Installation

Pretrained model

Inference

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages