Hurricast 🌀

A novel deep learning framework for hurricane forecasting

📋 Overview

This repository contains the official implementation of the paper:
"Hurricane Forecasting: A Novel Multimodal Machine Learning Framework"

by Léonard Boussioux, Cynthia Zeng, Théo Guénais, Dimitris Bertsimas
Published in Weather and Forecasting

Key Features

🔮 24-hour lead time track and intensity forecasts
🗺️ Evaluated on North Atlantic and Eastern Pacific basins (2016-2019)
⚡ Computation in seconds
📊 Comparable accuracy to operational forecast models
🤝 Complementary to existing NHC approaches

🔬 Methodology

Hurricast employs a novel multimodal machine learning framework that combines multiple ML techniques and diverse data sources. Our approach follows a 3-step mechanism:

1️⃣ Feature Extraction

Uses encoder-decoder architectures
Processes reanalysis maps
Generates one-dimensional representations

2️⃣ Data Fusion

Concatenates statistical data
Combines with extracted map features

3️⃣ Prediction

Trains separate XGBoost models for:

24h intensity prediction
24h latitude displacement
24h longitude displacement

📈 Results

Our framework demonstrates:

Comparable mean absolute error to operational models
Improved forecast accuracy when integrated with consensus model
Significant reduction in computation time

🔗 References

Paper Link

📝 Citation

@article{boussioux2022hurricane,
  title={Hurricane forecasting: A novel multimodal machine learning framework},
  author={Boussioux, L{\'e}onard and Zeng, Cynthia and Gu{\'e}nais, Th{\'e}o and Bertsimas, Dimitris},
  journal={Weather and forecasting},
  volume={37},
  number={6},
  pages={817--831},
  year={2022}
}

📄 License

This project is licensed under the MIT License.

Encoder - Decoder Architectures

To perform feature extraction, we experimented with encoder-decoder architectures under a supervised learning mechanism.

The encoder component consists of a Convolutional Neural Network (CNN).
We provide two decoder variations:
- The first one relies on Recurrent Neural Networks (RNNs). We support RNN, LSTM, and GRU.
- The second one uses Transformers. While the GRU model the temporal aspect through a recurrence mechanism, the Transformers utilize attention mechanisms and positional encoding to model long-range dependencies.

To perform feature extraction from a given input sequence of reanalysis maps and statistical data, we pass them through the whole frozen encoder-decoder, except the last fully-connected layer(s).

Here is an example of the CNN-encoder GRU-decoder architecture we used:

At each time step, we utilize the CNN to produce a one-dimensional representation of the reanalysis maps. Then, we concatenate these embeddings with the corresponding statistical features to create a sequence of inputs fed sequentially to the GRU. At each time step, the GRU outputs a hidden state passed to the next time step. Finally, we concatenate all the successive hidden states and pass them through three fully connected layers to predict intensity or track with a 24-hour lead time. We finally extract our spatial-temporal embeddings as the output of the second fully connected layer.

Here is an example of the CNN-encoder Transformer-decoder architecture we used:

Code Structure

We bring your attention to a few particular files and folders:

src/: holds the core of the code, in particular:
- run.py:
- setup.py: command line parser with all arguments
- models/: contains all code related to building encoder-decoder models
- utils/: contains all utils function (e.g., data preprocessing, data downloading)
  - utils_vision_data.py
  - data_processing.py
  - models.py

How to use ?

scripts: The python file to run a model and model configs. run_hurricast.py and config.py
The command line parser is in src/setup.py

The entire code base is wrapped up in src.

prepro.py :
- 1. Class to process the data
- 1. Add a collate function that allows to batch the data using dictionary. Together with a dataloader
    the command next(iter(loader)) will output a dictionary.
```
Example: mode = intensity; loader = DataLoader(foofoo); in_model, in_loss = next(iter(loader))
>>> print(in_loss.keys(), in_model.keys())
"trg_y", "x_viz", "x_stat" 
```

All the files work upon choosing a "mode". Depending on the mode, the variables will be different, as well as the layer of our model. For instance, for each mode, here is the "target variable" we aim at predicting.

accepted_modes = {#Modes and associated targets
    'intensity': 'tgt_intensity',
    'displacement': 'tgt_displacement',
    'intensity_cat': 'tgt_intensity_cat',
    'baseline_intensity_cat': 'tgt_intensity_cat_baseline',
    'baseline_displacement': 'tgt_displacement_baseline'
    }

Each mode is also associated with a task, i.e classification/regression (or potentially new tasks in the future).

modes = {#Modes and associated tasks
    'intensity': 'regression',
    'displacement': 'regression',
    'intensity_cat': 'classification',
    'baseline_intensity_cat': 'classification',
    'baseline_displacement': 'regression'
}

The model creation is wrapped up in the src/models/factory.py file. The different models are in the src/models/hurricast_models.py, and each model should be "registered" before being used (with a simple decorator.) Example:

@RegisterModel('TRANSFORMER')
class TRANSFORMER(nn.Module):
  ...

The baselines and older versions are in a separate file (baselines.py). The file experimental_models.py is still experimental, but class inheritance can probably allow us to do something cool :sunglass:

About the "experimental model"

Image Encoder: Any module that transforms $(BS \times num_channels\times H_1\times H_2) \rightarrow (BS, H_{out})$ is accepted as an encoder.
Decoder: Any module that models a sequence $(BS \times Sequence_length \times H_{in_decoder}) \rightarrow (BS \times H_{out_decoder})$ is an accepted decoder.
Hurricast Model: Wraps the two models above.
- The Encoder can be ommited by initializing the model with a None encoder (or batching a None image).
- Similarly, we can decide not to use one the tabular data by specifying no_stats=True
- The fusion of the tabular and vision data can eventually be changed.
- A last activation and linear layer will transform $(BS \times H_{out_decoder}) \rightarrow Activation \rightarrow (BS \times 1)$

How to use from the command line

python run_hurricast.py\
    --mode=intensity\ #Or intensity_cat/displacement
    --full_encoder\ #Or no_encoder/split_encoder
    --encoder_config=full_encoder_config\ #Name of the config in config.py
    --transformer\ #Whether we use the transformer or a recurrent model
    --decoder_config=transformer_config\ #Name of the config in config.py
    --y_name ...\
    --vision_name ...\
    --predict_at ...\
    --window_size..\
    --train_test_split
    #Can add --no_stat if we want the vision data only

Data:

The vision data is very large (>30Gb): we currently don't host it online.

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
data		data
misc		misc
notebooks		notebooks
results/results_toy		results/results_toy
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
overview.ipynb		overview.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hurricast 🌀

📋 Overview

Key Features

🔬 Methodology

1️⃣ Feature Extraction

2️⃣ Data Fusion

3️⃣ Prediction

📈 Results

🔗 References

📝 Citation

📄 License

Encoder - Decoder Architectures

Code Structure

How to use ?

How to use from the command line

Data:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

leobix/hurricast

Folders and files

Latest commit

History

Repository files navigation

Hurricast 🌀

📋 Overview

Key Features

🔬 Methodology

1️⃣ Feature Extraction

2️⃣ Data Fusion

3️⃣ Prediction

📈 Results

🔗 References

📝 Citation

📄 License

Encoder - Decoder Architectures

Code Structure

How to use ?

How to use from the command line

Data:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages