Skip to content

feng-li/catastrophe-loss-prediction-with-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Catastrophe loss prediction with NLP

This repository provides the code implementation for our paper

Han Wang, Wen Wang, Feng Li, Yanfei Kang and Han Li (2025). “Catastrophe Duration and Loss Prediction via Natural Language Processing”. Variance, Vol. 18

Disclaimer

This code is provided AS IS, with no further updates or maintenance. If you have any questions, please contact Feng Li via email feng.li@gsm.pku.edu.cn.

Introduction

In this project, we predict the duration and loss of catastrophes using textual information from online news articles. This new approach is informative in providing timely warnings of the severity of a catastrophe, which can aid decision-making and support appropriate responses.

  • get_text.py: Build a web crawler to extract its news contents.

  • get_url.py: Obtain news title, news source, published time.

  • vector-embedded-by-BERT.py & vector-embedded-by-Word2Vec.py: Construct the document vector.

  • keywords.txt: Select 300 words as our keyword database.

  • rf_clf.py & xgb_clf.py & lgb_clf.py: Predict the losses of each catastrophe-affected country by machine learning models such as Random Forest, XGBoost, and LightGBM.

  • rf_reg.py & xgb_reg.py & lgb_reg.py: Predict the duration days of each catastrophe-affected country by machine learning models such as Random Forest, XGBoost, and LightGBM.

About

Catastrophe loss prediction with NLP

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages