This repository contains the accompanying code for the paper:
"Injecting Numerical Reasoning Skills into Language Models." Mor Geva*, Ankit Gupta* and Jonathan Berant. In ACL, 2020. [PDF]
The repository contains:
- Implementation/pre-training/finetuning of GenBERT on MLM/synthetic-data/DROP/SQuAD (in
pre_trainingdir) - Code and vocabularies for textual data generation (in
textual_data_generationdir) - Code for numerical data generation (in
pre_training/numeric_data_generationdir)
Instructions for downloading our data + models are in the README of pre_training dir.
@inproceedings{ggb2020injecting,
title={Injecting Numerical Reasoning Skills into Language Models},
author={Geva, Mor and Gupta, Ankit and Berant, Jonathan},
booktitle={ ACL },
year={ 2020 }
}