Skip to content

This repository contains all the code of our participation (HULAT) at emEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy Analysis

Notifications You must be signed in to change notification settings

isegura/hulat_intimacy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

hulat_intimacy

This repository contains all the code of our participation (HULAT) at emEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy Analysis (https://codalab.lisn.upsaclay.fr/competitions/7096)

  • 1_dataset_MINT.ipynb: in this notebook, you can find code to study the dataset of the task. For example, you will find some plots about the distribution of the intimacy scores, languages and text length.
  • 2_system.ipynb: this notebook allows you to reproduce all our experiments by using the training and test dataset provided by the organizers of the task. To obtain the files train-full.csv and test_labeled.csv, you must contact with the organizers.
  • how_split_MINT.ipynb: in this notebook, you can find the code to split the original training dataset into the three splits that we used during the development phase.
  • DA4intimacy.ipynb: this notebook includes the data augmentation techniques that we used to increase the training dataset for this task.

Please properly cite our work if you find them useful.

@InProceedings{segura2023hulatintimacy,

title={HULAT at SemEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet   Intimacy Analysis},

author={Segura-Bedmar, Isabel},

booktitle = {17th International Workshop on Semantic Evaluation (SemEval-2023)},

year = {2023}

}

Funding:

This work is part of the R&D&i ACCESS2MEET (PID2020-116527RB-I0) project financed by MCIN AEI/10.13039/501100011033/

About

This repository contains all the code of our participation (HULAT) at emEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy Analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published