Feature-based Sentiment Analysis on Food Reviews

This project aims to develop a web application that can help e-commerce users and product suppliers understand which are the strengths and weaknesses of products sold online. This information is the result of applying a feature-based sentiment analysis on customer reviews coming from Amazon. The used approach is based on three main steps: features extraction from products, opinion orientation identification, and results' summarization. Regarding opinion orientation identification, two methods were compared. The first is based on "The Semantic Orientation CALculator" (SO-CAL) framework, while the other relies on the "for Valence Aware Dictionary for sEntiment Reasoning" (VADER) framework.

Requirements

To install the requirements:

pip install -r requirements.txt

Data

The project is based on a reduced version of the Amazon Fine Food Reviews dataset (https://www.kaggle.com/snap/amazon-fine-food-reviews), which originally included about 500000 food reviews coming from a period of over ten years (until October 2012). The smaller version is made up of a 35172 reviews and each of them contains the product's id, the user's id, the rating score given by him and, finally, the review's text.

Usage

script.py includes all the functions used for dataset preprocessing and figure generation.
main.py includes all the function used to perform sentiment analysis and storing the results in Elasticsearch.
app.py contains the web-app code.
preprocess.py, SO_Calc.py and SO_Run.py and the directory Resources are adapted from the SO-CAL python library (https://github.com/sfu-discourse-lab/SO-CAL)

Screenshots

References

J. J. McAuley and J. Leskovec, “From amateurs to connoisseurs: modeling theevolution of user expertise through online reviews,” inProceedings of the 22nd in-ternational conference on World Wide Web, 2013, pp. 897–908.
B. Liuet al., “Sentiment analysis and subjectivity.”Handbook of natural languageprocessing, vol. 2, no. 2010, pp. 627–666, 2010.
M. Hu and B. Liu, “Mining and summarizing customer reviews,” inProceedings ofthe tenth ACM SIGKDD international conference on Knowledge discovery and datamining, 2004, pp. 168–177.
M. Eirinaki, S. Pisal, and J. Singh, “Feature-based opinion mining and ranking,”Journal of Computer and System Sciences, vol. 78, no. 4, pp. 1175–1184, 2012.
M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, “Lexicon-based methodsfor sentiment analysis,”Computational linguistics, vol. 37, no. 2, pp. 267–307, 2011.
C. Hutto and E. Gilbert, “Vader: A parsimonious rule-based model for sentimentanalysis of social media text,” inProceedings of the International AAAI Conferenceon Web and Social Media, vol. 8, no. 1, 2014.
R. Campos, V. Mangaravite, A. Pasquali, A. Jorge, C. Nunes, and A. Jatowt,“Yake! keyword extraction from single documents using multiple local features,”Information Sciences, vol. 509, pp. 257–289, 2020.

Authors

Lorenzo Pirola

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
Resources		Resources
assets		assets
model		model
.gitignore		.gitignore
ElasticSearchClient.py		ElasticSearchClient.py
README.md		README.md
SO_Calc.py		SO_Calc.py
SO_Run.py		SO_Run.py
app.py		app.py
features.csv		features.csv
main.py		main.py
preprocess.py		preprocess.py
products.csv		products.csv
requirements.txt		requirements.txt
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Feature-based Sentiment Analysis on Food Reviews

Requirements

Data

Usage

Screenshots

References

Authors

About

Uh oh!

Languages

lpirola13/sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Feature-based Sentiment Analysis on Food Reviews

Requirements

Data

Usage

Screenshots

References

Authors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages