Skip to content
#

yelp-dataset

Here are 226 public repositories matching this topic...

Hybrid Learning-to-Rank system processing 2.4M Yelp reviews. Features a custom NLP pipeline (SBERT + VADER), Neural Ranking architecture, and MMR diversity re-ranking to solve the Cold-Start problem.

  • Updated Mar 30, 2026
  • Python

A large-scale data analysis project built on Apache Hadoop and Apache Spark, analyzing 7M+ Yelp reviews, 150K businesses, and 2M users. Covers business intelligence, user behavior, rating patterns, and review trends using PySpark and Hive on a multi-node cluster. Visualized through Apache Zeppelin notebooks.

  • Updated Mar 28, 2026
  • Python

DineSmart — Turning Unstructured Yelp Reviews into Strategic Insights. 📈 A comprehensive Business Analytics project integrating Sentiment Analysis and Machine Learning to help customers discover dining experiences and entrepreneurs identify market gaps. Built with Python, Gensim, and scikit-learn.

  • Updated Feb 24, 2026
  • Jupyter Notebook

In this NLP project, we will classify Yelp reviews into 1-star or 5-star categories using simplified methods, utilizing the Yelp Review Data Set from Kaggle, which includes a "stars" column for ratings and user votes on "cool," "useful," and "funny" reviews.

  • Updated Feb 5, 2026
  • Jupyter Notebook
Predicting_Yelp_Review_Quality

[Archived] Classical NLP pipeline (2019-2020) predicting Yelp review quality using TF-IDF, FastText, LDA, and traditional ML. Pre-transformer era techniques preserved as a learning resource.

  • Updated Jan 13, 2026
  • Jupyter Notebook

This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.

  • Updated Dec 19, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the yelp-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the yelp-dataset topic, visit your repo's landing page and select "manage topics."

Learn more