Skip to content
View vbalalian's full-sized avatar

Block or report vbalalian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vbalalian/README.md

Vincent Balalian — Data Analytics & Engineering Projects

I’m a Business Analytics major with a focus on designing data pipelines, dashboards, and predictive models.


Projects

Name What & Why Tech Highlights
E-Store Marketing Analytics Event-driven batch analytics pipeline for 400M+ ecommerce events to calculate customer churn, conversion metrics, and perform RFM analysis. dbt · Dagster · GCP
Predicting Restaurant Success with Yelp data Investigates whether sentiment analysis of early restaurant reviews improves prediction of long-term Yelp ratings. dbt · BigQuery · Python (pandas, scikit-learn, XGBoost, VADER)
Littlefield Factory Sim Analytics Pipeline Tool for scraping, analyzing, and reporting real-time metrics in Littlefield simulation. Helps understand throughput, utilization, and decision latency. Python · SQL · GCP
Roman Coins Open Source Data Pipeline End-to-end data engineering pipeline for collecting, transforming, and serving historical coin data. Demonstrates robustness, open-source tooling, and CI/CD. Python · PostgreSQL · FastAPI · dbt · Dagster · Docker

Connect

Pinned Loading

  1. estore-analytics estore-analytics Public

    Event-driven batch analytics pipeline for a large ecommerce events dataset. Built on GCP with dbt and Dagster.

    Python 1

  2. roman_coins_data_pipeline roman_coins_data_pipeline Public

    ELT pipeline used for learning the fundamentals of data engineering.

    Python 4

  3. littlefield littlefield Public

    Combined web-scraping, loading, and reporting tool for Littlefield simulation, built for use with Google Cloud Run functions and Google Cloud Scheduler

    Python

  4. three-gits three-gits Public

    Group analytics project for a predictive analytics course. Using the Yelp open dataset to predict restaurant success.

    Jupyter Notebook