Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
-
Updated
Sep 8, 2022 - Python
Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
spark (scala and python)
Implementation of Inferring Networks of Substitutable and Complementary Products Model paper
Example from Spark MLLib (in python)
kaggle machine learning with spark
An item-based recommender model that computes cosine similarity for each item pairs using the item factors matrix generated by Spark MLlib’s ALS algorithm and recommends top 5 items based on the selected item.
A movie recommendation system on MovieLens 25M dataset using Python and Apache Spark
Scalable Trust-Signal Detection: A Big Data pipeline using PySpark and GCP Dataproc to classify 8GB+ of Amazon reviews with high-precision Random Forest modeling. Engineered for horizontal scalability and verified data integrity
#MachineLearning project for predicting the house rent demonstrating MLOps
NYU Real Time Big Data Analysis Final Project
End-to-end customer churn prediction pipeline built with Apache Spark MLlib — featuring synthetic data generation, feature engineering, Chi-Square selection, and multi-model hyperparameter tuning with cross-validation.
CSCI-GA.3033-005 - Big Data Application Development
Image Captioning with pretrained last layer of DCNN and RNN
📊 Predict and reduce customer churn with a production-ready machine learning pipeline, ensuring quick and accurate insights for better retention strategies.
An automatic machine learning based customer segmentation model with RFM analysis at ICTA conference 2024
Add a description, image, and links to the mllib topic page so that developers can more easily learn about it.
To associate your repository with the mllib topic, visit your repo's landing page and select "manage topics."