-
-
online-purchase-propensity Public
In progress. Modeling shopper purchase propensity for a ficticious online retail website.
Jupyter Notebook UpdatedJul 12, 2021 -
game-ab-testing Public
A/B testing impact of progression system changes on player retention / interaction. Non-parametric hypothesis testing and power transformations for non-normally distributed data.
jupyter-notebook python3 statistical-inference ab-testing wilcoxon-mann-whitney-test retention-analysis yeo-johnson-transformationsJupyter Notebook UpdatedJul 6, 2021 -
retail-rfm-analysis Public
Recency, Frequency, and Monetary Value (RFM) analysis and customer segmentation for transactional data. Unsupervised cluster learning to delineate customer types.
-
dask-emnist-classification Public
Dask-parallelized project, contrasting GaussianNB and LightGBM models for EMNIST handwritten character classification.
Jupyter Notebook UpdatedJun 9, 2021 -
mnist-svm-tuning Public
Optimizing LinearSVC models trained on the MNIST Handwritten Digits dataset, includes ensemble methods and bayesian optimization.
Jupyter Notebook UpdatedMay 15, 2021 -
Quick visualization of linear decision boundaries for a scratch-implemented perceptron classifier. Model evaluates loss function with each weight / bias update and will store away best performing p…
Jupyter Notebook UpdatedMay 15, 2021 -
tweet-sentiment-nlp Public
A notebook covering tweet natural language processing, from data munging to predictive model creation and evaluation. Sentiment analysis will be used to classify tweet body text as positive or nega…
-
heart-descent-methods Public
Partial scratch-implementation of coordinate descent, using a specified feature selector method to choose and later update a coordinate's corresponding weight. This behavior mimics the "fit" functi…
Jupyter Notebook UpdatedMay 3, 2021 -
Comparison of scratch-implemented and sklearn nearest neighbors methods for classification of vertebral patient data. Using multiple distance metrics, models were k-value optimized to reduce error …
sklearn jupyter-notebook python3 classification nearest-neighbors distance-metric scratch-implementationJupyter Notebook UpdatedApr 16, 2021 -
animal-attribute-clustering Public
A quick comparison of two (K-Means and hierarchical/agglomerative) clustering methods for text-based animal species classification.
python3 jupyter-notebooks kmeans-clustering hierarchical-clustering clustering-methods text-classificaiton animals-with-attributesJupyter Notebook UpdatedApr 14, 2021 -
mnist-digit-classifier Public
Gaussian naive Bayes classifier for digits in the MNIST dataset. Similar in nature to my other repo ("newsgroup-naive-bayes"), albeit instead of multinomial document classification, this repo explo…
python jupyter-notebook image-classification mnist-handwriting-recognition scratch-implementation gaussian-naive-bayes covariance-smoothingJupyter Notebook UpdatedApr 14, 2021 -
newsgroups-naive-bayes Public
Multinomial naive Bayes newsgroup document classification without relying on pre-built sklearn modules. Smoothing and inverse document frequencies utilized to improve model accuracy.
text-classification python3 without-sklearn multinomial-naive-bayes jupyer-notebook laplace-smoothing inverse-document-frequencyJupyter Notebook UpdatedApr 14, 2021 -
nintendo-game-ratings Public
EDA and association rule mining for a Metacritic-sourced dataset of Nintendo games, with comprehensive game-specific data like review scores, release date, developers, genres, and ESRB rating.
ratings exploratory-data-analysis jupyter-notebook python3 tableau metacritic association-rule-miningJupyter Notebook UpdatedApr 14, 2021 -
xbox-game-pass Public
Xbox Game Pass subscription value quantification and visualization.
-
billboard-artist-frequency Public
Frequency analysis of historical Billboard Hot 100 lists based on an artist's supporting vs main artist numbers.
Jupyter Notebook UpdatedApr 2, 2021 -
drug-use-and-health Public
A brief look at drug use trends in the United States, as they relate to: earliest age of consumption, and impact on mental health.
Jupyter Notebook UpdatedSep 10, 2020