-
data-engineering-with-data-build-tool-dbt-4458303 Public
Forked from LinkedInLearning/data-engineering-with-data-build-tool-dbt-4458303This is a code repository for the course Data Engineering with Data Build Tool (DBT).
Dockerfile Other UpdatedAug 31, 2024 -
kNN_Recommender_System Public
The objective of this project is to build a kNN-based recommender system in order to predict the top 5 movie based on a given movie, in this case "The Post". As there is no need for classification …
-
-
This project demonstrates experience summarizing Subqueries and Joins in SQL to fulfill multiple data requests.
UpdatedMay 25, 2021 -
SQL_UNION_CASE Public
This project demonstrates experience using UNION and CASE functions in SQL to fulfill multiple data requests.
UpdatedMay 25, 2021 -
SQL_Indexes Public
This project uses the a custom database table in combination with various Index functions to fulfill multiple data requests.
UpdatedMay 25, 2021 -
COVID_19_Reopening Public
Given its current circumstances, should a county reopen to prevent a second outbreak of COVID 19? We will answer this question by identifying similar counties, which can be referred to when decidin…
Jupyter Notebook UpdatedMay 25, 2021 -
Ensemble_Models Public
In this project, four predictive machine learning models are applied to one dataset in order to select the model that returns the most accurate predictions. Different values for the hyperparameters…
-
Wine_Quality_Analysis Public
Several wines are analyzed in order to gain insights on what features affect a wine's rating using logistic regressions models, kNN algorithms, topic modeling, LDA, and Gensim models.
-
Decision_Tree_Algorithms Public
This project aims to build various Decision Tree Classifier Models to predict the income group of people along with seven demographic variables. Out of all decision trees, the best model is chosen,…
-
This project employs a Logistic Regression model with the objective to predict the risk of Cardiovascular Disease and identify factors that increase may increase risks.
-
EDA_Indian_Air_Pollution Public
This project analyzes levels of various pollutants in India by state. The results are used to calculate air quality, which is then compared among the states with a focus on Andhra Pradesh.
Jupyter Notebook UpdatedApr 22, 2021 -
The model is trained with a set of emails labelled as either from Spam or Not Spam. There are 702 emails equally divided into spam and non spam category. Next, we shall test the model on 260 emails…
-
SQL_Data_Requests Public
This project demonstrates an understanding of various SQL syntaxes to complete multiple data requests, including custom requests, using the Sales_Order and sakila databases.
UpdatedMar 11, 2021 -
SQL_String_Date_Functions Public
This project demonstrates an understanding of Date and String functions by fulfilling various data requests. It also demonstrates knowledge of writing and executing SQL within Jupyter notebook, whi…
Jupyter Notebook UpdatedMar 3, 2021 -
SQL_Windows_and_CTE Public
This project uses the sakila database in combination with various window functions and CTEs to fulfill multiple data requests.
UpdatedMar 3, 2021 -
Basic_RegEx Public
A regular expression is a sequence of characters that define a search pattern. Usually such patterns are used by string-searching algorithms for "find" or "find and replace" operations on strings, …
Jupyter Notebook UpdatedFeb 2, 2021 -
SQL_Joins Public
This project uses the sakila sample database which contains customer- and inventory information for a movie rental business. Inner Join and Left Join statements are used to execute specific tasks s…
UpdatedFeb 2, 2021