List of projects
1. Sales Prediction Using Linear Regression
• Objective: Build a linear regression model to predict sales based on factors like
advertising costs, product stock, and the number of potential customers.
• Dataset: Use available sales datasets from Kaggle or other open e-commerce datasets.
• Implementation: Focus on basic data analysis, cleaning, and building a linear regression
model for prediction.
2. Email Spam Classification Using Logistic Regression
• Objective: Use a logistic regression model to classify emails into two categories: spam
and not spam.
• Dataset: Use a spam email dataset, such as the Spam Email Dataset from the UCI
Machine Learning Repository.
• Implementation: Students work with simple text data processing and apply logistic
regression for classification.
3. Diabetes Detection Using Decision Trees
• Objective: Use a decision tree to detect whether a patient is likely to have diabetes based
on basic health indicators.
• Dataset: Use the Pima Indians Diabetes dataset from UCI or Kaggle, which includes
health metrics such as age, weight, and blood pressure.
• Implementation: Build a basic decision tree model for classification to predict the
likelihood of diabetes.
4. House Price Prediction Using Multiple Linear Regression
• Objective: Predict house prices based on factors like area, number of bedrooms, and
location.
• Dataset: Use a housing dataset such as the Boston Housing Dataset available in Scikit-
learn or on Kaggle.
• Implementation: Focus on building a multiple linear regression model and evaluating its
performance.
5. Handwritten Digit Recognition Using a Simple Neural Network
• Objective: Use a simple neural network to classify handwritten digits.
• Dataset: Use the well-known MNIST dataset, which contains images of handwritten
digits.
• Implementation: Students build a simple neural network with 1-2 hidden layers to
classify digits from images.