0% found this document useful (0 votes)

20 views5 pages

Car Fuel Efficiency Prediction

The document outlines a project to predict car fuel efficiency using Polynomial Regression based on engine size, utilizing a dataset from Kaggle. It details the steps of loading the dataset, visualizing relationships, implementing both Polynomial and Simple Linear Regression models, and evaluating their performance through Mean Squared Error and R² scores. The results indicate that Polynomial Regression (degree=3) outperforms Simple Linear Regression in predictive accuracy due to its ability to capture nonlinear relationships.

Uploaded by

mcanarender

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views5 pages

Car Fuel Efficiency Prediction

Uploaded by

mcanarender

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Predicting Car Fuel Efficiency

Objective: Use Polynomial Regression to predict car fuel efficiency based on engine size.
Dataset: https://www.kaggle.com/uciml/autompg-dataset
Tasks:
1. Load and explore the dataset.
2. Create scatter plots to visualize the relationships between engine size and fuel efficiency.
3. Implement Polynomial Regression (e.g., degree=3) to predict fuel efficiency.
4. Evaluate and compare the performance with a Simple Linear Regression model.

# Import necessary libraries

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score

# Step 1: Load and explore the dataset

file_path = "/content/drive/MyDrive/nkphd/auto-mpg.csv" df = pd.read_csv(file_path)

# Display basic information about the dataset

print("Dataset Overview:")
print(df.head())
print("\nSummary Statistics:")
print(df.describe())

# Check for missing values

print("\nMissing Values:")
print(df.isnull().sum())

# Drop rows with missing values

df.dropna(inplace=True)

# Step 2: Scatter plot of engine size vs. fuel efficiency

plt.figure(figsize=(8, 6))
plt.scatter(df['displacement'], df['mpg'], color='blue', alpha=0.6)
plt.title("Engine Size vs. Fuel Efficiency")
plt.xlabel("Engine Size (Displacement)")
plt.ylabel("Fuel Efficiency (MPG)")
plt.grid()
plt.show()
# Step 3: Polynomial Regression
# Define features (engine size) and target (mpg)
X = df[['displacement']]
y = df['mpg']

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Create polynomial features of degree 3

poly = PolynomialFeatures(degree=3)
X_train_poly = poly.fit_transform(X_train)
X_test_poly = poly.transform(X_test)

# Train the polynomial regression model

poly_model = LinearRegression()
poly_model.fit(X_train_poly, y_train)

# Predict using the polynomial regression model

y_pred_poly = poly_model.predict(X_test_poly)

# Step 4: Simple Linear Regression

# Train the simple linear regression model
linear_model = LinearRegression()
linear_model.fit(X_train, y_train)

# Predict using the simple linear regression model

y_pred_linear = linear_model.predict(X_test)

# Evaluate the models

mse_poly = mean_squared_error(y_test, y_pred_poly)
r2_poly = r2_score(y_test, y_pred_poly)

mse_linear = mean_squared_error(y_test, y_pred_linear)

r2_linear = r2_score(y_test, y_pred_linear)

print("\nModel Performance:")
print(f"Polynomial Regression (degree=3) - MSE: {mse_poly:.2f}, R²: {r2_poly:.2f}")
print(f"Simple Linear Regression - MSE: {mse_linear:.2f}, R²: {r2_linear:.2f}")

# Visualize the Polynomial Regression fit

plt.figure(figsize=(8, 6))
plt.scatter(X, y, color='blue', alpha=0.6, label="Actual")
X_sorted = np.sort(X, axis=0)
plt.plot(X_sorted, poly_model.predict(poly.transform(X_sorted)), color='red', label="Polynomial
Regression (degree=3)")
plt.plot(X_sorted, linear_model.predict(X_sorted), color='green', linestyle='--', label="Simple
Linear Regression")
plt.title("Model Comparison")
plt.xlabel("Engine Size (Displacement)")
plt.ylabel("Fuel Efficiency (MPG)")
plt.legend()
plt.grid()
plt.show()
Performance Evaluation
The Mean Squared Error (MSE) and R² score are used to compare both models:

 Polynomial Regression (degree=3) provides a lower MSE and a higher R² score,

indicating a better fit and improved predictive accuracy.
 Simple Linear Regression, due to its linear nature, has a higher MSE and a lower R²
score, meaning it cannot capture the nonlinear relationship between engine size and fuel
efficiency effectively.

Comparison

 Linear Regression assumes a straight-line relationship, leading to underfitting in cases

where the relationship is nonlinear.
 Polynomial Regression captures the curvature in the data, fitting more accurately but at
the cost of increased model complexity.

Conclusion

Polynomial Regression (degree=3) performs better in predicting fuel efficiency compared to

Simple Linear Regression. The lower MSE and higher R² score confirm its superior accuracy
in this dataset.

PGM 7
No ratings yet
PGM 7
3 pages
7 PRGM
No ratings yet
7 PRGM
4 pages
Program - 7
No ratings yet
Program - 7
4 pages
BCSL606 P7
No ratings yet
BCSL606 P7
5 pages
Iml 51
No ratings yet
Iml 51
10 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
Car Fuel Efficiency Presentation Pro
No ratings yet
Car Fuel Efficiency Presentation Pro
7 pages
Car Fuel Efficiency Presentation
No ratings yet
Car Fuel Efficiency Presentation
7 pages
Experiment 7a and 7b
No ratings yet
Experiment 7a and 7b
3 pages
Nihal Pathan BT32027
No ratings yet
Nihal Pathan BT32027
4 pages
7th ExP
No ratings yet
7th ExP
4 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
Multi Regression
No ratings yet
Multi Regression
12 pages
Exp - 6-Model Development - SDK - Ok
No ratings yet
Exp - 6-Model Development - SDK - Ok
11 pages
Car MPG Prediction for Efficiency
No ratings yet
Car MPG Prediction for Efficiency
2 pages
ML Regression for Data Scientists
No ratings yet
ML Regression for Data Scientists
7 pages
UCD Linear Reg2
No ratings yet
UCD Linear Reg2
3 pages
Vehicle Fuel Efficiency Prediction
No ratings yet
Vehicle Fuel Efficiency Prediction
9 pages
Data Scientists: Fuel Efficiency Model
No ratings yet
Data Scientists: Fuel Efficiency Model
14 pages
Aiml Code and Output - Team 1
No ratings yet
Aiml Code and Output - Team 1
6 pages
Polynomial Regression Lab Guide
No ratings yet
Polynomial Regression Lab Guide
10 pages
Optimizing Fuel Efficiency Using Polynomial Functions 3
No ratings yet
Optimizing Fuel Efficiency Using Polynomial Functions 3
9 pages
Project Report
No ratings yet
Project Report
3 pages
ML0101EN Reg Simple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Simple Linear Regression Co2 Py v1
4 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
Cars Fuel Efficiency Presentation
No ratings yet
Cars Fuel Efficiency Presentation
10 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
Activity 2 QP
No ratings yet
Activity 2 QP
4 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
10 pages
Assignment 5
No ratings yet
Assignment 5
9 pages
Exp-3 - 2 - Jupyter Notebook
No ratings yet
Exp-3 - 2 - Jupyter Notebook
4 pages
Regression
No ratings yet
Regression
5 pages
Cars Fuel Efficiency Presentation
No ratings yet
Cars Fuel Efficiency Presentation
10 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
CMSC 177 - Regressionlr&Svm
No ratings yet
CMSC 177 - Regressionlr&Svm
30 pages
Polynomial Regression in Python
No ratings yet
Polynomial Regression in Python
6 pages
ML Lab1
No ratings yet
ML Lab1
11 pages
Corrosion Rate Predictions
No ratings yet
Corrosion Rate Predictions
14 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Exercises D'application Regression Analysis
No ratings yet
Exercises D'application Regression Analysis
4 pages
ML Polynomial Regression4
No ratings yet
ML Polynomial Regression4
36 pages
SML - Week 3
No ratings yet
SML - Week 3
5 pages
Business Analytics
No ratings yet
Business Analytics
3 pages
Unit 3 7
No ratings yet
Unit 3 7
4 pages
Program 7LPR
No ratings yet
Program 7LPR
2 pages
Module 07
No ratings yet
Module 07
21 pages
DS Exp6
No ratings yet
DS Exp6
5 pages
UNIT-1 Polynomial Regression
No ratings yet
UNIT-1 Polynomial Regression
7 pages
Simple Linear Regression - Assign2
No ratings yet
Simple Linear Regression - Assign2
9 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Lab3 Report Revathy
No ratings yet
Lab3 Report Revathy
8 pages
Linear Regression Guide for Data Analysts
No ratings yet
Linear Regression Guide for Data Analysts
16 pages
Intro to Multiple Regression
No ratings yet
Intro to Multiple Regression
16 pages
Exercises 2 Unfinished
No ratings yet
Exercises 2 Unfinished
8 pages
DSBDA Practical 4 Tutorial
No ratings yet
DSBDA Practical 4 Tutorial
8 pages
Comparative - Analysis - With - Performance - Metrics 5
No ratings yet
Comparative - Analysis - With - Performance - Metrics 5
3 pages
New Slide Data
No ratings yet
New Slide Data
3 pages
RM Good
No ratings yet
RM Good
8 pages
PHD Syllabus Computer Science and Appls-2024-2025
No ratings yet
PHD Syllabus Computer Science and Appls-2024-2025
24 pages
On The Insert Tab
No ratings yet
On The Insert Tab
1 page
Covid 19
No ratings yet
Covid 19
12 pages
Question Bank For Research Methodology
No ratings yet
Question Bank For Research Methodology
1 page
STAT 3008 Outline
No ratings yet
STAT 3008 Outline
4 pages
Measurement: Rami Ahmad
No ratings yet
Measurement: Rami Ahmad
10 pages
Edrolo VCE General Maths Units 1 2 2nd Edition Edrolo Download
No ratings yet
Edrolo VCE General Maths Units 1 2 2nd Edition Edrolo Download
158 pages
Understanding R-squared in Regression
0% (1)
Understanding R-squared in Regression
5 pages
JPractCardiovascSci4133-1144607 031046.pdfkhushbu
No ratings yet
JPractCardiovascSci4133-1144607 031046.pdfkhushbu
5 pages
Zalazar Et Al 2024 Theriogenology
No ratings yet
Zalazar Et Al 2024 Theriogenology
11 pages
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
No ratings yet
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
39 pages
De Guzman, Isaiah Q. - Mmem
No ratings yet
De Guzman, Isaiah Q. - Mmem
19 pages
Unit 2
No ratings yet
Unit 2
34 pages
Freight Trip Generation Model Based On Land Use
No ratings yet
Freight Trip Generation Model Based On Land Use
8 pages
Mathematics Investigations and Modeling
No ratings yet
Mathematics Investigations and Modeling
32 pages
Econometrics Mcqs..........
No ratings yet
Econometrics Mcqs..........
5 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Jasper Shin Physics IA Makeup - Spring Oscillationx
No ratings yet
Jasper Shin Physics IA Makeup - Spring Oscillationx
11 pages
Chapter 11 - Generalized Regression For DOEs
No ratings yet
Chapter 11 - Generalized Regression For DOEs
42 pages
EMF Nonlinear
No ratings yet
EMF Nonlinear
18 pages
Lamerz D3 20m V2 An Update On Roche S Experien
No ratings yet
Lamerz D3 20m V2 An Update On Roche S Experien
21 pages
House Price Prediction for Buyers
100% (1)
House Price Prediction for Buyers
10 pages
Deregulation of The Petroleum Sector Journal
No ratings yet
Deregulation of The Petroleum Sector Journal
67 pages
Stats Methods for BITS Students
No ratings yet
Stats Methods for BITS Students
119 pages
QA - M4 - MLR - Chapter 18 IND - Business StatisticsGovind Chand Beri
No ratings yet
QA - M4 - MLR - Chapter 18 IND - Business StatisticsGovind Chand Beri
25 pages
Introductory Statistics For The Life and Biomedical Sciences 1st Edition Julie Vu Ready To Read
No ratings yet
Introductory Statistics For The Life and Biomedical Sciences 1st Edition Julie Vu Ready To Read
162 pages
Machine Learning for CS Students
No ratings yet
Machine Learning for CS Students
16 pages
Spreadsheet Determines Hyperbolic-Decline Parameters - Oil & Gas Journal
No ratings yet
Spreadsheet Determines Hyperbolic-Decline Parameters - Oil & Gas Journal
5 pages
Linear Algebra - Inder Sir - Demo
No ratings yet
Linear Algebra - Inder Sir - Demo
45 pages
Structural Monitoring, FEM Updating and Performance Assessment of A Wind Turbine
No ratings yet
Structural Monitoring, FEM Updating and Performance Assessment of A Wind Turbine
8 pages
Stat 136 Chapter 9 Diagnostic Plots and Nonlinearity
No ratings yet
Stat 136 Chapter 9 Diagnostic Plots and Nonlinearity
47 pages
DAO2702 Programming For Business Analytics S2AY1819
No ratings yet
DAO2702 Programming For Business Analytics S2AY1819
3 pages
Specification, SFU Notes
No ratings yet
Specification, SFU Notes
19 pages