0% found this document useful (0 votes)

18 views2 pages

Practical 3

The document outlines a Python script for analyzing the 'auto-mpg' dataset using pandas, numpy, and scikit-learn. It includes data loading, preprocessing, handling missing values, feature selection with SelectKBest, and visualization of feature scores. The script concludes by identifying the top three relevant features based on their scores.

Uploaded by

try.adityabhise

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views2 pages

Practical 3

Uploaded by

try.adityabhise

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestRegressor
from sklearn.feature_selection import SelectKBest, f_regression

# Load the dataset

df = pd.read_csv('auto-mpg.csv')

# Display the first few rows of the dataset

print("Dataset Preview:")
print(df.head())

# Check for null values

print("\nNull Values:")
print(df.isnull().sum())

# Basic information about the dataset

print("\nDataset Info:")
print(df.info())

# Handle missing values (optional, depending on dataset)

df = df.dropna()

# Remove non-numeric columns like 'car name' (or any other categorical columns)
df = df.drop(columns=['car name']) # Dropping the 'car name' column

# If there are any other categorical columns, encode them (e.g., using one-hot
encoding)
# df = pd.get_dummies(df, drop_first=True) # Uncomment if you have categorical
features

# Splitting into features (X) and target (y)

X = df.drop(columns=['mpg']) # Assuming 'mpg' is the target variable
y = df['mpg']

# Feature selection using SelectKBest

from sklearn.feature_selection import SelectKBest
from sklearn.feature_selection import f_regression

selector = SelectKBest(score_func=f_regression, k='all')

X_new = selector.fit_transform(X, y)

# Display feature scores

feature_scores = pd.DataFrame({'Feature': X.columns, 'Score': selector.scores_})
print("\nFeature Scores:")
print(feature_scores)

# Visualizing feature scores

import matplotlib.pyplot as plt
import seaborn as sns

plt.figure(figsize=(10, 6))
sns.barplot(x='Feature', y='Score', data=feature_scores)
plt.title('Feature Selection Scores')
plt.xticks(rotation=45)
plt.show()
# Selecting top 3 features (example)
top_features = feature_scores.nlargest(3, 'Score')['Feature'].tolist()
print("\nTop 3 Relevant Features:")
print(top_features)

ML Lab1
No ratings yet
ML Lab1
11 pages
Task03 Carpricepredictionwithmachinelearning 1752340918
No ratings yet
Task03 Carpricepredictionwithmachinelearning 1752340918
3 pages
Car Evaluation Data Analysis & Random Forest Model
No ratings yet
Car Evaluation Data Analysis & Random Forest Model
12 pages
23BCE7199 ML Lab Assignment
No ratings yet
23BCE7199 ML Lab Assignment
15 pages
1
No ratings yet
1
13 pages
CQF June 2021 M4L4 Solutions
No ratings yet
CQF June 2021 M4L4 Solutions
14 pages
ML Foram
No ratings yet
ML Foram
17 pages
Codes For Practice
No ratings yet
Codes For Practice
2 pages
Machine Learning Lab Assignment 1
No ratings yet
Machine Learning Lab Assignment 1
23 pages
UCD Linear Reg2
No ratings yet
UCD Linear Reg2
3 pages
Código Carros
No ratings yet
Código Carros
3 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Data Analysis Report
No ratings yet
Data Analysis Report
74 pages
MLL
No ratings yet
MLL
2 pages
Exp 3 ML
No ratings yet
Exp 3 ML
4 pages
AML Lab
No ratings yet
AML Lab
14 pages
Statisitics Project 3
No ratings yet
Statisitics Project 3
22 pages
Problem Statement Is To Predict Price Column Based On Data With 24 Columns With Over 200 Data Entries Using Linear Regression
No ratings yet
Problem Statement Is To Predict Price Column Based On Data With 24 Columns With Over 200 Data Entries Using Linear Regression
5 pages
City Cycle Fuel Consumption 2024
No ratings yet
City Cycle Fuel Consumption 2024
23 pages
Xətti Reqressiya Modelinin Qurulması
No ratings yet
Xətti Reqressiya Modelinin Qurulması
4 pages
Statisitics Project 7
No ratings yet
Statisitics Project 7
22 pages
Program
No ratings yet
Program
2 pages
Model
No ratings yet
Model
164 pages
ML 1-10
No ratings yet
ML 1-10
53 pages
Machine Learning Record VR19
No ratings yet
Machine Learning Record VR19
46 pages
Slip
No ratings yet
Slip
5 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
ML Code Output
No ratings yet
ML Code Output
38 pages
Ex-07 DS
No ratings yet
Ex-07 DS
5 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
Note
No ratings yet
Note
9 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
Web App Code
No ratings yet
Web App Code
5 pages
HousepricedataDT - Ipynb - Colab
No ratings yet
HousepricedataDT - Ipynb - Colab
3 pages
ML
No ratings yet
ML
11 pages
AI
No ratings yet
AI
16 pages
Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Enda Practical 3 Explanation One
No ratings yet
Enda Practical 3 Explanation One
7 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
EDS - Python Cheat Sheet
0% (1)
EDS - Python Cheat Sheet
3 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Automobile Linear Regression
No ratings yet
Automobile Linear Regression
1 page
Aiml Code and Output - Team 1
No ratings yet
Aiml Code and Output - Team 1
6 pages
SimpleLinear Regression
No ratings yet
SimpleLinear Regression
2 pages
Random Forest
No ratings yet
Random Forest
2 pages
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
No ratings yet
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
10 pages
ML Lab
No ratings yet
ML Lab
10 pages
MACHINE LEARNING Manual
No ratings yet
MACHINE LEARNING Manual
36 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Data Mining
No ratings yet
Data Mining
10 pages
Train
No ratings yet
Train
17 pages
ML Batch
No ratings yet
ML Batch
36 pages
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
AML Code For m2
No ratings yet
AML Code For m2
7 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
MDS372 Lab4 2448001
No ratings yet
MDS372 Lab4 2448001
17 pages
ML - Lab - Ex 2
No ratings yet
ML - Lab - Ex 2
4 pages

Practical 3

Uploaded by

Practical 3

Uploaded by

import pandas as pd

# Load the dataset

# Display the first few rows of the dataset

# Check for null values

# Basic information about the dataset

# Handle missing values (optional, depending on dataset)

# Splitting into features (X) and target (y)

# Feature selection using SelectKBest

selector = SelectKBest(score_func=f_regression, k='all')

# Display feature scores

# Visualizing feature scores

You might also like