0% found this document useful (0 votes)

127 views30 pages

DS Practical

The document is a lab report submitted by Amandeep (Roll No. 105/20, Branch: CSE, 6th Sem) to Dr. Vinay Chopra (Associate Professor) at D.A.V. Institute of Engineering & Technology, Jalandhar. It contains 15 experiments performed using Python libraries and concepts related to data science and data analysis. The experiments include using pandas to load and analyze NBA player data, performing matrix operations in NumPy, combining DataFrames in pandas, adding/selecting columns in pandas DataFrames, and creating visualizations like box plots, histograms, pivot tables and heatmaps.

Uploaded by

XYZ NK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views30 pages

DS Practical

Uploaded by

XYZ NK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 30

DAVIET/CSE/2003635

D.A.V INSTITUTE OF ENGINERING & TECHNOLOGY

JALANDHAR

Data Science LAB(BTCS 616-18)

Submitted To:- Submitted By:-
Dr. Vinay Chopra Amandeep
(Associate Professor) Roll:-105/20
Branch :- CSE/6 th Sem

DS_ LAB Page 1

DAVIET/CSE/2003635

INDEX
S.NO. EXPERIMENTS Page Remarks
No.
1 Using the pandas Python Library 3-4

2 a) Write a basic code in python for Matrix input from user 5-6
b) Create Matrix using map function using Numpy
c) Write a python program to demonstrate the operations
add , subtract and divide using Numpy
3 Combine two Dataframe in Python-Pandas 7-9
a) Concatenating dataframe
b) Joining dataframe
c) Concatenating using append

4 Adding new column to existing dataframe in Pandas 10-12

a) By declaring a new list as a column.
b) By using dataframe insert()
c) Using dataframe assign() method
d) By using a dictionary

5 Create a new column in Pandas dataframe based on the 13-15

existing columns
a)Use dataframe. Apply()
b) We can achieve the same result by directly perfroming
the required operation on the desired column element-wise
c) Using dataframe .map() function

6 Create a new column in Pandas dataframe based on a given . 16-17

condition
a) Using list comprehension
b)Using dataframe.apply() function
c)Using dataframe .map() function
d) Using numpy.where() function

DS_ LAB Page 2

DAVIET/CSE/2003635

7 Selecting row in Pandas dataframe based on a given condition 18-20

a) Selecting all the rows from the gives dataframes in which
‘percentage’ is greater than 80% using loc()
b) Selecting those rows whose column value is present in the
list using isin() method of the dataframe
c) Selecting rows based on multiple column conditions using
’&’ operator
8 Pandas dataframe where() 21
9 Create Box Plot. 22
10 Create Histogram 23
11 Create Pivot Table. 24
12 Create Heapmap. 25
13 Demostrating means() in Pyhton program 26
14 Demostrating standard deviation in python program 27
15 Calculation skewness and kurtosis in python 28

DS_ LAB Page 3

DAVIET/CSE/2003635

Task 1: Using the pandas Python Library

import requests
download_url =
"https://raw.githubusercontent.com/fivethirtyeight/data/master/nba-elo/
nbaallelo.csv"
target_csv_path = "nba_all_elo.csv"
response = requests.get(download_url)
response.raise_for_status() # Check that the request was successful
with open(target_csv_path, "wb") as f:
f.write(response.content)
print("Download ready.")

import pandas as pd
nba = pd.read_csv("nba_all_elo.csv")
type(nba)

len(nba)

nba.shape

nba.describe()

DS_ LAB Page 4

DAVIET/CSE/2003635

import numpy as np

nba.describe(include=object

nba["team_id"].value_counts()

DS_ LAB Page 5

DAVIET/CSE/2003635

Task 2: a) Write a basic code in python for Matrix input from user

INPUT:
R = int(input("Enter the number of rows:"))
C = int(input("Enter the number of columns:"))
matrix = []
print("Enter the entries rowwise:")
for i in range(R): # A for loop for row entries
a =[]
for j in range(C): # A for loop for column entries
a.append(int(input()))
matrix.append(a)
for i in range(R):
for j in range(C):
print(matrix[i][j], end = " ")
print()

OUTPUT:

b) Create Matrix using map function using Numpy

DS_ LAB Page 6

DAVIET/CSE/2003635

INPUT:
import numpy as np
R = int(input("Enter the number of rows:"))
C = int(input("Enter the number of columns:"))
print("Enter the entries in a single line (separated by space): ")
entries = list(map(int, input().split()))
matrix = np.array(entries).reshape(R, C)
print(matrix)

OUTPUT:

c) Write a python program to demonstrate the operations add , subtract and divide
using Numpy

INPUT:
import numpy as np
print("Add:")
print(np.add(1.0, 4.0))
print("Subtract:")
print(np.subtract(1.0, 4.0))
print("Multiply:")
print(np.multiply(1.0, 4.0))
print("Divide:")
print(np.divide(1.0, 4.0))

OUTPUT:

DS_ LAB Page 7

DAVIET/CSE/2003635

Task 3: Combine two Dataframe in Python-Pandas

a) Concatenating dataframe

INPUT:
import pandas as pd
df1 = pd.DataFrame({'id': ['A01', 'A02', 'A03', 'A04'],
'Name': ['ABC', 'PQR', 'DEF', 'GHI']})
df2 = pd.DataFrame({'id': ['B05', 'B06', 'B07', 'B08'],
'Name': ['XYZ', 'TUV', 'MNO', 'JKL']})
frames = [df1, df2]
result = pd.concat(frames)
display(result)

OUTPUT:

b) Joining dataframe

INPUT:
import pandas as pd

DS_ LAB Page 8

DAVIET/CSE/2003635

df1 = pd.DataFrame({'id': ['A01', 'A02', 'A03', 'A04'],

'Name': ['ABC', 'PQR', 'DEF', 'GHI']})
df3 = pd.DataFrame({'City': ['MUMBAI', 'PUNE', 'MUMBAI', 'DELHI'],
'Age': ['12', '13', '14', '12']})
result = pd.concat([df1, df3], axis=1, join='inner')
display(result)

OUTPUT:

c) Concatenating using append

OUTPUT:

DS_ LAB Page 9

DAVIET/CSE/2003635

Task 4: Adding new column to existing dataframe in Pandas

a) By declaring a new list as a column.

INPUT:
import pandas as pd
data = {'Name': ['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Height': [5.1, 6.2, 5.1, 5.2],
'Qualification': ['Msc', 'MA', 'Msc', 'Msc']}
df = pd.DataFrame(data)
address = ['Delhi', 'Bangalore', 'Chennai', 'Patna']
df['Address'] = address
print(df)

OUTPUT:

DS_ LAB Page 10

DAVIET/CSE/2003635

b) By using dataframe insert()

INPUT:
import pandas as pd
data = {'Name': ['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Height': [5.1, 6.2, 5.1, 5.2],
'Qualification': ['Msc', 'MA', 'Msc', 'Msc']}
df = pd.DataFrame(data)
df.insert(2, "Age", [21, 23, 24, 21], True)
print(df)
OUTPUT:

c) Using dataframe assign() method

INPUT:
import pandas as pd
data = {'Name': ['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Height': [5.1, 6.2, 5.1, 5.2],
'Qualification': ['Msc', 'MA', 'Msc', 'Msc']}
df = pd.DataFrame(data)
df2 = df.assign(address=['Delhi', 'Bangalore', 'Chennai', 'Patna'])
print(df2)

OUTPUT:

DS_ LAB Page 11

DAVIET/CSE/2003635

d) By using a dictionary

INPUT:
import pandas as pd
data = {'Name': ['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Height': [5.1, 6.2, 5.1, 5.2],
'Qualification': ['Msc', 'MA', 'Msc', 'Msc']}
address = {'Delhi': 'Jai', 'Bangalore': 'Princi',
'Patna': 'Gaurav', 'Chennai': 'Anuj'}
df = pd.DataFrame(data)
df['Address'] = address
print(df)

OUTPUT:

DS_ LAB Page 12

DAVIET/CSE/2003635

Task 5: Create a new column in Pandas dataframe based on the existing columns
a) Use dataframe. Apply()

OUTPUT:

DS_ LAB Page 13

DAVIET/CSE/2003635

b) We can achieve the same result by directly perfroming the required operation on
the desired column element-wise

INPUT:
import pandas as pd
df = pd.DataFrame({'Date':['10/2/2011', '11/2/2011', '12/2/2011', '13/2/2011'],
'Event':['Music', 'Poetry', 'Theatre', 'Comedy'],
'Cost':[10000, 5000, 15000, 2000]})
df['Discounted_Price'] = df['Cost'] - (0.1 * df['Cost'])
print(df)
df['Discounted_Price'] = df.apply(lambda row: row.Cost -
(row.Cost * 0.1), axis = 1)
print(df)

OUTPUT:

DS_ LAB Page 14

DAVIET/CSE/2003635

c) Using dataframe .map() function

INPUT:
data = {
"name": ["John", "Ted", "Dev", "Brad", "Rex", "Smith", "Samuel", "David"],
"salary": [10000, 20000, 50000, 45500, 19800, 95000, 5000, 50000]
}
df = pd.DataFrame(data)
display(df.head())
def salary_stats(value):
if value < 10000:
return "very low"
if 10000 <= value < 25000:
return "low"
elif 25000 <= value < 40000:
return "average"
elif 40000 <= value < 50000:
return "better"
elif value >= 50000:
return "very good"
df['salary_stats'] = df['salary'].map(salary_stats)
display(df.head())

OUTPUT:

DS_ LAB Page 15

DAVIET/CSE/2003635

Task 6: Create a new column in Pandas dataframe based on a given condition

a) Using list comprehension

DS_ LAB Page 16

DAVIET/CSE/2003635

INPUT:
import pandas as pd
df = pd.DataFrame({'Date' : ['11/8/2011', '11/9/2011', '11/10/2011',
'11/11/2011', '11/12/2011'],
'Event' : ['Music', 'Poetry', 'Music', 'Comedy', 'Poetry']})
print(df)
df['Price'] = [1500 if x =='Music' else 800 for x in df['Event']]
print(df)

OUTPUT:

b) Using dataframe.apply() function

INPUT:
def set_value(row_number, assigned_value):
return assigned_value[row_number]
event_dictionary ={'Music' : 1500, 'Poetry' : 800, 'Comedy' : 1200}
df['Price'] = df['Event'].apply(set_value, args =(event_dictionary, ))
print(df)

OUTPUT:

DS_ LAB Page 17

DAVIET/CSE/2003635

c) Using dataframe .map() function

INPUT:
event_dictionary ={'Music' : 1500, 'Poetry' : 800, 'Comedy' : 1200}
df['Price'] = df['Event'].map(event_dictionary)
print(df)

OUTPUT:

d) Using numpy.where() function

INPUT:
df['Price'] = np.where(df['Event']
=='Music', 1500,800 )
print(df)
OUTPUT:

DS_ LAB Page 18

DAVIET/CSE/2003635

Task 7: Selecting row in Pandas dataframe based on a given condition

a) Selecting all the rows from the gives dataframes in which ‘percentage’ is greater
than 80% using loc()

INPUT:
import pandas as pd
record = {
'Name': ['Ankit', 'Amit', 'Aishwarya', 'Priyanka', 'Priya', 'Shaurya' ],
'Age': [21, 19, 20, 18, 17, 21],
'Stream': ['Math', 'Commerce', 'Science', 'Math', 'Math', 'Science'],
'Percentage': [88, 92, 95, 70, 65, 78] }
dataframe = pd.DataFrame(record, columns = ['Name', 'Age', 'Stream',
'Percentage'])
print("Given Dataframe :\n", dataframe)
rslt_df = dataframe[dataframe['Percentage'] > 80]
print('\nResult dataframe :\n', rslt_df)

OUTPUT:

b) Selecting those rows whose column value is present in the list using isin()
method of the dataframe

INPUT:
import pandas as pd

DS_ LAB Page 19

DAVIET/CSE/2003635

record = {
'Name': ['Ankit', 'Amit', 'Aishwarya', 'Priyanka', 'Priya', 'Shaurya' ],
'Age': [21, 19, 20, 18, 17, 21],
'Stream': ['Math', 'Commerce', 'Science', 'Math', 'Math', 'Science'],
'Percentage': [88, 92, 95, 70, 65, 78]}
dataframe = pd.DataFrame(record, columns = ['Name', 'Age', 'Stream',
'Percentage'])
print("Given Dataframe :\n", dataframe)
options = ['Math', 'Commerce']
rslt_df = dataframe[dataframe['Stream'].isin(options)]
print('\nResult dataframe :\n', rslt_df)

OUTPUT:

c) Selecting rows based on multiple column conditions using ’&’ operator

DS_ LAB Page 20

DAVIET/CSE/2003635

dataframe = pd.DataFrame(record, columns = ['Name', 'Age', 'Stream',

'Percentage'])
print("Given Dataframe :\n", dataframe)
options = ['Math', 'Science']
rslt_df = dataframe[(dataframe['Age'] == 21) &
dataframe['Stream'].isin(options)]
print('\nResult dataframe :\n', rslt_df)

OUTPUT:

DS_ LAB Page 21

DAVIET/CSE/2003635

Task 8: Pandas dataframe where()

INPUT:
import pandas as pd
data = {
"age": [50, 40, 30, 40, 20, 10, 30],
"qualified": [True, False, False, False, False, True, True]
}
df = pd.DataFrame(data)
print(df)
newdf = df.where(df["age"] > 30)
print(newdf)

OUTPUT:

DS_ LAB Page 22

DAVIET/CSE/2003635

Task 9: Create Box Plot

INPUT:
import matplotlib.pyplot as plt
import numpy as np
np.random.seed(10)
data_1 = np.random.normal(100, 10, 200)
data_2 = np.random.normal(90, 20, 200)
data_3 = np.random.normal(80, 30, 200)
data_4 = np.random.normal(70, 40, 200)
data = [data_1, data_2, data_3, data_4]
fig = plt.figure(figsize =(10, 7))
ax = fig.add_axes([0, 0, 1, 1])
bp = ax.boxplot(data)
plt.show()

OUTPUT:

DS_ LAB Page 23

DAVIET/CSE/2003635

Task 10: Create Histogram

INPUT:
import matplotlib.pyplot as plt
import numpy as np
from matplotlib import colors
from matplotlib.ticker import PercentFormatter
np.random.seed(23685752)
N_points = 10000
n_bins = 20
x = np.random.randn(N_points)
y = .8 ** x + np.random.randn(10000) + 25
fig, axs = plt.subplots(1, 1,
figsize =(6, 5),
tight_layout = True)
axs.hist(x, bins = n_bins)
plt.show()

DS_ LAB Page 24

DAVIET/CSE/2003635

OUTPUT:

Task 11: Create Pivot Table

INPUT:
import pandas as pd
df = pd.DataFrame({'Product' : ['Carrots', 'Broccoli', 'Banana', 'Banana',
'Beans', 'Orange', 'Broccoli', 'Banana'],
'Category' : ['Vegetable', 'Vegetable', 'Fruit', 'Fruit',
'Vegetable', 'Fruit', 'Vegetable', 'Fruit'],
'Quantity' : [8, 5, 3, 4, 5, 9, 11, 8],
'Amount' : [270, 239, 617, 384, 626, 610, 62, 90]})
pivot = df.pivot_table(index =['Product'],
values =['Amount'],
aggfunc ='sum')
print(pivot)

OUTPUT:

DS_ LAB Page 25

DAVIET/CSE/2003635

Task 12: Create Heapmap

INPUT:
import numpy as np
import seaborn as sns
import matplotlib.pylab as plt
data_set = np.random.rand( 10 , 10 )
ax = sns.heatmap( data_set , linewidth = 0.5 , cmap = 'coolwarm' )
plt.title( "2-D Heat Map" )
plt.show()

OUTPUT:

DS_ LAB Page 26

DAVIET/CSE/2003635

Task 13: Demostrating means() in Pyhton program

INPUT:
import statistics
data1 = [1, 3, 4, 5, 7, 9, 2]
x = statistics.mean(data1)
print("Mean is :", x)

OUTPUT:

DS_ LAB Page 27

DAVIET/CSE/2003635

Task 14: Demostrating standard deviation in python program

INPUT:
import statistics
sample = [1, 2, 3, 4, 5]
print("Standard Deviation of sample is % s " % (statistics.stdev(sample)))

OUTPUT:

DS_ LAB Page 28

DAVIET/CSE/2003635

Task 15: Calculation skewness and kurtosis in python

INPUT:

import scipy
from scipy.stats import kurtosis
dataset = [10, 25, 14, 26, 35, 45, 67, 90,
40, 50, 60, 10, 16, 18, 20]
from scipy.stats import skew
dataset = [88, 85, 82, 97, 67, 77, 74, 86,
81, 95, 77, 88, 85, 76, 81]
print("SKEWNESS")
print(skew(dataset, axis=0, bias=True))

DS_ LAB Page 29

DAVIET/CSE/2003635

print("KURTOSIS")

OUTPUT:

DS_ LAB Page 30

Minimum Level Pandas Skill Based Questions
No ratings yet
Minimum Level Pandas Skill Based Questions
8 pages
Pragya File
No ratings yet
Pragya File
31 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
6 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
Practical of R
No ratings yet
Practical of R
38 pages
Ip Project Work 2
No ratings yet
Ip Project Work 2
52 pages
Pandas
No ratings yet
Pandas
5 pages
Ip HHW
No ratings yet
Ip HHW
32 pages
Python Project File
No ratings yet
Python Project File
31 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Pandas Series & DataFrame Guide
No ratings yet
Pandas Series & DataFrame Guide
60 pages
Programs For Practical
No ratings yet
Programs For Practical
3 pages
Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas
5 pages
DATAFRAME
No ratings yet
DATAFRAME
11 pages
Pandas
No ratings yet
Pandas
27 pages
Practical
No ratings yet
Practical
29 pages
Pandas Lab Assignment Work-2
No ratings yet
Pandas Lab Assignment Work-2
5 pages
Python Pandas DataFrame Tasks
No ratings yet
Python Pandas DataFrame Tasks
9 pages
Practical File IP
No ratings yet
Practical File IP
27 pages
Python Pandas Practical Guide
No ratings yet
Python Pandas Practical Guide
111 pages
CLASS XII - IP List of Practicals With Coding 2020
No ratings yet
CLASS XII - IP List of Practicals With Coding 2020
15 pages
Learn Data Analysis With Pandas - Introduction
No ratings yet
Learn Data Analysis With Pandas - Introduction
2 pages
Class 12 Pandas Practical Guide
No ratings yet
Class 12 Pandas Practical Guide
15 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Case Base Practice Question
No ratings yet
Case Base Practice Question
7 pages
Lucknow Public School - 20241201 - 220143 - 0000
No ratings yet
Lucknow Public School - 20241201 - 220143 - 0000
44 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Ge - Computer Science Data Analysis
No ratings yet
Ge - Computer Science Data Analysis
16 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
Info Practical
No ratings yet
Info Practical
56 pages
S02 Lab
No ratings yet
S02 Lab
31 pages
DF Ques1
No ratings yet
DF Ques1
2 pages
Aim and Alogrorithm - Full
No ratings yet
Aim and Alogrorithm - Full
9 pages
Even Students
No ratings yet
Even Students
36 pages
IP Record Python 23-24 Aryan
No ratings yet
IP Record Python 23-24 Aryan
42 pages
DATAFRAME
0% (1)
DATAFRAME
6 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
98 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
Lab 9
No ratings yet
Lab 9
9 pages
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
24 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
Practice Questions (Unsolved)
No ratings yet
Practice Questions (Unsolved)
8 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
Python & Pandas Cheat Sheet Guide
No ratings yet
Python & Pandas Cheat Sheet Guide
11 pages
Pyq Solution
No ratings yet
Pyq Solution
12 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
Python Pandas Assignment Guide
No ratings yet
Python Pandas Assignment Guide
9 pages
Ip Study
No ratings yet
Ip Study
18 pages
XII - Informatics Practices (LAB MANUAL)
100% (1)
XII - Informatics Practices (LAB MANUAL)
42 pages
Practical File Python
No ratings yet
Practical File Python
25 pages
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
35 pages
Python Cheat Sheet 2.0
100% (2)
Python Cheat Sheet 2.0
10 pages
Questions Practical File
No ratings yet
Questions Practical File
13 pages
Practical File 12th
No ratings yet
Practical File 12th
19 pages
Part A Assignment - No - 1
No ratings yet
Part A Assignment - No - 1
7 pages
Certificate in Computing (Cic) : Efune' 2008
No ratings yet
Certificate in Computing (Cic) : Efune' 2008
20 pages
SDK FS2004
No ratings yet
SDK FS2004
68 pages
Lec08 PDF
No ratings yet
Lec08 PDF
95 pages
3M
No ratings yet
3M
20 pages
Algorithm Design for CS Students
No ratings yet
Algorithm Design for CS Students
5 pages
ZXMP M721 System Architecture - Sun Jianfeng - 20140616
No ratings yet
ZXMP M721 System Architecture - Sun Jianfeng - 20140616
23 pages
Geotechnical and Structural Instrumentation
No ratings yet
Geotechnical and Structural Instrumentation
3 pages
SFTP - Apex
No ratings yet
SFTP - Apex
7 pages
DCV Nexus
No ratings yet
DCV Nexus
217 pages
Introduction To VLSI Design: Amit Kumar Mishra ECE Department IIT Guwahati
No ratings yet
Introduction To VLSI Design: Amit Kumar Mishra ECE Department IIT Guwahati
20 pages
Weekly Design Dept. Quiz #48
No ratings yet
Weekly Design Dept. Quiz #48
2 pages
Computer Examples: Tenenbaum, de Silva, Langford "A Global Geometric Framework For Nonlinear Dimensionality Reduction"
No ratings yet
Computer Examples: Tenenbaum, de Silva, Langford "A Global Geometric Framework For Nonlinear Dimensionality Reduction"
37 pages
Shadows of Doubt Modding Guide
No ratings yet
Shadows of Doubt Modding Guide
12 pages
Smart Teacher's Kit: Elena Mutonono
No ratings yet
Smart Teacher's Kit: Elena Mutonono
17 pages
FortiGate Best Practices Enhanced Configuration Guide 1743879376
No ratings yet
FortiGate Best Practices Enhanced Configuration Guide 1743879376
38 pages
NETSTA
No ratings yet
NETSTA
4 pages
SMG Release Notes 10 9 1
No ratings yet
SMG Release Notes 10 9 1
12 pages
Types of Computer & Their Parts
No ratings yet
Types of Computer & Their Parts
6 pages
668eb7550817e7867f7b26f8 NX 2312 Install WNT
No ratings yet
668eb7550817e7867f7b26f8 NX 2312 Install WNT
46 pages
K2000 All-In-One Machine: TC-S36424 Spec: T/4U/V5
No ratings yet
K2000 All-In-One Machine: TC-S36424 Spec: T/4U/V5
9 pages
20200124040832case Study
No ratings yet
20200124040832case Study
3 pages
Excel Data Visualization Essentials
100% (3)
Excel Data Visualization Essentials
24 pages
Cisco Packet Tracer 6.0.1 Overview Presentation
No ratings yet
Cisco Packet Tracer 6.0.1 Overview Presentation
37 pages
Microbiology An Evolving Science 4th Edition Slonczewski Digital Access
100% (2)
Microbiology An Evolving Science 4th Edition Slonczewski Digital Access
409 pages
Operators Manual, SmoothX
100% (3)
Operators Manual, SmoothX
672 pages
02 - SS-Activities - Application of Function Library
No ratings yet
02 - SS-Activities - Application of Function Library
4 pages
Maxis Fibre Internet Quick Start Guide
No ratings yet
Maxis Fibre Internet Quick Start Guide
13 pages
Mechatronics & Robotics Course Guide
No ratings yet
Mechatronics & Robotics Course Guide
8 pages
B.Com Computer App Practical Exam July 2024
No ratings yet
B.Com Computer App Practical Exam July 2024
8 pages
A321neo ACF Familiarization Briefing - 27032024
No ratings yet
A321neo ACF Familiarization Briefing - 27032024
3 pages

DS Practical

Uploaded by

DS Practical

Uploaded by

DAVIET/CSE/2003635

D.A.V INSTITUTE OF ENGINERING & TECHNOLOGY

Data Science LAB(BTCS 616-18)

DS_ LAB Page 1

4 Adding new column to existing dataframe in Pandas 10-12

5 Create a new column in Pandas dataframe based on the 13-15

6 Create a new column in Pandas dataframe based on a given . 16-17

DS_ LAB Page 2

7 Selecting row in Pandas dataframe based on a given condition 18-20

DS_ LAB Page 3

Task 1: Using the pandas Python Library

DS_ LAB Page 4

DS_ LAB Page 5

b) Create Matrix using map function using Numpy

DS_ LAB Page 6

DS_ LAB Page 7

Task 3: Combine two Dataframe in Python-Pandas

DS_ LAB Page 8

df1 = pd.DataFrame({'id': ['A01', 'A02', 'A03', 'A04'],

c) Concatenating using append

DS_ LAB Page 9

Task 4: Adding new column to existing dataframe in Pandas

a) By declaring a new list as a column.

DS_ LAB Page 10

b) By using dataframe insert()

c) Using dataframe assign() method

DS_ LAB Page 11

DS_ LAB Page 12

DS_ LAB Page 13

DS_ LAB Page 14

c) Using dataframe .map() function

DS_ LAB Page 15

Task 6: Create a new column in Pandas dataframe based on a given condition

DS_ LAB Page 16

b) Using dataframe.apply() function

DS_ LAB Page 17

c) Using dataframe .map() function

d) Using numpy.where() function

DS_ LAB Page 18

Task 7: Selecting row in Pandas dataframe based on a given condition

DS_ LAB Page 19

c) Selecting rows based on multiple column conditions using ’&’ operator

DS_ LAB Page 20

dataframe = pd.DataFrame(record, columns = ['Name', 'Age', 'Stream',

DS_ LAB Page 21

Task 8: Pandas dataframe where()

DS_ LAB Page 22

Task 9: Create Box Plot

DS_ LAB Page 23

Task 10: Create Histogram

DS_ LAB Page 24

Task 11: Create Pivot Table

DS_ LAB Page 25

Task 12: Create Heapmap

DS_ LAB Page 26

Task 13: Demostrating means() in Pyhton program

DS_ LAB Page 27

Task 14: Demostrating standard deviation in python program

DS_ LAB Page 28

Task 15: Calculation skewness and kurtosis in python

DS_ LAB Page 29

DS_ LAB Page 30

You might also like