0% found this document useful (0 votes)

52 views18 pages

FDS Record-1-4

The document outlines a laboratory exercise at Panimalar Engineering College for exploring Python packages such as NumPy, SciPy, Jupyter, Statsmodels, and Pandas. It includes installation instructions, feature exploration, and sample code demonstrating array creation, operations, and data manipulation. The exercise aims to familiarize students with data science tools and their functionalities.

Uploaded by

shobanasofficial

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views18 pages

FDS Record-1-4

Uploaded by

shobanasofficial

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

PANIMALAR ENGINEERING COLLEGE

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

REG.NO:
DATE :
EX.NO : 01
Download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and
Pandas packages.

AIM:
To install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas
packages.
Procedure:
1. Installation
NumPy, SciPy, Jupyter, Statsmodels, and Pandas can be easily installed using Python's package
manager, pip. Open a terminal or command prompt and type the following commands one by one:
● pip install numpy
● pip install scipy
● pip install jupyter
● pip install statsmodels
● pip install pandas
Explore the Features:
● NumPy: NumPy is a fundamental package for scientific computing with Python. It
provides support for arrays, matrices, and high-level mathematical functions to operate on
these arrays.
● SciPy: SciPy is built on top of NumPy and provides additional functionality for scientific
computing. It includes modules for optimization, integration, interpolation, linear algebra,
and more.
● Jupyter: Jupyter is a web-based interactive computing platform that allows you to create
and share documents containing live code, equations, visualizations, and narrative text.
● Statsmodels: Statsmodels is a Python module that provides classes and functions for
estimating many different statistical models, as well as for conducting statistical tests and
exploring data.

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

● Pandas: Pandas is a powerful data analysis and manipulation library for Python. It
provides data structures like Series and DataFrame, which are ideal for working with
structured data.
2. Launch Jupyter Notebook.
3. Explore the Features
● Create an array
● Perform element-wise operations
● Basic statistical functions

NUMPY
PROGRAM:
import numpy as np
print("===== 1. Array Creation =====")
arr1 = np.array([1, 2, 3, 4])
print("Array from list:", arr1)
arr2 = np.zeros((2, 3))
print("Array of zeros:\n", arr2)
arr3 = np.ones((3, 2))
print("Array of ones:\n", arr3)
arr4 = np.arange(0, 10, 2) # [0, 2, 4, 6, 8]
print("Array with range:", arr4)
arr5 = np.linspace(0, 1, 5) # [0. , 0.25, 0.5 , 0.75, 1.]
print("Array with linspace:", arr5)
print("\n===== 2. Array Operations =====")
arr6 = np.array([1, 2, 3, 4])
arr7 = np.array([5, 6, 7, 8])
sum_arr = arr6 + arr7
print("Array addition:", sum_arr)
prod_arr = arr6 * arr7
print("Array multiplication:", prod_arr)
23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY
PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

exp_arr = arr6 ** 2
print("Array exponentiation:", exp_arr)
print("\n===== 3. Indexing and Slicing =====")
# Creating a 2D array
matrix = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
# Indexing
element = matrix[1, 2]
print("Element at [1, 2]:", element)
# Slicing: Extracting subarrays
sub_matrix = matrix[0:2, 1:3]
print("Sub-matrix:\n", sub_matrix)
# Boolean indexing
mask = matrix > 5
print("Elements greater than 5:\n", matrix[mask])
print("\n===== 4. Broadcasting =====")
arr8 = np.array([1, 2, 3])
arr9 = np.array([[10], [20], [30]]) # Shape (3, 1)
broadcasted_result = arr8 + arr9
print("Broadcasted result:\n", broadcasted_result)
print("\n===== 5. Linear Algebra =====")
a = np.array([[1, 2], [3, 4]])
b = np.array([[5, 6], [7, 8]])
# Matrix multiplication
matmul_result = np.dot(a, b)
print("Matrix multiplication result:\n", matmul_result)
# Determinant of a matrix
det_a = np.linalg.det(a)
print("Determinant of matrix a:", det_a)

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

# ===== 6. Statistical Operations =====

print("\n===== 6. Statistical Operations =====")
arr10 = np.array([1, 2, 3, 4, 5])
mean_val = np.mean(arr10)
print("Mean:", mean_val)
std_val = np.std(arr10)
print("Standard Deviation:", std_val)
median_val = np.median(arr10)
print("Median:", median_val)
OUTPUT:
===== 1. Array Creation =====
Array from list: [1 2 3 4]
Array of zeros:
[[0. 0. 0.]
[0. 0. 0.]]
Array of ones:
[[1. 1.]
[1. 1.]
[1. 1.]]
Array with range: [0 2 4 6 8]
Array with linspace: [0. 0.25 0.5 0.75 1. ]
===== 2. Array Operations =====
Array addition: [ 6 8 10 12]
Array multiplication: [ 5 12 21 32]
Array exponentiation: [ 1 4 9 16]
===== 3. Indexing and Slicing =====
Element at [1, 2]: 6
Sub-matrix:

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

[[2 3]
[5 6]]
Elements greater than 5:
[6 7 8 9]
===== 4. Broadcasting =====
Broadcasted result:
[[11 12 13]
[21 22 23]
[31 32 33]]
===== 5. Linear Algebra =====
Matrix multiplication result:
[[19 22]
[43 50]]
Determinant of matrix a: -2.0
===== 6. Statistical Operations =====
Mean: 3.0
Standard Deviation: 1.4142135623730951
Median: 3.0
PANDAS
import pandas as pd
import numpy as np
print("===== 1. Create DataFrame =====")
# Creating a DataFrame from a dictionary
data = {
'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Edward'],
'Age': [24, 27, 22, 32, 29],
'City': ['New York', 'Los Angeles', 'Chicago', 'Houston', 'Phoenix'],
'Salary': [70000, 80000, 120000, 90000, 100000]

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

}
df = pd.DataFrame(data)
print("DataFrame created from a dictionary:\n", df)
print("\n===== 2. DataFrame Operations =====")
age_column = df['Age']
print("Age column:\n", age_column)
row_2 = df.iloc[2]
print("\nRow 2:\n", row_2)
row_label = df.loc[1] # 1 is the index label for Bob
print("\nRow with label 1 :\n", row_label)
print("\n===== 3. Filtering and Conditions =====")
filtered_df = df[df['Age'] > 25]
print("Filtered DataFrame (Age > 25):\n", filtered_df)
# Filtering using multiple conditions (Age > 25 and Salary < 100000)
filtered_df_multi_cond = df[(df['Age'] > 25) & (df['Salary'] < 100000)]
print("\nFiltered DataFrame (Age > 25 and Salary < 100000):\n", filtered_df_multi_cond)
print("\n===== 4. Summary Statistics =====")
summary_stats = df.describe()
print("Summary statistics of numeric columns:\n", summary_stats)
mean_salary = df['Salary'].mean()
print("\nMean Salary:", mean_salary)
max_salary = df['Salary'].max()
print("\nMaximum Salary:", max_salary)
print("\n===== 5. Grouping Data =====")
# Group by 'City' and calculate the mean salary for each city
grouped_by_city = df.groupby('City')['Salary'].mean()
print("Average Salary grouped by City:\n", grouped_by_city)
print("\n===== 6. Sorting Data =====")

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

# Sorting the DataFrame by 'Salary' in descending order

sorted_by_salary = df.sort_values(by='Salary', ascending=False)
print("DataFrame sorted by Salary (descending):\n", sorted_by_salary)
# Sorting the DataFrame by 'Age' in ascending order
sorted_by_age = df.sort_values(by='Age', ascending=True)
print("\nDataFrame sorted by Age (ascending):\n", sorted_by_age)
print("\n===== 7. Adding and Removing Columns =====")
df['Experience'] = [2, 5, 1, 8, 4]
print("DataFrame with 'Experience' column added:\n", df)
df_dropped = df.drop(columns=['Experience'])
print("\nDataFrame after dropping 'Experience' column:\n", df_dropped)
print("\n===== 8. Merging DataFrames =====")
# Creating another DataFrame to merge with
data2 = {
'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Edward'],
'Department': ['HR', 'IT', 'Finance', 'Marketing', 'Sales']
}
df2 = pd.DataFrame(data2)
# Merging the two DataFrames based on 'Name' column
merged_df = pd.merge(df, df2, on='Name')
print("Merged DataFrame:\n", merged_df)
print("\n===== 9. Handling Missing Data =====")
df_with_na = df.copy()
df_with_na.loc[1, 'Salary'] = np.nan # Introducing NaN for Bob's salary
print("DataFrame with missing data:\n", df_with_na)
# Fill missing data (for 'Salary' column, using the mean)
df_filled = df_with_na.fillna({'Salary': df['Salary'].mean()})
print("\nDataFrame after filling missing data:\n", df_filled)

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

# Dropping rows with missing data

df_dropped_na = df_with_na.dropna()
print("\nDataFrame after dropping rows with missing data:\n", df_dropped_na)
OUTPUT
===== 1. Create DataFrame =====
DataFrame created from a dictionary:
Name Age City Salary
0 Alice 24 New York 70000
1 Bob 27 Los Angeles 80000
2 Charlie 22 Chicago 120000
3 David 32 Houston 90000
4 Edward 29 Phoenix 100000
===== 2. DataFrame Operations =====
Age column:
0 24
1 27
2 22
3 32
4 29
Name: Age, dtype: int64
Row 2:
Name Charlie
Age 22
City Chicago
Salary 120000
Name: 2, dtype: object
Row with label 1 (Bob):
Name Bob

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Age 27
City Los Angeles
Salary 80000
Name: 1, dtype: object
===== 3. Filtering and Conditions =====
Filtered DataFrame (Age > 25):
Name Age City Salary
1 Bob 27 Los Angeles 80000
3 David 32 Houston 90000
4 Edward 29 Phoenix 100000
Filtered DataFrame (Age > 25 and Salary < 100000):
Name Age City Salary
1 Bob 27 Los Angeles 80000
===== 4. Summary Statistics =====
Summary statistics of numeric columns:
Age Salary
count 5.000000 5.0
mean 26.800000 94000.0
std 3.774917 17124.1
min 22.000000 70000.0
25% 24.000000 80000.0
50% 27.000000 90000.0
75% 29.000000 100000.0
max 32.000000 120000.0
Mean Salary: 94000.0
Maximum Salary: 120000
===== 5. Grouping Data =====
Average Salary grouped by City:

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

City
Chicag

RESULT:
Thus the program to explore the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas
packages is executed.

REG.NO:
DATE:
EX.NO : 02
PROGRAM TO REMOVE ROWS IN NUMPY ARRAY THAT

CONTAINS NON-NUMERIC VALUES

AIM:
23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY
PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

The aim of the program is to remove rows from a NumPy array that contain non-
numeric values.

ALGORITHM:
● Import the numpy library to work with arrays.
● Create a Sample NumPy Array.
● Check for Non-Numeric Values.
● Use a vectorized approach np.vectorize()) to check whether each element is
numeric(either an integer or a float).
○ Filter the Rows Containing Only Numeric Values.
● Use logical indexing to identify the rows where all elements are numeric.
● Row Validation: np.all(mask, axis=1) ensures that only rows with all numeric
values are retained.
● Remove any rows where at least one element is non-numeric.
● Print both the original array and the cleaned array (with non-numeric rows
removed).
PROGRAM:
import numpy as np
data=np.array([[1,2,3],[4,’x’,6],[7,8,9],[‘a’,2,3],[10,11,12]])
mask=np.Vectorize(lambda X : isinstance(x,(int,float)))(data)
Valid_rows=np.all(mask,axis=1)
Cleaned_data=data[Valid_rows]
print(“Original Array:”)
print(data)
print(“\n Cleaned Array(rows with non-numeric values removed):”)
print(Cleaned_data)

OUTPUT:
Original Array:
23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY
PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

[[‘1’ ‘2’ ’3’]

[‘4’ ‘x’ ‘6’]
[‘7’ ‘8’ ‘9’]
[‘a’ ‘2’ ‘3’]
[‘10’ ‘11’ ‘12’]]
Cleaned Array (rows with non-numeric values removed):
[[‘1’ ‘2’ ‘3’]
[‘7’ ‘8’ ‘9’]
[‘10’ ‘11’ ‘12’]]

RESULT:
Thus the above program to remove rows in numpy array that contains non-numeric
values is executed.

REG.NO:
DATE:
EX.NO : 03
CREATE AN EMPTY & A FULL NUMPY ARRAY

AIM:
The aim of the program is to create and initialize NumPy arrays using two different
functions empty() and full().

ALGORITHM:
23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY
PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

● Install NumPy
● Import NumPy
● Creating an Empty Array using np.empty(shape)
● Creating a Full Array using np.full(shape, fill_value)
PROGRAM:
import numpy as np
empty_array=np.empty((3,3))
print(“Empty Array”)
print(empty_array)
full_array=np.full((3,3),7)
print(“Full Array”)
print(full_array)

OUTPUT:
Empty Array:
[[0.00000000e+000 1.77956813e-321 0.00000000e+000]
[6.93909653e-310 6.93909653e-310 0.00000000e+000]
[6.93909653e-310 6.93909653e-310 2.12199579e-314]]
Full Array:
[[7 7 7]
[7 7 7]

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

[7 7 7]]

RESULT:
Thus the program to create and initialize NumPy arrays using two different functions empty()
and full() is executed.

REG.NO:
DATE:
ADDITIONAL PROGRAM : 01
CASE STUDY:
You are a data analyst working for a school discrete.the school wants to track the grades of
students in different subjects across multiple terms.you ask with a performing basic
operations of numpy.you have five grades in three subjects for two terms.
TASK TO PERFORM:
1.Increase the grade for all students by 5 points in each subjects.
2.Calculate the average grade for each students across all the subjects.

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

3.Find the highest grade in each subject.

4.extract the grade of student 3.
5.Reshape the grade array so that the subject becomes rows and student becomes column.
6.Add 10 points to each student the math grade and 5 points to science and no change in
english.

AIM:
To analyze and track students' grades across different subjects and multiple terms using
NumPy. The analysis involves performing basic operations such as calculating the average,
maximum, and minimum grades for each subject and term.

ALGORITHM:
● Import Necessary Library:
Import NumPy to handle numerical computations efficiently.
● Define Grade Data:
Create a NumPy array to store students' grades.
The array should have dimensions corresponding to (students × subjects × terms).
● Perform Basic Operations:
Compute the average grade for each student in each term.
Find the highest and lowest grades in each subject across terms.
Calculate the overall average grade for each subject over both terms.
● Display the Results:
Print the processed data, including individual subject averages and term-wise
performance.

PROGRAM:
import numpy as np
grades = np.array([
[75, 80, 85],
[88, 76, 90],
[92, 85, 87],
[78, 88, 82],
23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY
PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

[85, 89, 84]

])
# 1. Increase all grades by 5 points
grades += 5
print("Grades after adding 5 points:")
print(grades)
# 2. Calculate the average grade for each student across all subjects
average_grades = np.mean(grades, axis=1)
print("\nAverage grade for each student:")
print(average_grades)
# 3. Find the highest grade in each subject
highest_grades = np.max(grades, axis=0)
print("\nHighest grade in each subject:")
print(highest_grades)
# 4. Extract grade of Student 3 (index 2 in 0-based indexing)
student_3_grades = grades[2]
print("\nGrades of Student 3:")
print(student_3_grades)
# 5. Reshape array so that subjects become rows and students become columns
reshaped_grades = grades.T
print("\nReshaped grades (Subjects as rows, Students as columns):")
print(reshaped_grades)
# 6. Add 10 points to Math, 5 to Science, and no change in English
grades[:, 0] += 10 # Math
grades[:, 1] += 5 # Science (English remains the same)
print("\nGrades after specific modifications:")
print(grades)

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

OUTPUT:
Grades after adding 5 points:
[[80 85 90]
[93 81 95]
[97 90 92]
[83 93 87]
[90 94 89]]
Average grade for each student:
[85. 89.66666667 93. 87.66666667 91. ]
Highest grade in each subject:
[97 94 95]
Grades of Student 3:
[97 90 92]
Reshaped grades (Subjects as rows, Students as columns):
[[80 93 97 83 90]

[85 81 90 93 94]
[90 95 92 87 89]]
Grades after specific modifications:
[[ 90 90 90]
[103 86 95]
[107 95 92]
[ 93 98 87]
[100 99 89]]

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

PANIMALAR ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

RESULT:
Thus the Student performance analysis is executed successfully.

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

NumPy and Pandas Basics Guide
No ratings yet
NumPy and Pandas Basics Guide
8 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
Practicals 1 To 4
No ratings yet
Practicals 1 To 4
15 pages
DSC Lab Programs
No ratings yet
DSC Lab Programs
24 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
FDS Lab
No ratings yet
FDS Lab
43 pages
Pandas & PyNumS Essentials
No ratings yet
Pandas & PyNumS Essentials
10 pages
Python Programming U5
No ratings yet
Python Programming U5
46 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
16 pages
Data Analysis Tools
No ratings yet
Data Analysis Tools
26 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
FDS Final Manual
No ratings yet
FDS Final Manual
41 pages
Int254 Unit 2
No ratings yet
Int254 Unit 2
33 pages
ML Contenthalf
No ratings yet
ML Contenthalf
35 pages
Foundation of Data Science Lab Manual Full
No ratings yet
Foundation of Data Science Lab Manual Full
8 pages
DSA LAB Manual - Good Content
No ratings yet
DSA LAB Manual - Good Content
70 pages
Data Analysis Lab with Python
No ratings yet
Data Analysis Lab with Python
11 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Data Science
No ratings yet
Data Science
42 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
Module 6 NumPY and Pandas
No ratings yet
Module 6 NumPY and Pandas
12 pages
DV Lab Manual Modified
No ratings yet
DV Lab Manual Modified
31 pages
Learninng Plan
No ratings yet
Learninng Plan
6 pages
Ai Programs
No ratings yet
Ai Programs
22 pages
Ds Lab-1
No ratings yet
Ds Lab-1
40 pages
Lab Manual
No ratings yet
Lab Manual
81 pages
Pandas Research
No ratings yet
Pandas Research
14 pages
Data Preprocessing
No ratings yet
Data Preprocessing
159 pages
Report
No ratings yet
Report
18 pages
Staff Manula 01
No ratings yet
Staff Manula 01
7 pages
ML Programs
No ratings yet
ML Programs
34 pages
ML Manual
No ratings yet
ML Manual
21 pages
CS3361-Data Science Lab Manual - B.rethina Kumar
No ratings yet
CS3361-Data Science Lab Manual - B.rethina Kumar
36 pages
Wa0000.
No ratings yet
Wa0000.
53 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Python Unit-5
No ratings yet
Python Unit-5
14 pages
FOD Record Sem 1
No ratings yet
FOD Record Sem 1
25 pages
Practical 1
No ratings yet
Practical 1
5 pages
Dav 2 Unit
No ratings yet
Dav 2 Unit
55 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Fds PDF
No ratings yet
Fds PDF
58 pages
Unit 3 (FODS)
No ratings yet
Unit 3 (FODS)
34 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Exp - 1 - Introduction To Data Analytics and Python Fundamentals - SDK - Ok
No ratings yet
Exp - 1 - Introduction To Data Analytics and Python Fundamentals - SDK - Ok
9 pages
Lab 3 & 4
No ratings yet
Lab 3 & 4
10 pages
Assignment 7
No ratings yet
Assignment 7
1 page
12 Ai Practical File
100% (2)
12 Ai Practical File
5 pages
AD3301 DEV Lab Manual
No ratings yet
AD3301 DEV Lab Manual
26 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
Dsa Lab Manual
No ratings yet
Dsa Lab Manual
72 pages
ML Lab Manual
No ratings yet
ML Lab Manual
59 pages
Toc-Pedalogical Report
No ratings yet
Toc-Pedalogical Report
1 page
Ieee
No ratings yet
Ieee
1 page
Unit Iii
No ratings yet
Unit Iii
89 pages
Brainstorming Session On Next-Gen OS Mission
No ratings yet
Brainstorming Session On Next-Gen OS Mission
2 pages
DBMS - Unit 5
No ratings yet
DBMS - Unit 5
54 pages
21CS1411-DBMS Lab-Assignment-Marks
No ratings yet
21CS1411-DBMS Lab-Assignment-Marks
38 pages
21CS1503 Toc QB
No ratings yet
21CS1503 Toc QB
6 pages
FDS Rec 9-12
No ratings yet
FDS Rec 9-12
6 pages
FDS Record 5-8
No ratings yet
FDS Record 5-8
15 pages
Closure Properties of Context Free Languages
No ratings yet
Closure Properties of Context Free Languages
1 page
Unit 5-Undecidability
No ratings yet
Unit 5-Undecidability
17 pages
Computational Thinking With Python 2 Lab
No ratings yet
Computational Thinking With Python 2 Lab
12 pages
Praveen's Resume
No ratings yet
Praveen's Resume
1 page
Wolfram Mathematica With JupyterLab
No ratings yet
Wolfram Mathematica With JupyterLab
18 pages
Python String Slicing for O&G
No ratings yet
Python String Slicing for O&G
7 pages
QuantLib Python Guide
No ratings yet
QuantLib Python Guide
285 pages
Python by Example Book 1 (Fundamentals and Basics)
100% (1)
Python by Example Book 1 (Fundamentals and Basics)
57 pages
YOLOv8 v9 Dataset Setup 1
No ratings yet
YOLOv8 v9 Dataset Setup 1
219 pages
Python Strings
No ratings yet
Python Strings
12 pages
Python Programming Training
No ratings yet
Python Programming Training
2 pages
GLA Gen AI Jun-Jul'25 PDF
No ratings yet
GLA Gen AI Jun-Jul'25 PDF
6 pages
Marine Creatures Project Report
No ratings yet
Marine Creatures Project Report
8 pages
Python Notes
No ratings yet
Python Notes
54 pages
1.introduction To Machine Learning and Toolkit
No ratings yet
1.introduction To Machine Learning and Toolkit
102 pages
Azure HDInsight Spark Lab Guide
No ratings yet
Azure HDInsight Spark Lab Guide
29 pages
Pre-Training BERT From Scratch With Cloud TPU
No ratings yet
Pre-Training BERT From Scratch With Cloud TPU
11 pages
Python Lec 1
No ratings yet
Python Lec 1
29 pages
AI Interview Notes
No ratings yet
AI Interview Notes
11 pages
JupyterLab: Evolution of Notebooks
No ratings yet
JupyterLab: Evolution of Notebooks
22 pages
Python Setup for Beginners
No ratings yet
Python Setup for Beginners
12 pages
Python Content Manual (1) - 8
No ratings yet
Python Content Manual (1) - 8
1 page
Sample Project Report
No ratings yet
Sample Project Report
26 pages
Nbconvert Readthedocs Io en Latest
No ratings yet
Nbconvert Readthedocs Io en Latest
185 pages
PDF Basics of Python Programming 2nd Edition Dr. Pratiyush Guleria Download
100% (13)
PDF Basics of Python Programming 2nd Edition Dr. Pratiyush Guleria Download
89 pages
Week1a-Notes - Jupyter Notebook
No ratings yet
Week1a-Notes - Jupyter Notebook
9 pages
Chapter 1
No ratings yet
Chapter 1
85 pages
Region Proposal Object Detection With Opencv, Keras, and Tensorflow
No ratings yet
Region Proposal Object Detection With Opencv, Keras, and Tensorflow
20 pages
Handout 1 - Introduction To Setting Up Python
No ratings yet
Handout 1 - Introduction To Setting Up Python
49 pages
Lab 1.1-JupyterNotebook-TheBasics - MD
No ratings yet
Lab 1.1-JupyterNotebook-TheBasics - MD
2 pages
Jupyter Notebook Tutorial
No ratings yet
Jupyter Notebook Tutorial
23 pages
Cloud MD Simulations for Educators
No ratings yet
Cloud MD Simulations for Educators
13 pages

FDS Record-1-4

Uploaded by

FDS Record-1-4

Uploaded by

PANIMALAR ENGINEERING COLLEGE

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

# ===== 6. Statistical Operations =====

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

# Sorting the DataFrame by 'Salary' in descending order

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

# Dropping rows with missing data

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

CONTAINS NON-NUMERIC VALUES

[[‘1’ ‘2’ ’3’]

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

3.Find the highest grade in each subject.

[85, 89, 84]

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

23AD1413 – FOUNDATIONS OF DATA SCIENCE LABORATORY

You might also like