0% found this document useful (0 votes)

20 views37 pages

Chapter4 3

This document provides an introduction to using pandas for data analysis, focusing on efficient iteration over DataFrames. It covers various methods for calculating statistics, such as win percentages and run differentials, and compares the performance of different iteration techniques including .iloc, .iterrows(), and .itertuples(). The document emphasizes the importance of vectorization and using NumPy for optimal performance in data manipulation.

Uploaded by

Ousmane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views37 pages

Chapter4 3

Uploaded by

Ousmane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Intro to pandas

DataFrame iteration
W RITIN G EF F ICIEN T P YTH ON CODE

Logan Thomas
Senior Data Scientist, Protection
Engineering Consultants
pandas recap
See pandas overview in Intermediate Python for Data Science

Library used for data analysis

Main data structure is the DataFrame

Tabular data with labeled rows and columns

Built on top of the NumPy array structure

Chapter Objective:
Best practice for iterating over a pandas DataFrame

WRITING EFFICIENT PYTHON CODE

Baseball stats
import pandas as pd

baseball_df = pd.read_csv('baseball_stats.csv')
print(baseball_df.head())

Team League Year RS RA W G Playoffs

0 ARI NL 2012 734 688 81 162 0
1 ATL NL 2012 700 600 94 162 1
2 BAL AL 2012 712 705 93 162 1
3 BOS AL 2012 734 806 69 162 0
4 CHC NL 2012 613 759 61 162 0

WRITING EFFICIENT PYTHON CODE

Baseball stats
Team
0 ARI
1 ATL
2 BAL
3 BOS
4 CHC

WRITING EFFICIENT PYTHON CODE

Baseball stats
Team League Year RS RA W G Playoffs
0 ARI NL 2012 734 688 81 162 0
1 ATL NL 2012 700 600 94 162 1
2 BAL AL 2012 712 705 93 162 1
3 BOS AL 2012 734 806 69 162 0
4 CHC NL 2012 613 759 61 162 0

WRITING EFFICIENT PYTHON CODE

Calculating win percentage
import numpy as np

def calc_win_perc(wins, games_played):

win_perc = wins / games_played

return np.round(win_perc,2)

win_perc = calc_win_perc(50, 100)

print(win_perc)

0.5

WRITING EFFICIENT PYTHON CODE

Adding win percentage to DataFrame
win_perc_list = []

for i in range(len(baseball_df)):
row = baseball_df.iloc[i]

wins = row['W']
games_played = row['G']

win_perc = calc_win_perc(wins, games_played)

win_perc_list.append(win_perc)

baseball_df['WP'] = win_perc_list

WRITING EFFICIENT PYTHON CODE

Adding win percentage to DataFrame
print(baseball_df.head())

Team League Year RS RA W G Playoffs WP

0 ARI NL 2012 734 688 81 162 0 0.50
1 ATL NL 2012 700 600 94 162 1 0.58
2 BAL AL 2012 712 705 93 162 1 0.57
3 BOS AL 2012 734 806 69 162 0 0.43
4 CHC NL 2012 613 759 61 162 0 0.38

WRITING EFFICIENT PYTHON CODE

Iterating with .iloc
%%timeit
win_perc_list = []

for i in range(len(baseball_df)):
row = baseball_df.iloc[i]

wins = row['W']
games_played = row['G']

win_perc = calc_win_perc(wins, games_played)

win_perc_list.append(win_perc)

baseball_df['WP'] = win_perc_list

183 ms ± 1.73 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

WRITING EFFICIENT PYTHON CODE

Iterating with .iterrows()
win_perc_list = []

for i,row in baseball_df.iterrows():

wins = row['W']
games_played = row['G']

win_perc = calc_win_perc(wins, games_played)

win_perc_list.append(win_perc)

baseball_df['WP'] = win_perc_list

WRITING EFFICIENT PYTHON CODE

Iterating with .iterrows()
%%timeit
win_perc_list = []

for i,row in baseball_df.iterrows():

wins = row['W']
games_played = row['G']

win_perc = calc_win_perc(wins, games_played)

win_perc_list.append(win_perc)

baseball_df['WP'] = win_perc_list

95.3 ms ± 3.57 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

WRITING EFFICIENT PYTHON CODE

Practice DataFrame
iterating with
.iterrows()
W RITIN G EF F ICIEN T P YTH ON CODE
Another iterator
method: .itertuples()
W RITIN G EF F ICIEN T P YTH ON CODE

Logan Thomas
Senior Data Scientist, Protection
Engineering Consultants
Team wins data
print(team_wins_df)

Team Year W
0 ARI 2012 81
1 ATL 2012 94
2 BAL 2012 93
3 BOS 2012 69
4 CHC 2012 61
...

WRITING EFFICIENT PYTHON CODE

for row_tuple in team_wins_df.iterrows():
print(row_tuple)
print(type(row_tuple[1]))

(0, Team ARI

Year 2012
W 81
Name: 0, dtype: object)
<class 'pandas.core.series.Series'>

(1, Team ATL

Year 2012
W 94
Name: 1, dtype: object)
<class 'pandas.core.series.Series'>
...

WRITING EFFICIENT PYTHON CODE

Iterating with .itertuples()
for row_namedtuple in team_wins_df.itertuples():
print(row_namedtuple)

Pandas(Index=0, Team='ARI', Year=2012, W=81)

Pandas(Index=1, Team='ATL', Year=2012, W=94)
...

print(row_namedtuple.Index)

print(row_namedtuple.Team)

ATL

WRITING EFFICIENT PYTHON CODE

Comparing methods
%%timeit
for row_tuple in team_wins_df.iterrows():
print(row_tuple)

527 ms ± 41.1 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

%%timeit
for row_namedtuple in team_wins_df.itertuples():
print(row_namedtuple)

7.48 ms ± 243 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

WRITING EFFICIENT PYTHON CODE

for row_tuple in team_wins_df.iterrows():
print(row_tuple[1]['Team'])

ARI
ATL
...

for row_namedtuple in team_wins_df.itertuples():

print(row_namedtuple['Team'])

TypeError: tuple indices must be integers or slices, not str

for row_namedtuple in team_wins_df.itertuples():

print(row_namedtuple.Team)

ARI
ATL
...

WRITING EFFICIENT PYTHON CODE

Let's keep iterating!
W RITIN G EF F ICIEN T P YTH ON CODE
pandas alternative to
looping
W RITIN G EF F ICIEN T P YTH ON CODE

Logan Thomas
Senior Data Scientist, Protection
Engineering Consultants
print(baseball_df.head())

Team League Year RS RA W G Playoffs

0 ARI NL 2012 734 688 81 162 0
1 ATL NL 2012 700 600 94 162 1
2 BAL AL 2012 712 705 93 162 1
3 BOS AL 2012 734 806 69 162 0
4 CHC NL 2012 613 759 61 162 0

def calc_run_diff(runs_scored, runs_allowed):

run_diff = runs_scored - runs_allowed

return run_diff

WRITING EFFICIENT PYTHON CODE

Run differentials with a loop
run_diffs_iterrows = []

for i,row in baseball_df.iterrows():

run_diff = calc_run_diff(row['RS'], row['RA'])
run_diffs_iterrows.append(run_diff)

baseball_df['RD'] = run_diffs_iterrows
print(baseball_df)

Team League Year RS RA W G Playoffs RD

0 ARI NL 2012 734 688 81 162 0 46
1 ATL NL 2012 700 600 94 162 1 100
2 BAL AL 2012 712 705 93 162 1 7
...

WRITING EFFICIENT PYTHON CODE

pandas .apply() method
Takes a function and applies it to a DataFrame
Must specify an axis to apply ( 0 for columns; 1 for rows)

Can be used with anonymous functions ( lambda functions)

Example:

baseball_df.apply(

lambda row: calc_run_diff(row['RS'], row['RA']),

axis=1
)

WRITING EFFICIENT PYTHON CODE

Run differentials with .apply()
run_diffs_apply = baseball_df.apply(
lambda row: calc_run_diff(row['RS'], row['RA']),
axis=1)

baseball_df['RD'] = run_diffs_apply
print(baseball_df)

Team League Year RS RA W G Playoffs RD

0 ARI NL 2012 734 688 81 162 0 46
1 ATL NL 2012 700 600 94 162 1 100
2 BAL AL 2012 712 705 93 162 1 7
...

WRITING EFFICIENT PYTHON CODE

Comparing approaches
%%timeit
run_diffs_iterrows = []

for i,row in baseball_df.iterrows():

run_diff = calc_run_diff(row['RS'], row['RA'])
run_diffs_iterrows.append(run_diff)

baseball_df['RD'] = run_diffs_iterrows

86.8 ms ± 3 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

WRITING EFFICIENT PYTHON CODE

Comparing approaches
%%timeit
run_diffs_apply = baseball_df.apply(
lambda row: calc_run_diff(row['RS'], row['RA']),
axis=1)

baseball_df['RD'] = run_diffs_apply

30.1 ms ± 1.75 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

WRITING EFFICIENT PYTHON CODE

Let's practice using
pandas .apply()
method!
W RITIN G EF F ICIEN T P YTH ON CODE
Optimal pandas
iterating
W RITIN G EF F ICIEN T P YTH ON CODE

Logan Thomas
Senior Data Scientist, Protection
Engineering Consultants
pandas internals
Eliminating loops applies to using pandas as well

pandas is built on NumPy

Take advantage of NumPy array ef ciencies

WRITING EFFICIENT PYTHON CODE

print(baseball_df)

Team League Year RS RA W G Playoffs

0 ARI NL 2012 734 688 81 162 0
1 ATL NL 2012 700 600 94 162 1
2 BAL AL 2012 712 705 93 162 1
...

wins_np = baseball_df['W'].values

print(type(wins_np))

print(wins_np)

[ 81 94 93 ...]

WRITING EFFICIENT PYTHON CODE

Power of vectorization
Broadcasting (vectorizing) is extremely ef cient!

baseball_df['RS'].values - baseball_df['RA'].values

array([ 46, 100, 7, ..., 188, 110, -117])

WRITING EFFICIENT PYTHON CODE

Run differentials with arrays
run_diffs_np = baseball_df['RS'].values - baseball_df['RA'].values

baseball_df['RD'] = run_diffs_np
print(baseball_df)

Team League Year RS RA W G Playoffs RD

0 ARI NL 2012 734 688 81 162 0 46
1 ATL NL 2012 700 600 94 162 1 100
2 BAL AL 2012 712 705 93 162 1 7
3 BOS AL 2012 734 806 69 162 0 -72
4 CHC NL 2012 613 759 61 162 0 -146
...

WRITING EFFICIENT PYTHON CODE

Comparing approaches
%%timeit
run_diffs_np = baseball_df['RS'].values - baseball_df['RA'].values

baseball_df['RD'] = run_diffs_np

124 µs ± 1.47 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

WRITING EFFICIENT PYTHON CODE

Let's put our skills
into practice!
W RITIN G EF F ICIEN T P YTH ON CODE
Congratulations!
W RITIN G EF F ICIEN T P YTH ON CODE

Logan Thomas
Senior Data Scientist, Protection
Engineering Consultants
What you have learned
The de nition of ef cient and Pythonic code

How to use Python's powerful built-in library

The advantages of NumPy arrays

Some handy magic commands to pro le code

How to deploy ef cient solutions with zip() , itertools , collections , and set theory

The cost of looping and how to eliminate loops

Best practices for iterating with pandas DataFrames

WRITING EFFICIENT PYTHON CODE

Well done!
W RITIN G EF F ICIEN T P YTH ON CODE

Chapter 4
No ratings yet
Chapter 4
37 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Data 1542431842578
No ratings yet
Data 1542431842578
11 pages
Ip Practical File
No ratings yet
Ip Practical File
23 pages
DS - Lab Manual
No ratings yet
DS - Lab Manual
31 pages
PMA Experiment 3
No ratings yet
PMA Experiment 3
8 pages
Lab Mannual
No ratings yet
Lab Mannual
49 pages
Ip Practical File
No ratings yet
Ip Practical File
39 pages
Info Practical
No ratings yet
Info Practical
111 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Lab Manual Python Programming Language
No ratings yet
Lab Manual Python Programming Language
21 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
CLASS XII - IP List of Practicals With Coding 2020
No ratings yet
CLASS XII - IP List of Practicals With Coding 2020
15 pages
Practical File Ip Class 12
No ratings yet
Practical File Ip Class 12
40 pages
FDSA Lab Manual 1
No ratings yet
FDSA Lab Manual 1
34 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
Rufh 4
No ratings yet
Rufh 4
24 pages
Python Lab Programs
No ratings yet
Python Lab Programs
58 pages
Pandas Worksheet
No ratings yet
Pandas Worksheet
3 pages
Class 12 Ip Practical Programs 2024-25 Revised
No ratings yet
Class 12 Ip Practical Programs 2024-25 Revised
42 pages
Class 12 Ip Practical Exercises 2022-23 (Updated)
No ratings yet
Class 12 Ip Practical Exercises 2022-23 (Updated)
29 pages
Grade 12 Python Pandas Programs
No ratings yet
Grade 12 Python Pandas Programs
40 pages
Foundations For Efficiencies Writing Efficiency Code With Python
No ratings yet
Foundations For Efficiencies Writing Efficiency Code With Python
28 pages
Ipclass 12
No ratings yet
Ipclass 12
21 pages
Badri Project New 1
No ratings yet
Badri Project New 1
26 pages
Practical - With Solution - XII - IP
No ratings yet
Practical - With Solution - XII - IP
13 pages
017) Pandas - Batch 2 - Day 017
No ratings yet
017) Pandas - Batch 2 - Day 017
47 pages
Dataframe Programs
No ratings yet
Dataframe Programs
12 pages
Ip Practical
No ratings yet
Ip Practical
31 pages
Fundamentals of Data Science Lab Manual New
No ratings yet
Fundamentals of Data Science Lab Manual New
33 pages
Creating A Series Using Scalar Values
No ratings yet
Creating A Series Using Scalar Values
15 pages
Python API Lab: Pandas & NBA API
No ratings yet
Python API Lab: Pandas & NBA API
8 pages
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
24 pages
School File Python (1) Manan (1) Final
No ratings yet
School File Python (1) Manan (1) Final
20 pages
Class X - A.I. - Practical Lab Manual - VVA 2024-25
No ratings yet
Class X - A.I. - Practical Lab Manual - VVA 2024-25
50 pages
12 - Ip Prac
No ratings yet
12 - Ip Prac
52 pages
Xii - Ip - Holiday HW
No ratings yet
Xii - Ip - Holiday HW
2 pages
11th PGM
No ratings yet
11th PGM
9 pages
DSF Lab Exp Full
No ratings yet
DSF Lab Exp Full
88 pages
Class XII Python & Matplotlib Guide
No ratings yet
Class XII Python & Matplotlib Guide
19 pages
24UAD315 DEV Final Record
No ratings yet
24UAD315 DEV Final Record
49 pages
Ip Cbse Practical File GR 12
No ratings yet
Ip Cbse Practical File GR 12
34 pages
Practical (Data Science)
No ratings yet
Practical (Data Science)
13 pages
Dsa Lab
No ratings yet
Dsa Lab
28 pages
Final Print
No ratings yet
Final Print
43 pages
12 IP Practical
No ratings yet
12 IP Practical
14 pages
Dsa Lab Record (Ai&Ds)
No ratings yet
Dsa Lab Record (Ai&Ds)
34 pages
Ankit Class 12 Practical File
No ratings yet
Ankit Class 12 Practical File
33 pages
Python Practical Questions
No ratings yet
Python Practical Questions
13 pages
Python Operators and Functions Guide
No ratings yet
Python Operators and Functions Guide
21 pages
Xii Ip Sample Practical File 2022 23 2
No ratings yet
Xii Ip Sample Practical File 2022 23 2
30 pages
Practical File 2024-25
No ratings yet
Practical File 2024-25
25 pages
Ip Project Work 2
No ratings yet
Ip Project Work 2
52 pages
Class-12-Ip-Practical - Old
No ratings yet
Class-12-Ip-Practical - Old
42 pages
I.P Practical
No ratings yet
I.P Practical
12 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Your Pathway To Success 6 Typical Competence Profiles
No ratings yet
Your Pathway To Success 6 Typical Competence Profiles
8 pages
3 - Stockprice Workflow Resource
No ratings yet
3 - Stockprice Workflow Resource
2 pages
Safety Guide For The Mining Industry 1
No ratings yet
Safety Guide For The Mining Industry 1
18 pages
NQA IMS Quote Request Form UK (2) 250528 112838
No ratings yet
NQA IMS Quote Request Form UK (2) 250528 112838
15 pages
Unions
No ratings yet
Unions
5 pages
Product Sales
No ratings yet
Product Sales
1 page
Enrol Today - Diploma in ESG
No ratings yet
Enrol Today - Diploma in ESG
4 pages
Safety Audt VS Safety Inspection
No ratings yet
Safety Audt VS Safety Inspection
11 pages
SharePoint - Maven Certificate
No ratings yet
SharePoint - Maven Certificate
3 pages
Pass 99 VENTIUS INTERNATIONAL 2021 en
No ratings yet
Pass 99 VENTIUS INTERNATIONAL 2021 en
1 page
PURE Project Manager® Syllabus 2025
No ratings yet
PURE Project Manager® Syllabus 2025
54 pages
30 - Sales-Procedure-Template
100% (1)
30 - Sales-Procedure-Template
2 pages
CSRD Professional 2025 - Course Brochure
No ratings yet
CSRD Professional 2025 - Course Brochure
18 pages
ISO 45001 2018 Legal and Other Requirements Register Sample
100% (6)
ISO 45001 2018 Legal and Other Requirements Register Sample
5 pages
Chapter2 2
No ratings yet
Chapter2 2
51 pages
35 - Warehousing-Procedure
No ratings yet
35 - Warehousing-Procedure
2 pages
26 NVQ Level 7 Web
No ratings yet
26 NVQ Level 7 Web
1 page
Chapter3 11
No ratings yet
Chapter3 11
58 pages
01 01 25 Updated CV Samir Bougueroua For Panel Operator
No ratings yet
01 01 25 Updated CV Samir Bougueroua For Panel Operator
3 pages
Two-Way Screenarray Cinema Loudspeakers: Key Features
No ratings yet
Two-Way Screenarray Cinema Loudspeakers: Key Features
3 pages
High-Frequency Trading Insights
No ratings yet
High-Frequency Trading Insights
4 pages
API 579 Fitness For Service Using INSPECT - Codeware
No ratings yet
API 579 Fitness For Service Using INSPECT - Codeware
4 pages
Byte Filling Function Guide
No ratings yet
Byte Filling Function Guide
3 pages
Friction and Automobile Tires
No ratings yet
Friction and Automobile Tires
3 pages
Hind Swaraj
No ratings yet
Hind Swaraj
16 pages
Philippines-Japan Local Administration Seminar 2015
No ratings yet
Philippines-Japan Local Administration Seminar 2015
62 pages
Fuji Xerox PCL 6 Driver License
No ratings yet
Fuji Xerox PCL 6 Driver License
14 pages
CSI - 24 Charcha-ae-Celebal PPT Script
No ratings yet
CSI - 24 Charcha-ae-Celebal PPT Script
5 pages
Ques
No ratings yet
Ques
3 pages
Ang Lakas Ay Daig NG Paraan
No ratings yet
Ang Lakas Ay Daig NG Paraan
5 pages
Assignment 2 (2013)
No ratings yet
Assignment 2 (2013)
1 page
Lab TOC Analyser Guide
No ratings yet
Lab TOC Analyser Guide
12 pages
w5-ITT565-Lecture5-UFUTURE Configure Routing and Remote Access
No ratings yet
w5-ITT565-Lecture5-UFUTURE Configure Routing and Remote Access
31 pages
GroupActivity G3
No ratings yet
GroupActivity G3
3 pages
Better Catalogue 2024
No ratings yet
Better Catalogue 2024
33 pages
iSolarCloud WEB 3.0 User Manual
No ratings yet
iSolarCloud WEB 3.0 User Manual
74 pages
Dental Age Estimation of 6-15 Year Old Indian Children Using Demirjian Method
No ratings yet
Dental Age Estimation of 6-15 Year Old Indian Children Using Demirjian Method
4 pages
【穿越─正義】策展論述 (長版) 英文
No ratings yet
【穿越─正義】策展論述 (長版) 英文
14 pages
Test Instructions For Applicants
100% (1)
Test Instructions For Applicants
5 pages
Emerging Trends in Philippine Literature: Creative Nonfiction
100% (1)
Emerging Trends in Philippine Literature: Creative Nonfiction
14 pages
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
No ratings yet
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
3 pages
Theology's Historical Challenges
No ratings yet
Theology's Historical Challenges
15 pages
EEPROM 24LC512 - 21754e
No ratings yet
EEPROM 24LC512 - 21754e
26 pages
Indigenous Landscapes and Spanish Missions New Perspectives From Archaeology and Ethnohistory 1st Edition Lee Panich PDF Download
100% (7)
Indigenous Landscapes and Spanish Missions New Perspectives From Archaeology and Ethnohistory 1st Edition Lee Panich PDF Download
59 pages
28 Day Shred Day05
No ratings yet
28 Day Shred Day05
2 pages
Reported Speecg
No ratings yet
Reported Speecg
29 pages
Black Francophone Power Dynamics
100% (8)
Black Francophone Power Dynamics
22 pages
Compilation - of - Swimming - Officials - Questions - and - Answers 2
No ratings yet
Compilation - of - Swimming - Officials - Questions - and - Answers 2
13 pages

Chapter4 3

Uploaded by

Chapter4 3

Uploaded by

Intro to pandas

Library used for data analysis

Main data structure is the DataFrame

Built on top of the NumPy array structure

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

def calc_win_perc(wins, games_played):

win_perc = wins / games_played

win_perc = calc_win_perc(50, 100)

WRITING EFFICIENT PYTHON CODE

win_perc = calc_win_perc(wins, games_played)

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs WP

WRITING EFFICIENT PYTHON CODE

win_perc = calc_win_perc(wins, games_played)

WRITING EFFICIENT PYTHON CODE

for i,row in baseball_df.iterrows():

win_perc = calc_win_perc(wins, games_played)

WRITING EFFICIENT PYTHON CODE

for i,row in baseball_df.iterrows():

win_perc = calc_win_perc(wins, games_played)

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

(0, Team ARI

(1, Team ATL

WRITING EFFICIENT PYTHON CODE

Pandas(Index=0, Team='ARI', Year=2012, W=81)

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

for row_namedtuple in team_wins_df.itertuples():

TypeError: tuple indices must be integers or slices, not str

for row_namedtuple in team_wins_df.itertuples():

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs

def calc_run_diff(runs_scored, runs_allowed):

run_diff = runs_scored - runs_allowed

WRITING EFFICIENT PYTHON CODE

for i,row in baseball_df.iterrows():

Team League Year RS RA W G Playoffs RD

WRITING EFFICIENT PYTHON CODE

Can be used with anonymous functions ( lambda functions)

lambda row: calc_run_diff(row['RS'], row['RA']),

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs RD

WRITING EFFICIENT PYTHON CODE

for i,row in baseball_df.iterrows():

86.8 ms ± 3 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

pandas is built on NumPy

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs

WRITING EFFICIENT PYTHON CODE

array([ 46, 100, 7, ..., 188, 110, -117])

WRITING EFFICIENT PYTHON CODE

Team League Year RS RA W G Playoffs RD

WRITING EFFICIENT PYTHON CODE

WRITING EFFICIENT PYTHON CODE

How to use Python's powerful built-in library

The advantages of NumPy arrays

Some handy magic commands to pro le code

The cost of looping and how to eliminate loops

Best practices for iterating with pandas DataFrames

WRITING EFFICIENT PYTHON CODE

You might also like