0% found this document useful (0 votes)

83 views12 pages

Performance

This document discusses various performance metrics for measuring computer systems. It provides examples of different metrics like execution time, throughput, component metrics, and discusses how to properly measure and analyze performance. Key points covered include defining good metrics, how to ensure reproducibility in experiments, and examples of metrics to use and avoid like MIPS and how averages are calculated.

Uploaded by

bijan shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views12 pages

Performance

Uploaded by

bijan shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Performance Metrics

Why study performance metrics?

• determine the benefit/lack of benefit of designs
• computer design is too complex to intuit performance &
performance bottlenecks
• have to be careful about what you mean to measure & how
you measure it

What you should get out of this discussion

• good metrics for measuring computer performance
• what they should be used for
• what metrics you shouldn’t use & how metrics are misused

perf
Performance of Computer Systems
Many different factors to take into account when determining
performance:
• Technology
• circuit speed (clock, MHz)
• processor technology (how many transistors on a chip)
• Organization
• type of processor (ILP)
• configuration of the memory hierarchy
• type of I/O devices
• number of processors in the system
• Software
• quality of the compilers
• organization & quality of OS, databases, etc.

perf
“Principles” of Experimentation

Meaningful metrics
execution time & component metrics that explain it

Reproducibility
machine configuration, compiler & optimization level, OS, input

Real programs
no toys, kernels, synthetic programs
SPEC is the norm (integer, floating point, graphics, webserver)
TPC-B, TPC-C & TPC-D for database transactions

Simulation
long executions, warm start to mimic steady-state behavior
usually applications only; some OS simulation
simulator “validation” & internal checks for accuracy

perf
Metrics that Measure Performance
Raw speed: peak performance (never attained)

Execution time: time to execute one program from beginning to

end
• the “performance bottom line”
• wall clock time, response time
• Unix time function: 13.7u 23.6s 18:27 3%

Throughput: total amount of work completed in a given time

• transactions (database) or packets (web servers) / second
• an indication of how well hardware resources are being used
• good metrics for chip designers or managers of computer
systems

(Often improving execution time will improve throughput & vice

versa.)

Component metrics: subsystem performance, e.g., memory

behavior
• help explain how execution time was obtained
• pinpoints performance bottlenecks

perf
Execution Time

Performancea = 1 / (Execution Timea)

Processor A is faster than processor B, i.e.,

Execution TimeA < Execution TimeB

PerformanceA > PerformanceB

Relative Performance

PerformanceA / PerformanceB
=n
= ExecutionTImeB / ExecutionTimeA

performance of A is n times greater than B

execution time of B is n times longer than A

perf
CPU Execution Time
The time the CPU spends executing an application
• no memory effects
• no I/O
• no effects of multiprogramming

CPUExecutionTime = CPUClockCycles * ClockCycleTime

Cycle time (clock period) is measured in time or rate

• clock cycle time = 1/clock cycle rate

CPUExecutionTime = CPUClockCycles / ClockCycleRate

• clock cycle rate of 1 MHz = cycle time of 1 µs

• clock cycle rate of 1 GHz = cycle time of 1 ns

perf
CPI

CPUClockCycles = NumberOfInstructions * CPI

Average number of clock cycles per instruction

• throughput metric
• component metric, not a measure of performance
• used for processor organization studies, given a fixed compiler
& ISA

Can have different CPIs for classes of instructions

e.g., floating point instructions take longer than integer
instructions

n
CPUClockCycles = ∑ (CPI i × C i )
1

where CPIi = CPI for a particular class of instructions

where Ci = the number of instructions of the ith class that have
been executed

Improving part of the architecture can improve a CPIi

• Talk about the contribution to CPI of a class of instructions

perf
CPU Execution Time

CPUExecutionTime =
numberofInstructions * CPI * clockCycleTime

To measure:
• execution time: depends on all 3 factors
• time the program
• number of instructions: determined by the ISA
• programmable hardware counters
• profiling
• count number of times each basic block is executed
• instruction sampling
• CPI: determined by the ISA & implementation
• simulator: interpret (in software) every instruction &
calculate the number of cycles it takes to simulate it
• clock cycle time: determined by the implementation & process
technology

Factors are interdependent:

• RISC: increases instructions/program, but decreases CPI &
clock cycle time because the instructions are simple
• CISC: decreases instructions/program, but increases CPI &
clock cycle time because many instructions are more complex

perf
Metrics Not to Use
MIPS (millions of instructions per second)
instruction count / execution time*10^6 =
clock rate / (CPI * 10^6)
- instruction set-dependent (even true for similar architectures)
- implementation-dependent
- compiler technology-dependent
- program-dependent
+ intuitive: the higher, the better

MFLOPS (millions of floating point operations per second)

floating point operations / (execution time * 10^6)
+ FP operations are independent of FP instruction
implementation
- different machines implement different FP operations
- different FP operations take different amounts of time
- only measures FP code

static metrics (code size)

perf
Means
Measuring the performance of a workload
• arithmetic: used for averaging execution times
n
  1


∑ timei  ×
i =1  n
• harmonic: used for averaging rates ("the average of", as
opposed to "the average statistic of")
p
p
 

 ∑ 1 
ratei 
 i =1 
• weighted means: the programs are executed with different
frequencies, for example:
n
  1


∑ timei × weighti  ×
i =1  n

perf
Means

FP Ops Time (secs)

Computer A Computer B Computer C

program 1 100 1 10 20
program 2 100 1000 100 20
total 1001 110 40
arith mean 500.5 55 20

FP Ops Rate (FLOPS)

Computer A Computer B Computer C

program 1 100 100 10 5
program 2 100 .1 1 5
harm mean .2 1.5 5
arith mean 50.1 5.5 5

Computer C is ~25 times faster than A when measuring execution

time

Still true when measuring MFLOPS(a rate) with the harmonic mean

perf
Speedup

Speedup = Execution TimebeforeImprovement /

ExecutionTimeafterImprovement
Amdahl’s Law:
Performance improvement from speeding up a part of a
computer system is limited by the proportion of time the
enhancement is used.

perf

Processors:: INTEL 8086
No ratings yet
Processors:: INTEL 8086
10 pages
Unit - Iv
No ratings yet
Unit - Iv
48 pages
IMREE - Practical
No ratings yet
IMREE - Practical
12 pages
Batch vs Real-Time Processing Explained
No ratings yet
Batch vs Real-Time Processing Explained
6 pages
Project 1: Bank System Using Structure
No ratings yet
Project 1: Bank System Using Structure
4 pages
BE Project
100% (1)
BE Project
53 pages
Electrical Design for Engineers
No ratings yet
Electrical Design for Engineers
10 pages
Touch Controlled Load Switch Using Microcontroller
No ratings yet
Touch Controlled Load Switch Using Microcontroller
2 pages
Computer Engineering Project Report
No ratings yet
Computer Engineering Project Report
13 pages
Embedded Systems Basics
No ratings yet
Embedded Systems Basics
26 pages
Lab Manual CS7001 Distributed System Powered by A2softech (A2kash)
No ratings yet
Lab Manual CS7001 Distributed System Powered by A2softech (A2kash)
30 pages
Chapter 4-BIOS and CMOS
No ratings yet
Chapter 4-BIOS and CMOS
59 pages
8051 Timer Interrupts for Square Waves
No ratings yet
8051 Timer Interrupts for Square Waves
5 pages
Short Circuit Analysis of Power System
No ratings yet
Short Circuit Analysis of Power System
4 pages
Microprocessor Based Digital PID Controller For Speed Control of D.C. Motor
100% (1)
Microprocessor Based Digital PID Controller For Speed Control of D.C. Motor
6 pages
Year & Sem.: Iii Yr / Vi Sem Faculty Name: A.Manjunathan Department: Ece Unit No.: Iii Title: Program Design and Analysis
No ratings yet
Year & Sem.: Iii Yr / Vi Sem Faculty Name: A.Manjunathan Department: Ece Unit No.: Iii Title: Program Design and Analysis
90 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
430 pages
Emi All Units PDF
No ratings yet
Emi All Units PDF
381 pages
Experiment 3 AIM: Write A Program To Generate Various Standard Test Signals. Software: Matlab Procedure
0% (1)
Experiment 3 AIM: Write A Program To Generate Various Standard Test Signals. Software: Matlab Procedure
3 pages
01 Synchronous Generators 2
No ratings yet
01 Synchronous Generators 2
63 pages
Microprocessor 8085 Architecture
No ratings yet
Microprocessor 8085 Architecture
19 pages
Optimal Unit Commitment Strategies
100% (3)
Optimal Unit Commitment Strategies
14 pages
SCADA Systems for Power Management
No ratings yet
SCADA Systems for Power Management
32 pages
Timing Diagrams For 1 Byte Instructions: 1. Adc M
75% (4)
Timing Diagrams For 1 Byte Instructions: 1. Adc M
21 pages
Machine Language Assembly Language and High Level Language
No ratings yet
Machine Language Assembly Language and High Level Language
7 pages
Sequential Circuit: Shreyas Patel M.Tech VLSI Design (VIT, Vellore) SVNIT, Surat
No ratings yet
Sequential Circuit: Shreyas Patel M.Tech VLSI Design (VIT, Vellore) SVNIT, Surat
21 pages
Instruction Set Architecture and Design
No ratings yet
Instruction Set Architecture and Design
27 pages
IOE Instrumentation Lab Manual
No ratings yet
IOE Instrumentation Lab Manual
14 pages
Chapter 2: 8051 Microcontroller Architecture: 2.1 What Is 8051 Standard?
No ratings yet
Chapter 2: 8051 Microcontroller Architecture: 2.1 What Is 8051 Standard?
46 pages
Metaheuristic Algorithms For 6G Wireless Communications Recent Advances
No ratings yet
Metaheuristic Algorithms For 6G Wireless Communications Recent Advances
35 pages
Ee 1404 Power System Lab Manual
No ratings yet
Ee 1404 Power System Lab Manual
64 pages
Developing Counter and Time Delay Routine
No ratings yet
Developing Counter and Time Delay Routine
24 pages
MDA-8086 Machine Code Execution Lab
No ratings yet
MDA-8086 Machine Code Execution Lab
3 pages
"8086 Interrupts" in (Microprocessor Systems and Interfacing)
No ratings yet
"8086 Interrupts" in (Microprocessor Systems and Interfacing)
20 pages
Fire Detection Using Embedded Systems
100% (1)
Fire Detection Using Embedded Systems
2 pages
HBL2VVRLAcatalogue PDF
No ratings yet
HBL2VVRLAcatalogue PDF
12 pages
Chapter 15 - Load Flow Analysis
No ratings yet
Chapter 15 - Load Flow Analysis
41 pages
C&DF Lab Test
No ratings yet
C&DF Lab Test
4 pages
AN B2L First - Steps EN V1
No ratings yet
AN B2L First - Steps EN V1
10 pages
APPENDIX IEEE Line Limit (MW) P.U PDF
No ratings yet
APPENDIX IEEE Line Limit (MW) P.U PDF
19 pages
Mosfet Characteristics
No ratings yet
Mosfet Characteristics
8 pages
Performance Impact Analysis With KPP Using Application Response Measurement in E-Government Systems
No ratings yet
Performance Impact Analysis With KPP Using Application Response Measurement in E-Government Systems
4 pages
PSS Lab Exp Edited PDF
No ratings yet
PSS Lab Exp Edited PDF
122 pages
String Instructions
100% (1)
String Instructions
7 pages
Computer Memory Hierarchy
No ratings yet
Computer Memory Hierarchy
24 pages
Flashflex Microcontroller Using The Programmable Counter Array (Pca)
No ratings yet
Flashflex Microcontroller Using The Programmable Counter Array (Pca)
17 pages
Advantages of Valve Regulated Lead Acid (VRLA) Batteries
No ratings yet
Advantages of Valve Regulated Lead Acid (VRLA) Batteries
1 page
Syllabus Electrical Machines Subject Code: Ele-405 L T P: 2 1 0 Credits: 03
No ratings yet
Syllabus Electrical Machines Subject Code: Ele-405 L T P: 2 1 0 Credits: 03
3 pages
Module 5 BJT Biasing Examples
No ratings yet
Module 5 BJT Biasing Examples
16 pages
Major Project Report ON: Home Automation Using DTMF
No ratings yet
Major Project Report ON: Home Automation Using DTMF
61 pages
Microprocessor - Sanjay Chawal Sir
No ratings yet
Microprocessor - Sanjay Chawal Sir
192 pages
Computer Architecture and Organization Ch#2 Examples
No ratings yet
Computer Architecture and Organization Ch#2 Examples
6 pages
Power System Simulation Lab Manual New
No ratings yet
Power System Simulation Lab Manual New
57 pages
DCS Basics for Engineers
No ratings yet
DCS Basics for Engineers
5 pages
Power Plant Equipment Lecture-4 (Numerical)
No ratings yet
Power Plant Equipment Lecture-4 (Numerical)
13 pages
Electrical Circuit Design Lab
No ratings yet
Electrical Circuit Design Lab
49 pages
Stepwise Refinement for Programmers
No ratings yet
Stepwise Refinement for Programmers
10 pages
DIP Notes Unit-3
No ratings yet
DIP Notes Unit-3
57 pages
Performance Matrices
No ratings yet
Performance Matrices
14 pages
Da Ci
No ratings yet
Da Ci
13 pages
Organization of Multiprocessor Systems
No ratings yet
Organization of Multiprocessor Systems
87 pages
DLD Module 7
No ratings yet
DLD Module 7
19 pages
Io Module5
No ratings yet
Io Module5
80 pages
I/O Architecture Explained
No ratings yet
I/O Architecture Explained
20 pages
DLD Module 7 Print
No ratings yet
DLD Module 7 Print
13 pages
Computer Architecture Problem Set
No ratings yet
Computer Architecture Problem Set
4 pages
Processor Datapath Architectures
No ratings yet
Processor Datapath Architectures
15 pages
GenerativeAI Projects
100% (4)
GenerativeAI Projects
46 pages
ZFS وXFS وBtrfs وEXT4.in - English
No ratings yet
ZFS وXFS وBtrfs وEXT4.in - English
4 pages
MS Office Quiz for Job Seekers
No ratings yet
MS Office Quiz for Job Seekers
27 pages
Nikon Software NIS-Elements D
No ratings yet
Nikon Software NIS-Elements D
4 pages
DT Range Volume Profile NT8 Handbook
100% (2)
DT Range Volume Profile NT8 Handbook
49 pages
Hytera SmartDispatch-Net Troubleshooting Guide V4.0
No ratings yet
Hytera SmartDispatch-Net Troubleshooting Guide V4.0
51 pages
Author Guide 1.7
No ratings yet
Author Guide 1.7
11 pages
d3d11 Log
No ratings yet
d3d11 Log
1,251 pages
Distributed and Cloud Computing Clusters Grids Clouds and The Future of Internet 1st Edition by Kai Hwang, Jack Dongarra, Geoffrey Fox Download
100% (1)
Distributed and Cloud Computing Clusters Grids Clouds and The Future of Internet 1st Edition by Kai Hwang, Jack Dongarra, Geoffrey Fox Download
50 pages
Igor Urlicic ZIVOTOPIS
No ratings yet
Igor Urlicic ZIVOTOPIS
2 pages
Python For Data Analysis 3rd Edition by Wes McKinney ISBN 9781098103989 109810398X - Experience The Full Ebook by Downloading It Now
100% (15)
Python For Data Analysis 3rd Edition by Wes McKinney ISBN 9781098103989 109810398X - Experience The Full Ebook by Downloading It Now
76 pages
CRQ Creation 4 New Remedy
No ratings yet
CRQ Creation 4 New Remedy
62 pages
OS Lab File: B.Tech CSE 2022-2026
No ratings yet
OS Lab File: B.Tech CSE 2022-2026
59 pages
15 3200 074 Atomx+XYZ User+Manual (REVA)
No ratings yet
15 3200 074 Atomx+XYZ User+Manual (REVA)
236 pages
Ajerd0301 03
No ratings yet
Ajerd0301 03
12 pages
Serato DJ 1.9.2 Software Manual - English
No ratings yet
Serato DJ 1.9.2 Software Manual - English
79 pages
This Study Resource Was: Automation Anywhere Certification Mcqs With Answers
No ratings yet
This Study Resource Was: Automation Anywhere Certification Mcqs With Answers
7 pages
Flutter Essentials - Navigation, Routing, and State - PARKER, JP - 2024 - Independently Published - Anna's Archive
No ratings yet
Flutter Essentials - Navigation, Routing, and State - PARKER, JP - 2024 - Independently Published - Anna's Archive
124 pages
Resume Joseph Chen 202407
No ratings yet
Resume Joseph Chen 202407
3 pages
Grade 3 - Q4 Exam 2024
No ratings yet
Grade 3 - Q4 Exam 2024
6 pages
70770-3 NEAT Projects 2 Professional-User Manual
No ratings yet
70770-3 NEAT Projects 2 Professional-User Manual
121 pages
User Manual - SIDH
No ratings yet
User Manual - SIDH
31 pages
Info Visualization Model Guide
No ratings yet
Info Visualization Model Guide
2 pages
Embedded Systems & OS Design
100% (1)
Embedded Systems & OS Design
37 pages
Ci CD
No ratings yet
Ci CD
19 pages
Huawei V2 and V3 Server Best Practice With VMware ESXi System 07
No ratings yet
Huawei V2 and V3 Server Best Practice With VMware ESXi System 07
21 pages
SpatixManual v025.003
No ratings yet
SpatixManual v025.003
151 pages
Python File Handling Basics
No ratings yet
Python File Handling Basics
23 pages
Winscope1000 Global Guide
No ratings yet
Winscope1000 Global Guide
32 pages
Jss 3 Dafem Mock Examination
No ratings yet
Jss 3 Dafem Mock Examination
8 pages

Performance

Uploaded by

Performance

Uploaded by

Performance Metrics

Why study performance metrics?

What you should get out of this discussion

Execution time: time to execute one program from beginning to

Throughput: total amount of work completed in a given time

(Often improving execution time will improve throughput & vice

Component metrics: subsystem performance, e.g., memory

Performancea = 1 / (Execution Timea)

Processor A is faster than processor B, i.e.,

Execution TimeA < Execution TimeB

performance of A is n times greater than B

CPUExecutionTime = CPUClockCycles * ClockCycleTime

Cycle time (clock period) is measured in time or rate

CPUExecutionTime = CPUClockCycles / ClockCycleRate

• clock cycle rate of 1 MHz = cycle time of 1 µs

CPUClockCycles = NumberOfInstructions * CPI

Average number of clock cycles per instruction

Can have different CPIs for classes of instructions

where CPIi = CPI for a particular class of instructions

Improving part of the architecture can improve a CPIi

Factors are interdependent:

MFLOPS (millions of floating point operations per second)

static metrics (code size)

FP Ops Time (secs)

Computer A Computer B Computer C

FP Ops Rate (FLOPS)

Computer A Computer B Computer C

Computer C is ~25 times faster than A when measuring execution

Speedup = Execution TimebeforeImprovement /

You might also like