0% found this document useful (0 votes)

57 views15 pages

Lecture: Metrics To Evaluate Performance

The document discusses various metrics for evaluating computer system performance, including wall clock time, throughput, and benchmark suites. It describes averaging metrics like arithmetic mean (AM), geometric mean (GM), and harmonic mean (HM) for summarizing performance across multiple programs or workloads. AM provides a single number representation but depends on the choice of reference machine, while GM avoids this but may produce inconsistent results. The key factors affecting performance are clock speed, cycles per instruction (CPI), and number of instructions.

Uploaded by

srikar_datta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views15 pages

Lecture: Metrics To Evaluate Performance

Uploaded by

srikar_datta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Lecture: Metrics to Evaluate Performance

Topics: Benchmark suites, Performance equation,

Summarizing performance with AM, GM, HM

Video 1: Using AM as a performance summary

Video 2: GM, Performance Equation
Video 3: AM vs. HM vs. GM

1
Measuring Performance

Two primary metrics: wall clock time (response time for a

program) and throughput (jobs performed in unit time)

To optimize throughput, must ensure that there is minimal

waste of resources

2
Benchmark Suites

Performance is measured with benchmark suites: a

collection of programs that are likely relevant to the user

SPEC CPU 2006: cpu-oriented programs (for desktops)

SPECweb, TPC: throughput-oriented (for servers)
EEMBC: for embedded processors/workloads

3
Summarizing Performance

Consider 25 programs from a benchmark set how do

we capture the behavior of all 25 programs with a
single number?
P1 P2 P3
Sys-A 10 8 25
Sys-B 12 9 20
Sys-C 8 8 30

Sum of execution times (AM)

Sum of weighted execution times (AM)
Geometric mean of execution times (GM)

4
Sum of Weighted Exec Times Example

We fixed a reference machine X and ran 4 programs

A, B, C, D on it such that each program ran for 1 second

The exact same workload (the four programs execute

the same number of instructions that they did on
machine X) is run on a new machine Y and the
execution times for each program are 0.8, 1.1, 0.5, 2

With AM of normalized execution times, we can conclude

that Y is 1.1 times slower than X perhaps, not for all
workloads, but definitely for one specific workload (where
all programs run on the ref-machine for an equal #cycles)

5
Summarizing Performance

Consider 25 programs from a benchmark set how do

we capture the behavior of all 25 programs with a
single number?
P1 P2 P3
Sys-A 10 8 25
Sys-B 12 9 20
Sys-C 8 8 30

Sum of execution times (AM)

Sum of weighted execution times (AM)
Geometric mean of execution times (GM)
(may find inconsistencies here)
6
GM Example

Computer-A Computer-B Computer-C

P1 1 sec 10 secs 20 secs
P2 1000 secs 100 secs 20 secs

Conclusion with GMs: (i) A=B

(ii) C is ~1.6 times faster

For (i) to be true, P1 must occur 100 times for every

occurrence of P2

With the above assumption, (ii) is no longer true

Hence, GM can lead to inconsistencies

7
Summarizing Performance

GM: does not require a reference machine, but does

not predict performance very well
So we multiplied execution times and determined
that sys-A is 1.2x fasterbut on what workload?

AM: does predict performance for a specific workload,

but that workload was determined by executing
programs on a reference machine
Every year or so, the reference machine will have
to be updated

8
CPU Performance Equation

Clock cycle time = 1 / clock speed

CPU time = clock cycle time x cycles per instruction x

number of instructions

Influencing factors for each:

clock cycle time: technology and pipeline
CPI: architecture and instruction set design
instruction count: instruction set design and compiler

CPI (cycles per instruction) or IPC (instructions per cycle)

can not be accurately estimated analytically
9
An Alternative Perspective - I

Each program is assumed to run for an equal number

of cycles, so were fair to each program

The number of instructions executed per cycle is a

measure of how well a program is doing on a system

The appropriate summary measure is sum of IPCs or

AM of IPCs = 1.2 instr + 1.8 instr + 0.5 instr
cyc cyc cyc

This measure implicitly assumes that 1 instr in prog-A

has the same importance as 1 instr in prog-B
10
An Alternative Perspective - II

Each program is assumed to run for an equal number

of instructions, so were fair to each program

The number of cycles required per instruction is a

measure of how well a program is doing on a system

The appropriate summary measure is sum of CPIs or

AM of CPIs = 0.8 cyc + 0.6 cyc + 2.0 cyc
instr instr instr

This measure implicitly assumes that 1 instr in prog-A

has the same importance as 1 instr in prog-B
11
AM and HM

Note that AM of IPCs = 1 / HM of CPIs and

AM of CPIs = 1 / HM of IPCs

So if the programs in a benchmark suite are weighted

such that each runs for an equal number of cycles, then
AM of IPCs or HM of CPIs are both appropriate measures

If the programs in a benchmark suite are weighted such

that each runs for an equal number of instructions, then
AM of CPIs or HM of IPCs are both appropriate measures

12
AM vs. GM

GM of IPCs = 1 / GM of CPIs

AM of IPCs represents thruput for a workload where each

program runs sequentially for 1 cycle each; but high-IPC
programs contribute more to the AM

GM of IPCs does not represent run-time for any real

workload (what does it mean to multiply instructions?); but
every programs IPC contributes equally to the final measure

13
Speedup Vs. Percentage

Speedup is a ratio = old exec time / new exec time

Improvement, Increase, Decrease usually refer to

percentage relative to the baseline
= (new perf old perf) / old perf

A program ran in 100 seconds on my old laptop and in 70

seconds on my new laptop
What is the speedup?
What is the percentage increase in performance?
What is the reduction in execution time?

14
Title

Bullet

Lecture 3
No ratings yet
Lecture 3
21 pages
Lecture 2: Metrics To Evaluate Systems
No ratings yet
Lecture 2: Metrics To Evaluate Systems
33 pages
Computer Performance Evaluation Guide
No ratings yet
Computer Performance Evaluation Guide
17 pages
Performance Chap4
No ratings yet
Performance Chap4
20 pages
CPU Performance & Power Evaluation
No ratings yet
CPU Performance & Power Evaluation
76 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Performance: Latency
No ratings yet
Performance: Latency
7 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
Performance Matrices
No ratings yet
Performance Matrices
14 pages
RISC-V ISA & Performance Metrics
No ratings yet
RISC-V ISA & Performance Metrics
72 pages
Performance
No ratings yet
Performance
51 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Performance Measures
No ratings yet
Performance Measures
5 pages
Computer Performance Metrics
No ratings yet
Computer Performance Metrics
40 pages
Computer Performance Insights
No ratings yet
Computer Performance Insights
22 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
Performance
No ratings yet
Performance
12 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
Computer Architecture Performance Analysis
No ratings yet
Computer Architecture Performance Analysis
34 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Da Ci
No ratings yet
Da Ci
13 pages
CPU Performance Metrics Guide
No ratings yet
CPU Performance Metrics Guide
31 pages
Performance Measures
No ratings yet
Performance Measures
25 pages
Module 2 (26-10-2024)
No ratings yet
Module 2 (26-10-2024)
50 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Computer Performance Analysis
No ratings yet
Computer Performance Analysis
23 pages
Measuring Performance: Chris Clack B261 Systems Architecture
No ratings yet
Measuring Performance: Chris Clack B261 Systems Architecture
19 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
L14 Introduction To Performance Evaluation
No ratings yet
L14 Introduction To Performance Evaluation
48 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Bản Sao Của Lecture 2 - Performance Measurement
No ratings yet
Bản Sao Của Lecture 2 - Performance Measurement
9 pages
Module 3
No ratings yet
Module 3
23 pages
0measuring Performance PDF
No ratings yet
0measuring Performance PDF
15 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
SEN307 Lecture 5
No ratings yet
SEN307 Lecture 5
34 pages
Advanced Computer Architecture Course Overview
No ratings yet
Advanced Computer Architecture Course Overview
56 pages
Ilovepdf - Merged (4) 36 274
No ratings yet
Ilovepdf - Merged (4) 36 274
120 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Lecture 2: Performance/Power, MIPS Instructions
No ratings yet
Lecture 2: Performance/Power, MIPS Instructions
28 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Lecture4 Performance Evaluation
No ratings yet
Lecture4 Performance Evaluation
34 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
23 pages
ACA Lec2 New
No ratings yet
ACA Lec2 New
44 pages
Module-2 Introduction and Performance Analysis
No ratings yet
Module-2 Introduction and Performance Analysis
51 pages
(2010-02-27) Measuring Performance
No ratings yet
(2010-02-27) Measuring Performance
11 pages
Computer Architecture & Performance
No ratings yet
Computer Architecture & Performance
31 pages
Quatitative Principle
No ratings yet
Quatitative Principle
56 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
Verilog HDL: History and Usage Guide
No ratings yet
Verilog HDL: History and Usage Guide
11 pages
Verilog Operators
No ratings yet
Verilog Operators
14 pages
Verilog Assignments
No ratings yet
Verilog Assignments
27 pages
VLSI Design For Test/Power Spring 2017 Project 4: Chip Testing Lab Final Reports Due On 11:59pm, 4/30/2017
No ratings yet
VLSI Design For Test/Power Spring 2017 Project 4: Chip Testing Lab Final Reports Due On 11:59pm, 4/30/2017
5 pages
Synopsys Report
No ratings yet
Synopsys Report
57 pages
Thesis-Design and Analysis of A 16-Bit 10mhz Pipeline Adc Using Various Approximation Techniques in 0.25 Cmos
No ratings yet
Thesis-Design and Analysis of A 16-Bit 10mhz Pipeline Adc Using Various Approximation Techniques in 0.25 Cmos
99 pages
Chapter 3
No ratings yet
Chapter 3
6 pages
Mutation Price List
No ratings yet
Mutation Price List
8 pages
Soc11 Leon Tutorial
No ratings yet
Soc11 Leon Tutorial
12 pages
Android Tablets
No ratings yet
Android Tablets
2 pages
Data Sheet
No ratings yet
Data Sheet
40 pages
Raid Levels
No ratings yet
Raid Levels
3 pages
Crown Pip Blu Manual Especificaciones Tecnicas
No ratings yet
Crown Pip Blu Manual Especificaciones Tecnicas
2 pages
TDS-540 E3011c
No ratings yet
TDS-540 E3011c
8 pages
IF A 1 and B 0: Car Sensors
No ratings yet
IF A 1 and B 0: Car Sensors
67 pages
TMS320C5x DSP Processor
No ratings yet
TMS320C5x DSP Processor
580 pages
Uhook Usb Disk Security: Data Loss Prevention Products and Solutions
No ratings yet
Uhook Usb Disk Security: Data Loss Prevention Products and Solutions
22 pages
DX Diag
No ratings yet
DX Diag
11 pages
Tech Buyers' Notebook Guide
No ratings yet
Tech Buyers' Notebook Guide
36 pages
Products t1 Brochure Trio
No ratings yet
Products t1 Brochure Trio
8 pages
Cambridge International AS & A Level: Computer Science 9618/13
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/13
20 pages
Beecore Flysky f3 Evo Brushed
No ratings yet
Beecore Flysky f3 Evo Brushed
4 pages
Rxi Box Ipc-Xr: Hardware Reference Manual
No ratings yet
Rxi Box Ipc-Xr: Hardware Reference Manual
60 pages
Riscv Spec PDF
No ratings yet
Riscv Spec PDF
239 pages
Lesson01 FT125 Overview Siemens
No ratings yet
Lesson01 FT125 Overview Siemens
10 pages
Cloud Infrastructure Achitecture Case Study
No ratings yet
Cloud Infrastructure Achitecture Case Study
38 pages
Vsphere Esxi Vcenter Server 50 Upgrade Guide
No ratings yet
Vsphere Esxi Vcenter Server 50 Upgrade Guide
172 pages
Setup Guide - S7-1200 V90 Pulse Train
67% (3)
Setup Guide - S7-1200 V90 Pulse Train
96 pages
NetBackup 52xx and 5330 Appliance Troubleshooting Guide - 2.7.2
No ratings yet
NetBackup 52xx and 5330 Appliance Troubleshooting Guide - 2.7.2
213 pages
TV LC4045 F TOSHIBA PDF
No ratings yet
TV LC4045 F TOSHIBA PDF
9 pages
Panelview Plus 7 Standard Terminals: User Manual
No ratings yet
Panelview Plus 7 Standard Terminals: User Manual
156 pages
Management Information System: Bba LLB by The - Lawgical - World
No ratings yet
Management Information System: Bba LLB by The - Lawgical - World
19 pages
Programming AT90S2313
No ratings yet
Programming AT90S2313
4 pages
AWC 500 Data Sheet 4921240395 UK
No ratings yet
AWC 500 Data Sheet 4921240395 UK
36 pages
Martin Pring On Price Patterns PDF
No ratings yet
Martin Pring On Price Patterns PDF
6 pages
Prelim Exam - Attempt Review
No ratings yet
Prelim Exam - Attempt Review
11 pages

Lecture: Metrics To Evaluate Performance

Uploaded by

Lecture: Metrics To Evaluate Performance

Uploaded by

Lecture: Metrics to Evaluate Performance

Topics: Benchmark suites, Performance equation,

Video 1: Using AM as a performance summary

Two primary metrics: wall clock time (response time for a

To optimize throughput, must ensure that there is minimal

Performance is measured with benchmark suites: a

SPEC CPU 2006: cpu-oriented programs (for desktops)

Consider 25 programs from a benchmark set how do

Sum of execution times (AM)

We fixed a reference machine X and ran 4 programs

The exact same workload (the four programs execute

With AM of normalized execution times, we can conclude

Consider 25 programs from a benchmark set how do

Sum of execution times (AM)

Computer-A Computer-B Computer-C

Conclusion with GMs: (i) A=B

For (i) to be true, P1 must occur 100 times for every

With the above assumption, (ii) is no longer true

Hence, GM can lead to inconsistencies

GM: does not require a reference machine, but does

AM: does predict performance for a specific workload,

Clock cycle time = 1 / clock speed

CPU time = clock cycle time x cycles per instruction x

Influencing factors for each:

CPI (cycles per instruction) or IPC (instructions per cycle)

Each program is assumed to run for an equal number

The number of instructions executed per cycle is a

The appropriate summary measure is sum of IPCs or

This measure implicitly assumes that 1 instr in prog-A

Each program is assumed to run for an equal number

The number of cycles required per instruction is a

The appropriate summary measure is sum of CPIs or

This measure implicitly assumes that 1 instr in prog-A

Note that AM of IPCs = 1 / HM of CPIs and

So if the programs in a benchmark suite are weighted

If the programs in a benchmark suite are weighted such

AM of IPCs represents thruput for a workload where each

GM of IPCs does not represent run-time for any real

Speedup is a ratio = old exec time / new exec time

Improvement, Increase, Decrease usually refer to

A program ran in 100 seconds on my old laptop and in 70

You might also like