Multi-Core Computer Architecture: Performance Evaluation Methods

Uploaded by

harshithgrandhala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views20 pages

Multi-Core Computer Architecture: Performance Evaluation Methods

Uploaded by

harshithgrandhala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Multi-Core Computer Architecture

Lecture 1D
Performance Evaluation Methods

John Jose
Associate Professor
Department of Computer Science & Engineering
Indian Institute of Technology Guwahati
Measuring Performance
❖ When can we say one computer / architecture / design is
better than others?
❖ Desktop PC – ( execution time of a program)
❖ Server (transactions / unit time)

❖ When can we say X is n times faster than Y ?

❖ Execution timeY / Execution timeX =n
❖ Throughput x/ Throughput y =n
Measuring Performance
❖ Typical performance metrics:
❖Response time
❖Throughput
❖CPU time
❖Wall clock time
❖Speedup
❖ Benchmarks
❖Toy programs (e.g. sorting, matrix multiply)
❖Synthetic benchmarks (e.g. Dhrystone)
❖Benchmark suites (e.g. SPEC06, SPLASH)
Benchmark Suite
Benchmark Based Evaluation
SPEC Ratio
Reference for SPEC 2006:
Sun Ultra Enterprise 2 workstation with
a 296-MHz UltraSPARC II processor
Amdahl's Law
❖ Amdahl’s Law defines the speedup that can be gained by improving
some portion of a computer.
❖ The performance improvement to be gained from using some faster
mode of execution is limited by the fraction of the time the faster mode
can be used.
Amdahl's Law- Illustration
Example: Suppose that we want to enhance the floating point
operations of a processor by introducing a new advanced FPU unit.
Let the new FPU is 10 times faster on floating point computations
than the original processor. Assuming a program has 40% floating
point operations, what is the overall speedup gained by
incorporating the enhancement?

Solution:
Fraction enhanced = 0.4
Speedup enhanced = 10
Amdahl's Law for Parallel Processing
100 100 100 100
100 50 50 25 25 25 25 ∞ Processors, Time ≈0
100 100 100 100
100 50 50 25 25 25 25 ∞ Processors, Time ≈0
100 100 100 100
Work 500, Work 500, Work 500, Work 500,
Time 500 Time 400 Time 350 Time 300
Sp=1X Sp=1.25X Sp=1.4X Sp=1.7X
How much Speed up you can achieve ?
Design Example
A common transformation required in graphics processors is square
root. Implementations of floating-point (FP) square root vary
significantly in performance, especially among processors designed
for graphics. Suppose FP square root (FPSQR) is responsible for
20% of the execution time of a critical graphics benchmark.
One proposal is to enhance the FPSQR hardware and
speed up this operation by a factor of 10. The other alternative is
just to try to make all FP instructions in the graphics processor run
faster by a factor of 1.6; FP instructions are responsible for half of
the execution time for the application. Compare these two design
alternatives using Amdahl's Law.
Design Example
Case A: FPSQR hardware optimization Case B: FP instructions optimization
Principles of Computer Design
❖ All processors are driven by clock.
❖ Expressed as clock rate in GHz or clock period in ns
❖ CPU Time = CPU clock cycles x clock cycle time
Principles of Computer Design

❖ Clock cycle time- hardware technology

❖ CPI- organization and ISA
❖ IC-ISA and compiler technology
Principles of Computer Design
❖ Different instruction types having different CPIs
Example: Basic Performance Analysis
Consider two programs A and B that solves a given problem. A is scheduled
to run on a processor P1 operating at 1 GHz and B is scheduled to run on
processor P2 running at 1.4 GHz. A has total 10000 instructions, out of
which 20% are branch instructions, 40% load store instructions and rest are
ALU instructions. B is composed of 25% branch instructions. The number of
load store instructions in B is twice the count of ALU instructions. Total
instruction count of B is 12000. In both P1 and P2 branch instructions have
an average CPI of 5 and ALU instructions has an average CPI of 1.5. Both
the architectures differ in the CPI of load-store instruction. They are 2 and 3
for P1 and P2, respectively. Which mapping (A on P1 or B on P2) solves the
problem faster, and by how much?
Example: Basic Performance Analysis
A on P1 (1GHz 🡪 CCT = 1ns) B on P2 (1.4 GHz🡪 CCT = 0.714ns)
IC=10000 IC=12000
Fraction BR: L/S: ALU = 20: 40: 40 Fraction BR: L/S: ALU = 25: 50: 25
CPI of BR: L/S: ALU = 5: 2: 1.5 CPI of BR: L/S: ALU = 5: 3 : 1.5

(a) CPI A_P1=(0.2x5 + 0.4x2 + 0.4x1.5) = 2.4

ExT = 2.4 x10000x1 ns= 24000 ns
(b) CPI B_P2=(0.25x5 + 0.5x3 + 0.25x1.5) = 3.125
ExT = 3.125 x12000x0.714 ns = 26775 ns
Hence A on P1 is faster.
Example: Amdahl's Law
A company is releasing 2 latest versions (beta and gamma) of its basic
processor architecture named alpha. Beta and gamma are designed by
making modifications on three major components (X, Y and Z) of the alpha.
It was observed that for a program A the fractions of the total execution time
on these three components, X, Y, and Z are 40%, 30%, and 20%,
respectively. Beta speeds up X and Z by 2 times but slows down Y by 1.3
times, where as gamma speeds up X, Y and Z by 1.2, 1.3 and 1.4 times,
respectively.
(a) How much faster is gamma over alpha for running A?
(b) Whether beta or gamma is faster for running A? Find the speedup factor
fx=0.4 : fy=0.3: fz=0.2
Beta: Nx=2 : Ny=1/1.3 : Nz=2
Gamma: Nx=1.2 : Ny=1.3 : Nz=1.4
Example: Amdahl's Law

(a) Gamma is 1.239 times faster over alpha

(b) Beta is faster than gamma🡪 1.267/1.239 = 1.022 times
johnjose@iitg.ac.in
http://www.iitg.ac.in/johnjose/

06 CA (Performance Enhancement)
No ratings yet
06 CA (Performance Enhancement)
31 pages
CALec 1
No ratings yet
CALec 1
24 pages
Computer Architecture Unit 1 - Phase 2 PDF
No ratings yet
Computer Architecture Unit 1 - Phase 2 PDF
26 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
18 pages
Fundamentals of Computer Design - 1
No ratings yet
Fundamentals of Computer Design - 1
32 pages
CPU Performance Analysis Guide
No ratings yet
CPU Performance Analysis Guide
35 pages
Lec 3
No ratings yet
Lec 3
21 pages
CS-3006 4 PerformanceAnalysis
No ratings yet
CS-3006 4 PerformanceAnalysis
62 pages
Ece586 Lec4 1
No ratings yet
Ece586 Lec4 1
4 pages
Chapter 1 Lecture 2 & 3 - Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Performance
36 pages
513 Lec 02 Quantifying Performance
No ratings yet
513 Lec 02 Quantifying Performance
50 pages
Chapter 1 Lecture 2 & 3 - Computer Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Computer Performance
37 pages
C F C P S (CS61063) : Tutorial 1
No ratings yet
C F C P S (CS61063) : Tutorial 1
13 pages
Introduction To Computer Organization
No ratings yet
Introduction To Computer Organization
66 pages
Fundamentals of Computer Design: Bina Ramamurthy CS506
No ratings yet
Fundamentals of Computer Design: Bina Ramamurthy CS506
25 pages
Eceg-3143 Computer Architecture & Organization Lecture 2-Computer Performance Concepts
No ratings yet
Eceg-3143 Computer Architecture & Organization Lecture 2-Computer Performance Concepts
15 pages
Quatitative Principle
No ratings yet
Quatitative Principle
56 pages
Numerical: Central Processing Unit
No ratings yet
Numerical: Central Processing Unit
28 pages
CS-3006 10 PerformanceAnalysis
No ratings yet
CS-3006 10 PerformanceAnalysis
52 pages
Medidas de Rendimiento
No ratings yet
Medidas de Rendimiento
5 pages
4 Performance
No ratings yet
4 Performance
27 pages
Amdahl's Law: Example 1
No ratings yet
Amdahl's Law: Example 1
12 pages
Computer Architecture Unit 1
No ratings yet
Computer Architecture Unit 1
12 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
AKT Turkmenbaev Nursulttan
No ratings yet
AKT Turkmenbaev Nursulttan
3 pages
Lec 3
No ratings yet
Lec 3
20 pages
Computer Architecture
No ratings yet
Computer Architecture
26 pages
Amdahl's Law for CS Students
No ratings yet
Amdahl's Law for CS Students
2 pages
Performance
No ratings yet
Performance
4 pages
CA Lecture1
No ratings yet
CA Lecture1
9 pages
B38DF LS2b Performance
No ratings yet
B38DF LS2b Performance
20 pages
Coa Presentation
No ratings yet
Coa Presentation
20 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
1.2 Performance
No ratings yet
1.2 Performance
14 pages
Kelompok 1 - Fundamental of Computer Design
No ratings yet
Kelompok 1 - Fundamental of Computer Design
20 pages
Micro Processor and Assembly Language
No ratings yet
Micro Processor and Assembly Language
16 pages
TUT2
No ratings yet
TUT2
3 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Quiz For Chapter 1 Computer Abstractions and Technology
No ratings yet
Quiz For Chapter 1 Computer Abstractions and Technology
5 pages
Amdahl's Law & Parallel Processing
No ratings yet
Amdahl's Law & Parallel Processing
16 pages
CAAL Chapter 2 Lecture 3 Amdhal's
No ratings yet
CAAL Chapter 2 Lecture 3 Amdhal's
10 pages
Exam 2 Review Answers
No ratings yet
Exam 2 Review Answers
3 pages
Bản Sao Của Lecture 2 - Performance Measurement
No ratings yet
Bản Sao Của Lecture 2 - Performance Measurement
9 pages
Lec 2
No ratings yet
Lec 2
31 pages
Computer Architecture and Performance
No ratings yet
Computer Architecture and Performance
33 pages
Lec 2
No ratings yet
Lec 2
31 pages
Lec 8 Ch1 Part2
No ratings yet
Lec 8 Ch1 Part2
29 pages
Lecture 3.1.4 (Amdahl's Law)
No ratings yet
Lecture 3.1.4 (Amdahl's Law)
13 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
CS1601 Computer Architecture
100% (1)
CS1601 Computer Architecture
389 pages
Computer Architecture Unit1
No ratings yet
Computer Architecture Unit1
20 pages
Sof108 Computer Architecture SESSION: September 2019 TUTORIAL 2 - Quantitative Principles of Computer Design
No ratings yet
Sof108 Computer Architecture SESSION: September 2019 TUTORIAL 2 - Quantitative Principles of Computer Design
3 pages
HW2 Solutions
No ratings yet
HW2 Solutions
4 pages
Amdahl's Law Example #2: - Protein String Matching Code
No ratings yet
Amdahl's Law Example #2: - Protein String Matching Code
23 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
Computer Architecture Measuring Performance
No ratings yet
Computer Architecture Measuring Performance
33 pages
CA Classes-41-45
No ratings yet
CA Classes-41-45
5 pages
Zimpod Change Programme
No ratings yet
Zimpod Change Programme
5 pages
Nigerian Monetary Policy Review
100% (1)
Nigerian Monetary Policy Review
13 pages
Algebra 2 Trigonometry Homework Answers
No ratings yet
Algebra 2 Trigonometry Homework Answers
6 pages
HP Split 13 x2 PC
No ratings yet
HP Split 13 x2 PC
107 pages
User Manual - 19 Inch LCD Monitor (MML1941-PCR MML1942-PER)
No ratings yet
User Manual - 19 Inch LCD Monitor (MML1941-PCR MML1942-PER)
48 pages
Chemistry Foundations for Students
No ratings yet
Chemistry Foundations for Students
34 pages
Research Paper (Toc)
No ratings yet
Research Paper (Toc)
8 pages
News May 2024 - Parish of Newcastle & Newtownmountkennedy With Calary, Co. Wicklow
No ratings yet
News May 2024 - Parish of Newcastle & Newtownmountkennedy With Calary, Co. Wicklow
28 pages
Boltzmann Equation Overview
No ratings yet
Boltzmann Equation Overview
7 pages
Iq Practice 5 Bafa ..
No ratings yet
Iq Practice 5 Bafa ..
1 page
AFM
No ratings yet
AFM
213 pages
1.3 Responsibility Accounting Problems Answers
No ratings yet
1.3 Responsibility Accounting Problems Answers
5 pages
Corporate Law MCQ Part 1
No ratings yet
Corporate Law MCQ Part 1
12 pages
Final Work Order Dakkigram2 Pwss
No ratings yet
Final Work Order Dakkigram2 Pwss
8 pages
Dastrup 2021b Chapter 2 Final
No ratings yet
Dastrup 2021b Chapter 2 Final
153 pages
Rules of Passage Narration
No ratings yet
Rules of Passage Narration
7 pages
Presentation Puna Ptipk Mapin 20151028 English
No ratings yet
Presentation Puna Ptipk Mapin 20151028 English
34 pages
Geneva Wheel (Major Project)
No ratings yet
Geneva Wheel (Major Project)
6 pages
Stainless Steel - Wikipedia
No ratings yet
Stainless Steel - Wikipedia
14 pages
Irs 2184 Datasheet
No ratings yet
Irs 2184 Datasheet
30 pages
Vitamin C Project Kartik Styled
No ratings yet
Vitamin C Project Kartik Styled
14 pages
Lupranol 2074-2
No ratings yet
Lupranol 2074-2
2 pages
CS10 Reviewer
No ratings yet
CS10 Reviewer
3 pages
The Revenue and Receipts Cycle Slides 2024
No ratings yet
The Revenue and Receipts Cycle Slides 2024
29 pages
Assistant Professor
No ratings yet
Assistant Professor
9 pages
BW-Series Battery Scale Manual
No ratings yet
BW-Series Battery Scale Manual
24 pages
El 30 Xxen
No ratings yet
El 30 Xxen
242 pages
Consumer Behavior On Purchase of Laptop
80% (5)
Consumer Behavior On Purchase of Laptop
20 pages
Toyota Yaris Relay Locations
No ratings yet
Toyota Yaris Relay Locations
22 pages
Sa 29 PDF
No ratings yet
Sa 29 PDF
26 pages

Multi-Core Computer Architecture: Performance Evaluation Methods

Uploaded by

Multi-Core Computer Architecture: Performance Evaluation Methods

Uploaded by

Multi-Core Computer Architecture

❖ When can we say X is n times faster than Y ?

❖ Clock cycle time- hardware technology

(a) CPI A_P1=(0.2x5 + 0.4x2 + 0.4x1.5) = 2.4

(a) Gamma is 1.239 times faster over alpha

You might also like