0% found this document useful (0 votes)

14 views61 pages

ch4 3

The document discusses data hazards in pipelined architectures, focusing on techniques like forwarding and stalling to resolve these hazards. It covers the detection of forwarding needs, conditions for forwarding, and the implications of load-use hazards, as well as control hazards related to branching and exceptions. Additionally, it explores instruction-level parallelism (ILP) through static and dynamic multiple issue techniques, speculation, and loop unrolling to enhance performance in pipelined processors.

Uploaded by

macbay prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views61 pages

ch4 3

Uploaded by

macbay prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 61

Data Hazards:

Forwarding versus Stalling

•
Consider this
sequence: SUB X2,
X1,X3 AND
X12,X2,X5 OR
X13,X6,X2
ADD X14,X2,X2
STUR X15,[X2,#100]

•
We can resolve hazards with forwarding
– How do we detect when to forward?
Dependencies & Forwarding

FIGURE 4.51
Detecting the Need to Forward
•
Pass register numbers along pipeline
–
e.g., ID/EX.RegisterRs = register number for Rs sitting
in ID/EX pipeline register
•
ALU operand register numbers in EX stage
are given by
–
ID/EX.RegisterRn1, ID/EX.RegisterRm2
•
Data hazards when
Fwd from
1a. EX/MEM.RegisterRd = ID/EX.RegisterRn1 EX/MEM
1b. EX/MEM.RegisterRd = ID/EX.RegisterRm2 pipeline reg

2a. MEM/WB.RegisterRd = ID/EX.RegisterRn1 Fwd from

2b. MEM/WB.RegisterRd = ID/EX.RegisterRm2 MEM/WB
pipeline reg
Detecting the Need to Forward
•
But only if forwarding instruction will write
to a register!
– EX/MEM.RegWrite, MEM/WB.RegWrite

•
And only if Rd for that instruction is not XZR
– EX/MEM.RegisterRd ≠ 31,
MEM/WB.RegisterRd ≠ 31
FIGURE 4.52
Forwarding Paths

FIGURE 4.53
Forwarding Conditions
Mux control Source Explanation
ForwardA = 00 ID/EX The first ALU operand comes from the register file.
ForwardA = 10 EX/MEM The first ALU operand is forwarded from the prior ALU
result.
ForwardA = 01 MEM/WB The first ALU operand is forwarded from data memory or
an earlier
ALU result.
ForwardB = 00 ID/EX The second ALU operand comes from the register file.

ForwardB = 10 EX/MEM The second ALU operand is forwarded from the prior ALU
result.
ForwardB = 01 MEM/WB The second ALU operand is forwarded from data memory
or an
earlier ALU result.

FIGURE 4.54 The control values for the forwarding multiplexors in Figure 4.53.
Double Data Hazard
• Consider the sequence:
add X1,X1,X2
add X1,X1,X3
add X1,X1,X4
• Both hazards occur
– Want to use the most recent
• Revise MEM hazard condition
– Only fwd if EX hazard condition isn’t true
Revised Forwarding Condition

• MEM hazard
– if (MEM/WB.RegWrite
and (MEM/WB.RegisterRd ≠ 31)
and not(EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 31)
and (EX/MEM.RegisterRd ≠ ID/EX.RegisterRn1))
and (MEM/WB.RegisterRd = ID/EX.RegisterRn1)) ForwardA = 01
– if (MEM/WB.RegWrite
and (MEM/WB.RegisterRd ≠ 31)
and not(EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 31)
and (EX/MEM.RegisterRd ≠ ID/EX.RegisterRm2))
and (MEM/WB.RegisterRd = ID/EX.RegisterRm2)) ForwardB = 01
Datapath with Forwarding

FIGURE 4.55 The datapath modified to resolve hazards via forwarding

FIGURE 4.56
Datapath with Forwarding

FIGURE 4.57 A
pipelined sequence of
instructions
Load-Use Hazard Detection
• Check when using instruction is decoded in ID
stage
• ALU operand register numbers in ID stage
are given by
– IF/ID.RegisterRn1, IF/ID.RegisterRm2
• Load-use hazard when
– If (ID/EX.MemRead and
((ID/EX.RegisterRd = IF/ID.RegisterRn1) or
(ID/EX.RegisterRd = IF/ID.RegisterRm2))
• If detected, stall and insert bubble
How to Stall the Pipeline
• Force control values in ID/EX register to 0
– EX, MEM and WB do nop (no-operation)
• Prevent update of PC and IF/ID register
– Using instruction is decoded again
– Following instruction is fetched again
– 1-cycle stall allows MEM to read data for LDUI
•
Can subsequently forward to EX stage
Load-Use Data Hazard

Stall inserted
here

FIGURE 4.58
Datapath with Hazard Detection

FIGURE 4.59
Stalls and Performance
The BIG Picture
•
Stalls reduce performance
–
But are required to get correct results
•
Compiler can arrange code to avoid
hazards and stalls
–
Requires knowledge of the pipeline structure
Control Hazards
Branch Hazards
•
If branch outcome determined in MEM

Flush these
instructions
(Set control
values to 0)

FIGURE 4.60 The impact of the pipeline on the branch instruction

Reducing Branch Delay
• Move hardware to determine outcome to ID stage
– Target address adder
– Register comparator
• Example: branch taken
36: SUB X10, X4, X8
40: CBZ X1, X3, 8// PC-relative branch to
40+8 4=72
* AND X12, X2, X5
44:
48: ORR X13, X2, X6
52: ADD X14, X4, X2
56: SUB X15, X6, X7
...
72: LDUR X4, [X7,#50]
Example: Branch Taken

FIGURE 4.61 a
Example: Branch Taken

FIGURE 4.61 b
Dynamic Branch Prediction
• In deeper and superscalar pipelines, branch
penalty is more significant
• Use dynamic prediction
– Branch prediction buffer (aka branch history table)
– Indexed by recent branch instruction addresses
– Stores outcome (taken/not taken)
– To execute a branch
•
Check table, expect the same outcome
•
Start fetching from fall-through or target
•
If wrong, flush pipeline and flip prediction
1-Bit Predictor: Shortcoming
•
Inner loop branches mispredicted twice!
outer: …
…
inner: …
…
CBZ …, …, inner
…
CBZ …, …, outer


Mispredict as taken on last
iteration of inner loop

Then mispredict as not taken on
first iteration of inner loop next
time around
2-Bit Predictor
•
Only change prediction on two
successive mispredictions

FIGURE 4.62 The states in a 2-bit prediction scheme

Exceptions and Interrupts
• “Unexpected” events requiring
change in flow of control
– Different ISAs use the terms differently
• Exception
– Arises within the CPU
•
e.g., undefined opcode, overflow, syscall, …
• Interrupt
– From an external I/O controller
• Dealing with them without sacrificing performance
is hard
Here are examples showing whether the situation is
internally generated by the processor or externally
generated and the name that ARM uses:
Handling Exceptions
• Save PC of offending (or interrupted) instruction
– In LEGv8: Exception Link Register (ELR)

• Save indication of the problem

– In LEGv8: Exception Syndrome Register (ESR)
– We’ll assume 1-bit
•
0 for undefined opcode, 1 for overflow
An Alternate Mechanism
• Vectored Interrupts
– Handler address determined by the cause
• Exception vector address to be added to a vector
table base register:
– Unknown Reason: 00 0000two
– Overflow: 10 1100two
– …: 11 1111two
• Instructions either
– Deal with the interrupt, or
– Jump to real handler
Exceptions in a Pipeline
• Another form of control hazard
• Consider overflow on add in EX stage
ADD X1, X2, X1
– Prevent X1 from being clobbered
– Complete previous instructions
– Flush add and subsequent instructions
– Set ESR and ELR register values
– Transfer control to handler
• Similar to mispredicted branch
– Use much of the same hardware
Pipeline with Exceptions

FIGURE 4.64
Exception Example
• Exception on ADD in
40 SUB X11, X2, X4
44 AND X12, X2, X5
48 ORR X13, X2, X6
4C ADD X1, X2, X1
50 SUB X15, X6, X7
54 LDUR X16, [X7,#100]
…
• Handler
80000180 STUR X26, [X0,#1000]
80000184 STUR X27, [X0,#1008]
…
Exception Example

FIGURE 4.65 Clock 6

Exception Example

FIGURE 4.65 Clock 7

Parallelism via Instructions
Instruction-Level Parallelism (ILP)

• Pipelining: executing multiple instructions in parallel

• To increase ILP
– Deeper pipeline
•
Less work per stage  shorter clock cycle
– Multiple issue
•
Replicate pipeline stages  multiple pipelines
•
Start multiple instructions per clock cycle
•
CPI < 1, so use Instructions Per Cycle (IPC)
•
E.g., 4GHz 4-way multiple-issue
– 16 BIPS, peak CPI = 0.25, peak IPC = 4
•
But dependencies reduce this in practice
Multiple Issue
• Static multiple issue
– Compiler groups instructions to be issued together
– Packages them into “issue slots”
– Compiler detects and avoids hazards
• Dynamic multiple issue
– CPU examines instruction stream and chooses
instructions to issue each cycle
– Compiler can help by reordering instructions
– CPU resolves hazards using advanced techniques
at runtime
Speculation
• “Guess” what to do with an instruction
– Start operation as soon as possible
– Check whether guess was right
•
If so, complete the operation
•
If not, roll-back and do the right thing

• Common to static and dynamic multiple issue

• Examples
– Speculate on branch outcome
•
Roll back if path taken is different
– Speculate on load
•
Roll back if location is updated
Compiler/Hardware Speculation
• Compiler can reorder instructions
– e.g., move load before branch
– Can include “fix-up” instructions to recover
from incorrect guess
• Hardware can look ahead for instructions
to execute
– Buffer results until it determines they are
actually needed
– Flush buffers on incorrect speculation
Speculation and Exceptions
• What if exception occurs on a
speculatively executed instruction?
– e.g., speculative load before null-pointer check
• Static speculation
– Can add ISA support for deferring exceptions
• Dynamic speculation
– Can buffer exceptions until instruction
completion (which may not occur)
Static Multiple Issue
• Compiler groups instructions into “issue
packets”
– Group of instructions that can be issued on a single
cycle
– Determined by pipeline resources required
• Think of an issue packet as a very long
instruction
– Specifies multiple concurrent operations
–  Very Long Instruction Word (VLIW)
Scheduling Static Multiple Issue
• Compiler must remove some/all hazards
– Reorder instructions into issue packets
– No dependencies with a packet
– Possibly some dependencies between packets
•
Varies between ISAs; compiler must know!
– Pad with nop if necessary
LEGv8 with Static Dual Issue
•
Two-issue packets
–
One ALU/branch instruction
–
One load/store instruction
–
64-bit aligned
•
ALU/branch, then load/store
•
Pad an unused instruction with nop

Address Instruction type Pipeline Stages

n ALU/branch IF ID EX MEM WB
n+4 Load/store IF ID EX MEM WB
n+8 ALU/branch IF ID EX MEM WB
n + 12 Load/store IF ID EX MEM WB
n + 16 ALU/branch IF ID EX MEM WB
n + 20 Load/store IF ID EX MEM WB

FIGURE 4.66
LEGv8 with Static Dual Issue

FIGURE 4.67
use latency

Number of clock cycles between a

load instruction and an instruction
that can use the result of the load
without stalling the pipeline.
Hazards in the Dual-Issue LEGv8

•
More instructions executing in parallel
•
EX data hazard
–
Forwarding avoided stalls with single-issue
–
Now can’t use ALU result in load/store in same packet
•
ADD X0, X0, X1
LDUR X2, [X0,#0]
•
Split into two packets, effectively a stall
•
Load-use hazard
–
Still one cycle use latency, but now two instructions
•
More aggressive scheduling required
Scheduling Example
•
Schedule this for dual-issue LEGv8
Loop: LDUR X0, [X20,#0] //
X0=array element
ADD X0, X0,X21 //
add scalar in X21
STUR X0, [X20,#0] //
store result
SUBI X20, X20,#4 //
decrement pointer
CMP X20, X22 //
branch $s1!=0
BGT Loop

ALU/branch Load/store cycle

Loop: nop LDUR X0, [X20,#0] 1
SUBI X20, X20,#4 nop 2
ADD X0, X0,X21 nop 3
CMP X20, X22 nop 4
BGT Loop STUR X0, [X20,#0] 5

FIGURE 4.68
Loop Unrolling
• Replicate loop body to expose
more parallelism
– Reduces loop-control overhead
• Use different registers per replication
– Called “register renaming”
– Avoid loop-carried “anti-dependencies”
• Store followed by a load of the same register
• Aka “name dependence”
– Reuse of a register name
Loop Unrolling Example
ALU/branch Load/store cycle
Loop: SUBI X20, X20,#32 LDUR X0, [X20,#0] 1
nop LDUR X1, [X20,#24] 2
ADD X0, X0, X21 LDUR X2, [X20,#16] 3
ADD X1, X1, X21 LDUR X3, [X20,#8] 4
ADD X2, X2, X21 STUR X0, [X20,#32] 5
ADD X3, X3, X21 sw X1, [X20,#24] 6
CMP X20,X22 sw X2, [X20,#16] 7
BGT Loop sw X3, [X20,#8] 8

Figure 4.69

•
IPC = 15/8 = 1.875
– Closer to 2, but at cost of registers and code size
Dynamic Multiple-Issue Processors

• “Superscalar” processors
• CPU decides whether to issue 0, 1, 2, …
each cycle
– Avoiding structural and data hazards
• Avoids the need for compiler scheduling
– Though it may still help
– Code semantics ensured by the CPU
Dynamic Pipeline Scheduling
• Allow the CPU to execute instructions out
of order to avoid stalls
– But commit result to registers in order
• Example
LDUR X0, [X21,#20]
ADD X1, X0, X2 SUB
X23,X23,X3 ANDI X5,
X23,#20
– Can start sub while ADD is waiting for LDUI
Dynamically Scheduled CPU
Preserves
dependencies

Hold pending
operands

Reorders buffer for Results also sent to

Can supply operands

for issued
instructions
FIGURE 4.70
Why Do Dynamic Scheduling?
• Why not just let the compiler schedule code?
• Not all stalls are predicable
– e.g., cache misses
• Can’t always schedule around branches
– Branch outcome is dynamically determined
• Different implementations of an ISA
have different latencies and hazards
Does Multiple Issue Work?
The BIG Picture
•
Yes, but not as much as we’d like
•
Programs have real dependencies that limit ILP
•
Some dependencies are hard to eliminate
–
e.g., pointer aliasing
•
Some parallelism is hard to expose
–
Limited window size during instruction issue
•
Memory delays and limited bandwidth
–
Hard to keep pipelines full
•
Speculation can help if done well
Power Efficiency
• Complexity of dynamic scheduling
and speculations requires power
• Multiple simpler cores may be better
Microprocessor Year Clock Rate Pipeline Issu Out-of- Cores Power
Stages e order/
widt Speculation
h
i486 1989 25MHz 5 1 No 1 5W
Pentium 1993 66MHz 5 2 No 1 10W
Pentium Pro 1997 200MHz 10 3 Yes 1 29W
P4 Willamette 2001 2000MHz 22 3 Yes 1 75W
P4 Prescott 2004 3600MHz 31 3 Yes 1 103W
Core 2006 2930MHz 14 4 Yes 2 75W
UltraSparc III 2003 1950MHz 14 4 No 1 90W
UltraSparc T1 2005 1200MHz 6 1 No 8 70W
FIGURE 4.71
Seach

Apple M1 vs Intel Core i9-10980HK

Fallacies
• Pipelining is easy (!)
– The basic idea is easy
– The devil is in the details
• e.g., detecting data hazards
• Pipelining is independent of technology
– So why haven’t we always done pipelining?
– More transistors make more advanced techniques feasible
– Pipeline-related ISA design needs to take account
of technology trends
• e.g., predicated instructions
Concluding Remarks
• ISA influences design of datapath and control
• Datapath and control influence design of ISA
• Pipelining improves instruction throughput
using parallelism
– More instructions completed per second
– Latency for each instruction not reduced
• Hazards: structural, data, control
• Multiple issue and dynamic scheduling (ILP)
– Dependencies limit achievable parallelism
– Complexity leads to the power wall

Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
CA7 2024S2 New
No ratings yet
CA7 2024S2 New
30 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Instruction Pipelining Basics
No ratings yet
Instruction Pipelining Basics
20 pages
Lec12 Pipeline 2 Notes
No ratings yet
Lec12 Pipeline 2 Notes
58 pages
Pipeline Hazards: Structural Hazards: Resource Conflict
No ratings yet
Pipeline Hazards: Structural Hazards: Resource Conflict
49 pages
Lec 4
No ratings yet
Lec 4
35 pages
Lecture 4.3 - The Processor - Pipelining
No ratings yet
Lecture 4.3 - The Processor - Pipelining
27 pages
Table 1: Control Signals and Opcodes
No ratings yet
Table 1: Control Signals and Opcodes
6 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
L13 Stalls and Flushes
No ratings yet
L13 Stalls and Flushes
27 pages
Comp206 Lecture9
No ratings yet
Comp206 Lecture9
53 pages
2b.pipeline RISC-V v2
No ratings yet
2b.pipeline RISC-V v2
13 pages
Ca08 2014 PDF
No ratings yet
Ca08 2014 PDF
54 pages
Computer Pipelining Explained
No ratings yet
Computer Pipelining Explained
45 pages
Data Hazards in ALU Instructions: Consider This Sequence
No ratings yet
Data Hazards in ALU Instructions: Consider This Sequence
14 pages
Pooja Vashisth
No ratings yet
Pooja Vashisth
68 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
U33
No ratings yet
U33
61 pages
Unit 6 Part1 Ilp
No ratings yet
Unit 6 Part1 Ilp
39 pages
CPU Pipelining Concepts
No ratings yet
CPU Pipelining Concepts
28 pages
Arch4 Pipelined Processor Design Afterlecture
No ratings yet
Arch4 Pipelined Processor Design Afterlecture
130 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Pipelining
No ratings yet
Pipelining
29 pages
Lecture 6 The Processors-Improving The Performance
No ratings yet
Lecture 6 The Processors-Improving The Performance
40 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
Lecture10 Cda3101
No ratings yet
Lecture10 Cda3101
32 pages
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
No ratings yet
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
7 pages
3 Pipeline
No ratings yet
3 Pipeline
21 pages
Kuliah 14 Pipeliningg
No ratings yet
Kuliah 14 Pipeliningg
28 pages
3 Pipeline
No ratings yet
3 Pipeline
38 pages
Chapter 04 RISC V Removed
No ratings yet
Chapter 04 RISC V Removed
99 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
CMP3010L05-Hazard Continue ILP
No ratings yet
CMP3010L05-Hazard Continue ILP
54 pages
Computer Architecture: Exceptions
No ratings yet
Computer Architecture: Exceptions
35 pages
Parallelism Via Instructions: Instruction-Level Parallelism (ILP)
No ratings yet
Parallelism Via Instructions: Instruction-Level Parallelism (ILP)
21 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
214 pages
Onur 447 Spring15 Lecture12 Ooo Execution Afterlecture
No ratings yet
Onur 447 Spring15 Lecture12 Ooo Execution Afterlecture
67 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
Lecture 4.4 - The Processor - Pipelining
No ratings yet
Lecture 4.4 - The Processor - Pipelining
21 pages
System On Chip: Inside Processor Pipeline Stalls
No ratings yet
System On Chip: Inside Processor Pipeline Stalls
12 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
05 Risc V Pipeline
No ratings yet
05 Risc V Pipeline
31 pages
L8 PipelineHazards 1
No ratings yet
L8 PipelineHazards 1
28 pages
4-Advanced Pipelining - 241114 - 060906
No ratings yet
4-Advanced Pipelining - 241114 - 060906
80 pages
Lec 06
No ratings yet
Lec 06
18 pages
Hazards - V3
No ratings yet
Hazards - V3
34 pages
Lec 24
No ratings yet
Lec 24
3 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
44 pages
Pipelining New
No ratings yet
Pipelining New
33 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
Lec13 Pipe Control
No ratings yet
Lec13 Pipe Control
19 pages
MIPS Pipeline Stages & Hazards
No ratings yet
MIPS Pipeline Stages & Hazards
84 pages
The Hour Between Dog and Wolf
No ratings yet
The Hour Between Dog and Wolf
131 pages
Assembly&C Language
No ratings yet
Assembly&C Language
6 pages
ch5 1
No ratings yet
ch5 1
44 pages
ch4 2
No ratings yet
ch4 2
42 pages
ch2 1
No ratings yet
ch2 1
54 pages
ch2 2
No ratings yet
ch2 2
36 pages
Fundamentals of Blockchains
No ratings yet
Fundamentals of Blockchains
41 pages
PE EC603Csdf
No ratings yet
PE EC603Csdf
1 page
Instruction Level Parallelism Guide
No ratings yet
Instruction Level Parallelism Guide
31 pages
Seminar PPT Reconfigurable Processors
100% (1)
Seminar PPT Reconfigurable Processors
21 pages
Memory Hierarchy for CS Students
No ratings yet
Memory Hierarchy for CS Students
29 pages
20a04504a Computer Architecture and Organization
No ratings yet
20a04504a Computer Architecture and Organization
1 page
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
43 pages
Lecture 1 Introduction and History
No ratings yet
Lecture 1 Introduction and History
58 pages
Interfacing 16×2 LCD With 8051
No ratings yet
Interfacing 16×2 LCD With 8051
39 pages
MCS-012 J-16
No ratings yet
MCS-012 J-16
7 pages
Understanding Nonvolatile BIOS Memory
No ratings yet
Understanding Nonvolatile BIOS Memory
11 pages
8085 Microprocessor Guide
No ratings yet
8085 Microprocessor Guide
14 pages
Comp201TH Unit 3 Lecture 2 Addressing Modes
No ratings yet
Comp201TH Unit 3 Lecture 2 Addressing Modes
4 pages
I O Interface
No ratings yet
I O Interface
27 pages
Computer Maintenance & Support
No ratings yet
Computer Maintenance & Support
112 pages
A 14Nm Logic Technology Featuring 2 - Generation Finfet, Air-Gapped Interconnects, Self-Aligned Double Patterning and A 0.0588 M Sram Cell Size
No ratings yet
A 14Nm Logic Technology Featuring 2 - Generation Finfet, Air-Gapped Interconnects, Self-Aligned Double Patterning and A 0.0588 M Sram Cell Size
3 pages
Intel 8085 Microprocessor Guide
No ratings yet
Intel 8085 Microprocessor Guide
12 pages
Introduction To IC Design & Fabrication
No ratings yet
Introduction To IC Design & Fabrication
18 pages
Advanced Circuit Design Guide
No ratings yet
Advanced Circuit Design Guide
41 pages
Digital Electronics MCQs
No ratings yet
Digital Electronics MCQs
3 pages
ATmega328P - Instruction Set Manual
No ratings yet
ATmega328P - Instruction Set Manual
166 pages
16 October 2017
No ratings yet
16 October 2017
21 pages
Coa T1
No ratings yet
Coa T1
2 pages
ST9+ Microcontroller Programming Guide
No ratings yet
ST9+ Microcontroller Programming Guide
303 pages
NIT 1 4 7040 Lecture 5 VLSI EC601
No ratings yet
NIT 1 4 7040 Lecture 5 VLSI EC601
26 pages
Micro - 1 Introduction To Microprocessor 25-9-2023
No ratings yet
Micro - 1 Introduction To Microprocessor 25-9-2023
15 pages
The Function of IC in Laptops
No ratings yet
The Function of IC in Laptops
9 pages
Sharma 2017
No ratings yet
Sharma 2017
6 pages
Unit 2 Part A
No ratings yet
Unit 2 Part A
10 pages
Slide 1
No ratings yet
Slide 1
22 pages
Understanding Cache Memory Basics
No ratings yet
Understanding Cache Memory Basics
4 pages

ch4 3

Uploaded by

ch4 3

Uploaded by

Data Hazards:

Forwarding versus Stalling

2a. MEM/WB.RegisterRd = ID/EX.RegisterRn1 Fwd from

FIGURE 4.55 The datapath modified to resolve hazards via forwarding

FIGURE 4.60 The impact of the pipeline on the branch instruction

FIGURE 4.62 The states in a 2-bit prediction scheme

• Save indication of the problem

FIGURE 4.65 Clock 6

FIGURE 4.65 Clock 7

• Pipelining: executing multiple instructions in parallel

• Common to static and dynamic multiple issue

Address Instruction type Pipeline Stages

Number of clock cycles between a

ALU/branch Load/store cycle

Reorders buffer for Results also sent to

Can supply operands

Apple M1 vs Intel Core i9-10980HK

You might also like