0% found this document useful (0 votes)

62 views36 pages

Pipe Lining

The document describes pipelining by dividing a task into stages that are executed synchronously, with the output of one stage becoming the input of the next. It provides examples of arithmetic pipelining for fixed-point and floating-point operations, breaking down operations like addition into stages like loading operands, performing the operation, and storing the result. The document also discusses concepts like pipeline performance, throughput, and ideal speedup that can be achieved through pipelining.

Uploaded by

Akshay Mahajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views36 pages

Pipe Lining

Uploaded by

Akshay Mahajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Pipelining

Unit 2
R1←An
R2←Bn Load An & Bn An * Bn + Cn for n = 1,2,3,4

R3←R1 *R2 Multiply R1 & R2 Memory

R4 ←Cn Load Cn An Bn Cn

R5 ←R3 + R4 Add R3 & R4 S

t
a
g
R1 R2
e
1
Clock Stage 1 Stage 2 Stage 3
Pulse
R1 R2 R3 R4 R5 MULTIPLIER
S
1 A1 B1 t
a
g
2 A2 B2 A1*B1 C1 e R3 R4
2
3 A3 B3 A2*B2 C2 A1*B1+C1
S
4 A4 B4 A3*B3 C3 A2*B2+C2 t
ADDER
a
g
5 A4*B4 C4 A3*B3+C3 e
3
6 A4*B4+C4 R5
Synchronous Pipelining

Stage Stage Stage

Bufer

Bufer
1 2 n

• A task is divided into different subtasks which are performed synchronously by

diferent hardware blocks known as stages
• Stages -> perform operations & result produced at each stage stored in latches or
buffer.
• After the arrival of clock pulse, all the latches will transfer data to the next stage
simultaneously.
• All stages will have some delay, therefore the maximum delay time of a stage will define
clock period of pipeline
• Clock period t of pipeline is equal to sum of maximum stage delay time and latch delay
time d i.e t = max(ts) + d
• Five tasks are being executed by dividing each task into four subtasks
• So how many stages???
• T12 -> Task 1 in Stage 2

1 2 3 4 5 6 7 8 Time Cycle

Task 1 T11 T12 T13 T14

Task 2 T21 T22 T23 T24

Task 3 T31 T32 T33 T34

Task 4 T41 T42 T43 T44

Task 5 T51 T52 T53 T54

Asynchronous Pipelining

Input

Ready Stage 1 Ready Stage 2 Ready Stage k Ready

Ack Ack Ack Ack

• Data flow between adjacent stages controlled by handshaking protocol.

• When a stage is ready to transmit data, it sends ready signal to the next stage.
• After receiving the data, acknowledge (ack) will be sent to previous stage.
• Variable throughput due to amount of delay experienced at diferent stages.
Pipeline Performance

• Consider execution of m tasks (instructions) using n stages (subtasks).

• Time for execution of task in non pipeline implementation = m * n * t
• In n-stage pipeline processor, first task will be executed in n clock cycles.
• Remaining m-1 tasks will be executed in every one clock cycle due to
overlap
• Time units required to execute m tasks pipeline implementation
= ( n + m - 1)* t
•
Speedup S(n)
= =
When no of tasks (instructions) are very high m→∞
Lim S(n) = n = ideal speedup= maximum speedup
m→∞
The ideal speedup is equal to the number of pipeline stages. That is, when m is
very large, a pipelined processor can produce output approximately n times
faster than a non pipelined processor.
•• Efficiency of Pipeline E(n)

E(n)=
=
=
•
• Throughput U(n): Number of tasks executed
per unit time.
U(n) = =
Types of Pipeline
• Arithmetic Pipeline
└ Fixed –Point Arithmetic Pipeline
Float –Point Arithmetic Pipeline
• Instruction Pipeline
└ Two Stage Instruction Pipeline
└ Four Stage Instruction Pipeline
└ Six Stage Instruction Pipeline
ARITHMETIC PIPELINING

• Pipelining used for arithmetic computation are called

Arithmetic pipeline.
• Arithmetic pipeline constructed to perform simple fixed point
and complex floating arithmetic operations.
• Basic operation like addition, subtraction , division &
multiplication can be efficiently partitioned into subtasks for
pipeline stages
• FIXED POINT ARTHMETIC PIPELINE
• FLOATING POINT ARTHMETIC PIPELINE
Adders
• Two types of adders are used
• Carry propagation adder– Add numbers such that carry generated is propagated

eg 0101 101
+0100 + 1100
----------------- -------------------
0 1
0001 0001

• Carry save adder-Add numbers such that carry generated is propagated rather
carries are saved in carry vector
1 1 1 1

eg 1010 101
+ 0110 + 11
+ 1111 + 1
----------------- -------------------
Sum 00011 Sum 111
Carry 1 1 1 0 0 Carry 010
n + 1 bit max bit no
FIXED POINT ARTHMETIC PIPELINE
8 bit 8 bit

8 bit 8 bit
Multiplier Recording Logic
8 9 10 11 12 13 14 15

8 9 10 11 12 13
Carry Save Carry Save 14 15
Adder Adder
10 13 13
10
Carry Save Adder Carry Save Adder
13 13 15 15

13 13 15
15
Carry Save Adder
15
15
Carry Save Adder
16 16

16 16
Carry Propagate Adder
16

16
Pipeline Floating Point Adder
Steps to add to two floating point number
n1= (f1,e1) eg n1= 0.5 x 103
n2= (f2,e2) eg n2= 0.3 x 105
Step 1 : Compare the exponents (e1, e2 ) of two numbers (n1, n2 ).Pickup fraction of number with smaller
exponent. Pickup diference k=│ e1 - e2 │ and also larger exponent
n1= 0.5 x 103 n2= 0.3 x 105
Fraction of Smaller exponent= 0.5 Larger exponent= 105
k=│ e1 - e2 │= │ 3 - 5 │= 2

Step 2 : Right shift the fraction with smaller exponent by k positions

i.e.By right shifting n1= 0.5 x 103 by k=2 positions we get, n1= 0.005 x 105

Step 3 : Add the fraction f1 + f2 =0.305

Fraction part= 0.305 Exponent part = 105

Step 4: Find M = Compute no of leading zero’s in the mantissa of the result.

i.e no of leading zero’s in 0.305 is 3 . Therefore ,M=3

Step 5 : Subtract the number M from exponent and left shift the mantissa of the result by M positions
M=3
Exponent part 105-3 = 102 Left shift result mantissa by 3 positions=305
Fraction Part = 305 & Exponent part = 102
e1 f1 e2 f2

BUFFER e2 BUFFER
S e1 f2
t f1
a
g EXPONENT COMPARATOR FRACTION SELECTOR
e Fraction with smaller exp
1 k
Min(e1,e2) RIGHT SHIFTER
Max(e1,e2) Fraction with
larger exp
BUFFER BUFFER BUFFER
S
t Fraction with larger exp Fraction right shifted
a
g
Max(e1,e2) ADDER
e
2
BUFFER BUFFER
f1 + f 2

LEADING ZERO COUNTER

S
t Max(e1,e2) M
a
g LEFT SHIFT
e
3 EXPONENT SUBSTRACTOR

BUFFER
BUFFER
Exponent Fraction
INSTRUCTION PIPELINING

• A typical instruction processing involves following steps

1. Instruction Fetch
2. Instruction Decode
3. Address generation of operand
4. Operand fetch
5. Instruction Execution
6. Writing of Result
Two Stage Instruction Pipeline
• Subdividing instruction processing into two stages
1. Fetch Instruction
2. Execute Instruction

Stage Stage
Bufer

Bufer
Bufer
1 2

Instruction
1 2 3 4 5 6 7 8 Time Cycle

Instruction 1 EX
IF
Instruction 2 EX
IF
Instruction3 EX
IF
Instruction4 EX
IF
Four Stage Instruction Pipeline
• Subdividing instruction processing into two stages
1. Instruction Fetch IF
2. Instruction Decode ID
3. Operand fetch OF
4. Instruction Execution EX

Instruction 1 2 3 4 5 6 7 8 Time Cycle

Instruction 1 ID OF EX
IF
Instruction 2 ID OF EX
IF
Instruction 3 ID OF EX
IF
Instruction 4 ID OF EX
IF
Six Stage Instruction Pipeline
• A typical instruction processing involves following steps
1. Instruction Fetch IF
2. Instruction Decode ID
3. Calculate operands CO
4. Operand fetch OF
5. Instruction Execution EX
6. Write Operand WO

Instruction 1 2 3 4 5 6 7 8 9 Time Cycle

Instruction 1 ID CO OF EX WO
IF

Instruction 2 ID CO OF EX WO
IF

Instruction 3 ID CO OF EX WO
IF

Instruction 4 ID CO OF EX WO
IF
Six Stage Instruction Pipeline Time Cycle
Instruction 1 2 3 4 5 6 7 8 10 11 12 13 14

Instruction 1 ID CO OF EX WO
IF

Instruction 2 ID CO OF EX WO
IF

Instruction 3 ID CO OF EX WO
IF

Instruction 4 ID CO OF EX
IF

Instruction 5 ID CO OF
IF

Instruction 6 ID CO
IF

Instruction 7 ID
IF

Instruction 15 ID CO OF EX WO
IF

Instruction 16 ID CO OF EX WO
IF
Pipeline Hazards
• Pipeline hazards prevent the next instruction in the instruction
stream from executing during in its designated clock cycle
• Instruction is stalled & all instructions later in the pipeline are
also stalled.
• No new instructions are fetched during the stall.

• Classified into following three major types:

1. Structural hazards
2. Data Hazards
3. Control Hazards
Structural hazards
• Occurs when certain resource is requested by more than one instruction
at the same time.
• Arise from resource conflicts
• HW cannot support all possible combinations of instructions
• No separate memory available for instruction fetch and data
memory

Instruction 1 2 3 4 5 6 7 8

Instruction 1 ID EX MEM WR
IF

Instruction 2 ID EX MEM WR
IF

Instruction 3 ID EX MEM WR
IF

Instruction 4 ID EX MEM WR
IF

Techniques to eliminate hazards

Duplicate resources
Reorder the instruction
Data Hazards
• Dependency among instructions i.e data dependency
• Occurs when there is an instruction in the pipeline that
afects the result of another instruction in the pipeline
• I1 : ADD R3, R2 R3←R3+R2
• I2 : MUL R1, R3 R1←R1*R3

Instruction 1 2 3 4 5 6 7 8

Instruction 1 ID CO OF EX WO
IF

Instruction 2 ID CO stall stall OF EX WO

Instruction 3 ID CO OF EX WO
IF
• Consider two instructions, A and B. A occurs before B.
Data Dependencies

• True Data Dependency

• Read-After-Write (RAW) Hazard
• Occurs when the value produced by an instruction is required by a
subsequent instruction.
• I1 : ADD R3, R2,R1 R3←R3+R1
• I2 : SUB R4, R3 ,1 R4←R3-1
• This is common, and forwarding helps to solve it.
• B tries to read a register before A has written it and gets the old
value.
Data Dependencies
• Name Dependency
• Anti-Dependencies
• Write-After-Read (RAW) Hazard

• Occurs when an instruction writes a location which has been read by a previous instruction.
• i+1 instruction tries to write an operand before it is read by ith instruction. So, ith instruction incorrectly
gets the new value
• I1 : ADD R3, R2,R1 R3←R3+R1
• I2 : SUB R4, R5 ,1 R4←R5-1

• WAR (write after read)

– B tries to write a register before A has read it.
•
– In this case, A uses the new (incorrect) value.
• Output Dependency
• Write-After-Write (WAW) Hazard
• Occurs when a location is written by two instructions.
• i+1 instruction tries to write an operand before it is written by ith instruction.
So, writes end up being performed in the wrong order.

• B tries to write an operand before A has written it.

• After instruction B has executed, the value of the register should be B's result, but A's result is
stored instead.

• I1 : ADD R3, R2,R1 R3←R3+R1

• I2 : SUB R2, R3 ,1 R2←R3-1
• I3 : ADD R3, R2,R5 R3←R2+R5

Pipeline - 3117
No ratings yet
Pipeline - 3117
21 pages
CS212 Unit 5
No ratings yet
CS212 Unit 5
38 pages
Coa M3 Bit
No ratings yet
Coa M3 Bit
4 pages
Unit 7 N
No ratings yet
Unit 7 N
13 pages
Fixed-Point and Floating-Point Arithmetic Design
No ratings yet
Fixed-Point and Floating-Point Arithmetic Design
4 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
32 pages
CAO - Lecutre5 Datapath Design
No ratings yet
CAO - Lecutre5 Datapath Design
43 pages
CH-1 1 Pipelining
No ratings yet
CH-1 1 Pipelining
43 pages
Unit 5 Pipeline
No ratings yet
Unit 5 Pipeline
13 pages
ACA - Pipelining
No ratings yet
ACA - Pipelining
25 pages
Unit-6 Pipelining
No ratings yet
Unit-6 Pipelining
63 pages
Arithmatic Pipline Unit-3
No ratings yet
Arithmatic Pipline Unit-3
27 pages
Advanced Pipeline Design Guide
No ratings yet
Advanced Pipeline Design Guide
57 pages
Coa Module 3 Previous Year University Question Papers Solved
No ratings yet
Coa Module 3 Previous Year University Question Papers Solved
23 pages
Pipeline - 3117
No ratings yet
Pipeline - 3117
22 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Mca Coa-Unit III
No ratings yet
Mca Coa-Unit III
16 pages
COA Module 3 QB Complete Solutions
No ratings yet
COA Module 3 QB Complete Solutions
20 pages
UNIT-3: MIPS Instructions
No ratings yet
UNIT-3: MIPS Instructions
15 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
28 pages
Coa Module 5
No ratings yet
Coa Module 5
10 pages
Dld&Co Cse-Ds Unit 4-2
No ratings yet
Dld&Co Cse-Ds Unit 4-2
38 pages
CST202 Lect. Note 3
No ratings yet
CST202 Lect. Note 3
25 pages
Pipelining and Vector Processing Guide
No ratings yet
Pipelining and Vector Processing Guide
63 pages
Arithmetic Pipelining Techniques
No ratings yet
Arithmetic Pipelining Techniques
5 pages
Coa, Unit V, Notes
No ratings yet
Coa, Unit V, Notes
26 pages
Assignment-4 Ca
100% (1)
Assignment-4 Ca
10 pages
Unit-5 (Coa) Notes
100% (1)
Unit-5 (Coa) Notes
33 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
Csso U 5
No ratings yet
Csso U 5
29 pages
Mid Sem Q1 Q4 Solutions
No ratings yet
Mid Sem Q1 Q4 Solutions
5 pages
Screenshot 2023-11-15 at 9.45.37 AM
No ratings yet
Screenshot 2023-11-15 at 9.45.37 AM
66 pages
UNIT - 5 Pipeling Concept
No ratings yet
UNIT - 5 Pipeling Concept
15 pages
Arithmatic Pipline Unit-3
No ratings yet
Arithmatic Pipline Unit-3
27 pages
Unit 6 COA
No ratings yet
Unit 6 COA
37 pages
Principles of Linear Pipelining
50% (2)
Principles of Linear Pipelining
71 pages
3rd Unit
No ratings yet
3rd Unit
72 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
COA Module 5 QB Complete Solutions
No ratings yet
COA Module 5 QB Complete Solutions
32 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
COA Unit-2 Notes (P3)
No ratings yet
COA Unit-2 Notes (P3)
13 pages
Coa Mod 4 5
No ratings yet
Coa Mod 4 5
91 pages
Parallel Processing & Pipelining Guide
No ratings yet
Parallel Processing & Pipelining Guide
8 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
Pipelining: Why Wait - . - ?
No ratings yet
Pipelining: Why Wait - . - ?
27 pages
Pipelining
No ratings yet
Pipelining
33 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
Arithmetic Pipelining Explained
No ratings yet
Arithmetic Pipelining Explained
2 pages
Unit 5 1
No ratings yet
Unit 5 1
21 pages
Computer Pipelining Techniques
No ratings yet
Computer Pipelining Techniques
18 pages
Multiprocessor Systems & Pipelining
No ratings yet
Multiprocessor Systems & Pipelining
11 pages
Chapter One: Introduction To Pipelined Processors
No ratings yet
Chapter One: Introduction To Pipelined Processors
41 pages
Computer Architecture - An: Unit-1
No ratings yet
Computer Architecture - An: Unit-1
30 pages
Lect3 Pipeline
No ratings yet
Lect3 Pipeline
4 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
52 pages
CO UNIT - 5 - Parallel Processing
No ratings yet
CO UNIT - 5 - Parallel Processing
30 pages
Mod 3
No ratings yet
Mod 3
46 pages
CS - GATE-2018 - Paper - 4-FEB-2018 Morning Session
No ratings yet
CS - GATE-2018 - Paper - 4-FEB-2018 Morning Session
34 pages
Advanced Software Development Methodologies PDF
No ratings yet
Advanced Software Development Methodologies PDF
6 pages
Data Mining: Data Exploration: - Chapter 6
No ratings yet
Data Mining: Data Exploration: - Chapter 6
56 pages
Architecture of Parallel Computer
No ratings yet
Architecture of Parallel Computer
22 pages
Parallel & Distributed Systems: Sem-Viii Course-Computer Emgineering
No ratings yet
Parallel & Distributed Systems: Sem-Viii Course-Computer Emgineering
22 pages
CS - GATE-2018 - Paper - 4-FEB-2018 Morning Session
No ratings yet
CS - GATE-2018 - Paper - 4-FEB-2018 Morning Session
34 pages
Module 5 - Pipelining
No ratings yet
Module 5 - Pipelining
61 pages
Vector Processor
No ratings yet
Vector Processor
13 pages
Microprocessor Basics for Tech Enthusiasts
No ratings yet
Microprocessor Basics for Tech Enthusiasts
2 pages
(Signal Processing and Communications 13) Hu, Yu Hen - Programmable Digital Signal Processors - Architecture, Programming, and App PDF
No ratings yet
(Signal Processing and Communications 13) Hu, Yu Hen - Programmable Digital Signal Processors - Architecture, Programming, and App PDF
386 pages
5 - 8051 - Chapter 5
No ratings yet
5 - 8051 - Chapter 5
14 pages
Lecture Notes: Microprocessors and Microcontrollers
No ratings yet
Lecture Notes: Microprocessors and Microcontrollers
217 pages
Microcontroller Course Guide
No ratings yet
Microcontroller Course Guide
3 pages
System On Chip Architecture
No ratings yet
System On Chip Architecture
36 pages
What Is An Accelerated Processing Unit
No ratings yet
What Is An Accelerated Processing Unit
13 pages
MPMC-UNIT 2 R20 Microcontrollers
No ratings yet
MPMC-UNIT 2 R20 Microcontrollers
45 pages
Lecture 02 Introduction To ARM7 Processor
No ratings yet
Lecture 02 Introduction To ARM7 Processor
29 pages
Embedded System Application: History of Microprocessors For Embedded Systems
No ratings yet
Embedded System Application: History of Microprocessors For Embedded Systems
37 pages
ARM Assembly for EECE Students
No ratings yet
ARM Assembly for EECE Students
58 pages
SMT: Boosting Processor Performance
No ratings yet
SMT: Boosting Processor Performance
4 pages
Accumulator
No ratings yet
Accumulator
3 pages
LED Blinking with 8051 Assembly
No ratings yet
LED Blinking with 8051 Assembly
2 pages
Micro Processors and Interfacing
0% (1)
Micro Processors and Interfacing
4 pages
Complete 8086 Instruction Set
No ratings yet
Complete 8086 Instruction Set
49 pages
Microprocessor Programming Guide
No ratings yet
Microprocessor Programming Guide
9 pages
I O Interface
No ratings yet
I O Interface
27 pages
Unit 2
No ratings yet
Unit 2
14 pages
Computer Architecture N Scheme Important Questions Diploma
No ratings yet
Computer Architecture N Scheme Important Questions Diploma
1 page
Name: Kajal .M.Kothari STD: Fybms Div: B Roll No: 76 Subject: Computer Topic: Current Procesor
No ratings yet
Name: Kajal .M.Kothari STD: Fybms Div: B Roll No: 76 Subject: Computer Topic: Current Procesor
4 pages
Intel Core I7 Processor
No ratings yet
Intel Core I7 Processor
7 pages
Naslovna Strana
No ratings yet
Naslovna Strana
2 pages
MIC Micro Project (10,31,33,45)
100% (2)
MIC Micro Project (10,31,33,45)
8 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
8086 Interview Prep Guide
No ratings yet
8086 Interview Prep Guide
3 pages
Branch Instructions
No ratings yet
Branch Instructions
13 pages
Csa (Bca) (3rd) May12
No ratings yet
Csa (Bca) (3rd) May12
2 pages

Pipe Lining

Uploaded by

Pipe Lining

Uploaded by

Pipelining

R3←R1 *R2 Multiply R1 & R2 Memory

R5 ←R3 + R4 Add R3 & R4 S

Stage Stage Stage

• A task is divided into different subtasks which are performed synchronously by

Task 1 T11 T12 T13 T14

Task 2 T21 T22 T23 T24

Task 3 T31 T32 T33 T34

Task 4 T41 T42 T43 T44

Task 5 T51 T52 T53 T54

Ready Stage 1 Ready Stage 2 Ready Stage k Ready

• Data flow between adjacent stages controlled by handshaking protocol.

• Consider execution of m tasks (instructions) using n stages (subtasks).

• Pipelining used for arithmetic computation are called

Step 2 : Right shift the fraction with smaller exponent by k positions

Step 3 : Add the fraction f1 + f2 =0.305

Step 4: Find M = Compute no of leading zero’s in the mantissa of the result.

LEADING ZERO COUNTER

• A typical instruction processing involves following steps

Instruction 1 2 3 4 5 6 7 8 Time Cycle

Instruction 1 2 3 4 5 6 7 8 9 Time Cycle

• Classified into following three major types:

Techniques to eliminate hazards

Instruction 2 ID CO stall stall OF EX WO

• True Data Dependency

• WAR (write after read)

• B tries to write an operand before A has written it.

• I1 : ADD R3, R2,R1 R3←R3+R1

You might also like