0% found this document useful (0 votes)

277 views25 pages

SIMD Array Processor

The document describes two SIMD array processors - ILLIAC-IV and Burroughs Scientific Processor (BSP). ILLIAC-IV consists of multiple processing elements under a single control unit. Each processing element contains an ALU, registers and local memory. Vector instructions are sent to processing elements for distributed execution to achieve spatial parallelism. A masking scheme is used to control the status of processing elements during instruction execution. BSP has fewer processing units than ILLIAC-IV but with all processors having equal access to a common logical address space divided into separate memory modules. Each processing element is an arithmetic unit with input/output registers. The document also discusses various interconnection network topologies used

Uploaded by

Tejodeep Bose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

277 views25 pages

SIMD Array Processor

Uploaded by

Tejodeep Bose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

SIMD ARRAY

PROCESSORS
Chapter 4

NOTE: Refer two author book Kai Hwang and Briggs page no:325
SIMD ARRAY PROCESSORS

• ILLIAC-IV

• Burroughs Scientific Processor(BSP)

ILLIAC-IV
ILLIAC IV
• Constitutes of multiple synchronized processing
elements under one control unit

• Each processing elements contain an ALU, registers

and local memory

• User programs are loaded into the control unit

memory from an external source

• The control unit then decodes and decides where the

instruction should be executed.
ILLIAC IV
• Branching instructions are executed directly on the
control unit

• Vector instructions are sent to the processing

elements for distributive execution to achieve spatial
parallelism via duplicate arithmetic units

• The data can be loaded into the processing element’s

memory from external source via system bus and
broadcast mode of control unit
ILLIAC
•
IV
Masking scheme is used to control the status of
processing element during execution

• A PE may either be activated or deactivated during an

instruction cycle

• Enabled PE only perform execution

• Data exchange between the PE is done via

interconnection network that performs data routing
and manipulation function
ILLIAC IV
• Interconnection network is too under the supervision
of control unit
• A host computer is interfaced with array processor
through control unit
• A host computer is a general purpose machine which
serves as a overall manager of the entire system.
• Host : front-end machine; manages resources, i/o activity
• Array Processor can be considered as back-end attached
computer
• Note: each node has local memory
Burroughs Scientific Processor(BSP)
• Effectively a successor to the ILLIAC IV machine

• But with an architecture modified to reflect the fact that the BSP was
intended to be a commercial product

• It has fewer processing units than ILLIAC IV

• All processors enjoyed equal access to a common logical address space

which was divided into a number of physically separate memory
modules

• Each processing element is nothing more than an arithmetic unit with

input and output registers, and these units are homogeneous and non-
pipelined
• Note: All nodes share the same memory howsoever the memory are divided
into modules
SIMD Interconnection Network

1D : Linear; 2D: Ring, Mesh,Star, Tree; 3D: Hypercube

SIMD Interconnection Network
• Data exchange among the PE’s are done via interconnection
network
• They perform all data routing and manipulation functions
• Architecture of an interconnection network is based on
topology
• Static: pattern is fixed and cannot be reconfigured
• Dynamic : pattern inside the network are not fixed
• Depending on the number of stages it is divided into two
types
• Single stage
• Multi stage

NOTE: refer 334 page no of two author kai Hwang and Briggs
Single Stage
• Only one stage is used
• Depending on the interstage connection used a single stage is also
known as recirculating network
• Data may recirculate single stage many times before reaching their
destination
• E.g : Crossbar Network

NOTE: refer 334 page no of two author kai Hwang and Briggs
Multi Stage
• Consist of many stages interconnected switch
• Characterized by switch box and network connectivity
• The connectivity is controlled by choice of interstage
connection pattern
• A switch box may have any of the four patterns
mentioned in the next slide.

NOTE: refer 337 page no of two author kai Hwang and Briggs
1: Straight
2:Exchange
3: Lower Broadcast
4:Upper Broadcast

NOTE: refer 337 page no of two author kai Hwang and Briggs
Mesh Connected Illilac Network

No of Nodes = N=16
No of interconnection per node (r) =√N=4
Max no of hops≤ √N-1 Example:
Routing function for connecting ith node: Node 3 will be connected with:
R+i(1)= (i+1) mod N 0≤ i ≤N-I i+1= 4th node
R-i(1)= (i-1) mod N i-1=3rd node
R+r(i)= (r+i) mod N i+4=7th node
R-r(i)= (r-i) mod N i-4=(-1)=15
Shuffle exchange and omega Networks:
• The class of shuffle- exchange network is based on two routing function
shuffle(S) and exchange(G).
• Two types: perfect shuffle and Inverse shuffle
• For perfect shuffle
• Let A=an-1, an-2,…….a1,a0 be a PE address.
• S(an-1, an-2,…….a1,a0 )=an-2,…….a1,a0 , an-1 ..
• N= number of PE’s.
• n= log n
• The cyclic shifting of bits in A to the left for one bit position is performed by
the S.
NOTE: 350 page no of two author kai Hwang and Briggs
PERFECT SHUFFLE

NOTE: 351 page no of two author kai Hwang and Briggs

Shuffle exchange and omega Networks:
• Inverse shuffle
• Let A=an-1, an-2,…….a1,a0 be a PE address.
• S(an-1, an-2,…….a1,a0 )=a0, an-1, an-2,…….a1
• N= number of PE’s.
• n= log n
• The cyclic shifting of bits in A to the right for one bit position is
performed by the S.

NOTE: 350 page no of two author kai Hwang and Briggs

Inverse Shuffle

NOTE: 351 page no of two author kai Hwang and Briggs

OMEGA NETWORK (by using perfect
shuffle)

NOTE: refer Video

BLOCKING STATE
• If the i/o ports are on the same side then the network
is called one-sided networks, also known as full
switches
• Two side multistage has an input and output side
• This can be classified further as blocking or non blocking

• If the simultaneous connections of some multiple i/o pairs result in

conflicts in switches or links, then the multistage network is known
as blocking network. Eg: OMEGA NETWORK

• If the network can perform all possible connections sources (input)

and destination(output) by rearranging its connection then its
known as non-blocking network. Eg: Benes Network
Cube Interconnection Network
• Cube network can be implemented as either a re-circular network or
as multistage network for SIMD.
• In cube network by single cube we can able to connect 8 PEs.
• To connect 16 node we need two cube and so on.
• For representing 8 PEs 3bits are required.
• Interconnection made by using following rule:
• vertical lines connect vertices(PEs) differ in the most significant bit.
• Horizontal line differs in the least significant bit.
• Vertices at both ends of diagonal lines differs in the middle bit position.

(refer page no:343

Cube Interconnection Network
Assignment:
• Design a 4-cube network for an array processor consisting of 16
Processing Elements (PEs). Trace the path to route a packet of data
from the node 0110 to 1101.
• Barrel Shifter Network (refer page no:345)
• With the aid of an example explain the ‘Masking’ and ‘Data Routing’
mechanism in an SIMD Array processor.
Thank You

SIMD Architecture Explained
100% (1)
SIMD Architecture Explained
45 pages
Chapter 2 - Memory Management (Simple Systems)
No ratings yet
Chapter 2 - Memory Management (Simple Systems)
31 pages
Components of The Data Processing
No ratings yet
Components of The Data Processing
4 pages
Unit 4 - Run - Time Environment
No ratings yet
Unit 4 - Run - Time Environment
34 pages
Identify Ways of Representing Algorithms
No ratings yet
Identify Ways of Representing Algorithms
33 pages
Serial and Parallel First 3 Lecture
No ratings yet
Serial and Parallel First 3 Lecture
17 pages
OS Manual
No ratings yet
OS Manual
35 pages
CSD 205 - Design and Analysis of Algorithms: Instructor: Dr. M. Hasan Jamal Lecture# 01: Introduction
100% (1)
CSD 205 - Design and Analysis of Algorithms: Instructor: Dr. M. Hasan Jamal Lecture# 01: Introduction
101 pages
Aies Unit - 2
No ratings yet
Aies Unit - 2
28 pages
Unit 2
100% (1)
Unit 2
58 pages
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
100% (1)
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
37 pages
Design and Analysis Algorithm: Sorting Algorithms
No ratings yet
Design and Analysis Algorithm: Sorting Algorithms
17 pages
Dbms PPT For Chapter 7
No ratings yet
Dbms PPT For Chapter 7
45 pages
Module3 - Fixed Partitions
No ratings yet
Module3 - Fixed Partitions
17 pages
Contiguous Memory Allocation: Partitions
No ratings yet
Contiguous Memory Allocation: Partitions
42 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
OS Concepts Chapter 2 Solution To Practice Exercises Part 2
100% (2)
OS Concepts Chapter 2 Solution To Practice Exercises Part 2
2 pages
Module 2
No ratings yet
Module 2
157 pages
Topics Covered: Physical Disk Organization Example Disk Scheduling Algorithms Research Work
No ratings yet
Topics Covered: Physical Disk Organization Example Disk Scheduling Algorithms Research Work
21 pages
Sorting Algorithms
0% (1)
Sorting Algorithms
3 pages
Daa Notes
No ratings yet
Daa Notes
26 pages
Computer Architecture Fundamentals
No ratings yet
Computer Architecture Fundamentals
40 pages
DSA Lab Manual Solved by M.Daud Sajid 028 BSSE4A FA21
No ratings yet
DSA Lab Manual Solved by M.Daud Sajid 028 BSSE4A FA21
162 pages
Radix Sort Algorithm
No ratings yet
Radix Sort Algorithm
10 pages
ACA Notes UNIT-1
No ratings yet
ACA Notes UNIT-1
20 pages
Arithmetic & Logic Unit
No ratings yet
Arithmetic & Logic Unit
58 pages
Module 4
No ratings yet
Module 4
35 pages
Computer Science (Optional II) Grade 9-10: Micro Syllabus - Academic Year 2069
100% (1)
Computer Science (Optional II) Grade 9-10: Micro Syllabus - Academic Year 2069
6 pages
File Management for IT Students
No ratings yet
File Management for IT Students
9 pages
10 Disk Management
No ratings yet
10 Disk Management
39 pages
Security in Distributed Systems
No ratings yet
Security in Distributed Systems
16 pages
Unit-2 Memory Management - Detail
No ratings yet
Unit-2 Memory Management - Detail
81 pages
2023 Winter Question Paper (Msbte Study Resources)
No ratings yet
2023 Winter Question Paper (Msbte Study Resources)
2 pages
Tree Data Structure
No ratings yet
Tree Data Structure
7 pages
Chapter 10: Algorithms 10.1. Deterministic and Non-Deterministic Algorithm
No ratings yet
Chapter 10: Algorithms 10.1. Deterministic and Non-Deterministic Algorithm
5 pages
Macro Programming Essentials
No ratings yet
Macro Programming Essentials
62 pages
Operating System: Operating Systems: Internals and Design Principles
No ratings yet
Operating System: Operating Systems: Internals and Design Principles
81 pages
Unit-4 (Memory Management)
No ratings yet
Unit-4 (Memory Management)
103 pages
1-IAS Architecture-12-12-2022
No ratings yet
1-IAS Architecture-12-12-2022
34 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
37 pages
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
0% (1)
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
40 pages
Report On Cpu
75% (4)
Report On Cpu
41 pages
Bit Slice Processor
No ratings yet
Bit Slice Processor
1 page
Basic Terminologies
No ratings yet
Basic Terminologies
8 pages
Mealy vs Moore Machines Guide
No ratings yet
Mealy vs Moore Machines Guide
21 pages
By ICT Industry Skills Council
No ratings yet
By ICT Industry Skills Council
27 pages
Ads Unit 3
No ratings yet
Ads Unit 3
87 pages
Chapter 2 Instruction Sets of 8086 Part 2
No ratings yet
Chapter 2 Instruction Sets of 8086 Part 2
30 pages
Slot14 15 CH08 OperatingSystemSupport 43 Slides
No ratings yet
Slot14 15 CH08 OperatingSystemSupport 43 Slides
34 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
52 pages
Parallel Sorting on Multi-Core CPUs
No ratings yet
Parallel Sorting on Multi-Core CPUs
22 pages
Parameter Passing Techniques
No ratings yet
Parameter Passing Techniques
5 pages
Intro To Automata Theory
No ratings yet
Intro To Automata Theory
23 pages
Simple Sorting and Searching Algorithms 2.1searching: Pseudocode
No ratings yet
Simple Sorting and Searching Algorithms 2.1searching: Pseudocode
7 pages
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
No ratings yet
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
30 pages
Logical & Physical Address Space - Swapping - Memory Management Techniques
No ratings yet
Logical & Physical Address Space - Swapping - Memory Management Techniques
47 pages
Interconnection Networks
No ratings yet
Interconnection Networks
7 pages
Parallel Processors: Session 5 Interconnection Networks
No ratings yet
Parallel Processors: Session 5 Interconnection Networks
48 pages
Module 4 Chapter 1
No ratings yet
Module 4 Chapter 1
28 pages
Module 3
No ratings yet
Module 3
25 pages
Chapter Seven: Networks: Mobile Business
No ratings yet
Chapter Seven: Networks: Mobile Business
47 pages
Mikrotik Certified Ipv6 Engineer Course Materials
No ratings yet
Mikrotik Certified Ipv6 Engineer Course Materials
244 pages
Types of Firewalls Presentation-1
No ratings yet
Types of Firewalls Presentation-1
10 pages
Typical Architectural Styles: Prepared by Mrs. M Praveena, HOD MCA
No ratings yet
Typical Architectural Styles: Prepared by Mrs. M Praveena, HOD MCA
48 pages
VXlan Deep Dive
No ratings yet
VXlan Deep Dive
40 pages
Carren Hudson AWS Engineer
No ratings yet
Carren Hudson AWS Engineer
7 pages
SecurityPlus Domain4 Notes
No ratings yet
SecurityPlus Domain4 Notes
3 pages
TCSM 3 I
No ratings yet
TCSM 3 I
26 pages
2nd Quarter TLE ICT 7 - Chapter 5
No ratings yet
2nd Quarter TLE ICT 7 - Chapter 5
12 pages
Fujitsu M12 (Building Block Configuration) Installation Specialist
No ratings yet
Fujitsu M12 (Building Block Configuration) Installation Specialist
2 pages
CP VNC V21ZR4C MD
No ratings yet
CP VNC V21ZR4C MD
5 pages
AWS Cloud Services Overview
No ratings yet
AWS Cloud Services Overview
6 pages
Socket Programming in Java
No ratings yet
Socket Programming in Java
4 pages
TURCK As Interface Masters Gateways
No ratings yet
TURCK As Interface Masters Gateways
30 pages
Cigale Iu User Manual
No ratings yet
Cigale Iu User Manual
82 pages
Make Money With ShareCash PDF
No ratings yet
Make Money With ShareCash PDF
5 pages
P-OTN - Packet Optical Transport Network
No ratings yet
P-OTN - Packet Optical Transport Network
20 pages
DS-2CD3387G2P-LSU SL Datasheet 20240326
No ratings yet
DS-2CD3387G2P-LSU SL Datasheet 20240326
6 pages
GSM FCT-Voice V1 System Manual PDF
No ratings yet
GSM FCT-Voice V1 System Manual PDF
85 pages
Sevt 102008 Uc500 Lab
No ratings yet
Sevt 102008 Uc500 Lab
31 pages
Avalon
No ratings yet
Avalon
3 pages
Deco X1500 (EU) 1.0 - Datasheet
No ratings yet
Deco X1500 (EU) 1.0 - Datasheet
4 pages
SENTRON Pac3200 Manual Ul PSC en
No ratings yet
SENTRON Pac3200 Manual Ul PSC en
172 pages
OS6024 2.52GA User Guide
No ratings yet
OS6024 2.52GA User Guide
144 pages
Project Report On HTML Based Web Development (Blog)
60% (5)
Project Report On HTML Based Web Development (Blog)
31 pages
Hoang Anh's Electrical Engineering CV
No ratings yet
Hoang Anh's Electrical Engineering CV
4 pages
eRAN18.1 Smart 8T8R UL Turbo Coverage
No ratings yet
eRAN18.1 Smart 8T8R UL Turbo Coverage
17 pages
Multilin FM2 Relay Instruction Manual
No ratings yet
Multilin FM2 Relay Instruction Manual
112 pages
AutoCAD Electrical Essentials
100% (1)
AutoCAD Electrical Essentials
3 pages
IP Routing Basics
No ratings yet
IP Routing Basics
12 pages

SIMD Array Processor

Uploaded by

SIMD Array Processor

Uploaded by

SIMD ARRAY

• Burroughs Scientific Processor(BSP)

• Each processing elements contain an ALU, registers

• User programs are loaded into the control unit

• The control unit then decodes and decides where the

• Vector instructions are sent to the processing

• The data can be loaded into the processing element’s

• A PE may either be activated or deactivated during an

• Enabled PE only perform execution

• Data exchange between the PE is done via

• It has fewer processing units than ILLIAC IV

• All processors enjoyed equal access to a common logical address space

• Each processing element is nothing more than an arithmetic unit with

1D : Linear; 2D: Ring, Mesh,Star, Tree; 3D: Hypercube

NOTE: 351 page no of two author kai Hwang and Briggs

NOTE: 350 page no of two author kai Hwang and Briggs

NOTE: 351 page no of two author kai Hwang and Briggs

NOTE: refer Video

• If the simultaneous connections of some multiple i/o pairs result in

• If the network can perform all possible connections sources (input)

(refer page no:343

You might also like