0% found this document useful (0 votes)

43 views40 pages

Multiprocessor Concepts

This document provides an introduction to multiprocessor concepts. It discusses various forms of parallelism that can be achieved through multiprocessing or multicomputing. It describes different models of shared-memory multiprocessors including uniform memory access (UMA), non-uniform memory access (NUMA), and cache-only memory architecture (COMA). It also discusses distributed memory multicomputers and conditions required for parallelism such as data dependencies, control dependencies, and resource dependencies.

Uploaded by

Vanishree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views40 pages

Multiprocessor Concepts

Uploaded by

Vanishree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Introduction to

Multiprocessor Concepts
Introduction

 Why parallel processing?

-to meet increasing demand for higher performance, lower costs, sustained
productivity in real-life applications
 How is it achieved?
-multiprogramming, multiprocessing or multicomputing
-Parallelism can appear in various forms like lookahead, pipelining,
vectorization, concurrency, simultaneity, data parallelism, partitioning,
interleaving, overlapping, multiplicity, replication, time sharing,
multithreading, distributed computing at processor levels.
Multiprocessors and Multicomputers

 Parallel computers can be modeled as physical models having shared

common memory or unshared distributed memories
Shared-Memory Multiprocessors

 3 main shared memory multiprocessor models

-Uniform memory access(UMA)
-Nonuniform memory access(NUMA)
-Cache only memory architecture(COMA)
They differ in how the memory and peripheral resources are shared or
distributed
The UMA model

 Physical memory is uniformly shared by all the processors

 All processors have equal access time to each memory words
 Peripherals are shared usually but however each processor might have a
private cache
 Multiprocessors are called tightly coupled systems due to high degree of
resource sharing
 System interconnect can be a common bus, crossbar switch, multistage
network
 UMA-suitable for general purpose and time sharing applications by
multiple users
Cont..

 Also used to improve execution speed of large time-critical applications

 Parallel events are done using shared variables in the common memory
 Symmetric multiprocessor- when all processors have equal access to all
peripheral devices
 Asymmetric multiprocessor- Only one or a subset of processors are
executive capable(Master processor)-equally capable of running OS
kernel and I/O service routines
-Master processor can execute OS and handle I/O
-Others do not have I/O capability and are called attached processors(AP)
-They execute user codes under the supervision of Master processor
Cont..
Approximate performance of a
Uniprocessor

 Array elements A(I), B(I), C(I) is assumed to have N elements

 L2,L4,L6 assumed to take 1 machine cycle and L1,L3,L5,L7 ignored
 K cycles need for interprocessor communication via shared memory
Cont..

 Ignoring bus contention/memory access violation problems

 Uniprocessor take
- 2N cycles( N for I loop and N for J loop)
 Multiprocessor with M number of processors take
-looping partitioned into M sections L=N/M elements per section
NUMA model

 Is a shared memory system in which access time varies with the location of
the memory word.
 Shared memory is distributed to all processors called local memories
 Collection of all local memories forms a global address space accessible
by all processors
Cont..
COMA

 Special case of NUMA model where distributed main memories are called
as caches
 No memory hierarchy at each processor node
 All caches form global address space
 Remote cache access is assisted by distributed cache directories
Cont..
Cont..

 Multiprocessor systems are suitable for general-purpose multiuser

applications where programmability is major concern
 Shortcoming of multiprocessors is the lack of scalability
 It is also difficult to build centralized shared memory model and is limited
by latency tolerance
 Hence distributed memory multi computers could be used which have
larger scalability but are limited by less programmability
Distributed memory computers

 Distributed memory multicomputer system consists of multiple

computers(nodes) interconnected by a message passing network
 Each node is an independent computer consisting of processor, local
memory. Disks or I/O peripherals being optional
 Message passing network provides point-to-point static connections
among the nodes
 Since a processor can access only its private local memory,
multicomputers are called no-remote-memory-access(NORMA) machines
 Internode communication is carried using message passing using static
connection network
Cont..
Conditions for Parallelism

 To provide parallelism in terms of computing, the key areas are identified

as
-Computation models for parallel computing
-Interprocessor communication in parallel architectures
-System integration
Data dependencies

 Execution of several segments in parallel needs to have each segment

independent of the other
 Dependence graphs are used to describe the dependencies among
elements
 Nodes of dependence graph correspond to instructions and the directed
edges shows the relationship between nodes
 Analysis of dependence graph provides idea to bring in parallelism and
vectorization
Data Dependence

 Flow dependence
Data dependence in programs

 Dependence is a partial ordering relation, ie members of not every pair of

statements are related
Dependence on I/O operations

 S1 and S3 are I/O dependent and both access tape unit 4

 Hence data dependence relation should not be arbitrarily violated during
program execution else may get erroneous result
 In uniprocessor system, repetitive runs should yield same results and is
preserved by defining the order of execution
Cont..
 In multiprocessor system, the program order may or may not be preserved
depending on type of memory model used
 Determinism is obtained by
-control by programmer
-constrained modification of writable data on shared memory
 l
Control Dependencies

 A situation where the order of execution of statements cannot be

determined before run time
 Different paths taken after conditional branch may introduce or eliminate
data dependence among instructions
 Dependence might exist between operations in successive iterations of a
loop
Cont..

 The successive iterations of the loop are control independent

Cont..

 Control dependence ex
 Control dependence often restricts parallelism
Resource Dependence

 Is concerned with using shared resources like integer units, floating point
units, registers, memory areas, ALU
Bernstein’s Conditions

 A process is a software entity corresponding to the abstraction of a

program fragment defined at various processing levels
 Ii- set of all input variables needed to execute Pi (operands-obtained by
memory or registers)
 Oi- set of all output variables generated after executing Pi ( to be stored in
registers or memory locations)
 If P1 and P2 has no dependencies for input and output, then it is said as
parallel P1II P2 and are obtained using Bernstein’s conditions which is flow
independence, antiindependence and output independence
Cont..

 l
Example

 Consider P1, P2, P3, P4, P5 in program order

 Assume each statement takes 1 step and no pipelining
Cont..

 Parallelism is commutative but not equivalence( Pi II Pj and Pj II Pk does

not imply Pi II Pk) but it is associative
Detection of parallelism
Cont..

 In general

 Where n is number of processes and parallelism violation can happen

collectively or partially
Hardware and Software Parallelism

 Hardware Parallelism defined by machine architecture and hardware

multiplicity
 It is often a function of cost and performance tradeoffs
 It can also indicate the peak performance of the processor resources
Cont..

 It can be characterized by number of instructions per machine cycle like if

a processor issues k instructions per machine cycle, then it is called a k-
issue processor
 A conventional processor might take 1 or 2 m/c to issue single instruction
 Intel processor variant is a 3 issue processor which issues 3 instructions per
machine cycle like 1 arithmetic, 1 memory access, 1 branch
 IBM processor variant issues 4. 1 arithmetic, 1 memory, 1 floating point and
1 branch per cycle
 A multiprocessor system built with n k-issue processors should be able to
handle a maximum of nk threads of instructions simultaneously
Software Parallelism

 Is defined by the control and data dependence of programs

 It is a function of algorithm, programming style, and compiler optimization
 Parallelism in a program varies during the execution period
 It often limits the sustained performance of the processor
Mismatch between software
parallelism and hardware parallelism

Seminar
No ratings yet
Seminar
85 pages
Chapter 1 (Parallel Computer Models)
No ratings yet
Chapter 1 (Parallel Computer Models)
20 pages
Os Unit1 Part 2
No ratings yet
Os Unit1 Part 2
15 pages
Unit Iv Parallelism
No ratings yet
Unit Iv Parallelism
80 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Parallel & Distributed Computing Course Overview
No ratings yet
Parallel & Distributed Computing Course Overview
47 pages
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
No ratings yet
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
70 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
Unit 4
No ratings yet
Unit 4
42 pages
Parallel VS Distributed Computing
No ratings yet
Parallel VS Distributed Computing
9 pages
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
No ratings yet
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
22 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
1st Ia Preparation
No ratings yet
1st Ia Preparation
15 pages
PDS Merged
No ratings yet
PDS Merged
182 pages
RS - Pds-Oe 3010
No ratings yet
RS - Pds-Oe 3010
8 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Computer Architecture for CS Students
No ratings yet
Computer Architecture for CS Students
72 pages
Unit 6
No ratings yet
Unit 6
36 pages
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
No ratings yet
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
22 pages
Lec13 Multiprocessors
No ratings yet
Lec13 Multiprocessors
69 pages
1 Introduction
No ratings yet
1 Introduction
30 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Parallel Processing
No ratings yet
Parallel Processing
35 pages
OS Notes
No ratings yet
OS Notes
16 pages
Lecture 8 Miscellaneous Topics
No ratings yet
Lecture 8 Miscellaneous Topics
52 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Pipeliningandvectorprocessing 140612142847 Phpapp01
No ratings yet
Pipeliningandvectorprocessing 140612142847 Phpapp01
53 pages
Parallel Architecture Course Guide
No ratings yet
Parallel Architecture Course Guide
65 pages
Multiprocessors
No ratings yet
Multiprocessors
39 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
33 pages
Unit 5
No ratings yet
Unit 5
96 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
Unit 1
No ratings yet
Unit 1
34 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
51 pages
CH17 COA9e
No ratings yet
CH17 COA9e
51 pages
Distributed System
100% (1)
Distributed System
26 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
Multiprocessing Vs Multithreading 2
No ratings yet
Multiprocessing Vs Multithreading 2
16 pages
Unit 4
No ratings yet
Unit 4
16 pages
MULTIPROCTLPA
No ratings yet
MULTIPROCTLPA
99 pages
Unit 3
No ratings yet
Unit 3
28 pages
Co 1
No ratings yet
Co 1
66 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
90 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
49 pages
Unit 5
No ratings yet
Unit 5
66 pages
Coa Chapter 5
No ratings yet
Coa Chapter 5
96 pages
UNIT-2 PP FlynnsClassification
No ratings yet
UNIT-2 PP FlynnsClassification
80 pages
Chapter 8 - Parallel Processing
No ratings yet
Chapter 8 - Parallel Processing
50 pages
Part 1 - Lecture 2 - Parallel Hardware
No ratings yet
Part 1 - Lecture 2 - Parallel Hardware
60 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Parallel Computing Concepts Guide
No ratings yet
Parallel Computing Concepts Guide
32 pages
Evo X ECU
No ratings yet
Evo X ECU
2 pages
Using The RTL2832U On The Macintosh - Ham Radio Science
No ratings yet
Using The RTL2832U On The Macintosh - Ham Radio Science
13 pages
Productline Catalog PDF
No ratings yet
Productline Catalog PDF
106 pages
ITV Interview Test
No ratings yet
ITV Interview Test
11 pages
LHM 0626/00 Ceiling Loudspeaker
No ratings yet
LHM 0626/00 Ceiling Loudspeaker
4 pages
Power System Protection - Part 10
No ratings yet
Power System Protection - Part 10
22 pages
Icom IC-V8000 Instruction Manual
100% (1)
Icom IC-V8000 Instruction Manual
88 pages
NCP1200A PWM Current Mode Controller For Universal Off Line Supplies Featuring Low Standby Power
No ratings yet
NCP1200A PWM Current Mode Controller For Universal Off Line Supplies Featuring Low Standby Power
16 pages
Datasheet Trafo Ferit EE 55
No ratings yet
Datasheet Trafo Ferit EE 55
11 pages
Electromagnetic Induction Guide
No ratings yet
Electromagnetic Induction Guide
17 pages
83 F
100% (1)
83 F
134 pages
Cw663 Manual Eng
No ratings yet
Cw663 Manual Eng
36 pages
Fire Protection in Cleanroom Laboratories: Protection of People, Assets and Reputation
No ratings yet
Fire Protection in Cleanroom Laboratories: Protection of People, Assets and Reputation
11 pages
Best Practices SQL Server For OpenText Content Server 10.5
No ratings yet
Best Practices SQL Server For OpenText Content Server 10.5
44 pages
4 From Creators of USB REZOTONE
No ratings yet
4 From Creators of USB REZOTONE
2 pages
Report Lab 2
0% (1)
Report Lab 2
6 pages
Air Circuit Breaker Maintenance Guide
100% (3)
Air Circuit Breaker Maintenance Guide
4 pages
Cache Performance Insights
No ratings yet
Cache Performance Insights
17 pages
Varian PaxScan 4336W Brochure
No ratings yet
Varian PaxScan 4336W Brochure
4 pages
Manufacturing Automation Lecture 3
No ratings yet
Manufacturing Automation Lecture 3
22 pages
Design For Performance
100% (1)
Design For Performance
34 pages
SC1128PG
No ratings yet
SC1128PG
1 page
Collinear Ant Building
No ratings yet
Collinear Ant Building
7 pages
Quick Earn Intelligent Bin
No ratings yet
Quick Earn Intelligent Bin
51 pages
Electrical Motor Locked Rotor Indicating Code Letters 4 20111
No ratings yet
Electrical Motor Locked Rotor Indicating Code Letters 4 20111
3 pages
DFSort Tuning Guide
No ratings yet
DFSort Tuning Guide
112 pages
Sherwood RX-4109 Owners Manual
No ratings yet
Sherwood RX-4109 Owners Manual
18 pages
Cameron MOSAIC Systems For The Ceiba Field Early Production System
No ratings yet
Cameron MOSAIC Systems For The Ceiba Field Early Production System
8 pages
DLD Lab Assignment No 1 Bscs-2a 057
No ratings yet
DLD Lab Assignment No 1 Bscs-2a 057
26 pages
Circuitos 9200
90% (10)
Circuitos 9200
88 pages