COA UNIT 5 (AutoRecovered)

Parallelism is a computing concept that allows multiple tasks to be executed simultaneously to enhance speed and efficiency, utilizing multiple processors or CPU components. It can be categorized into hardware and software parallelism, with various types including Instruction Level, Data Level, Task Level, and Transaction Level parallelism, each serving different applications. Flynn's Classification further categorizes systems based on their instruction and data handling capabilities, while ARM processors exemplify energy-efficient computing with features like RISC architecture and SIMD support.

Uploaded by

askvvplus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views14 pages

COA UNIT 5 (AutoRecovered)

Uploaded by

askvvplus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

COA UNIT 5

0Parallelism is a concept where multiple tasks or operations are executed at the same time
to increase the speed and efficiency of computing. This is done by using multiple processors
or parts of the CPU (like ALUs, or Arithmetic Logic Units) to handle different tasks
simultaneously. Let’s break it down in simple terms and go over each aspect of parallelism
mentioned:
What is Parallelism?
 Basic Definition: When two or more operations are performed simultaneously, it's
called parallelism.
 Goal: The main aim is to make computing faster by handling multiple tasks at once,
rather than one at a time.
 Parallel Computers: These are systems with multiple processors that work together
to solve a big problem, effectively speeding up the overall process.
Goals of Parallelism
1. Faster Computation: Parallelism reduces the time it takes to solve complex problems
by splitting tasks across processors.
2. Increased Throughput: More processing is done within the same time frame, making
systems more efficient.
3. Better Performance: By performing multiple operations at once, computers can
achieve more with the same clock speed.
4. Solving Bigger Problems: Parallelism allows systems to handle tasks that would be
too large or slow for a single CPU.
Real-World Applications of Parallelism
Parallelism is widely used in applications that require large amounts of computation or data
processing. Examples include:
 Weather Forecasting: Complex models run faster using multiple processors.
 Socio-Economic Models: Parallel computing helps handle data for large populations.
 Finite Element Analysis: Engineering simulations can use parallelism for quicker and
more accurate results.
 Artificial Intelligence: AI tasks, like image processing, rely on parallelism to handle
complex calculations.
 Genetic Engineering: Processing genetic data requires a lot of computation, which
parallelism can handle effectively.
 Defense and Medical Applications: Parallelism enables complex simulations and
large-scale data analysis.
Types of Parallelism
Parallelism can be achieved through hardware or software.
1. Hardware Parallelism
 Objective: To increase the speed of processing by designing computers with multiple
processors, cores, or threads.
 Processor Parallelism: Multiple CPUs, cores, or threads work together. For example,
a multi-core processor can run different parts of a program on each core.
 Memory Parallelism: Shared or distributed memory configurations allow different
processors to access data simultaneously. This structure is helpful for handling large,
complex tasks.
 Pipelining: Overlapping or pipelining instructions, where one instruction starts
before the previous one finishes, helps achieve parallelism.
2. Software Parallelism
 Definition: Software parallelism depends on how a program is written, including how
instructions are ordered and how data flows within the program.
 Control and Data Dependence: Programs are analyzed to see which parts can run
independently (parallel) and which depend on other steps.
 Program Flow Graph: This graph shows which operations can be done
simultaneously and which need to wait, helping identify the degree of parallelism in
the software.
 Variable Parallelism: As a program runs, the level of parallelism can change
depending on the tasks being executed.
Hardware Parallelism Details
 Instruction Issuing: A processor can issue multiple instructions per cycle (e.g., a 2-
issue or 3-issue processor), which makes it capable of parallel processing.
 Multi-Issue Processors: If a system has multiple processors, each issuing multiple
instructions per cycle, it can handle more tasks at once, improving throughput and
performance.
Software Parallelism Details
 Program Structure: How a program is structured affects its parallelism. For instance,
a well-optimized program can have many instructions that can be executed
simultaneously.
 Execution Variation: Software parallelism isn't always consistent—some parts of a
program may run in parallel, while others must run sequentially, depending on
dependencies.




Software parallelism is all about running different parts of a program simultaneously, and it
can be achieved at various levels. Each level of parallelism has a distinct approach and
applications, so let's explore the different types in detail.

1. Instruction Level Parallelism (ILP)

 What It Is: ILP is the degree to which individual instructions within a single program
can be executed in parallel. It depends on finding and running independent
instructions that do not rely on each other’s outcomes.
 How It Works: The processor and compiler work together to identify instructions
that can run simultaneously or be reordered for more efficient execution.
 Example:
o Consider three instructions:
1. x = a + b
2. y = c - d
3. z = x * y
o Instructions 1 and 2 don’t depend on each other, so they can run
simultaneously. However, instruction 3 depends on the results of 1 and 2, so
it can only start once they’re finished.
 Benefits: By overlapping the execution of independent instructions, ILP increases the
speed of execution within a single processor. A high ILP indicates efficient use of the
CPU, achieving better throughput at the same clock rate.
 Superscalar Architecture: Some modern processors, known as superscalar
processors, implement ILP by issuing multiple instructions per clock cycle, allowing
for faster processing within the same CPU.


2. Data Level Parallelism (DLP)

 What It Is: DLP occurs when the same operation is applied to multiple data points
simultaneously, particularly in parallel computing environments.
 How It Works: Data is distributed across different processing units or nodes, each
performing the same operation on its portion of the data.
 Example:
o Suppose we want to add all elements in an array. In sequential execution, it
would take n * Ta time units for an array of n elements, where Ta is the time
for one addition.
o

o In a data-parallel setup with four processors, the time taken would reduce to
(n/4) * Ta plus a small merging overhead.
o Another example is matrix multiplication, where each processor can handle
different parts of the matrix, significantly reducing computation time.
 Locality of Data: DLP's performance is affected by data locality, which refers to how
data is accessed and managed in memory. When data is close in memory, it’s faster
to access, especially with cache usage.
 Applications: Common in tasks that handle large data sets, such as scientific
computing, image processing, and machine learning.

3. Task Level Parallelism (TLP)

 What It Is: TLP divides a program into distinct tasks or functions that can run
independently on separate processors.
 How It Works: Tasks are assigned to different processors or cores, and each
processor handles its task independently, possibly with varying execution times.
 Example:
o In a web application, one task could be managing user input while another
task processes data, and yet another communicates with the database.
 Advantages: TLP allows for high concurrency and is well-suited for multi-core
processors. Unlike ILP, which focuses on parallelizing instructions, TLP focuses on
parallelizing entire tasks or threads.
 Applications: Useful in systems where different program parts can be divided into
independent tasks, like operating systems, web servers, and real-time systems.

4. Transaction Level Parallelism

 What It Is: This parallelism type is specifically used in database systems or
transactional environments where multiple transactions (distinct sets of operations)
can run simultaneously.
 How It Works: Different transactions are processed in parallel, often without
interference, as long as they don’t conflict over data.
 Example:
o In a bank database, one transaction might be updating a user’s balance, while
another transaction checks the account status of a different user. These can
happen simultaneously, as they don’t interfere with each other.
 Concurrency Control: TLP often requires mechanisms for concurrency control to
handle conflicts, such as two transactions trying to modify the same data
simultaneously.
 Applications: Primarily used in databases, financial systems, and any application
where multiple independent transactions need to run concurrently.

Flynn’s Classification of Parallelism

Flynn’s Classification is a way to categorize computer systems based on how they handle
instructions (commands) and data. It helps us understand the different ways computers can
work on multiple tasks or sets of data simultaneously. There are four types in this
classification: SISD, SIMD, MISD, and MIMD.
1. Single Instruction, Single Data (SISD)
 What It Is: The computer can process only one instruction on one set of data at a
time.
 How It Works: Every step is performed one after another, like following a simple
recipe with no shortcuts.
 Example: Traditional, single-core computers that perform one task at a time without
parallelism.
This type of system is straightforward but slower for tasks that could benefit from parallel

processing.

2. Single Instruction, Multiple Data (SIMD)

 What It Is: The computer performs one instruction but applies it to multiple sets of
data at the same time.
 How It Works: Think of it like a teacher (instruction) giving the same direction to a
group of students (data points) all at once, with each student performing the action
individually.
 Example: Graphics Processing Units (GPUs) use SIMD to process many pixels in
parallel, making it great for tasks like image and video processing where the same
operation needs to be repeated across lots of data.
SIMD is efficient when you need to perform the same operation on a large amount of similar
data.
3. Multiple Instruction, Single Data (MISD)
 What It Is: The computer can perform multiple instructions on the same set of data
at the same time.
 How It Works: Imagine several doctors (instructions) looking at the same patient
(data) to give different treatments or evaluations.
 Example: This setup is rare in practice but can be used in fault-tolerant systems,
where multiple processors analyze the same data to detect errors.
MISD is uncommon but can be helpful in specialized systems that need to ensure data
accuracy or reliability.

4. Multiple Instruction, Multiple Data (MIMD)

 What It Is: The computer can perform different instructions on different sets of data
at the same time.
 How It Works: This is like a team of chefs in a restaurant, each preparing different
dishes using different ingredients at the same time.
 Example: Most modern multi-core computers are MIMD, where each core can work
on its own task with its own data, making it ideal for running multiple programs or
complex tasks that can be split into smaller parts.

MIMD systems are very powerful and flexible, allowing for true parallelism by handling
various tasks at once across multiple processors.

ARM PROCESSOR

The ARM (Advanced RISC Machine) processor is a type of CPU known for its energy
efficiency, simplicity, and versatility. ARM processors are widely used in mobile devices,
embedded systems, and increasingly in servers and laptops. Here are the main features of
ARM processors:
1. RISC Architecture (Reduced Instruction Set Computer)
 ARM processors use a simplified set of instructions compared to CISC (Complex
Instruction Set Computer) architectures, such as x86.
 The reduced instruction set leads to faster processing and lower power
consumption.
 This simplicity makes ARM processors ideal for battery-powered devices.
2. Energy Efficiency
 ARM CPUs are designed to be power-efficient, which is why they’re widely used in
smartphones, tablets, and IoT devices.
 Their low power consumption allows for longer battery life, making them suitable
for mobile and embedded applications.
3. 32-bit and 64-bit Variants
 ARM processors come in both 32-bit and 64-bit versions, allowing for flexibility
depending on the application needs.
 The 64-bit ARM processors can handle larger data sizes and address more memory,
which is essential for modern computing requirements.
4. Multiple Cores and High Scalability
 ARM processors are available in single-core to multi-core configurations, allowing
for a range of performance needs.
 They can scale from simple devices like microcontrollers to powerful, multi-core
processors for servers and desktops.
5. Thumb and Thumb-2 Instruction Set
 The ARM processor includes a Thumb instruction set, which provides 16-bit
encoding for frequently used instructions, reducing memory usage.
 Thumb-2 combines 16-bit and 32-bit instructions, allowing for greater code density
and efficiency, especially in memory-constrained environments.
6. Floating Point and SIMD Support
 ARM processors often have a Floating Point Unit (FPU) for mathematical operations,
beneficial for media and scientific applications.
 Some ARM CPUs also support SIMD (Single Instruction, Multiple Data) operations,
enhancing performance for data-intensive tasks like graphics processing.
7. ARMv8-A and ARMv9 Architecture Enhancements
 ARMv8-A introduced 64-bit processing, improved cryptographic extensions, and
enhanced virtualization support.
 ARMv9 builds on ARMv8 with enhanced performance, improved security
(Confidential Compute Architecture), and enhanced machine learning capabilities.
8. Security Features (TrustZone)
 ARM TrustZone technology provides a secure environment within the processor to
run trusted code, separating it from the main operating system.
 This secure environment is essential for secure transactions, DRM, and other privacy-
sensitive applications.
9. Vector Processing (NEON Technology)
 Many ARM processors feature NEON, a technology designed for accelerated
multimedia and signal processing tasks.
 NEON supports parallel processing of data, ideal for image processing, video
encoding, and audio applications.
10. Virtualization Support
 ARM processors, especially in the ARMv8 and newer architectures, include
virtualization support.
 This allows for running multiple operating systems on a single processor, useful in
server environments and development.

Parallel Programming - Unit 1
No ratings yet
Parallel Programming - Unit 1
81 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
Multiprocessing Vs Multithreading 2
No ratings yet
Multiprocessing Vs Multithreading 2
16 pages
Watercolor Organic Shapes SlidesMania
No ratings yet
Watercolor Organic Shapes SlidesMania
23 pages
Benefits of Parallel Computing
No ratings yet
Benefits of Parallel Computing
22 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Paralelism Overview
No ratings yet
Paralelism Overview
2 pages
Code Scheduling in Compiler Design
No ratings yet
Code Scheduling in Compiler Design
6 pages
Unit 5
No ratings yet
Unit 5
96 pages
UNIT 2 (HPC)
No ratings yet
UNIT 2 (HPC)
10 pages
Module 1
No ratings yet
Module 1
14 pages
Coa Chapter 5
No ratings yet
Coa Chapter 5
96 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
LP V Theory and Practical Explanation: o o o o
No ratings yet
LP V Theory and Practical Explanation: o o o o
96 pages
Unit Iv Parallelism
No ratings yet
Unit Iv Parallelism
80 pages
Unit 5
No ratings yet
Unit 5
96 pages
Module 1 Chapter2
No ratings yet
Module 1 Chapter2
100 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Arciticher
No ratings yet
Arciticher
6 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
19 pages
PDC1 Unit1
No ratings yet
PDC1 Unit1
19 pages
COA - Module-5
No ratings yet
COA - Module-5
35 pages
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
No ratings yet
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
65 pages
Parallel Processing: Types of Parallelism
No ratings yet
Parallel Processing: Types of Parallelism
7 pages
Module - 4 - Parallel Processing
No ratings yet
Module - 4 - Parallel Processing
32 pages
Module 1 Chapter2
No ratings yet
Module 1 Chapter2
98 pages
Distributedcomp
No ratings yet
Distributedcomp
13 pages
Coa PPT-2
No ratings yet
Coa PPT-2
16 pages
Parallelism in Computer Architecture
No ratings yet
Parallelism in Computer Architecture
27 pages
Lecture-2-06 01 2025
No ratings yet
Lecture-2-06 01 2025
21 pages
Unit 4
No ratings yet
Unit 4
16 pages
Unit V
No ratings yet
Unit V
95 pages
PDC 3
No ratings yet
PDC 3
26 pages
HPC Unit 2
No ratings yet
HPC Unit 2
72 pages
HPC Quebank Solution
No ratings yet
HPC Quebank Solution
40 pages
Pipelining vs. Parallel Processing
No ratings yet
Pipelining vs. Parallel Processing
23 pages
CA Classes-21-25
No ratings yet
CA Classes-21-25
5 pages
2 TypesofParallelism
No ratings yet
2 TypesofParallelism
69 pages
Unit 5
No ratings yet
Unit 5
66 pages
Chapter 2: Program and Network Properties
No ratings yet
Chapter 2: Program and Network Properties
94 pages
Unit 1
No ratings yet
Unit 1
21 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
34 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Quiz Prep
No ratings yet
Quiz Prep
21 pages
HPC Unit 1
No ratings yet
HPC Unit 1
65 pages
Lecture #1 - Class-1
No ratings yet
Lecture #1 - Class-1
17 pages
HPC Important Question
No ratings yet
HPC Important Question
19 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
29 pages
U1&u2 Padcom-25
No ratings yet
U1&u2 Padcom-25
95 pages
Introduction
No ratings yet
Introduction
17 pages
Quiz Prep
No ratings yet
Quiz Prep
21 pages
HPC - Unit-1 Insem Notes
No ratings yet
HPC - Unit-1 Insem Notes
76 pages
Chapter 02 - Asynchronous and Parallel Programming in
No ratings yet
Chapter 02 - Asynchronous and Parallel Programming in
55 pages
Microprocessor Archetecture Cheat Sheet
No ratings yet
Microprocessor Archetecture Cheat Sheet
3 pages
Text To PDF
No ratings yet
Text To PDF
16 pages
Computer and Programming Concepts
No ratings yet
Computer and Programming Concepts
57 pages
Microprocessor Question Solve
No ratings yet
Microprocessor Question Solve
31 pages
History of Dbms
No ratings yet
History of Dbms
9 pages
System Programing and Operating System
No ratings yet
System Programing and Operating System
376 pages
Certificate in IT Syllabus: Software Development Rationale
No ratings yet
Certificate in IT Syllabus: Software Development Rationale
4 pages
R Module 1
No ratings yet
R Module 1
34 pages
CAP460 Lecture0
No ratings yet
CAP460 Lecture0
36 pages
How To Code in Go PDF
100% (1)
How To Code in Go PDF
627 pages
Practical Assignment No 3 - ICT-Digi Trailblazers Course 1 - Grade 6
No ratings yet
Practical Assignment No 3 - ICT-Digi Trailblazers Course 1 - Grade 6
2 pages
Function Module
No ratings yet
Function Module
4 pages
C++ Programming (Mastering Programming Languages Series) by Theophilus Edet
100% (1)
C++ Programming (Mastering Programming Languages Series) by Theophilus Edet
336 pages
Lecture1-Microprocessors, Microcontroller Assembly Language
No ratings yet
Lecture1-Microprocessors, Microcontroller Assembly Language
59 pages
Chapter 09 Embedded Firmware Design and Development
76% (17)
Chapter 09 Embedded Firmware Design and Development
63 pages
NGA ISO 6974 Manual
100% (1)
NGA ISO 6974 Manual
89 pages
Form 3 Comp Science End of Term 1
No ratings yet
Form 3 Comp Science End of Term 1
2 pages
Procedural Programming
No ratings yet
Procedural Programming
10 pages
Government Polytechnic, Solapur.: Maharashtra State Board of Technical Education, Mumbai
No ratings yet
Government Polytechnic, Solapur.: Maharashtra State Board of Technical Education, Mumbai
15 pages
Philosophy of Computer Science: An Introduction To The Issues and The Literature William J. Rapaport
100% (4)
Philosophy of Computer Science: An Introduction To The Issues and The Literature William J. Rapaport
57 pages
Computer Applications Technology PAT
0% (1)
Computer Applications Technology PAT
29 pages
PSUC
No ratings yet
PSUC
337 pages
Controllogix 5570/5560 Redundancy: User Manual
No ratings yet
Controllogix 5570/5560 Redundancy: User Manual
244 pages
Iot Practical File
0% (2)
Iot Practical File
21 pages
Adaptive Vision
No ratings yet
Adaptive Vision
8 pages
Stages in The Program Development Life Cycle
No ratings yet
Stages in The Program Development Life Cycle
9 pages
Cpe Project Black Book-Group
No ratings yet
Cpe Project Black Book-Group
49 pages
Cloud-Native Development Insights
No ratings yet
Cloud-Native Development Insights
16 pages
POP Vs OOP
No ratings yet
POP Vs OOP
8 pages
CHAPTER 3 (2) Comp Science Igcse
0% (1)
CHAPTER 3 (2) Comp Science Igcse
14 pages

COA UNIT 5 (AutoRecovered)

Uploaded by

COA UNIT 5 (AutoRecovered)

Uploaded by

COA UNIT 5

1. Instruction Level Parallelism (ILP)

2. Data Level Parallelism (DLP)

3. Task Level Parallelism (TLP)

4. Transaction Level Parallelism

Flynn’s Classification of Parallelism

2. Single Instruction, Multiple Data (SIMD)

4. Multiple Instruction, Multiple Data (MIMD)

You might also like