0% found this document useful (0 votes)

126 views26 pages

Unit-1 ACA

The document discusses the theory of parallelism, focusing on parallel computers and their architectural models, including Flynn's taxonomy and shared memory multiprocessors. It explains the operational models of SIMD computers, the PRAM and VLSI models, and various architectures like vector supercomputers. Additionally, it covers multivector and SIMD tracks, as well as multithreaded and dataflow tracks in parallel computing.

Uploaded by

sidrahwaris1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views26 pages

Unit-1 ACA

Uploaded by

sidrahwaris1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 26

UNIT-1

Theory of parallelism

Q1. Give a brief note on parallel computers and its architectural model
Ans. Parallel computers are systems that can process multiple tasks simultaneously by
dividing a computational problem into smaller sub-problems and solving them
concurrently.

This approach significantly improves performance and efficiency, especially for tasks
involving large-scale computations.
Key characteristics of parallel computers include:
 Multiple processors working together.
 Concurrent execution of instructions.
 High-speed interprocessor communication.
Why parallel computing?
● The whole real-world runs in dynamic nature i.e. many things happen at a certain time
but at different places concurrently. This data is extensively huge to manage.
● Real-world data needs more dynamic simulation and modeling, and for achieving the
same, parallel computing is the key.
● Parallel computing provides concurrency and saves time and money.
● Complex, large datasets, and their management can be organized only and only using
parallel computing’s approach.
● Ensures the effective utilization of the resources. The hardware is guaranteed to be
used effectively whereas in serial computation only some part of the hardware was used
and the rest rendered idle.
● Also, it is impractical to implement real-time systems using serial computing.
Architectural Models of Parallel Computers
Parallel computer architectures are categorized based on the arrangement of processors
and the way they communicate and share resources. Major architectural models include:
1.Flynn's Taxonomy
 SISD (Single Instruction Single Data): A single processor executes one instruction at
a time on one data stream (traditional computers).

 SIMD (Single Instruction Multiple Data): Multiple processors execute the same
instruction on different data streams simultaneously (e.g., GPUs).

 MISD (Multiple Instruction Single Data): Rarely used, where multiple processors
execute different instructions on the same data stream.

 MIMD (Multiple Instruction Multiple Data): Multiple processors execute different

instructions on different data streams (e.g., distributed systems).
2.Shared Memory Architecture
 Processors share a common memory space.
 Communication occurs through shared memory.
 Suitable for tasks requiring frequent data sharing.
 Example: Multi-core processors.
3.Distributed Memory Architecture
 Each processor has its own local memory.

 Communication occurs via message passing.

 Scales better for large systems.
 Example: Clusters.
4.Hybrid Architecture
 Combines shared and distributed memory architectures.
 Common in modern supercomputers.
 Example: Systems with multi-core nodes connected via a high-speed network.
5.Dataflow Architecture
 Execution is driven by the availability of data rather than a pre-determined
sequence of instructions.
 Ideal for fine-grained parallelism.
NOTE:TO LEARN IN DETAIL ABOUT EACH ARCHITECTURE OF THE FLYNN’S TAXONOMY REFER SIR KI
PDF UNIT-1 KI TOPIC THE STATE OF COMPUTING pg 3-6

Q2. Discuss briefly about shared memory multiprocessor and it’s models.
Ans: Shared memory multiprocessor systems are a class of parallel computer
architectures where multiple processors share a common memory space. This
architecture allows all processors to access the same global memory,
facilitating interprocessor communication and data sharing. In shared memory
systems, multiple processors can read and write to a single, unified memory
space.
Memory is globally accessible by all processors. The processors are connected
to the shared memory through a bus or interconnection network.

Components
1. Processors: Multiple CPUs work simultaneously, each capable of
executing instructions independently.
2. Shared Memory: A single pool of memory accessible to all processors for
storing and retrieving data.
3. Interconnection Mechanism: A communication infrastructure (like buses
or networks) connects processors to memory.
4. Synchronization Mechanisms: Hardware/software tools (e.g., locks,
semaphores) to manage data consistency and prevent race conditions.
Shared memory machines can be divided into three categories based upon
memory access times
1.Uniform memory-access (UMA) model
The UMA Model In a UMA multiprocessor model (Below Fig), the physical
memory is uniformly shared by all the processors. All processors have equal
access time to all memory words,
which is why it is called uniform
memory access. Each processor
may use a private cache.
Peripherals are also shared in some
fashion.

 Advantages: Simple to design, predictable memory access times.

 Disadvantages: Limited scalability due to bus contention.
2.Non uniform-memory-access (NUMA) model
Memory is physically distributed among processors, but processors can still
access memory across the system. Access times vary depending on the
memory's location. Two NUMA machine models are depicted in Fig. The
shared memory is physically distributed to all processors, called local
memories. The collection of all local memories forms a global address space
accessible by all processors.
Each processor has faster access to its local memory.
Example: Modern server-class systems.
 Advantages: Scalable for larger systems.
 Disadvantages: More complex memory management and potential
latency for remote memory access.
3.Cache-only memory architecture (COMA) model
COMA is a type of distributed shared memory architecture where the main
memory is not statically allocated. Instead, all memory in the system is treated
as a large, dynamically managed cache. This allows data to migrate to the
memory module of the processor that needs it, ensuring faster access and
reducing latency.

In COMA, the memory is highly flexible, and the system dynamically adjusts
the placement of data based on the demands of the processors. This
architecture is particularly beneficial for workloads with irregular or
unpredictable memory access patterns, as it brings data closer to the
processor, improving performance.

Q3.Explain
the

architecture of vector super computer.

Ans. A vector supercomputer is a high-performance computing system
designed to perform operations on vectors (arrays of data) simultaneously. It is
optimized for handling large-scale computations in scientific and engineering
applications by leveraging vector processing, where a single instruction
operates on multiple data elements in parallel. This architecture is particularly
suited for tasks involving repetitive mathematical operations on large datasets,
such as simulations, weather modeling, and numerical analysis.
• A vector operand contains an ordered set of n elements, where n is called
the length of the vector. Each element in a vector is a scalar quantity, which
may be a floating-point number, an integer, a logical value or a character.
• A vector processor consists of a scalar processor and a vector unit, which
could be thought of as an independent functional unit capable of efficient
vector operations. A vector computer is often built on top of a scalar
processor. As shown in Fig, the vector processor is attached to the scalar
processor as an optional feature. Program and data are first loaded into the
main memory through a host computer. All instructions are first decoded by
the scalar control unit If the decoded instruction is a scalar operation or a
program control operation, it will be directly executed by the scalar processor
using the scalar functional pipelines.
Vector computers are designed with hardware that efficiently performs vector
operations. Instead of accessing operands directly from memory, data is first
loaded into registers, processed, and then stored back in registers. The
hardware uses pipelining to overlap the processing of operands. In pipelined
functional units, each stage handles a step of the operation on different data.
Once the pipeline is full, a new result is produced every clock cycle.
Vector Processor Classification
Vector processors are classified based on how they handle operands during
computations. The two main types are:
1. Memory-to-Memory Vector Processors:
Operands are fetched directly from memory for processing, and the results are
written back to memory without using intermediate registers.
o Operands reside in memory during computations.
o The processor directly interacts with memory for every operation.
 Advantages:
o Simpler design as no vector registers are required.
o Suitable for large datasets that cannot fit in registers.
 Disadvantages:
o Slower due to frequent memory accesses.
o Higher latency and increased memory bandwidth requirements.

2. Register-to-Register Vector Processors:

Data is first loaded from memory into vector registers, where it is processed.
The results are stored back in memory only after computation is complete.
o Operands are held in high-speed vector registers.
o Operations are performed entirely within the registers.

 Advantages:
o Faster due to reduced memory access during computations.
o Efficient use of memory bandwidth.
 Disadvantages:
o Requires large, high-speed vector registers, increasing hardware complexity.
o Limited by the size of the registers.
Q4.Briefly discuss the operational model of SIMD computers.
Ans: SIMD (Single Instruction, Multiple Data) computers are designed to
execute the same instruction simultaneously on multiple data elements. These
systems excel at performing data-parallel tasks, where the same operation
needs to be applied to large datasets, making them ideal for multimedia
processing, scientific simulations, and other data-intensive applications.
Modern SIMD examples include Intel’s SSE (Streaming SIMD Extensions) and
ARM’s NEON instruction sets.

SIMD Operational Model

An SIMD computer is represented by a 5-tuple: M=(N,C,I,M,R)M = (N, C, I, M,
R). Here's what each component means:
1. NN:
o The number of Processing Elements (PEs) in the machine.
o Example: ILLIAC IV had 64 PEs; Connection Machine CM-2 had
65,536 PEs.
2. CC:
o The set of instructions executed by the Control Unit (CU).
o Includes scalar operations and program flow controls.
3. II:
o The set of instructions broadcast by the CU to all PEs for parallel
execution.
o Examples: arithmetic, logic, data routing, masking, and local
operations.
4. MM:
o Masking schemes used to enable or disable specific PEs during
execution.
o Masks allow only a subset of PEs to process data, providing
flexibility.
5. RR:
o Data-routing functions that define communication patterns in the
interconnection network for inter-PE data transfer.
Vector Instructions in SIMD
A vector instruction specifies operations on multiple data elements stored in
vector registers. The fields in a vector instruction include:
1. Operation Code:
Specifies the operation to perform (e.g., addition, multiplication).
2. Base Address:
Refers to the memory location or vector register where operands are
fetched from or results are stored.
3. Address Increment:
Indicates how to locate the next element in a vector.
o Example: An increment of 1 means data is stored consecutively in
memory.
4. Address Offset:
Adds to the base address to compute the effective memory location.
5. Vector Length:
Specifies the number of elements in the vector to determine when to
stop processing.
Key Features of SIMD Computers
1. Parallel Processing: Multiple PEs operate in parallel under the control of
a single instruction stream.
2. Efficient for Large Data: Best suited for tasks involving repetitive
operations on large datasets.
3. Masking: Flexibility to enable/disable specific PEs for selective
processing.
4. Data Routing: Interconnection networks enable communication between
PEs for complex tasks.

Q5.Elaborate the PRAM and VSLI models

Ans: PRAM Model
The Parallel Random Access Machine (PRAM) model is a theoretical
framework used to analyze parallel algorithms and measure their
performance. It assumes an ideal parallel computer where multiple processors
can work together and access shared memory simultaneously in constant
time.
The PRAM model is ideal for studying theoretical parallel algorithms, focusing
on time and processor efficiency.
Key Features of PRAM
1. Processors:
o Contains nnn processors (or processing elements, PEs) working
together.
o All processors execute instructions in synchronization (SIMD: Single
Instruction, Multiple Data).
2. Shared Memory:
o A single, globally accessible memory shared by all processors.
o Memory access happens in constant time, regardless of the
number of processors.
3. Parallel Execution:
o Processors read from memory, compute, and write back to memory
in a synchronized manner (lockstep operation).

Memory Access Policies in PRAM

When multiple processors access memory, the system must define rules to
handle conflicts. These rules include:
1. Exclusive Read (ER):
o Only one processor can read from a specific memory location at a
time.
2. Exclusive Write (EW):
o Only one processor can write to a specific memory location at a
time.
3. Concurrent Read (CR):
o Multiple processors can read from the same memory location
simultaneously.
4. Concurrent Write (CW):
o Multiple processors can write to the same memory location
simultaneously. To avoid conflicts, a policy is needed to decide how
writes are handled.

Variants of PRAM Models

Different combinations of these memory policies lead to PRAM variants:
1. EREW (Exclusive Read, Exclusive Write):
o Most restrictive: Only one processor can read/write a memory
location at any given time.
2. CREW (Concurrent Read, Exclusive Write):
o Allows multiple processors to read, but only one can write to a
memory location.
3. CRCW (Concurrent Read, Concurrent Write):
o Most flexible: Allows multiple processors to read and write
simultaneously.

Why PRAM is Important

1. Idealized Model:
o It simplifies the study of parallel algorithms by ignoring
synchronization and memory access delays.
2. Algorithm Development:
o PRAM is widely used for designing and analyzing scalable parallel
algorithms.

VSLI Model
Very-Large-Scale Integration (VLSI) is the process of creating an integrated
circuit (IC) by combining thousands of electronic components, like transistors,
resistors, and capacitors, into a single chip.
The VLSI model deals with designing and arranging these components on a
chip while considering factors like power usage, speed, size, and
manufacturing limitations.
In parallel computers, VLSI chips are used to build processor arrays, memory
arrays, and large-scale switching networks. Modern VLSI chips are typically 2-
dimensional, and their size depends on the amount of memory they can store.
The efficiency of an algorithm on a VLSI chip is measured by its space
complexity.
This is calculated using the chip area (A) and the time (T) it takes to run the
algorithm. The product A×TA \times T represents the total number of bits the
chip processes.
For certain tasks, there’s a minimum value, f(s)f(s), that limits how efficiently
the chip can perform the computation.
such that A.T2 >= O (f(s)) Where A=chip area and T=time
Q6.Briefly discuss (a) Multivector & SIMD tracks (b) Multithreaded &
dataflow tracks.
Ans: (a) Multivector & SIMD tracks
Multivector Tracks:
These are traditional vector supercomputers. The CDC 7600 was the first
vector dual-proccssor system. Two subtraclcs were derived from the CDC
7600. The Cray and Japanese supercomputers all followed the register-to-
register architecture. It Focuses on vector processing, where operations are
performed on entire arrays (vectors) of data rather than individual elements.

Cray 1 pioneered the multivoctor development in 1978. The Cray/MPP was a

massively parallel system with distributed shared memory, to work as a back-
end accelerator engine compatible with the Cray Y-MP Series.
The other subtrack used memory-to-memory architecture in building vector
supercomputers. We have identified only the CDC Cyber 205 and its successor
the ETA10 here, for completeness in tracking different supercomputer
architectures.
SIMD Tracks:
In SIMD systems, a single instruction stream is broadcast to multiple
processors, which execute it simultaneously on different data elements.
The llliac IV pioneered thc construction of SIMD computers, although thc array
processorconccpt can be traced back iar earl icr to the 19605. The subtrack,
consisting ofthe Goodyear MPP, the AMT/DAP610, and the TMC/CM-2, were
all SIMD machines built with bit-slice PEs. The CM-5 was a synchronized MIMD
machine executing in a multiple-SIMD mode.

The other subtrack corresponds to medium-grain SIMD computers using word-

wide PEs. The BSP (Kuck and Stokes, 1982] was a shared-memory SIMD
machine built with 16 processors updating a group of 17 memory modules
synchronously. The GF11 was developed at the IBM Watson Laboratory for
scientific simulation research use. The MasPar MP1 was the only medium-
grain SIMD computer to achieve production use in that time period.
(b) Multithreaded & dataflow tracks.
Multithreaded tracks:
The cont-entional von Neumann machines are built with processors that
execute a single context by each processor at a time. ln other words, each
processor maintains a single thread ofeontrnl with its hardware resources. ln a
multithrcaded architecture, each processor can execute multiple contexts at
the same time. The term mufrirlireotfing implies that there are multiple
threads ofcontrol in each processor.
Multithreading oifers an efiicctive mechanism ibr hiding long latency in
building large-scale multiproccssors and is today a mature technology.
As shown in Fig., the multithrcading idea was pioneered by Burton Smith
[ 1978] in the HEP system which extended the concept of scoreboarding of
multiple functional units in the CDC 6400
Dataflow Tracks:
Dataflow systems focus on executing instructions based on the availability of
data, rather than following a sequential order as in traditional architectures.
The key idea is to use a datallow mechanism, instead of a control-flow
mechanism as in von Neumann machines, to direct the program flow. Fine
grain, instruction-level parallelism is exploited in dataflow computers.

the dataflow concept was pioneered by Jack Dennis (1974) with a “static”
architecture. The concept later inspired the development of “dynamic”
dataflow computers. A series of tagged-token architectures was developed at
MIT by Arvind and coworkers.
Q7. Discuss in brief Data flow architecture.
Ans: ln a daraflow computer, the execution of an instruction is driven by data
availability instead of being guided by a program counter, ln theory, any
instruction should be ready for execution whenever operands become
available.
The instructions in a data-driven program are not ordered in any way. instead
of being stored separately in a main memory, data are directly held inside
instructions. Computational results. {rrnra mirens} are passed directly
between instructions.
The data generated by an instruction will be duplicated into many copies and
forwarded directly to all needy instructions. Data tokens, once consumed by
an instruction, will no longer he available for reuse by other instructions..
A Dataflow Architecture
As shown in Fig, the global architecture consists ofn processing elements {PEs}
interconnected by an n x n routing network. The entire system supports
pipclined dataflow operations in all n PEs. Inter-PE eontnntnieations are done
through the pipelinod muting network.

Within each PE, the machine provides a low-level Iokm-nmfr-hing mechanism

which dispatches only those instructions whose input data [tokens] are
already available.
This data-driven scheme requires no program counter, and no control
sequencer. However, it requires special mechanisms to detect data availability,
to match data tokens with needy instructions, and to enable the chain reaction
of asynchronous instruction executions. No memory sharing between
instructions results in no side effects.
The global architecture consists of n processing elements (PEs) interconnected
by an n x n muting network. The entire system supports pipelined dataflow
operations in all n PEs. Inter-PE communications are done through the
pipelined routing network.

Q8. Explain in detail about program flow mechanism.

Ans: Program flow mechanisms refer to the various methods and strategies
employed to control the sequence of execution of instructions in a computer
system. They determine how a program’s control moves from one instruction
to another and ensure that computations occur in the correct order,
respecting dependencies and achieving the desired output.
The main program flow mechanisms are:
 Control-Driven Mechanism
 Data-Driven Mechanism
 Demand-Driven Mechanism

1. Control-Driven Mechanism
Conventional computers are based on a control flow mechanism by
which the order of program execution is explicitly stated in the user
programs. A program counter keeps track of the current instruction.
Instructions are executed sequentially unless a control flow instruction
(e.g., jump, call) changes the PC. Here sequential execution is default.
Flow control uses constructs like loops, conditionals, and jumps.
2. Data-Driven Mechanism
Dataflow computers are based on a data-driven mechanism which allows
the execution of any instruction to be driven by data (operand)
availability. Dataflow computers emphasize a high degree of parallelism
at the fine-grain instructional level.
Instructions are "enabled" for execution when their input data
(operands) are available.
Tokens carrying data are passed between instructions, and execution
proceeds asynchronously.
No program counter or central control mechanism.
3. Demand-Driven Mechanism:
Reduction computers are based on a demand-driven mechanism which
initiates an operation based on the demand for its results by other
computations.
Also known as lazy evaluation, it executes instructions only when their
results are required by another part of the program. Execution starts
from the final output requirement.
Instructions are triggered in reverse order of dependency.
Avoids unnecessary computation by focusing only on required results.
Execution is guided by demand propagation.
Q9. Explain bell’s taxonomy of MIMD computers
Ans: Parallel computers appear as either SIMD or MIMD configurations. The
SLMD-s appeal more to special purpose applications. it is clear that SIMD’s are
not size scalable. but unclear whether large SIMD’s are generation-scalable.
The fact that CM-5 had an MIMD architecture, away from the SIMD
architecture in CM -Z, represents the architectural trend. Furthermore, the
boundary between multi-processors and multi-computers has become blurred
in recent years.
The architectural trend for general-purpose parallel computers is in favor of
MTMD configurations with various memory configurations. Gordon Bell (1992)
has provided a taxonomy of MIMD machines, shown in Fig. He considers
shared-memory multiprocessors as having a single address space. Scalable
multiprocessors or multi-computers must use distributed memory.
Multiprocessors using centrally shared memory have limited scalability.
Multi-computers use distributed memories with multiple address spaces. They
are scalable with distributed memory. The evolution of fast LAN (local area
network]- connected workstations has created “commodity supercomputing”.
Bell was the first to advocate high-speed workstation clusters interconnected
by highspeed switches in lieu of special-purpose multi-computers. The C M-5
development was an early move in that direction.
NOTE: CAN ALSO REFER SPECTRUM Q20.

Q10. Explain system interconnect architectures.

Ans: System Interconnect Architectures refer to the frameworks used to
connect the various components of a computer system, like processors,
memory, and input/output devices. These architectures can be broadly
categorized into Static Interconnection and Dynamic Interconnection based on
their structure and flexibility.
1.Static Interconnection Architectures:
Static networks are created point-to-point direct connections which will not
alter during execution. Which means Static interconnection systems have fixed
pathways for communication between components. These pathways do not
change during the system's operation. They are faster and simpler.
Types of Static interconnection architectures:
(a)Linear Array: Processors are arranged in a straight line, with each
connected to its immediate neighbors. Linear arrays are actually simplest
connection topology. This is a one-dimensional network in which N nodes are
connected by N~ 1 links in a line, Internal nodes have degree 2, and the
terminal nodes have degree 1.
(b)Ring: Processors form a circular topology, with each processor connected to
two neighbors (left and right). A ring is obtained by connecting the two
terminal nodes of a linear array with one extra link. A ring can be uni-
directional or bidirectional. It is symmetric with a constant node degree of 2.

(c)Chordal Ring: By increasing the node degree from 2 to 3 or 4, we obtain two

chordal rings with degree 3 and 4. One and two extra links are added to
produce the two chordal rings, respectively. In general, the more links added,
the higher the node degree and the shorter the network diameter.
1.Dynamic Interconnection Architectures:
For multipurpose or general—purpose applications. we may need to use
dynamic connections which can implement all communication patterns based
on program demands. Instead of using fixed connections, switches or arbiters
must be used along the connecting paths to provide the dynamic connectivity.
In increasing order of cost and performance, dynamic connection networks
include bus systems, multistage interconnection networks (MIN), and crossbar
switch networks.
Types of Dynamic interconnection architectures:
(a)Bus System: A bus system is essentially a collection of wires and connectors
for data transactions among processors, memory modules, and peripheral
devices attached to the bus. The bus is used for only one transaction at a time
between source and destination.
In case of multiple requests, the bus arbitration logic must be able to allocate
or deallocate the bus, servicing the requests one at a time. A bus system has a
lower cost and provides a limited bandwidth compared to the other two
dynamic connection networks.

(b)Multistage interconnection network: MINs have been used in both MIND

and SIMD computers. A generalized multistage network is illustrated in the Fig
below. A number of a x b switches are used in each stage. Fixed interstage
connections are used between the switches in adjacent stages. The switches
can be dynamically set to establish the desired connections between the
inputs and outputs.
(c)Crossbar switch networks: The highest bandwidth and interconnection
capability are provided by crossbar networks. A crossbar network can be
visualized as a single-stage switch network. Like a telephone switchboard, the
crosspoint switches dynamic connections between source, destination pairs.
Each crosspoint switch can provide a dedicated connection path between a
pair. The switch can be set on or off dynamically upon program demand.

Done!!

Amdahl's Law: S (N) T (1) /T (N)
No ratings yet
Amdahl's Law: S (N) T (1) /T (N)
46 pages
Cs 903advanced Computer Architecture Unit - I
No ratings yet
Cs 903advanced Computer Architecture Unit - I
57 pages
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
No ratings yet
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
16 pages
Pipelining and Superscalar Techniques: CSE539: Advanced Computer Architecture
No ratings yet
Pipelining and Superscalar Techniques: CSE539: Advanced Computer Architecture
49 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
Multiprocessor vs Multicomputer Systems
No ratings yet
Multiprocessor vs Multicomputer Systems
27 pages
Figure 3.13 Transfer of Control With Multiple Interrupts
No ratings yet
Figure 3.13 Transfer of Control With Multiple Interrupts
10 pages
Unit 4 - Run - Time Environment
No ratings yet
Unit 4 - Run - Time Environment
34 pages
Ca Unit 4 Prabu
No ratings yet
Ca Unit 4 Prabu
24 pages
Recurrent & Recursive Nets
No ratings yet
Recurrent & Recursive Nets
10 pages
Principles of Scalable Performance
No ratings yet
Principles of Scalable Performance
61 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
37 pages
Co Unit 1 Notes
100% (1)
Co Unit 1 Notes
51 pages
Classification of Parallel Computers
No ratings yet
Classification of Parallel Computers
16 pages
OS 2 Marks
100% (11)
OS 2 Marks
15 pages
Jntu Kakinada - M.tech - Mathematical Foundations of Computer Science Sup FR 28
No ratings yet
Jntu Kakinada - M.tech - Mathematical Foundations of Computer Science Sup FR 28
2 pages
Scribid ACA Important Topics With Answers
No ratings yet
Scribid ACA Important Topics With Answers
57 pages
ACA Notes UNIT-1
No ratings yet
ACA Notes UNIT-1
20 pages
4.non Linear Pipeline
88% (8)
4.non Linear Pipeline
20 pages
Input Output Organization
No ratings yet
Input Output Organization
19 pages
Unit Iii - Aca
No ratings yet
Unit Iii - Aca
13 pages
Microprocessor and 8086 Architecture
No ratings yet
Microprocessor and 8086 Architecture
204 pages
Multivector and SIMD Computers
No ratings yet
Multivector and SIMD Computers
2 pages
BEFA All Units Notes
No ratings yet
BEFA All Units Notes
205 pages
Shift Micro Operations
No ratings yet
Shift Micro Operations
14 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
9 pages
SIMD Presentation
No ratings yet
SIMD Presentation
28 pages
OS Installation & CPU Scheduling Lab
No ratings yet
OS Installation & CPU Scheduling Lab
59 pages
Microprocessor - Overview: Block Diagram of A Computer
No ratings yet
Microprocessor - Overview: Block Diagram of A Computer
5 pages
Unit-II BDA
No ratings yet
Unit-II BDA
19 pages
Chapter-7 Multiprocessors and Multicomputers: Module-Iv
No ratings yet
Chapter-7 Multiprocessors and Multicomputers: Module-Iv
53 pages
COA Unit 1
No ratings yet
COA Unit 1
33 pages
Unit 5 - Operating System
No ratings yet
Unit 5 - Operating System
11 pages
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
No ratings yet
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
147 pages
Flynn'S Classification: Cs6303 Computer Architecture
No ratings yet
Flynn'S Classification: Cs6303 Computer Architecture
11 pages
Chapter 4 (Processors and Memory Hierarchy)
100% (1)
Chapter 4 (Processors and Memory Hierarchy)
17 pages
Vtunotesbysri: Module 5: Multimedia Networking
No ratings yet
Vtunotesbysri: Module 5: Multimedia Networking
33 pages
Unit 2 Chapter 4 (Deadlock)
No ratings yet
Unit 2 Chapter 4 (Deadlock)
9 pages
Cursor-Based Linked Lists
No ratings yet
Cursor-Based Linked Lists
4 pages
Unit-3 TOC
No ratings yet
Unit-3 TOC
42 pages
DL - & - CO - Unit 5 - Material (N)
No ratings yet
DL - & - CO - Unit 5 - Material (N)
15 pages
2.write A Program For Frame Sorting Technique Used in Buffers
No ratings yet
2.write A Program For Frame Sorting Technique Used in Buffers
2 pages
6.1 Emerging Databases
No ratings yet
6.1 Emerging Databases
18 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
GPU Programming and Parallelism
No ratings yet
GPU Programming and Parallelism
16 pages
1-IAS Architecture-12-12-2022
No ratings yet
1-IAS Architecture-12-12-2022
34 pages
Unit V
No ratings yet
Unit V
67 pages
Question Bank of Computer Network
No ratings yet
Question Bank of Computer Network
1 page
5 Ways of Increasing The Capacity of Cellular System
100% (1)
5 Ways of Increasing The Capacity of Cellular System
7 pages
TBC 304 Computer Organization and Architecture
No ratings yet
TBC 304 Computer Organization and Architecture
2 pages
Microprocessor Case Study
No ratings yet
Microprocessor Case Study
9 pages
PPL Unit 3-1
No ratings yet
PPL Unit 3-1
25 pages
IFT412 Mobile and Pervasive Computing
No ratings yet
IFT412 Mobile and Pervasive Computing
30 pages
Advanced Computer Architecture Unit 4 PDF
No ratings yet
Advanced Computer Architecture Unit 4 PDF
35 pages
Stack and SUBROUTINES Bindu Agarwalla
No ratings yet
Stack and SUBROUTINES Bindu Agarwalla
15 pages
Advanced Computer Architecture 2
No ratings yet
Advanced Computer Architecture 2
17 pages
PRAM Model
No ratings yet
PRAM Model
72 pages
The Von Neumann
No ratings yet
The Von Neumann
5 pages
Advanced Computer Architecture Assigment
No ratings yet
Advanced Computer Architecture Assigment
60 pages
Advance Computer Architecture2
No ratings yet
Advance Computer Architecture2
36 pages
Lesson 1 Quadratic Equation SY 2022 - 2023
No ratings yet
Lesson 1 Quadratic Equation SY 2022 - 2023
14 pages
Acidizing 5
No ratings yet
Acidizing 5
4 pages
List
No ratings yet
List
12 pages
Azure Certification Pathways E Book
No ratings yet
Azure Certification Pathways E Book
31 pages
Nutanix NCP
No ratings yet
Nutanix NCP
17 pages
Oil Tanker Operations PDF
83% (6)
Oil Tanker Operations PDF
253 pages
PS2001-CMS RMF DESCASE MP FILTRI NBI Auto Parts
No ratings yet
PS2001-CMS RMF DESCASE MP FILTRI NBI Auto Parts
5 pages
The Ultimate Mastery Roadmap Applied Cryptography & Web3 Security
No ratings yet
The Ultimate Mastery Roadmap Applied Cryptography & Web3 Security
26 pages
History Month Proposal Final
100% (1)
History Month Proposal Final
8 pages
Uber Interview Questions and Answers 43612
No ratings yet
Uber Interview Questions and Answers 43612
12 pages
Application of SVM and ANN For Intrusion Detection: Wun-Hwa Chen, Sheng-Hsun Hsu, Hwang-Pin Shen
No ratings yet
Application of SVM and ANN For Intrusion Detection: Wun-Hwa Chen, Sheng-Hsun Hsu, Hwang-Pin Shen
18 pages
Grade 9 - Mathematics - சுய கற்றல் கையேடு
100% (1)
Grade 9 - Mathematics - சுய கற்றல் கையேடு
40 pages
Cryptography and Network Security 2010
No ratings yet
Cryptography and Network Security 2010
4 pages
Bayesia Lab Car Diagnosis
No ratings yet
Bayesia Lab Car Diagnosis
86 pages
2024 - Master of Science and Engineering - Fall Admission - Application Information
No ratings yet
2024 - Master of Science and Engineering - Fall Admission - Application Information
25 pages
?????????????????K by TrHPhuc?.AccessibilityPunctuationGroup
No ratings yet
?????????????????K by TrHPhuc?.AccessibilityPunctuationGroup
142 pages
EN - Useful Correction Reports in MM-PUR Area
No ratings yet
EN - Useful Correction Reports in MM-PUR Area
9 pages
GEZE - Product Data Sheet - EN - 697800140304
No ratings yet
GEZE - Product Data Sheet - EN - 697800140304
3 pages
Software Update - 1100 Series
No ratings yet
Software Update - 1100 Series
5 pages
Lecture 1
No ratings yet
Lecture 1
25 pages
Introducing AWT: Container
No ratings yet
Introducing AWT: Container
31 pages
Instrumentation List
No ratings yet
Instrumentation List
2 pages
Anthology Viability Assessment 8.21.25
No ratings yet
Anthology Viability Assessment 8.21.25
10 pages
Uploading Stock to Storage Bin Guide
No ratings yet
Uploading Stock to Storage Bin Guide
3 pages
Alarm Annunciator: Model: 1608
No ratings yet
Alarm Annunciator: Model: 1608
3 pages
Practice Exercise For Data Entry
No ratings yet
Practice Exercise For Data Entry
11 pages
Research Paper On Programmable Logic Controller
No ratings yet
Research Paper On Programmable Logic Controller
5 pages
Sample Prob 1 2 Crashing
No ratings yet
Sample Prob 1 2 Crashing
4 pages
Myci 2013 Wip
No ratings yet
Myci 2013 Wip
4 pages
BUSI1319 Proposal R2
No ratings yet
BUSI1319 Proposal R2
14 pages

Unit-1 ACA

Uploaded by

Unit-1 ACA

Uploaded by

UNIT-1

 MIMD (Multiple Instruction Multiple Data): Multiple processors execute different

 Communication occurs via message passing.

 Advantages: Simple to design, predictable memory access times.

architecture of vector super computer.

2. Register-to-Register Vector Processors:

SIMD Operational Model

Q5.Elaborate the PRAM and VSLI models

Memory Access Policies in PRAM

Variants of PRAM Models

Why PRAM is Important

Cray 1 pioneered the multivoctor development in 1978. The Cray/MPP was a

The other subtrack corresponds to medium-grain SIMD computers using word-

Within each PE, the machine provides a low-level Iokm-nmfr-hing mechanism

Q8. Explain in detail about program flow mechanism.

Q10. Explain system interconnect architectures.

(c)Chordal Ring: By increasing the node degree from 2 to 3 or 4, we obtain two

(b)Multistage interconnection network: MINs have been used in both MIND

You might also like