0% found this document useful (0 votes)

16 views38 pages

Open MP3

The document discusses OpenMP environment variables and synchronization constructs. It reviews the OpenMP clauses like private, shared, default, reduction, if, and schedule. It describes synchronization directives like barrier, single, and master. Critical sections and the atomic directive are presented as ways to avoid race conditions when updating shared variables. Finally, environment variables for OpenMP like OMP_NUM_THREADS, OMP_DYNAMIC, OMP_SCHEDULE, and OMP_NESTED are explained. An example of calculating Pi using the Monte Carlo method in parallel is also provided.

Uploaded by

l215376

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views38 pages

Open MP3

Uploaded by

l215376

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Parallel and Distributed Computing

CS3006

Lecture 10
OpenMP-III
6th April 2022

Dr. Rana Asif Rehman

CS3006 - Spring 2022

Review of OpenMP Clause List

 Private
 firstprivate, lastprivate
 Shared
 Default
 private, shared, none
 Reduction
 If clause
 Schedule
 Static, dynamic, guided, runtime
 nowait

CS3006 - Spring 2022

Synchronization in
OpenMP
Barrier Directive

 On encountering this directive, all threads in a team

wait until others have caught up, and then release

#pragma omp barrier

CS3006 - Spring 2022

Single Directive

 A single directive specifies a structured block that is

executed by a single (arbitrary) thread in parallel
region
 Implicit barrier

#pragma omp single [clause list]

structured block

CS3006 - Spring 2022

Master Directive

 The master directive is a specialization of the single

directive in which only the master thread executes
the structured block
 No implicit barrier

#pragma omp master

structured block

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 A Critical Section is a code segment that has a shared variable
and need to be executed as an atomic action.
 It means that in a group of cooperating processes/threads,
at a given point of time, only one process must be
executing its critical section
 Forces threads to be mutex (mutually exclusive)
Only one thread at a time executes the given code section
double area, pi, x;
int i, n;
...
area = 0.0;
for (i = 0; i < n; i++) {
x += (i+0.5)/n; //can be calculated independently
area += 4.0/(1.0 + x*x); //requires mutex lock.
}
pi = area / n;

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 If we simply parallelize the loop... A race condition
may occur

double area, pi, x;

int i, n;
...
area = 0.0;
#pragma omp parallel for private(x)
for (i = 0; i < n; i++) {
x = (i+0.5)/n;
area += 4.0/(1.0 + x*x); //not atomic
}
pi = area / n;

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 Race Condition
Value of area Thread A Thread B

11.667
+ 3.765
11.667

15.432 + 3.563

15.230

 Thread A reads value of area first

 Thread B reads value of area before A can update its value
 Thread A updates value of area
 Thread B ignores update by A and writes its incorrect value to
area
CS3006 - Spring 2022
Critical Sections
(#pragma omp critical)
Race Condition
 A race condition is created when one process may
“race ahead” of another and overwrite the change
made by the first process to the shared variable

area 15.230 Answer should be 18.995

Thread A 15.432 Thread B 15.230

CS3006 - Spring 2022 area += 4.0/(1.0 + x*x)

Critical Sections
(#pragma omp critical)

 Critical section: a portion of code that only thread at a time may

execute
 We denote a critical section by putting the pragma
#pragma omp critical [(name)]
 Optional identifier name can be used to identify a critical region
 Solves the problem but, as only one thread at a time may
execute the statement; it becomes sequential code
double area, pi, x;
int i, n;
...
area = 0.0;
#pragma omp parallel for private(x)
for (i = 0; i < n; i++) {
x = (i+0.5)/n;
#pragma omp critical
area += 4.0/(1.0 + x*x);
}
CS3006 - Spring 2022 pi = area / n;
Atomic Directive

 The atomic directive specifies that the single

memory location update should be performed as
an atomic operation

#pragma omp atomic

Update instruction e.g., x++

CS3006 - Spring 2022

Environment
Variables in OpenMP
Environment Variables in OpenMP
 OpenMP provides additional environment variables
that help control execution of parallel programs

 OMP_NUM_THREADS
 OMP_DYNAMIC
 OMP_SCHEDULE
 OMP_NESTED

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_NUM_THREADS
 Specifies the default number of threads created
upon entering a parallel region.
 The number of threads can be changed during run-
time using:
 omp_set_num_threads(int threads) routine [OR]
 num_threads clause  num_threads(int threads)

 Setting OMP_NUM_THREADS to 4 using bash:

“ export OMP_NUM_THREADS=4 “

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP

OMP_DYNAMIC
 when set to TRUE, allows the number of threads to be
controlled at runtime. It means Openmp will use its
dynamic adjustment algorithm to create number of
threads that may optimize system performance
 Incase of TRUE , total number of threads generated may not
be equal to the threads requested by using the omp_set_num
threads() function or the num_threads clause.
 Incase of FALSE, usually total no. of generated threads in a
parallel region become as requested by the num_threads
clause
 OpenMP routines for setting/getting dynamic status:
 void omp_set_dynamic (int flag); //disables if flag=0
 Should be called from outside of a parallel region
 int omp_get_dynamic (); //return value of dynamic status
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP
OMP_DYNAMIC[dynamic.c]

workers = omp_get_max_threads(); //can use num_procs

printf("%d maximum allowed threads\n", workers);
printf("total number of allocated cores are:%d\n", omp_get_num_procs());
omp_set_dynamic(1);
omp_set_num_threads(8);
printf("total number of requested when dynamic is true are:%d\n", 8);
#pragma omp parallel
{
#pragma omp single nowait
printf("total threads in parallel region1=%d:\n", omp_get_num_threads());
#pragma omp for
for (i = 0; i < mult; i++)
{a = complex_func();}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_DYNAMIC[dynamic.c]

omp_set_dynamic(0);
omp_set_num_threads(8);
printf("total number of requested when dynamic is false
are:%d\n", 8);
#pragma omp parallel
{
#pragma omp single nowait
printf("total threads in parallel region2=%d:\n",
omp_get_num_threads());
#pragma omp for
for (i = 0; i < mult; i++)
{a = complex_func();}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP

OMP_SCHEDULE

 Controls the assignment of iteration spaces associated

with for directives that use the runtime scheduling class
 Possible values: static, dynamic, and guided
 Can also be used along with chunk size [optional]
 If chunk size is not specified than default chunk-size of 1 is
used.

 Setting OMP_SCHEDULE to guided with minimum chunk

size of 4 using Ubuntu-based terminal:
“ export OMP_SCHEDULE= " guided,4" “
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP

OMP_NESTED
 Default value is FALSE
 While using nested parallel pragma inside another, the
nested one is executed by the original team instead of
making new thread team.
 When TRUE
 Enables nested parallelism
 While using nested parallel pragma code inside another, it
makes a new team of threads for executing the nested
one.
 Use omp_set_nested(int val) with non-zero value to
set this variable to TRUE.
 When called with ‘0’ as argument, it set the variable to
FALSE
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP
OMP_NESTED[nested.c]
omp_set_nested(0);
#pragma omp parallel num_threads(2)
{
#pragma omp single
printf("Level 1: number of threads in the team : %d\n",
omp_get_num_threads());

#pragma omp parallel num_threads(4)

{
#pragma omp single
printf("Level 2: number of threads in the team : %d\n",
omp_get_num_threads());
}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_NESTED[nested.c]
omp_set_nested(1);
#pragma omp parallel num_threads(2)
{
#pragma omp single
printf("Level 1: number of threads in the team : %d\n",
omp_get_num_threads());

#pragma omp parallel num_threads(4)

{
#pragma omp single
printf("Level 2: number of threads in the team : %d\n",
omp_get_num_threads());
}
}

CS5009 - Advanced Operating Systems

Example
Computing Pi using Monti Carlo method

Preliminary Idea:

points in circle
Pi = 4 x ( )
points in square

Here a=0.5 , b=0.5 and r=0.5

CS5009 - Advanced Operating Systems
Computing Pi using Monti Carlo method

Steps
For all the random points
1. Calculate total points in the circle
2. Divide points in the circle to the points in the square
 Total number of points are also the total number of points inside the
square
3. Multiply this fraction with 4

As number of random points increases, the value of Pi

approaches to real value (i.e., 3.14179…..)

CS5009 - Advanced Operating Systems

Computing Pi using Monti Carlo method
Sequential Implementation
int niter= 100000000;
count=0;
seed(time(0));
for (i=0; i<niter;++i) //10 million
{
//get random points
x = (double)random()/RAND_MAX;
y = (double)random()/RAND_MAX;
z = ((x-0.5)*(x-0.5))+((y-0.5)*(y-0.5));
//check to see if point is in unit circle
if (z<0.25)
{
++count;
}
}
pi = ((double)count/(double)niter)*4.0; //p = 4(m/n)
printf("Seq_Pi: %f\n", pi);

CS5009 - Advanced Operating Systems

Computing Pi using Monti Carlo method
(Parallel construct [parallel_pi.c])
#pragma omp parallel shared(niter) private(i, x, y, z, chunk_size, seed) reduction(+ : count)
{
num_threads = omp_get_num_threads();
chunk_size = niter / num_threads;
seed=omp_get_thread_num();
#pragma omp master
{printf("chunk_size=%ld\n",chunk_size);}

count=0;
for (i=0;i<chunk_size; i++)
{
//get random points
x = (double)rand_r(&seed)/(double)RAND_MAX;
y = (double)rand_r(&seed)/(double)RAND_MAX;
z = ((x-0.5)*(x-0.5))+((y-0.5)*(y-0.5));
//check to see if point is in unit circle
if (z<0.25)
{
++count;
}
}
}
pi = ((double)count/(double)niter)*4.0;
CS5009 - Advanced Operating Systems
Parallelizing linked lists

Consider the following code:

current=head;
while(current->next != NULL){
complex_func(current->key); //complex consumer func
current=current->next;
}
 Assume that complex function can be computed foreach key
value independently
 The code can’t be parallelized directly as:
 We don’t have omp constructs to parallelize while loops and equivalent for
loop don’t have canonical form.
 This is because we don’t know number of iterations in advance
 If we simply put ‘omp parallel pragma’ before while, program semantics
will not be assured
CS5009 - Advanced Operating Systems
Parallelizing linked lists
[Naïve idea:1 with logical error]
Consider the following code:
current=head;
#pragma omp parallel firstprivate(current)
{
while(current-> next != NULL){
complex_func(current->key); //complex consumer func
current=current-> next;
}
}
 Creates team of threads, each with private ‘current’ variable.
 Each thread will execute for all the nodes in the list
 This means every thread will perform work equal to sequential
code
 No speedup achieved, this will rather increase execution time

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve idea:2 with logical error]
Consider the following code:
current=head;
#pragma omp parallel shared(current)
{
while(current-> next!=NULL){ //line 1
complex_func(current->key); //complex consumer func
current=current-> next; //line 3
}
}
 Creates team of threads sharing same ‘current’ variable.
 For first while iteration, complex_func may be called by each
thread with same key-value.
 Semantics/atomics will not be ensured (i.e., multiple threads
executing line-3 can change line-1 result for other threads)
 So, output may not be as assumed
CS5009 - Advanced Operating Systems
Parallelizing linked lists
[Naïve but Correct parallelization]
Observations:
1. We don’t know in advance the number nodes in the list
2. We also don’t know how to access all the nodes parallelly
from the list. This because the linked-list can only be accessed
sequentially

3. We can parallelize it using the following steps

1. Count number of nodes in the list call it ‘C’
2. Allocate a dynamic array of pointers-to-list of size ‘C’. Now using loop,
copy address of ith node to the ith element in the pointers-array.
3. Now we can use for-loop that can iterate on this array of pointers.
Furthermore, this for-loop can also be parallelized

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
1. Count number of nodes in the list call it ‘C’
//struct LIST{ int key; LIST* ptr; } list;

int C=0; LIST *p =head;

//Here assume head is pointer to the start of the list.
while(p != NULL){
p=p->next;
C++;
}

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
2. Allocate a dynamic array of pointers-to-list of size ‘C’ and
using loop, copy address of ith node to the ith element in the
pointers-array

LIST **Parray = new LIST* [C];

p =head;
while(p != NULL){
Parray[i]=p;
p=p->next;
}

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
3. Now we can use for-loop that can iterate on this array of
pointers. Furthermore, this for-loop can also be parallelized

#pragma omp parallel for schedule(static,1)

for(i=0;i<C;i++){
complex_func(Parray[i]->key);
}

 This method can result in speedups only if tasks are complex

enough to overcome the data-movement costs.
 Usually, data-movements are more costly than the
computations
 So, we need to devise another solution

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[A relatively better implementation ]
//omptask.c and tasktime.c //execute using g++
#pragma omp parallel
{
#pragma omp single //single process will go into the region
{
current=head;
while(current->ptr!=NULL){
//following line creates a task and adds to logical task pool.
#pragma omp task firstprivate(current)
complex_func(current->key);

current = current->ptr;
}
}
}
Total threads=4
CS5009 - Advanced Operating Systems
Total complex itters= 100 Million
List size= 10 nodes
Parallelizing linked lists
[omp task illustration]

CS5009 - Advanced Operating Systems

Questions

CS3006 - Spring 2022

References
1. Kumar, V., Grama, A., Gupta, A., & Karypis, G. (2017). Introduction to parallel computing. Redwood City, CA:
Benjamin/Cummings.

CS3006 - Spring 2022

Lecture 12 Synchronization Constructs in OpenMP
No ratings yet
Lecture 12 Synchronization Constructs in OpenMP
32 pages
Open MP2
No ratings yet
Open MP2
28 pages
OpenMP Shared-Memory Programming Guide
No ratings yet
OpenMP Shared-Memory Programming Guide
37 pages
Lecture 13 PDC Bcs 6ef Smi Spring 2025
No ratings yet
Lecture 13 PDC Bcs 6ef Smi Spring 2025
17 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
10 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
40 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
29 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
OpenMP Intro
No ratings yet
OpenMP Intro
52 pages
Num Tech
No ratings yet
Num Tech
39 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
51 pages
Open MP
No ratings yet
Open MP
30 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
Omp Sync Data Runtime Environment
No ratings yet
Omp Sync Data Runtime Environment
59 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
PDC Lecture 7
No ratings yet
PDC Lecture 7
11 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
Openmp
No ratings yet
Openmp
61 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
65 pages
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
No ratings yet
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
13 pages
OPENMP
No ratings yet
OPENMP
37 pages
HPC - Unit 3
No ratings yet
HPC - Unit 3
15 pages
Shared Memory: Openmp Environment and Synchronization
No ratings yet
Shared Memory: Openmp Environment and Synchronization
32 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
M4: Shared Memory Programming With Openmp
No ratings yet
M4: Shared Memory Programming With Openmp
63 pages
4.OpenMP Done
No ratings yet
4.OpenMP Done
3 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
No ratings yet
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
41 pages
OpenMP for Shared Memory Programming
No ratings yet
OpenMP for Shared Memory Programming
30 pages
Parallel Programming Module 3
No ratings yet
Parallel Programming Module 3
44 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
OpenMP Basics and Examples
No ratings yet
OpenMP Basics and Examples
80 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Azizul Azri Bin Mustaffa - PEC12-60
No ratings yet
Azizul Azri Bin Mustaffa - PEC12-60
36 pages
PDC Lecture 7
No ratings yet
PDC Lecture 7
10 pages
PC File
No ratings yet
PC File
57 pages
Database Partitioning With MySQL
No ratings yet
Database Partitioning With MySQL
6 pages
Flir One Pro Datasheet
No ratings yet
Flir One Pro Datasheet
2 pages
Divide and Conquer
No ratings yet
Divide and Conquer
4 pages
Intelilite NT Mrs19 Leaflet 07-2010 S2leilnt19
No ratings yet
Intelilite NT Mrs19 Leaflet 07-2010 S2leilnt19
2 pages
Audi A4 3G MMI Entertainment System PDF
No ratings yet
Audi A4 3G MMI Entertainment System PDF
6 pages
G HL980 en
No ratings yet
G HL980 en
8 pages
EMARO 2013/14 Student Handbook
No ratings yet
EMARO 2013/14 Student Handbook
30 pages
Global Package Tracking - AfterShip
No ratings yet
Global Package Tracking - AfterShip
1 page
CRUD NETBEANS With XAMPP
No ratings yet
CRUD NETBEANS With XAMPP
11 pages
STD 12 Chapter 9 Working With Array and String Textual Exercise and Previous Years Board Papers
100% (1)
STD 12 Chapter 9 Working With Array and String Textual Exercise and Previous Years Board Papers
10 pages
Bca Dbms Short Notes
No ratings yet
Bca Dbms Short Notes
50 pages
Java Full Stack
No ratings yet
Java Full Stack
6 pages
IBM Power10 Scale-Out L2 Quiz - Attempt Review
100% (1)
IBM Power10 Scale-Out L2 Quiz - Attempt Review
13 pages
AP-51XX Firmware Update Guide
No ratings yet
AP-51XX Firmware Update Guide
7 pages
Yeong Chin Machine RY Industries Co., LTD
No ratings yet
Yeong Chin Machine RY Industries Co., LTD
18 pages
CO Documents SAP Tables
No ratings yet
CO Documents SAP Tables
7 pages
CONTEX HD ULTRA I4250s SCANSTATION PDF
No ratings yet
CONTEX HD ULTRA I4250s SCANSTATION PDF
2 pages
Brain Tumor Report
No ratings yet
Brain Tumor Report
45 pages
Open PDF File in Web Browser Using C#, VB - Net - ASP - Net, C#.NET, VB
No ratings yet
Open PDF File in Web Browser Using C#, VB - Net - ASP - Net, C#.NET, VB
7 pages
IPMS 2000: Integrated Management System Overview
No ratings yet
IPMS 2000: Integrated Management System Overview
15 pages
Unit 4
No ratings yet
Unit 4
30 pages
2024 GIS310 SemesterTest Memo
No ratings yet
2024 GIS310 SemesterTest Memo
4 pages
Online Application & Java Setup Guide
No ratings yet
Online Application & Java Setup Guide
6 pages
WPI Log 2021.06.15 16.18.22
No ratings yet
WPI Log 2021.06.15 16.18.22
4 pages
Slides For Chapter 15: Coordination and Agreement: Distributed Systems: Concepts and Design
No ratings yet
Slides For Chapter 15: Coordination and Agreement: Distributed Systems: Concepts and Design
20 pages
MATLAB Lab Test Question
No ratings yet
MATLAB Lab Test Question
13 pages
Licensing Prerequisites
No ratings yet
Licensing Prerequisites
24 pages
Reevaluation of Programmed IO With Write-Combining Buffers To Improve IO Performance On Cluster Systems (NAS2015 - kPIO+WC)
No ratings yet
Reevaluation of Programmed IO With Write-Combining Buffers To Improve IO Performance On Cluster Systems (NAS2015 - kPIO+WC)
8 pages
Pushkar Resume
No ratings yet
Pushkar Resume
1 page
Introduction To Strings and Its Operations
No ratings yet
Introduction To Strings and Its Operations
89 pages

Open MP3

Uploaded by

Open MP3

Uploaded by

Parallel and Distributed Computing

Dr. Rana Asif Rehman

CS3006 - Spring 2022

CS3006 - Spring 2022

 On encountering this directive, all threads in a team

#pragma omp barrier

CS3006 - Spring 2022

 A single directive specifies a structured block that is

#pragma omp single [clause list]

CS3006 - Spring 2022

 The master directive is a specialization of the single

#pragma omp master

CS3006 - Spring 2022

CS3006 - Spring 2022

double area, pi, x;

CS3006 - Spring 2022

 Thread A reads value of area first

area 15.230 Answer should be 18.995

Thread A 15.432 Thread B 15.230

CS3006 - Spring 2022 area += 4.0/(1.0 + x*x)

 Critical section: a portion of code that only thread at a time may

 The atomic directive specifies that the single

#pragma omp atomic

CS3006 - Spring 2022

CS5009 - Advanced Operating Systems

 Setting OMP_NUM_THREADS to 4 using bash:

CS5009 - Advanced Operating Systems

workers = omp_get_max_threads(); //can use num_procs

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

 Controls the assignment of iteration spaces associated

 Setting OMP_SCHEDULE to guided with minimum chunk

#pragma omp parallel num_threads(4)

CS5009 - Advanced Operating Systems

#pragma omp parallel num_threads(4)

CS5009 - Advanced Operating Systems

Here a=0.5 , b=0.5 and r=0.5

As number of random points increases, the value of Pi

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

Consider the following code:

CS5009 - Advanced Operating Systems

3. We can parallelize it using the following steps

CS5009 - Advanced Operating Systems

int C=0; LIST *p =head;

CS5009 - Advanced Operating Systems

LIST **Parray = new LIST* [C];

CS5009 - Advanced Operating Systems

#pragma omp parallel for schedule(static,1)

 This method can result in speedups only if tasks are complex

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

CS3006 - Spring 2022

CS3006 - Spring 2022

You might also like