0% found this document useful (0 votes)

38 views24 pages

7 P2P-4

Uploaded by

spareyash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views24 pages

7 P2P-4

Uploaded by

spareyash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Point-to-Point - IV

Lecture 7
January 29, 2024
Performance of Send Modes
MPI_Send
MPI_Bsend
MPI_Ssend

Rendezvous
Forced buffering
Forced synchronization

2
Example

3
MPI_Bsend
The size given should be the sum of the sizes of all outstanding Bsends that you intend to have,
plus 'MPI_BSEND_OVERHEAD' for each Bsend that you do.

4
Nearest Neighbor (NN) Exchange

0 P-1

5
Nearest Neighbor Pseudocode Tags?
Performance?
Option 1: Schedule right sends followed by left sends
if (myrank < P-1)
{
// Send/recv right neighbor
? MPI_COMM_WORLD);
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank+1, myrank+1,
? MPI_COMM_WORLD, &status);
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank+1, myrank,
}

if (myrank > 0)
{
// Send/recv left neighbor
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank-1, myrank,? MPI_COMM_WORLD, &status);
? MPI_COMM_WORLD);
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank-1, myrank-1,
}
6
Output

7
Nearest Neighbor Pseudocode
Option 2: Schedule odd and even ranks alternately

if (myrank % 2 == 0 && myrank < P-1)

{
// Send/recv right neighbour from even ranks
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank+1, myrank+1, MPI_COMM_WORLD);
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank+1, myrank, MPI_COMM_WORLD, &status);
}

else if (myrank % 2 != 0 && myrank > 0)

{
// Send/recv left neighbor
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank-1, myrank, MPI_COMM_WORLD, &status);
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank-1, myrank-1, MPI_COMM_WORLD);
}
8
Nearest Neighbor Pseudocode
if (myrank % 2 != 0 && myrank < P-1)
{
// Send/recv right neighbor from odd ranks
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank+1, myrank+1, MPI_COMM_WORLD);
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank+1, myrank, MPI_COMM_WORLD, &status);
}

else if (myrank % 2 == 0 && myrank > 0)

{
// Send/recv left neighbor
MPI_Recv (recvbuf, myArraySize, MPI_DOUBLE, myrank-1, myrank, MPI_COMM_WORLD, &status);
MPI_Send (data, myArraySize, MPI_DOUBLE, myrank-1, myrank-1, MPI_COMM_WORLD);
}
9
Same Host (Option 1 vs. 2)
for i in `seq 1 5` ; do mpirun -np 4 ./nn-1 1000000 ; done
0.006751
0.006896
0.006518
0.006310
0.006356

for i in `seq 1 5` ; do mpirun -np 4 ./nn-2 1000000 ; done

0.006183
0.017730
0.006718
0.006862
10
0.006701
Two Hosts (Option 1 vs. 2)
for i in `seq 1 5` ; do mpirun -np 4 -hosts csews1,csews10 ./nn-1 1000000 ; done
0.450281
0.426031
0.419316
0.445110
0.416786

or i in `seq 1 5` ; do mpirun -np 4 -hosts csews1,csews10 ./nn-2 1000000 ; done

0.405743
0.423926
0.410813
0.420823
11
0.430066
Timing Option 1 vs. Option 2

12
Timing NN

13
P2P Blocking – Performance Bottleneck

• MPI_Send (buf, count, datatype, dest, tag, comm)

• MPI_Recv (buf, count, datatype, source, tag, comm, status)

0 1
MPI_Send (1)
Safe but may delay sender

MPI_Recv (0)
14
Computation Communication Overlap

0 1

compute
Send compute Time
compute Recv
Wait compute
compute

15
Non-blocking Point-to-Point

• MPI_Isend (buf, count, datatype, dest, tag, comm, request)

• MPI_Irecv (buf, count, datatype, source, tag, comm, request)

• MPI_Wait (request, status)

• MPI_Waitall (count, request, status)

16
Many-to-one Non-blocking P2P

17
Output

18
Non-blocking Performance
• Standard does not require overlapping communication and
computation
• Implementation may use a thread to move data in parallel
• Implementation can delay the initiation of data transfer until “Wait”
• MPI_Test – non-blocking, tests completion, starts progress
• MPIR_CVAR_ASYNC_PROGRESS (MPICH)

19
Asynchronous Communication Progress

20
Non-blocking Point-to-Point Safety
• MPI_Isend (buf, count, datatype, dest, tag, comm, request)
• MPI_Irecv (buf, count, datatype, source, tag, comm, request)
• MPI_Wait (request, status)

0 1
MPI_Isend MPI_Isend Safe
MPI_Recv MPI_Recv

21
Homework: NN 1D using Non-blocking

0 P-1

22
Process Mapping/Allocation

0 1 10 11

0 1 2 3
4 7

8 11

23
Attributes of Interconnects

• Topology
• Diameter
• Cost
• Anything else?

8 NN
No ratings yet
8 NN
27 pages
4 P2P-1
No ratings yet
4 P2P-1
31 pages
7 P2p-Iv
No ratings yet
7 P2p-Iv
27 pages
MPI Cluster Execution Guide
No ratings yet
MPI Cluster Execution Guide
42 pages
5 P2P-2
No ratings yet
5 P2P-2
16 pages
6 P2P-3
No ratings yet
6 P2P-3
18 pages
10.collectives I
No ratings yet
10.collectives I
31 pages
Distributed Memory Programming Using
No ratings yet
Distributed Memory Programming Using
113 pages
Intro MPI
No ratings yet
Intro MPI
60 pages
Lecture 04
No ratings yet
Lecture 04
58 pages
Lecture 12-MPI Collective Communication
No ratings yet
Lecture 12-MPI Collective Communication
53 pages
Lecture 11 MPI Point To Point Communication
No ratings yet
Lecture 11 MPI Point To Point Communication
36 pages
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
No ratings yet
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
199 pages
Introduction To C MPI PM
No ratings yet
Introduction To C MPI PM
50 pages
What Is Message Passing?: Data Transfer Plus Synchronization
No ratings yet
What Is Message Passing?: Data Transfer Plus Synchronization
17 pages
MPI4Py Python Tutorial for HPC
No ratings yet
MPI4Py Python Tutorial for HPC
28 pages
Lab Manual 07 - P&DC
No ratings yet
Lab Manual 07 - P&DC
3 pages
Message Passing and MPI: John Mellor-Crummey
No ratings yet
Message Passing and MPI: John Mellor-Crummey
78 pages
Group6 P&DC Presentation
No ratings yet
Group6 P&DC Presentation
18 pages
MPI Part2 Updated
No ratings yet
MPI Part2 Updated
20 pages
CSC4005 Tutorial3
No ratings yet
CSC4005 Tutorial3
40 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
6 P2p-Iii
No ratings yet
6 P2p-Iii
33 pages
Intro To MPI: Hpc-Support@duke - Edu
No ratings yet
Intro To MPI: Hpc-Support@duke - Edu
56 pages
Programming Using The Message-Passing Paradigm
No ratings yet
Programming Using The Message-Passing Paradigm
47 pages
Unit Iv Distributed Memory Programming With Mpi
No ratings yet
Unit Iv Distributed Memory Programming With Mpi
19 pages
Mpi Assignment
No ratings yet
Mpi Assignment
7 pages
Lecture 10 - MPI - Pptlecture 12 - Fault Tolerance
No ratings yet
Lecture 10 - MPI - Pptlecture 12 - Fault Tolerance
26 pages
04 cmsc416 Mpi
No ratings yet
04 cmsc416 Mpi
31 pages
5 P2p-Ii
No ratings yet
5 P2p-Ii
26 pages
MPI Communication and Functions Guide
No ratings yet
MPI Communication and Functions Guide
16 pages
MPI Parallel Programming Guide
No ratings yet
MPI Parallel Programming Guide
67 pages
MPI Communication & Collective Functions Guide
No ratings yet
MPI Communication & Collective Functions Guide
30 pages
Introduction MPI - Chap2 - Slide 3
No ratings yet
Introduction MPI - Chap2 - Slide 3
16 pages
Parallel & Distributed Computing: MPI - Message Passing Interface
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
49 pages
Key Concepts in MPI Programming: Processes
No ratings yet
Key Concepts in MPI Programming: Processes
6 pages
PC Record 15
No ratings yet
PC Record 15
81 pages
MPI Exercises PDF
No ratings yet
MPI Exercises PDF
7 pages
25th 26th Lecture
No ratings yet
25th 26th Lecture
24 pages
PDC Lecture 17 & 18
No ratings yet
PDC Lecture 17 & 18
16 pages
Distributed Memory Programming With MPI: Peter Pacheco
No ratings yet
Distributed Memory Programming With MPI: Peter Pacheco
121 pages
Message Passing-1
No ratings yet
Message Passing-1
76 pages
Mpi 1
No ratings yet
Mpi 1
20 pages
Ans Nov Dec 2023
No ratings yet
Ans Nov Dec 2023
18 pages
Parallel Programming With Message-Passing Interface (MPI)
No ratings yet
Parallel Programming With Message-Passing Interface (MPI)
6 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
MPI General CC
No ratings yet
MPI General CC
21 pages
Week 10
No ratings yet
Week 10
52 pages
Lecture 15 MPI Summarization
No ratings yet
Lecture 15 MPI Summarization
26 pages
MPI Collective
No ratings yet
MPI Collective
33 pages
Distributed Memory Programming With: Peter Pacheco
No ratings yet
Distributed Memory Programming With: Peter Pacheco
125 pages
Unit IV
No ratings yet
Unit IV
12 pages
MPI in Python: Communication & Management
No ratings yet
MPI in Python: Communication & Management
18 pages
MPI Pacheco Ch3
No ratings yet
MPI Pacheco Ch3
124 pages
Week12 - L01 and L02
No ratings yet
Week12 - L01 and L02
22 pages
5 P2p-Ii
No ratings yet
5 P2p-Ii
26 pages
Non-Blocking MPI Send and Receive Guide
No ratings yet
Non-Blocking MPI Send and Receive Guide
11 pages
MPI1
No ratings yet
MPI1
2 pages
Lab Mpi
100% (1)
Lab Mpi
32 pages
Automate Oracle HFM with SFCC
No ratings yet
Automate Oracle HFM with SFCC
10 pages
COME 2202 NAHPI Introduction To Computer Networks Course Outline April 2024docx
No ratings yet
COME 2202 NAHPI Introduction To Computer Networks Course Outline April 2024docx
24 pages
StewartCalc8 10 01
No ratings yet
StewartCalc8 10 01
32 pages
Security Controls Overview Guide
No ratings yet
Security Controls Overview Guide
13 pages
Mine Electrician Interview Questions and Answers 51852
No ratings yet
Mine Electrician Interview Questions and Answers 51852
12 pages
The Bollywood Bride - Pccm1qa PDF
No ratings yet
The Bollywood Bride - Pccm1qa PDF
2 pages
SAP Enterprise Structure Setup Guide
No ratings yet
SAP Enterprise Structure Setup Guide
45 pages
Batch 17
No ratings yet
Batch 17
27 pages
CERTIFICATION
No ratings yet
CERTIFICATION
14 pages
PSX High Compressed Download
No ratings yet
PSX High Compressed Download
2 pages
FASER: Binary Code Similarity Search Through The Use of Intermediate Representations
No ratings yet
FASER: Binary Code Similarity Search Through The Use of Intermediate Representations
12 pages
Dgtl-Brkent-2711 (2020)
No ratings yet
Dgtl-Brkent-2711 (2020)
90 pages
Collaborative Design Procedures For Architects and Engineers
No ratings yet
Collaborative Design Procedures For Architects and Engineers
177 pages
Instructions For The 3 Round of Online B.Ed. (2019) Counselling
No ratings yet
Instructions For The 3 Round of Online B.Ed. (2019) Counselling
1 page
Four-Essential-Questions-For-Boards-To - Ask-About-Generative-Ai
No ratings yet
Four-Essential-Questions-For-Boards-To - Ask-About-Generative-Ai
5 pages
DFINSDOS
No ratings yet
DFINSDOS
26 pages
Ugrd Cpe6359 2213T
No ratings yet
Ugrd Cpe6359 2213T
7 pages
YASKAWA AC Drive A1000: Quick Start Guide
No ratings yet
YASKAWA AC Drive A1000: Quick Start Guide
44 pages
F24 MVC Assignment 2
No ratings yet
F24 MVC Assignment 2
11 pages
Proposal 1
No ratings yet
Proposal 1
8 pages
Math Auto Correct
No ratings yet
Math Auto Correct
21 pages
DC Motor Control Using Fuzzy Logic Contr
No ratings yet
DC Motor Control Using Fuzzy Logic Contr
9 pages
Internationalization & Localization Guide
No ratings yet
Internationalization & Localization Guide
30 pages
2017 MSA Exam Ver1
No ratings yet
2017 MSA Exam Ver1
11 pages
XCS503 Software Engineering
No ratings yet
XCS503 Software Engineering
3 pages
Mandate Modelo PDF
No ratings yet
Mandate Modelo PDF
53 pages
Icp Oes PDF
No ratings yet
Icp Oes PDF
55 pages
Unit 1 The Product and The Process: Structure
No ratings yet
Unit 1 The Product and The Process: Structure
125 pages
LFCS Domains Competencies V2.16 PDF
No ratings yet
LFCS Domains Competencies V2.16 PDF
7 pages
JNTUA R15 77 6 BTech CSE
No ratings yet
JNTUA R15 77 6 BTech CSE
195 pages

7 P2P-4

Uploaded by

7 P2P-4

Uploaded by

Point-to-Point - IV

if (myrank % 2 == 0 && myrank < P-1)

else if (myrank % 2 != 0 && myrank > 0)

else if (myrank % 2 == 0 && myrank > 0)

for i in `seq 1 5` ; do mpirun -np 4 ./nn-2 1000000 ; done

or i in `seq 1 5` ; do mpirun -np 4 -hosts csews1,csews10 ./nn-2 1000000 ; done

• MPI_Send (buf, count, datatype, dest, tag, comm)

• MPI_Isend (buf, count, datatype, dest, tag, comm, request)

• MPI_Wait (request, status)

You might also like