Distributed Memory Programming with MPI

Distributed Memory Programming with MPI Slides extended from An Introduction to Parallel Programming by Peter Pacheco Dilum Bandara [email protected]

Distributed Memory Systems We discuss about developing programs for these systems using MPI MPI – Message Passing Interface Are set of libraries that can be called from C, C++, & Fortran Copyright © 2010, Elsevier Inc. All rights Reserved

Why MPI? Standardized & portable message-passing system One of the oldest libraries Wide-spread adoption Minimal requirements on underlying hardware Explicit parallelization Achieves high performance Scales to a large no of processors Intellectually demanding Copyright © 2010, Elsevier Inc. All rights Reserved

Compilation Copyright © 2010, Elsevier Inc. All rights Reserved mpicc -g -Wall -o mpi_hello mpi_hello.c wrapper script to compile turns on all warnings source file create this executable file name (as opposed to default a.out ) produce debugging information

Execution Copyright © 2010, Elsevier Inc. All rights Reserved mpiexec -n 1 ./ mpi_hello mpiexec -n 4 ./ mpi_hello Greetings from process 0 of 1 ! Greetings from process 0 of 4 ! Greetings from process 1 of 4 ! Greetings from process 2 of 4 ! Greetings from process 3 of 4 !

MPI Programs Need to add mpi.h header file Identifiers defined by MPI start with “MPI_” 1 st letter following underscore is uppercase For function names & MPI-defined types Helps to avoid confusion Copyright © 2010, Elsevier Inc. All rights Reserved

MPI Components MPI_Init Tells MPI to do all necessary setup e.g., allocate storage for message buffers, decide rank of a process argc_p & argv_p are pointers to argc & argv arguments in main( ) Function returns error codes Copyright © 2010, Elsevier Inc. All rights Reserved

Communicators Collection of processes that can send messages to each other Messages from others communicators are ignored MPI_Init defines a communicator that consists of all processes created when the program is started Called MPI_COMM_WORLD Copyright © 2010, Elsevier Inc. All rights Reserved

Single-Program Multiple-Data (SPMD) We compile 1 program Process 0 does something different Receives messages & prints them while the other processes do the work if-else construct makes our program SPMD We can run this program on any no of processors e.g., 4, 8, 32, 1000, … Copyright © 2010, Elsevier Inc. All rights Reserved

Communication msg_buf_p, msg_size, msg_type Determines content of message dest – destination processor’s rank tag – use to distinguish messages that are identical in content Copyright © 2010, Elsevier Inc. All rights Reserved

Exact behavior is determined by MPI implementation MPI_Send may behave differently with regard to buffer size, cutoffs, & blocking Cutoff if message size < cutoff  buffer if message size ≥ cutoff  MPI_Send will block MPI_Recv always blocks until a matching message is received Preserve message ordering from a sender Know your implementation Don’t make assumptions! Issues With Send & Receive Copyright © 2010, Elsevier Inc. All rights Reserved

Collective vs. Point-to-Point Communications All processes in the communicator must call the same collective function e.g., a program that attempts to match a call to MPI_Reduce on 1 process with a call to MPI_Recv on another process is erroneous Program will hang or crash Arguments passed by each process to an MPI collective communication must be “compatible” e.g., if 1 process passes in 0 as dest_process & another passes in 1, then the outcome of a call to MPI_Reduce is erroneous Program is likely to hang or crash Copyright © 2010, Elsevier Inc. All rights Reserved

Collective vs. P-to-P Communications (Cont.) output_data_p argument is only used on dest_process However, all of the processes still need to pass in an actual argument corresponding to output_data_p , even if it’s just NULL Point-to-point communications are matched on the basis of tags & communicators Collective communications don’t use tags Matched solely on the basis of communicator & order in which they’re called Copyright © 2010, Elsevier Inc. All rights Reserved

Partitioning Options Copyright © 2010, Elsevier Inc. All rights Reserved Block partitioning Assign blocks of consecutive components to each process Cyclic partitioning Assign components in a round robin fashion Block-cyclic partitioning Use a cyclic distribution of blocks of components

MPI_Allgather Concatenates contents of each process’ send_buf_p & stores this in each process’ recv_buf_p recv_count is the amount of data being received from each process Copyright © 2010, Elsevier Inc. All rights Reserved

Distributed Memory Programming with MPI

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Distributed Memory Programming with MPI

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Slide 27

Slide 28

Slide 29

Slide 30

Slide 31

Slide 32

Slide 33

Slide 34

Slide 35

Slide 36

Slide 37

Slide 38

Slide 39

Slide 40

Slide 41

Slide 42

Slide 43

Slide 44

Slide 45

Slide 46

Slide 47

Slide 48

Slide 49

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

8-top-ai-courses-for-customer-support-representatives-in-2025.pptx

7-essential-ai-courses-for-call-center-supervisors-in-2025.pptx

25-essential-ai-courses-for-user-support-specialists-in-2025.pptx

8-essential-ai-courses-for-insurance-customer-service-representatives-in-2025.pptx

Know for Certain

PPT OPD LES 3ertt4t4tqqqe23e3e3rq2qq232.pptx