当前位置：和泉文库 > 计算机 > 浏览文档

Parallel Algorithms Underlying MPI Implementations

Recursive Halving and Doubling Parallel Algorithm Examples

文件格式：PPT，文件大小：702KB，售价：15元

共54页，可试读18页，点击往前阅读 ↑↑

文档详细内容（约54页）

I Recursive Halving and Doubling p Figure 13.1. Summation in log(N) steps

Recursive Halving and Doubling Figure 13.1. Summation in log(N) steps

I Recursive Halving and Doubling Step 3: Processor 1 then must broadcast this sum to all other processors. This broadcast operation can be done using the same communication structure as the summation but in reverse. You will see pseudocode for this at the end of this section. Note that if the total number of processors is N, then only 2 log(N)(log base 2) steps are needed to complete the operation There is an even more efficient way to finish the job in only log(N) steps. By way of example, look at the next figure containing 8 processors. At each step, processor i and processor i+k send and receive data in a pairwise fashion and then perform the summation k is iterated from 1 through N/2 in powers of 2. If the total number of processors is N, then log(N) steps are needed. As an exercise, you should write out the necessary pseudocode for this example

Recursive Halving and Doubling • Step 3: Processor 1 then must broadcast this sum to all other processors. This broadcast operation can be done using the same communication structure as the summation, but in reverse. You will see pseudocode for this at the end of this section. Note that if the total number of processors is N, then only 2 log(N) (log base 2) steps are needed to complete the operation. • There is an even more efficient way to finish the job in only log(N) steps. By way of example, look at the next figure containing 8 processors. At each step, processor i and processor i+k send and receive data in a pairwise fashion and then perform the summation. k is iterated from 1 through N/2 in powers of 2. If the total number of processors is N, then log(N) steps are needed. As an exercise, you should write out the necessary pseudocode for this example

I Recursive Halving and Doubling pI p2 p3 p4 p5 p6 p7 Qoq9QsQo Figure 13. 2. Summation to all processors in log(N) steps

Recursive Halving and Doubling Figure 13.2. Summation to all processors in log(N) steps

I Recursive Halving and Doubling What about adding vectors? That is, how do you add several vectors component-wise to get a new vector? The answer is, you employ the method discussed earlier in a component-wise fashion This fascinating way to reduce the communications and to avoid abundant summations is described next. this method utilizes the recursive halving and doubling technique and is illustrated in Figure 13.3

Recursive Halving and Doubling • What about adding vectors? That is, how do you add several vectors component-wise to get a new vector? The answer is, you employ the method discussed earlier in a component-wise fashion. This fascinating way to reduce the communications and to avoid abundant summations is described next. This method utilizes the recursive halving and doubling technique and is illustrated in Figure 13.3

I Recursive Halving and Doubling Suppose there are 4 processors and the length of each vector is also 4 Step 1: Processor pO sends the first two components of the vector to rocessor p1, and p1 sends the last two components of the vector to pO. Then po gets the partial sums for the last two components, and p1 gets the partial sums for the first two components. So do p2 and p3 Step 2: Processor po sends the partial sum of the third component to processor p3. Processor p3 then adds to get the total sum of the third component. Similarly, processor 0, 1 and 2 find the total sums of the 4th, 2nd, and 1st components, respectively. Now the sum of the vectors are found and the components are stored in different processors Step 3: Broadcast the result using the reverse of the above communication process

Recursive Halving and Doubling • Suppose there are 4 processors and the length of each vector is also 4. • Step 1: Processor p0 sends the first two components of the vector to processor p1, and p1 sends the last two components of the vector to p0. Then p0 gets the partial sums for the last two components, and p1 gets the partial sums for the first two components. So do p2 and p3. • Step 2: Processor p0 sends the partial sum of the third component to processor p3. Processor p3 then adds to get the total sum of the third component. Similarly, processor 0,1 and 2 find the total sums of the 4th, 2nd, and 1st components, respectively. Now the sum of the vectors are found and the components are stored in different processors. • Step 3: Broadcast the result using the reverse of the above communication process

点击进入文档下载页（PPT格式）

共54页，可试读18页，点击继续阅读 ↓↓

您可能感兴趣的文档

《电子商务技术》课程教学资源（PPT课件讲稿）第五章电子商务安全技术
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第六章应用层（谢希仁）
南京大学：《面向对象技术 OOT》课程教学资源（PPT课件讲稿）并发对象 Concurrent Objects
《数据库系统概论 An Introduction to Database System》课程教学资源（PPT课件讲稿）第六讲关系数据理论
《数据结构 Data Structure》课程教学资源（PPT课件讲稿）06 非二叉树 Non-Binary Trees
《数字图像处理》课程教学资源（PPT课件讲稿）第5章图像复原
《C语言程序设计》课程电子教案（PPT课件讲稿）Chapter 02 用C语言编写程序
山西国际商务职业学院：《数据库应用程序设计》课程教学资源（PPT课件）第三章数据与数据运算
《计算机网络》课程教学资源（PPT课件讲稿）第一章计算机网络概述
《大学计算机基础》课程教学资源：作业习题
中国医科大学：《计算机网络实用教程》课程教学资源（PPT讲稿）高速局域网技术、交换式局域网技术、虚拟局域网技术、主要的城域网技术
《TCP/IP协议及其应用》课程教学资源（PPT课件讲稿）第3章 IP寻址与地址解析
中国铁道出版社：《局域网技术与组网工程》课程教学资源（PPT课件讲稿）第5章 Linux网络工程
陕西师范大学：Neural Networks and Fuzzy Systems（PPT讲稿）Chapter 3 NEURONAL DYNAMICS II：ACTIVATION MODELS
《计算机系统安全》课程教学资源（PPT课件讲稿）第六章访问控制 Access Control
中国科学技术大学：《现代密码学理论与实践》课程教学资源（PPT课件讲稿）第2章传统加密技术 Classical Encryption Techniques
《计算机数据恢复技术》课程教学资源（PPT课件讲稿）第1章数据恢复技术概述
北京大学：《高级软件工程》课程教学资源（PPT课件讲稿）第六讲网络环境中的软件质量
《大学生计算机基础》课程教学资源（PPT讲稿）第三章字处理软件（Word 2003）
中国水利水电出版社：《计算机组装与维护实训教程》课程教学资源（PPT课件讲稿，共九章）
上海交通大学：《软件工程 Software Engineering》课程教学资源（PPT课件讲稿）软件开发过程 Software Development Processes
《大型机高级系统管理技术》课程教学资源（PPT课件讲稿）第4章作业控制子系统
《计算机软件及应用》课程教学资源（PPT课件讲稿）第2章 Photoshop CS入门基础
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第二章计算机的前世今生（主讲：许成刚）

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录