找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Compiling Parallel Loops for High Performance Computers; Partitioning, Data A David E. Hudak,Santosh G. Abraham Book 1993 Springer Science+

[复制链接]
查看: 9350|回复: 37
发表于 2025-3-21 16:05:57 | 显示全部楼层 |阅读模式
书目名称Compiling Parallel Loops for High Performance Computers
副标题Partitioning, Data A
编辑David E. Hudak,Santosh G. Abraham
视频video
丛书名称The Springer International Series in Engineering and Computer Science
图书封面Titlebook: Compiling Parallel Loops for High Performance Computers; Partitioning, Data A David E. Hudak,Santosh G. Abraham Book 1993 Springer Science+
描述4. 2 Code Segments . . . . . . . . . . . . . . . 96 4. 3 Determining Communication Parameters . 99 4. 4 Multicast Communication Overhead · 103 4. 5 Partitioning . . . . . . · 103 4. 6 Experimental Results . 117 4. 7 Conclusion. . . . . . . · 121 5 COLLECTIVE PARTITIONING AND REMAPPING FOR MULTIPLE LOOP NESTS 125 5. 1 Introduction. . . . . . . . . 125 5. 2 Program Enclosure Trees. . 128 5. 3 The CPR Algorithm . . 132 5. 4 Experimental Results. . 141 5. 5 Conclusion. . 146 BIBLIOGRAPHY. 149 INDEX . . . . . . . . 157 LIST OF FIGURES Figure 1. 1 The Butterfly Architecture. . . . . . . . . . 5 1. 2 Example of an iterative data-parallel loop . . 7 1. 3 Contiguous tiling and assignment of an iteration space. 13 2. 1 Communication along a line segment. . . 24 2. 2 Access pattern for the access offset, (3,2). 25 2. 3 Decomposing an access vector along an orthogonal basis set of vectors. . . . . . . . . . . . . . . . . . . 26 2. 4 An analysis of communication patterns. 29 2. 5 Decomposing a vector along two separate basis sets of vectors. 31 2. 6 Cache lines aligning with borders. 33 2. 7 Cache lines not aligned with borders. 34 2. 8 nh is the difference of nd and nb. 42 2. 9 nh is the sum o
出版日期Book 1993
关键词Contig; Excel; algorithms; architecture; architectures; boundary element method; computer; design; function;
版次1
doihttps://doi.org/10.1007/978-1-4615-3164-7
isbn_softcover978-1-4613-6386-6
isbn_ebook978-1-4615-3164-7Series ISSN 0893-3405
issn_series 0893-3405
copyrightSpringer Science+Business Media New York 1993
The information of publication is updating

书目名称Compiling Parallel Loops for High Performance Computers影响因子(影响力)




书目名称Compiling Parallel Loops for High Performance Computers影响因子(影响力)学科排名




书目名称Compiling Parallel Loops for High Performance Computers网络公开度




书目名称Compiling Parallel Loops for High Performance Computers网络公开度学科排名




书目名称Compiling Parallel Loops for High Performance Computers被引频次




书目名称Compiling Parallel Loops for High Performance Computers被引频次学科排名




书目名称Compiling Parallel Loops for High Performance Computers年度引用




书目名称Compiling Parallel Loops for High Performance Computers年度引用学科排名




书目名称Compiling Parallel Loops for High Performance Computers读者反馈




书目名称Compiling Parallel Loops for High Performance Computers读者反馈学科排名




单选投票, 共有 1 人参与投票
 

1票 100.00%

Perfect with Aesthetics

 

0票 0.00%

Better Implies Difficulty

 

0票 0.00%

Good and Satisfactory

 

0票 0.00%

Adverse Performance

 

0票 0.00%

Disdainful Garbage

您所在的用户组没有投票权限
发表于 2025-3-22 00:10:11 | 显示全部楼层
https://doi.org/10.1007/978-1-4615-3164-7Contig; Excel; algorithms; architecture; architectures; boundary element method; computer; design; function;
发表于 2025-3-22 03:57:08 | 显示全部楼层
978-1-4613-6386-6Springer Science+Business Media New York 1993
发表于 2025-3-22 04:56:34 | 显示全部楼层
Niels Christian Hvidt,Elisabeth Assing Hvidtn multiprocessor) rely on hundreds of commercially-available microprocessors to provide computing power in a cost-effective manner. Microprocessor architectures and implementations are becoming increasingly sophisticated, e.g., the Alpha microprocessor introduced by Digital Equipment Corporation ope
发表于 2025-3-22 09:45:15 | 显示全部楼层
Spirituality, Religiousness and Healthtain types of parallel loops. Static program partitioning is attractive since the program partition is specified during the compilation phase, thereby eliminating one source of run-time overhead. Furthermore, static partitionings reduce communication overhead relative to dynamic schemes that attempt
发表于 2025-3-22 16:52:49 | 显示全部楼层
发表于 2025-3-22 17:37:11 | 显示全部楼层
https://doi.org/10.1007/978-3-030-02997-5 our analyses to code featuring iterative data-parallel loops and matrix data sets, we have observed several frequently occurring computation structures and communication characteristics. Applications that exhibit various computation structures and communication characteristics are given in Fig. 4.1
发表于 2025-3-23 00:16:55 | 显示全部楼层
Workplace and Organizational Spirituality,ray access methods and loop structures. These methods have been restricted to an analysis of a set of nested data-parallel loops., updating a single global array. In order for these methods to be applicable to a wide range of parallel applications, they must be capable of optimizing multiple data-pa
发表于 2025-3-23 05:01:15 | 显示全部楼层
发表于 2025-3-23 08:48:01 | 显示全部楼层
Spirituality, Religiousness and HealthD.91] and Stanford DASH multiprocessor systems [LLG.90] have non-uniform access latencies. The increased latency and reduced bandwidth of global memory has a substantial impact on performance. Restructuring of programs can reduce the number of global memory accesses and dramatically improve performance.
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-6-28 01:49
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表