书目名称 | Compiling Parallel Loops for High Performance Computers |
副标题 | Partitioning, Data A |
编辑 | David E. Hudak,Santosh G. Abraham |
视频video | |
丛书名称 | The Springer International Series in Engineering and Computer Science |
图书封面 |  |
描述 | 4. 2 Code Segments . . . . . . . . . . . . . . . 96 4. 3 Determining Communication Parameters . 99 4. 4 Multicast Communication Overhead · 103 4. 5 Partitioning . . . . . . · 103 4. 6 Experimental Results . 117 4. 7 Conclusion. . . . . . . · 121 5 COLLECTIVE PARTITIONING AND REMAPPING FOR MULTIPLE LOOP NESTS 125 5. 1 Introduction. . . . . . . . . 125 5. 2 Program Enclosure Trees. . 128 5. 3 The CPR Algorithm . . 132 5. 4 Experimental Results. . 141 5. 5 Conclusion. . 146 BIBLIOGRAPHY. 149 INDEX . . . . . . . . 157 LIST OF FIGURES Figure 1. 1 The Butterfly Architecture. . . . . . . . . . 5 1. 2 Example of an iterative data-parallel loop . . 7 1. 3 Contiguous tiling and assignment of an iteration space. 13 2. 1 Communication along a line segment. . . 24 2. 2 Access pattern for the access offset, (3,2). 25 2. 3 Decomposing an access vector along an orthogonal basis set of vectors. . . . . . . . . . . . . . . . . . . 26 2. 4 An analysis of communication patterns. 29 2. 5 Decomposing a vector along two separate basis sets of vectors. 31 2. 6 Cache lines aligning with borders. 33 2. 7 Cache lines not aligned with borders. 34 2. 8 nh is the difference of nd and nb. 42 2. 9 nh is the sum o |
出版日期 | Book 1993 |
关键词 | Contig; Excel; algorithms; architecture; architectures; boundary element method; computer; design; function; |
版次 | 1 |
doi | https://doi.org/10.1007/978-1-4615-3164-7 |
isbn_softcover | 978-1-4613-6386-6 |
isbn_ebook | 978-1-4615-3164-7Series ISSN 0893-3405 |
issn_series | 0893-3405 |
copyright | Springer Science+Business Media New York 1993 |