放气 发表于 2025-3-28 14:52:15

http://reply.papertrans.cn/32/3166/316531/316531_41.png

修饰语 发表于 2025-3-28 22:12:42

http://reply.papertrans.cn/32/3166/316531/316531_42.png

NIP 发表于 2025-3-29 01:57:49

MPI Trace Compression Using Event Flow Graphsrmance analysis is becoming increasingly difficult due to the growing complexity of scientific codes and the size of machines. Even though many tools have been developed over the past years to help in this task, current approaches either only offer an overview of the application discarding temporal

doxazosin 发表于 2025-3-29 04:34:29

ScalaJack: Customized Scalable Tracing with In-situ Data Analysiseasure. We address this problems by combining customized tracing and providing support for in-situ data analysis via ScalaJack, a framework with customizable instrumentation and pluggable extension capabilities for problem directed instrumentation and in-situ data analysis. We further eliminate cros

CODA 发表于 2025-3-29 07:55:38

Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gle hardware. This in turn makes it increasingly challenging to achieve correct and efficient thread synchronization. To support the programmer in this task, IBM introduced hardware transactional memory (TM) and speculative execution (SE) in their Blue Gene/Q system with its 16-core processor, which

斜谷 发表于 2025-3-29 13:06:36

http://reply.papertrans.cn/32/3166/316531/316531_46.png

尽责 发表于 2025-3-29 18:56:45

Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core Architecformance of such heterogeneous machines is challenging as it requires to carefully offload computations and manage data movements between the different processing units. The most promising and successful approaches so far rely on task-based runtimes that abstract the machine and rely on opportunisti

aqueduct 发表于 2025-3-29 20:57:48

http://reply.papertrans.cn/32/3166/316531/316531_48.png

宫殿般 发表于 2025-3-30 03:31:19

ParaShares: Finding the Important Basic Blocks in Multithreaded Programscombing through program source or thread traces for pathologies including communication overheads, data dependencies, and load imbalances. This work takes a new approach: it ignores any underlying pathologies, and focuses instead on pinpointing the exact locations in source code that consume the lar

减弱不好 发表于 2025-3-30 04:45:29

Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Rst, auto-tuners have been successfully applied to minimize execution time. However, besides execution time, additional optimization goals have recently arisen, such as energy consumption or computing costs. Therefore, more sophisticated methods capable of exploiting and identifying the trade-offs am
页: 1 2 3 4 [5] 6
查看完整版本: Titlebook: Euro-Par 2014: Parallel Processing; 20th International C Fernando Silva,Inês Dutra,Vítor Santos Costa Conference proceedings 2014 Springer