隐士 发表于 2025-3-25 04:25:02

http://reply.papertrans.cn/32/3166/316542/316542_21.png

使闭塞 发表于 2025-3-25 08:39:51

https://doi.org/10.1007/978-3-322-86645-5o successfully reduce the number of hardware counters needed to characterize a parallel region, and that this set of counters can be measured at run time with high accuracy and low overhead using counter multiplexing.

nonplus 发表于 2025-3-25 15:24:22

Werner R. Müller,Thomas M. Schwarbschedule on one cluster and then distributing it onto the other clusters might come in handy in practical approaches. We demonstrate this by presenting a practical algorithm with running time ., without hidden constants, that is an approximation algorithm with ratio 9/4 if the number . of clusters is dividable by 3 and bounded by . otherwise.

expository 发表于 2025-3-25 19:02:34

http://reply.papertrans.cn/32/3166/316542/316542_24.png

密切关系 发表于 2025-3-25 23:32:37

Aufzüge mit stetig umlaufendem Zugmitteles, we are able to exclude obviously dominated solutions from the design space before scheduling and synthesis. Compared to a standard, multi-criteria optimisation method, we show the benefits of our approach regarding runtime at the design level.

鬼魂 发表于 2025-3-26 04:01:30

Einleitung und Abgrenzung des Themasances. We evaluated the algorithm using data clustering, matrix multiplication, and bioinformatics applications and compared with existing load-balancing algorithms. PLB-HAC obtained the highest performance gains with more heterogeneous clusters and larger problems sizes, where a more refined load-distribution is required.

坚毅 发表于 2025-3-26 05:02:35

Accelerating Data-Dependence Profiling with Static Hintsffected source-code locations from instrumentation, allowing the profiler to skip them at runtime and avoiding the associated overhead. At the end, we merge static and dynamic dependences. We evaluated our approach with 38 benchmarks from two benchmark suites and obtained a median reduction of the profiling time by 62% across all the benchmarks.

Mawkish 发表于 2025-3-26 08:37:35

http://reply.papertrans.cn/32/3166/316542/316542_28.png

Spongy-Bone 发表于 2025-3-26 16:24:52

Hardware Counters’ Space Reduction for Code Region Characterizationo successfully reduce the number of hardware counters needed to characterize a parallel region, and that this set of counters can be measured at run time with high accuracy and low overhead using counter multiplexing.

无聊点好 发表于 2025-3-26 20:41:18

http://reply.papertrans.cn/32/3166/316542/316542_30.png
页: 1 2 [3] 4 5 6
查看完整版本: Titlebook: Euro-Par 2019: Parallel Processing; 25th International C Ramin Yahyapour Conference proceedings 2019 Springer Nature Switzerland AG 2019 ar