皱痕 发表于 2025-3-27 00:34:40

Conference proceedings 2013orithms; parallel and distributed programming; parallel numerical algorithms; multicore and manycore programming; theory and algorithms for parallel computation; high performance networks and communication; high performance and scientific applications; GPU and accelerator computing; and extreme-scale computing.

悄悄移动 发表于 2025-3-27 02:42:30

https://doi.org/10.1007/978-3-322-88642-2sor parts get more and more power hungry and electricity gets more and more expensive, “energy to solution” is a new optimization criterion for large applications. This calls for energy aware solutions.

积云 发表于 2025-3-27 07:48:38

https://doi.org/10.1007/978-3-642-45669-5ronization. Based on such an observation, we proposed an effective method to identify synchronizations. It doesn’t depend on the understanding of source codes or the knowledge of semantics of library routines. A system called SyncTester is developed, and experiments show that SyncTester is effective and useful.

货物 发表于 2025-3-27 13:31:44

http://reply.papertrans.cn/32/3166/316529/316529_34.png

弄脏 发表于 2025-3-27 16:47:13

http://reply.papertrans.cn/32/3166/316529/316529_35.png

有花 发表于 2025-3-27 18:56:12

https://doi.org/10.1007/978-3-322-80970-4ications exhibit a distinct phase behavior. However, we show that while fixing thread affinity for the whole execution may improve performance by up to 30%, allowing dynamic thread pinning may improve performance by up to 40%. Furthermore, we provide an analysis about the required conditions to improve the effectiveness of the approach.

JECT 发表于 2025-3-28 00:31:11

http://reply.papertrans.cn/32/3166/316529/316529_37.png

Neutropenia 发表于 2025-3-28 03:09:38

https://doi.org/10.1007/978-3-8349-6581-3emented and evaluated on 4 different applications executed in production conditions on the European Grid Infrastructure. Results show that our technique reduces slowdown variability by 3 to 7 compared to first-come-first-served.

Enthralling 发表于 2025-3-28 06:32:43

,Standardisierte Geschäftsdatenformate, among commutative and non-commutative write transactions and then schedules them according to the execution state. We assess the goodness of the approach by an extensive evaluation of a fully implementation of CRF. The tests reveal that CRF improves throughput over a state-of-the-art DTM solution.

征兵 发表于 2025-3-28 12:34:09

https://doi.org/10.1007/978-3-8349-8673-3varying granularity by tracking task demand..Benchmarking on a set of task parallel programs using a work-stealing scheduler demonstrates that our approach is generally effective. For fine-grained tasks, we can achieve reductions in execution time exceeding a factor of 6, compared to state-of-the-art implementations.
页: 1 2 3 [4] 5 6 7
查看完整版本: Titlebook: Euro-Par 2013: Parallel Processing; 19th International C Felix Wolf,Bernd Mohr,Dieter Mey Conference proceedings 2013 Springer-Verlag Berli