皱痕 发表于 2025-3-27 00:34:40
Conference proceedings 2013orithms; parallel and distributed programming; parallel numerical algorithms; multicore and manycore programming; theory and algorithms for parallel computation; high performance networks and communication; high performance and scientific applications; GPU and accelerator computing; and extreme-scale computing.悄悄移动 发表于 2025-3-27 02:42:30
https://doi.org/10.1007/978-3-322-88642-2sor parts get more and more power hungry and electricity gets more and more expensive, “energy to solution” is a new optimization criterion for large applications. This calls for energy aware solutions.积云 发表于 2025-3-27 07:48:38
https://doi.org/10.1007/978-3-642-45669-5ronization. Based on such an observation, we proposed an effective method to identify synchronizations. It doesn’t depend on the understanding of source codes or the knowledge of semantics of library routines. A system called SyncTester is developed, and experiments show that SyncTester is effective and useful.货物 发表于 2025-3-27 13:31:44
http://reply.papertrans.cn/32/3166/316529/316529_34.png弄脏 发表于 2025-3-27 16:47:13
http://reply.papertrans.cn/32/3166/316529/316529_35.png有花 发表于 2025-3-27 18:56:12
https://doi.org/10.1007/978-3-322-80970-4ications exhibit a distinct phase behavior. However, we show that while fixing thread affinity for the whole execution may improve performance by up to 30%, allowing dynamic thread pinning may improve performance by up to 40%. Furthermore, we provide an analysis about the required conditions to improve the effectiveness of the approach.JECT 发表于 2025-3-28 00:31:11
http://reply.papertrans.cn/32/3166/316529/316529_37.pngNeutropenia 发表于 2025-3-28 03:09:38
https://doi.org/10.1007/978-3-8349-6581-3emented and evaluated on 4 different applications executed in production conditions on the European Grid Infrastructure. Results show that our technique reduces slowdown variability by 3 to 7 compared to first-come-first-served.Enthralling 发表于 2025-3-28 06:32:43
,Standardisierte Geschäftsdatenformate, among commutative and non-commutative write transactions and then schedules them according to the execution state. We assess the goodness of the approach by an extensive evaluation of a fully implementation of CRF. The tests reveal that CRF improves throughput over a state-of-the-art DTM solution.征兵 发表于 2025-3-28 12:34:09
https://doi.org/10.1007/978-3-8349-8673-3varying granularity by tracking task demand..Benchmarking on a set of task parallel programs using a work-stealing scheduler demonstrates that our approach is generally effective. For fine-grained tasks, we can achieve reductions in execution time exceeding a factor of 6, compared to state-of-the-art implementations.