审问 发表于 2025-3-25 06:58:59
Performance Evaluation of the Intel Sandy Bridge Based NASA Pleiades Using Scientific and Engineerinnd compare it with the previous third generation Nehalem architecture. Several architectural features have been incorporated in Sandy Bridge: (a) four memory channels as opposed to three in Nehalem; (b) memory speed increased from 1333 MHz to 1600 MHz; (c) ring to connect on-chip L3 cache with cores弯曲道理 发表于 2025-3-25 09:35:15
http://reply.papertrans.cn/43/4264/426327/426327_22.pngNOVA 发表于 2025-3-25 13:28:36
Analysis of Data Reuse in Task-Parallel Runtimese method called . Reuse Distance (KRD). The metric is a low-overhead alternative designed to analyze data reuse at the socket level while minimizing perturbation to the parallel schedule. Using the KRD metric we show that reuse depends considerably on the system configuration (sockets, cores) and on疏忽 发表于 2025-3-25 16:59:25
http://reply.papertrans.cn/43/4264/426327/426327_24.png明智的人 发表于 2025-3-25 22:42:25
http://reply.papertrans.cn/43/4264/426327/426327_25.png极为愤怒 发表于 2025-3-26 00:08:12
Performance Modeling of Gyrokinetic Toroidal Simulations for a Many-Tasking Runtime Systemiency and scalability for many algorithms in scientific computing. One possible solution for improving efficiency and scalability in applications on this class of machines is the use of a many-tasking runtime system employing many lightweight, concurrent threads. Yet a priori estimation of the poten菊花 发表于 2025-3-26 05:46:28
Toward Better Simulation of MPI Applications on Ethernet/TCP Networksxt-generation exascale systems, and correctly modeling network behavior is essential for creating realistic simulations. In this article we describe an implementation of a flow-based hybrid network model that accounts for factors such as network topology and contention, which are commonly ignored byBET 发表于 2025-3-26 10:17:49
SESH Framework: A Space Exploration Framework for GPU Application and Hardware Codesignptive architecture and a variety of optimization options, it is often desirable to understand the dynamics between . application transformations and . hardware features when designing future GPUs for scientific workloads. However, current codesign efforts have been limited to manual investigation ofAsperity 发表于 2025-3-26 15:08:23
Optimal Checkpointing Period: Time vs. Energyde a model and detailed formulas for total execution time and consumed energy. We characterize the optimal period for both objectives, and we assess the range of time/energy trade-offs to be made by instantiating the model with a set of realistic scenarios for Exascale systems. We give a particularhegemony 发表于 2025-3-26 17:23:17
http://reply.papertrans.cn/43/4264/426327/426327_30.png