凶兆 发表于 2025-3-23 10:07:40
re needed for application developers to seamlessly migrate legacy code from today’s systems to tomorrow’s. Over the past decade and more, directives have been established as one of the promising paths to tackle programmatic challenges on emerging systems. This work focuses on applying and demonstrat特别容易碎 发表于 2025-3-23 15:09:14
http://reply.papertrans.cn/43/4227/422640/422640_12.png凹室 发表于 2025-3-23 19:40:47
Johanna Dorer,Matthias Marschik place on November 20, 2021. The workshop was initially planned to take place in Atlanta, GA, USA, and changed to an online format due to the COVID-19 pandemic..WACCPD is one of the major forums for bringing together users, developers, and the software and tools community to share knowledge and expeMalleable 发表于 2025-3-23 22:53:51
Udo Göttlichonstrated by many projects as well as our previous experience. In this work, OpenACC is leveraged to transform another Computational Fluid Dynamics (CFD) high order solver FINE/FR to be GPU-eligible. On the Summit supercomputer, impressive GPU speedup ranging from 6X to 80X has been achieved using uBronchial-Tubes 发表于 2025-3-24 04:11:03
http://reply.papertrans.cn/43/4227/422640/422640_15.pngamplitude 发表于 2025-3-24 07:37:28
Tanja Maier such as many-core CPUs and accelerators like GPUs. In this work, we implement a widely used block eigensolver, Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG), using two popular directive based programming models (OpenMP and OpenACC) for GPU-accelerated systems. Our work differs fr易发怒 发表于 2025-3-24 13:55:20
such as many-core CPUs and accelerators like GPUs. In this work, we implement a widely used block eigensolver, Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG), using two popular directive based programming models (OpenMP and OpenACC) for GPU-accelerated systems. Our work differs frdeciduous 发表于 2025-3-24 17:44:33
Friedrich Krotzchitecture-specific development and tuning. However, portability frameworks depend on compilers for auto-vectorization and may lack support for explicit vectorization on heterogeneous platforms. Alternatively, programmers can use intrinsics-based primitives to achieve more efficient vectorization, b渐强 发表于 2025-3-24 21:39:50
Tanja Thomasof-the-science earth system model development and simulation project and has gained national recognition. It has a large code base with over a million lines of code. How to make effective use of GPUs, however, remains a challenge. In this paper, we use the modal aerosol module (MAM) of E3SM as a driONYM 发表于 2025-3-25 02:12:00
Christoph Jacke,Martin Zieroldof-the-science earth system model development and simulation project and has gained national recognition. It has a large code base with over a million lines of code. How to make effective use of GPUs, however, remains a challenge. In this paper, we use the modal aerosol module (MAM) of E3SM as a dri