3.6
Component Concurrency: Coarse-Grained Parallelism for Earth System Models
Recently, the Modeling Services group undertook an even more ambitious, science-based project in which the chosen approach had the benefit of preparing the model infrastructure for current and future hardware paradigms. This effort, focused on coupling algorithms, rather than programming models, aimed at increasing coarse-grained concurrency at the granularity of model components. This allows each component to use timestepping based on its intrinsic timescales.
An initial effort focused on the atmospheric radiative transfer component. The scientific goal was to decrease the timescales for which the GFDL atmospheric radiation model was calculated, from some fixed time period to every atmospheric tme step, without impacting the SYPD performance of the model. The solution was to re-architect the flow of computation to enhance component level concurrency but doing so in a way that exploits the tightly-coupled, many-core architectures that are finally coming to fruition. Using OpenMP at a high level, the dynamics, radiation, and atmospheric physics can be run concurrently in dedicated shared-memory threads (MIMD) and all data exchange occurs via memory copies. The radiation and atmospheric physics are column-independent processes which can be further blocked and parallelized using a nested, second level of OpenMP (SIMD). The dynamics contain both column and slab computation and the nested OpenMP occurs differently for each and at a finer level. Performance data will be presented demonstrating the effectiveness of the approach and efficacy for planned self-booting, many-core systems. We will also show that despite changes to the coupling algorithms, the new systems produces valid climates running in AMIP mode.