
doi: 10.1002/nla.1808
handle: 10067/961930151162165141
SUMMARYThe basic building blocks of a classic multigrid algorithm, which are essentially stencil computations, all have a low ratio of executed floating point operations per byte fetched from memory. This important ratio can be identified as the arithmetic intensity. Applications with a low arithmetic intensity are typically bounded by memory traffic and achieve only a small percentage of the theoretical peak performance of the underlying hardware. We propose a polynomial Chebyshev smoother, which we implement using cache‐aware tiling, to increase the arithmetic intensity of a multigrid V‐cycle. This tiling approach involves a trade‐off between redundant computations and cache misses. Unlike common conception, we observe optimal performance for higher degrees of the smoother. The higher‐degree polynomial Chebyshev smoother can be used to smooth more than just the upper half of the error frequencies, leading to better V‐cycle convergence rates. Smoothing more than the upper half of the error spectrum allows a more aggressive coarsening approach where some levels in the multigrid hierarchy are skipped. Copyright © 2012 John Wiley & Sons, Ltd.
Multigrid methods; domain decomposition for boundary value problems involving PDEs, algorithm, convergence, bound by memory bandwidth, V-cycle, Chebyshev iteration, Stability and convergence of numerical methods for boundary value problems involving PDEs, multigrid, Mathematics, arithmetic intensity
Multigrid methods; domain decomposition for boundary value problems involving PDEs, algorithm, convergence, bound by memory bandwidth, V-cycle, Chebyshev iteration, Stability and convergence of numerical methods for boundary value problems involving PDEs, multigrid, Mathematics, arithmetic intensity
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 22 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
