Language-Centric Performance Analysis of OpenMP Programs with Aftermath

Conference object, Part of book or chapter of book, Contribution for newspaper or weekly magazine English OPEN
Drebes, Andi ; Bréjon, Jean-Baptiste ; Pop, Antoniu ; Heydemann, Karine ; Cohen, Albert (2016)
  • Publisher: HAL CCSD
  • Related identifiers: doi: 10.1007/978-3-319-45550-1_17
  • Subject: Tracing | [INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL] | [ INFO ] Computer Science [cs] | OpenMP | Performance Analysis

International audience; We present a new set of tools for the language-centric performance analysis and debugging of OpenMP programs that allows programmers to relate dynamic information from parallel execution to OpenMP constructs. Users can visualize execution traces, examine aggregate met-rics on parallel loops and tasks, such as load imbalance or synchronization overhead, and obtain detailed information on specific events, such as the partitioning of a loop's iteration space, its distribution to workers according to the scheduling policy and fine-grain synchronization. Our work is based on the Aftermath performance analysis tool and a ready-to-use, instrumented version of the LLVM/clang OpenMP run-time with negligible overhead for tracing. By analyzing the performance of the MG application of the NPB suite, we show that language-centric performance analysis in general and our tools in particular can help improve the performance of large-scale OpenMP applications significantly.
  • References (20)
    20 references, page 1 of 2

    1. http://vite.gforge.inria.fr. Accessed 05/2016.

    2. Intel openmp runtime library. https://www.openmprtl.org. Accessed 05/2016.

    3. LLVM OpenMP support. http://openmp.llvm.org. Accessed 05/2016.

    4. Omni compiler project. http://www.hpcs.cs.tsukuba.ac.jp/omnicompiler/download/download-benchmarks.html. Accessed 05/2016.

    5. D. Bailey, E. Barszcz, J. Barton, D. Browning, R. Carter, L. Dagum, R. Fatoohi, S. Fineberg, P. Frederickson, T. Lasinski, R. Schreiber, H. Simon, and V. Venkatakrishnan. The NAS Parallel Benchmarks. Technical report, 1994.

    6. Robert Bell, Allen D Malony, and Sameer Shende. Paraprof: A portable, extensible, and scalable tool for parallel performance profile analysis. In Euro-Par 2003 Par. Processing, pages 17-26. Springer, 2003.

    7. OpenMP Architecture Review Board. OpenMP Application Program Interface Version 4.5, November 2015.

    8. Andi Drebes, Antoniu Pop, Karine Heydemann, and Albert Cohen. Interactive visualization of cross-layer performance anomalies in dynamic task-parallel applications and systems. In 2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), April 2016.

    9. Alejandro Duran, Xavier Teruel, Roger Ferrer, Xavier Martorell, and Eduard Ayguade. Barcelona openmp tasks suite: A set of benchmarks targeting the exploitation of task parallelism in openmp. In Proceedings of the 2009 International Conference on Parallel Processing, ICPP '09, pages 124-131, Washington, DC, USA, 2009. IEEE Computer Society.

    10. Alexandre Eichenberger, John Mellor-Crummey, Martin Schulz, Nawal Copty, Jim Cownie, Robert Dietrich, Xu Liu, Eugene Loh, and Daniel Lorenz. OpenMP Technical Report 2 on the OMPT Interface. Technical report, 2014.

  • Metrics
    4
    views in OpenAIRE
    0
    views in local repository
    2
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    The University of Manchester - Institutional Repository - IRUS-UK 0 2
Share - Bookmark