Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Conference object . 2026
License: CC BY
Data sources: Datacite
ZENODO
Conference object . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Performance Made Visible: A Tool-Based Exploration of HPC Applications

Authors: Manda, Krishna Sai Lakshmi Gayatri; Pare, Ayushya; Commer, Michael; Errenst, Martin; Hellmann, Matthias; Hermanns, Marc-André; Mutzel, Petra;

Performance Made Visible: A Tool-Based Exploration of HPC Applications

Abstract

Understanding how scientific applications utilize modern High-Performance Computing (HPC) systems is essential for achieving efficiency and scalability. Yet, performance analysis is often perceived as complex or reserved for HPC specialists. This poster challenges that perception by demonstrating that generating and interpreting performance insights can be both straightforward and highly informative — even without deep system expertise. The focus is not on code optimization itself, but to highlight how accessible and insightful performance exploration has become using modern HPC tools. A representative parallel scientific application is executed under a fixed configuration across multiple tools. This consistent setup provides a fair comparison of how each tool reports performance data and how these perspectives complement each other in revealing resource utilization, scalability, and inefficiencies. Instrumentation and trace generation are performed with Score-P, which captures CUBE runtime summaries and detailed OTF2 trace files with MPI communication or stalls. The traces are visualized in Vampir, providing intuitive timelines of computational regions, synchronization points, and communication patterns. Linaro Forge Performance Reports offer high-level summaries including CPU efficiency, vectorization rate, memory usage, and I/O utilization, presenting a concise overview of runtime efficiency across hardware resources. Complementary low-level profiling is performed using perf and LIKWID, which expose fine-grained architectural details. Metrics such as cache bandwidth, floating-point throughput and branch prediction accuracy help characterize how effectively the application utilizes CPU and memory resources. Meanwhile, ClusterCockpit monitors system-level parameters — CPU frequency, memory usage and power consumption — enabling a real-time overview of node-level behavior and resource distribution across jobs. For kernel level profiling, Intel VTune or NVIDIA Nsight extend this view by capturing kernel execution timelines and data transfer characteristics. Together, these tools form a layered and complementary performance exploration workflow applied to a single, reproducible workload: Linaro Forge Performance Reports — high-level performance overviewScore-P + Cube - high-level hotspot detectionScore-P + Vampir — detailed timeline visualizationperf and LIKWID — architectural insight and counter-based diagnosticsClusterCockpit - live job monitoringNsight or VTune — Kernel profiling This integrated approach shows that comprehensive performance evaluation can be achieved quickly and transparently. Each tool contributes a distinct but complementary perspective — from the high-level runtime overview down to individual hardware counter analysis — enabling users to connect “what happens” during execution with “why it happens” at the system level. The workflow illustrates that performance analysis can be an intuitive and routine part of research — not a specialized or final-phase task. By lowering the entry barrier, researchers can confidently explore the performance characteristics of their codes, identify scaling limitations, and make informed decisions on parallelization or resource allocation. The poster includes QR codes linking to example job scripts, visualizations plots and corresponding tool outputs. Using these resources, researchers can generate similar performance reports for their own codes or simulations, helping them understand and evaluate their application’s behavior. Ultimately, the message is that performance analysis is approachable. With today’s HPC tooling, understanding how computation uses the hardware becomes a natural and rewarding part of every HPC workflow.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average