
The Dragon telemetry service is an easy-to-use, scalable means for users to visualize both hardware and custom metrics for complex workflows. Dragon is a high-performance distributed runtime for managing processes and data at-scale. It utilizes high-performance communication objects to enable efficient and transparent management of memory and movement of data. We demonstrate the use of the telemetry service for a multi-language AI-in-the-loop workflow where both built-in hardware metrics and custom user metrics are visualized in a Grafana dashboard.
distributed computing, HPC, visualizations, Telemetry, workflows, Python
distributed computing, HPC, visualizations, Telemetry, workflows, Python
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
