Downloads provided by UsageCounts
handle: 2117/125137
Shared memory systems are becoming increasingly complex as they typically integrate several storage devices. That brings different access latencies or bandwidth rates depending on the proximity between the cores where memory accesses are issued and the storage devices containing the requested data. In this context, techniques to manage and mitigate non-uniform memory access (NUMA) effects consist in migrating threads, memory pages or both and are generally applied by the system software. We propose techniques at the runtime system level to further mitigate the impact of NUMA effects on parallel applications' performance. We leverage runtime system metadata expressed in terms of a task dependency graph, where nodes are pieces of serial code and edges are control or data dependencies between them, to efficiently reduce data transfers. Our approach, based on graph partitioning, adds negligible overhead and is able to provide performance improvements up to 1.52× and average improvements of 1.12× with respect to the best state-of-the-art approach when deployed on a 288-core shared-memory system. Our approach reduces the coherence traffic by 2.28× on average with respect to the state-of-the-art. Peer Reviewed
Intelligent control, Data reduction, Scheduling, Parallel programming (Computer science), Non uniform memory access, Shared memory system, Programació en paral·lel (Informàtica), Parallel application, Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació, Task-based programming, Graph theory, NUMA, Shared memory, Graph Partitioning, Performance improvements, :Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació [Àrees temàtiques de la UPC], Task-based programming model Data transfer, Memory architecture, Virtual storage
Intelligent control, Data reduction, Scheduling, Parallel programming (Computer science), Non uniform memory access, Shared memory system, Programació en paral·lel (Informàtica), Parallel application, Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació, Task-based programming, Graph theory, NUMA, Shared memory, Graph Partitioning, Performance improvements, :Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació [Àrees temàtiques de la UPC], Task-based programming model Data transfer, Memory architecture, Virtual storage
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 15 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 41 | |
| downloads | 59 |

Views provided by UsageCounts
Downloads provided by UsageCounts