Name: Analysis of Model Parallelism for AI Applications on a 64-core RV64 Server CPU
Keywords: AI, Model parallelism, RISC-V, PyTorch, SOPHON SG2042

descriptionPublicationkeyboard_double_arrow_right Article 30 Jun 2025 English Publisher:Springer Science and Business Media LLCJournal:International Journal of Parallel Programming, volume 53 (issn: 0885-7458, eissn: 1573-7640,

Authors: Giulio Malenza; Adriano Marques Garcia; Robert Birke; Luca Benini; Marco Aldinucci;

doi: 10.1007/s10766-025-00802-6

handle: 2318/2090730

Analysis of Model Parallelism for AI Applications on a 64-core RV64 Server CPU

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Massive Data Parallel workloads, driven by inference on large ML models, are pushing hardware vendors to develop efficient and cost-effective multi-core server CPUs. The RISC-V architecture plays a prominent role due to its open, extensible, and energy-friendly ISA. Despite significant progress in recent years, finding efficient methods to run AI applications in parallel on new architectures to fully harness their maximum performance remains a challenge. In this study, we investigate the impact of model parallelism on the inference of machine learning models on the SOPHON SG2042 SoC, the first server-grade CPU based on the RV64 ISA, composed of 64 cores arranged in a grid of 16 groups of 4 cores. Specifically, we aim to enhance performance via better data locality stemming from splitting and assigning parts of the model to specific (groups of) cores handling dependencies via a pipeline execution. We orchestrate execution using FastFlow, a low-level programming framework designed for multithreaded streaming applications. By comparing the results against the standard multi-core inference approach based on data parallelism and analyzing the effects of different submodel-to-core mapping strategies, we aim to provide a comprehensive understanding of how the model parallel approach can maximize efficiency and utilization of hardware resources. In our experiments, using model parallelism improved up to 8.4 times the performance over the native PyTorch parallelism.

Related Organizations

Alma Mater Studiorum University of Bologna
Italy
University of Turin
Italy

Keywords

AI, Model parallelism, RISC-V, PyTorch, SOPHON SG2042

3 Research products, page 1 of 1

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Funded by

EC| DYMAN

Analysis of Model Parallelism for AI Applications on a 64-core RV64 Server CPU

Analysis of Model Parallelism for AI Applications on a 64-core RV64 Server CPU

3 Research products, page 1 of 1

oneDNN software on GitHub

onnx software on GitHub

cereal software on GitHub