Automated Models for Predicting Software Defects in Hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) Parallel Programs Using Deep Learning

Name: Automated Models for Predicting Software Defects in Hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) Parallel Programs Using Deep Learning
Keywords: software defect prediction, race conditions, deep learning, MPI, OpenMP, Electrical engineering. Electronics. Nuclear engineering, Hybrid parallel programming, TK1-9971

Amani Saad Althiban; Hajar M. Alharbi; Lama A. Al Khuzayem; Fathy Elbouraey Eassa

Found an issue? Give us feedback

IEEE Accessarrow_drop_down

IEEE Access

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

IEEE Access

Article . 2025

Data sources: DOAJ

Automated Models for Predicting Software Defects in Hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) Parallel Programs Using Deep Learning

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2025Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 13, pages 65,939-65,954 (eissn: 2169-3536,

Copyright policy )

Authors: Amani Saad Althiban; Hajar M. Alharbi; Lama A. Al Khuzayem; Fathy Elbouraey Eassa;

doi: 10.1109/access.2025.3559807

Automated Models for Predicting Software Defects in Hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) Parallel Programs Using Deep Learning

- Summary
- Subjects
- Metrics

Abstract

Hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) parallel programs are pivotal for scalability and efficiency in high-performance computing (HPC), especially as systems approach exa-scale operations. These programs leverage distributed and shared memory systems, making them indispensable for scientific simulations, data analysis, and numerical computations. However, synchronization defects such as deadlocks and race conditions pose significant challenges to reliability and performance, often eluding traditional static and dynamic analysis tools due to the complexity of hybrid systems. This study introduces a deep learning-based models for automated defect prediction in hybrid MPI and OpenMP programs. Using a balanced dataset of 1,500 C++ files, three neural architectures—Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM) networks, and a hybrid CNN-LSTM model—were evaluated. To preprocess the code, Abstract Syntax Tree (AST)-based token extraction was employed, capturing structural and semantic relationships critical for detecting defects. Token extraction methods included detailed C++ syntax analysis via the Clang library and a custom regular expression approach. The results reveal that Clang-token-based representation provided the most effective input for defect prediction, enabling CNN models to achieve an accuracy of 97%. By integrating AST-based structural insights with the predictive power of deep learning, this research offers a scalable solution for enhancing the reliability of hybrid parallel programs. The findings establish a foundation for future advancements in automated defect detection, addressing the pressing needs of high-performance computing systems.

Related Organizations

King Abdulaziz University
Saudi Arabia

Keywords

software defect prediction, race conditions, deep learning, MPI, OpenMP, Electrical engineering. Electronics. Nuclear engineering, Hybrid parallel programming, TK1-9971

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

gold