An Explainable Multimodal Deep Learning Framework for Automated Bone Fracture Detection-Review

Bone fracture diagnosis using radiographic imaging is a critical yet challenging task due to subtle fracture patterns, image quality variations, and increasing clinical workload on radiologists. Although deep learning–based methods have demonstrated promising performance in automated fracture detection, their clinical adoption remains limited due to black-box decision making, lack of explainability, and reliance on unimodal imaging data. This paper presents an explainable multimodal deep learning framework for automated bone fracture detection, severity assessment, and clinical decision support. The proposed framework integrates radiographic images with structured clinical metadata using an attention-based feature fusion strategy to enhance diagnostic accuracy and contextual understanding. Convolutional Neural Networks and Vision Transformer architectures are employed for image feature extraction, while clinical parameters are encoded through a multilayer perceptron. Model interpretability is achieved using Gradient-weighted Class Activation Mapping (Grad-CAM), enabling visual localization of fracture-relevant regions. Extensive analysis using publicly available musculoskeletal datasets demonstrates that the multimodal explainable approach outperforms conventional unimodal models in terms of accuracy, robustness, and clinical reliability. The results highlight the importance of explainable and context-aware AI systems in musculoskeletal imaging and support their potential integration into real-world clinical workflows for improved fracture diagnosis and decision support.

Keywords

Multimodal Deep Learning, Bone Fracture Detection

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now