
doi: 10.2139/ssrn.6510545
Real-time defect detection is critical for guaranteeing the integrity of Gas Metal Arc Welding (GMAW) joints. Conventional non-destructive testing (NDT) delivers high-quality diagnostics only after the weld has finished, precluding immediate corrective action. We present the Multi-modal BiFPN Gate Network (MBGN), a deep-learning framework that fuses two complementary data streams: (i) time-synchronized welding current and voltage waveforms processed by a one-dimensional CNN, and (ii) high-speed molten-pool imagery processed by a ResNet backbone. The two modalities are merged through a Bidirectional Feature Pyramid Network (BiFPN), and a lightweight Gate module adaptively recalibrates cross-modal interactions before a final classifier. Experiments on an industrially collected multimodal dataset demonstrate that MBGN attains 0.748 accuracy and 0.719 F1-score, outperforming state-of-the-art baselines by 8-12 % in F1. These results validate the efficacy of multimodal fusion for in-process defect detection and pave the way for autonomous real-time quality control in industrial welding.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
