
This paper presents a comprehensive machine learning (ML) framework for predicting liquefaction risk, a crucial aspect of seismic hazard assessment. A benchmark geotechnical dataset with multi-dimensional input features was used to evaluate several ML classifiers, followed by hyperparameter optimization through stratified 5-fold cross-validation. Optimized models were combined into a soft Voting Ensemble to enhance stability and accuracy of liquefaction potential prediction. The proposed ensemble model achieved a mean accuracy of 90.12% and a recall of 97.23%, outperforming individual models in most folds. The ensemble’s effectiveness was further evidenced by its precision-recall (PR) and receiver operating characteristic (ROC) curves, with areas under the curve (AUC) of 0.962 and 0.931, respectively—closely matching those of the Gradient Boosting classifier, indicating comparable discriminatory performance. Additionally, SHapley Additive exPlanations (SHAP) analysis was conducted on the ensemble model to assess contributions of each geotechnical inputs to the predictions, revealing that normalized shear wave velocity (VS1) as the most influential variable in liquefaction prediction. The proposed framework demonstrates a robust, interpretable, and performance-consistent approach for liquefaction risk assessment.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
