
Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. With accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.
Rainfall, Atmospheric Science, Environmental Engineering, Rainfall-Runoff Modeling, Electricity Price and Load Forecasting Methods, Science, Gaussian Naive Bayes, Logistic regression, Machine Learning, Satellite-Based Precipitation Estimation and Validation, FOS: Economics and business, K-nearest neighbors, Engineering, Hydrological Modeling using Machine Learning Methods, QA1-939, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Econometrics, Electrical and Electronic Engineering, Q, FOS: Environmental engineering, Load Forecasting, QA75.5-76.95, Computer science, Earth and Planetary Sciences, Model Performance, Electronic computers. Computer science, Physical Sciences, Environmental Science, Rainfall prediction, Randomforest, Mathematics, Forecasting
Rainfall, Atmospheric Science, Environmental Engineering, Rainfall-Runoff Modeling, Electricity Price and Load Forecasting Methods, Science, Gaussian Naive Bayes, Logistic regression, Machine Learning, Satellite-Based Precipitation Estimation and Validation, FOS: Economics and business, K-nearest neighbors, Engineering, Hydrological Modeling using Machine Learning Methods, QA1-939, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Econometrics, Electrical and Electronic Engineering, Q, FOS: Environmental engineering, Load Forecasting, QA75.5-76.95, Computer science, Earth and Planetary Sciences, Model Performance, Electronic computers. Computer science, Physical Sciences, Environmental Science, Rainfall prediction, Randomforest, Mathematics, Forecasting
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
