A hybrid Machine Learning approach for air quality prediction in Morocco: combining CatBoost with metaheuristic optimization algorithms
Keywords:
Air Pollution, Hybrid Machine Learning, CatBoost Algorithm, Metaheuristic Optimization, Air Quality Index
Abstract
Air pollution poses serious risks to public health and environmental sustainability, particularly in rapidlyurbanizing areas of developing countries. This study investigates whether combining machine learning algorithms with metaheuristic optimization techniques can improve the accuracy and efficiency of air quality prediction in Morocco. The main objective is to compare direct classification of Air Quality Index (AQI) categories with a regression-based approach, and to evaluate the effectiveness of two optimization strategies—Arithmetic Optimization Algorithm (AOA) and Hunger Games Search (HGS)—in tuning the CatBoost model’s hyperparameters. Using five months of air quality data from two monitoring stations in Ait Melloul, we modeled concentrations of PM2.5, PM10, CO, and derived corresponding AQI classifications. The hybrid approach demonstrated that regression-based classification improved accuracy by nearly 30 percentage points over direct classification. Moreover, HGS achieved similar predictive performance to AOA but was over twice as computationally efficient. CO concentration predictions in residential areas achieved high accuracy (R2 > 0.95),while particulate matter predictions revealed limitations in capturing extreme pollution events. These findings suggest that combining gradient boosting with metaheuristic optimization is a promising strategy for developing scalable and accurate air quality forecasting systems in North African urban environments, with important implications for public health protection and environmental policy implementation.
Published
2025-08-20
How to Cite
ED-DAOUDI, R., EL KHAMLICHI, S., & ETTAKI, B. (2025). A hybrid Machine Learning approach for air quality prediction in Morocco: combining CatBoost with metaheuristic optimization algorithms. Statistics, Optimization & Information Computing, 14(5), 2445-2471. https://doi.org/10.19139/soic-2310-5070-2705
Issue
Section
Research Articles
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).