Multivariate Hourly Air Quality Forecasting with MES-LSTM as the Core Framework

Authors

  • Lilis Anggraini Faculty of Computer Science, Dian Nuswantoro University, Semarang; Indonesia, Faculty of Information Technology, Islamic University of Kalimantan Muhammad Arsyad Al Banjari, Banjarmasin, Indonesia
  • Edi Noersasongko Faculty of Computer Science, Dian Nuswantoro University, Semarang, Indonesia
  • Aris Marjuni Faculty of Computer Science, Dian Nuswantoro University, Semarang, Indonesia
  • Purwanto Faculty of Computer Science, Dian Nuswantoro University, Semarang, Indonesia

DOI:

https://doi.org/10.19139/soic-2310-5070-3766

Keywords:

air quality forecasting; multivariate time series; MES-LSTM; exponential smoothing; LSTM baseline; temporal fusion transformer baseline; hybrid modeling; Beijing air quality

Abstract

Accurate air quality forecasting plays an important role in supporting environmental management, protecting public health, and enabling timely early-warning interventions. However, many existing forecasting approaches rely on univariate modeling and may not adequately capture interactions among multiple pollutants. This study therefore explores multivariate hourly air quality forecasting using Multivariate Exponential Smoothing--Long Short-Term Memory (MES-LSTM) as the primary forecasting framework. Experiments were conducted using the Beijing Air Quality dataset to predict carbon monoxide (CO), nitric oxide (NO), nitrogen dioxide (NO$_2$), ozone (O$_3$), and sulfur dioxide (SO$_2$). A standardized evaluation protocol was adopted for all models, including identical preprocessing procedures, chronological data partitioning, a 24-hour lookback window, and a one-hour forecasting horizon. The proposed MES-LSTM model was evaluated against Long Short-Term Memory (LSTM), Temporal Fusion Transformer (TFT), and a hybrid MES-LSTM--TFT architecture. The results show that MES-LSTM achieved the lowest forecasting errors for several pollutants, particularly CO, NO$_2$, and O$_3$, while remaining competitive for NO and SO$_2$. In contrast, the hybrid MES-LSTM--TFT model did not improve forecasting performance and generally produced larger prediction errors than the corresponding single-stage models. Additional robustness analyses, including five-seed experiments, persistence baselines, and multivariate coupling ablations, indicate that the effectiveness of pollutant interaction modeling depends on the target pollutant being predicted. Overall, the findings suggest that combining exponential smoothing with residual learning provides an effective and robust solution for short-term multivariate air quality forecasting. The study further demonstrates that MES-LSTM provides a strong and competitive forecasting framework under a unified evaluation protocol, while increased architectural complexity does not necessarily lead to improved predictive performance. Additional robustness analyses, including five-seed experiments, persistence baselines, and coupling ablations, were conducted to assess stability and reproducibility. Computational cost was not formally benchmarked and remains an avenue for future investigation.

Downloads

Published

2026-06-03

How to Cite

Anggraini, L., Noersasongko, E. ., Marjuni, A., & Purwanto. (2026). Multivariate Hourly Air Quality Forecasting with MES-LSTM as the Core Framework. Statistics, Optimization & Information Computing. https://doi.org/10.19139/soic-2310-5070-3766

Issue

Section

Research Articles

Most read articles by the same author(s)