Indicating if water is safe for human consumption using an enhanced machine learning approach

  • Mourad Nachaoui Faculté de Sciences et Technique, Université Sultan Moulay Slimane
  • Soufiane Lyaqini Hassan First University of Settat, Ecole Nationale des Sciences Appliquees, LAMSAD Laboratory
  • Marouane Chaouch Faculté de Sciences et Technique, Université Sultan Moulay Slimane
Keywords: Supervised learning, Smooth approximation, Hing loss, Tikhonov regularization, Taylor polynomials, Conjugate gradient, Water quality.

Abstract

Predicting water quality accurately is critically important in real-life water resource management. This work proposes an approach based on supervised machine learning to predict water quality. Motivated, by the success of the non-smooth loss function for supervised learning problems [22], we reformulate the learning problem as a regularized optimization problem whose fidelity term is the hinge loss function and the hypothesis space is a polynomial approximation. To deal with the non-differentiability of the loss function, a special smoothing function is proposed. Then, the obtained optimization problem is solved by an improved conjugate gradient algorithm. Finally,some experiments results are presented.

References

Rochmi, MN. "Access to clean water is still very far from the target". Accessed from: https://beritagar.id/article/editorial/hapuskan-perda-causing-economy-high-cost. 2016.

UNESCO. "Global Climate Change". Accessed from: www.unesco.org. August 19, 2021.

Elysia, V. “Water and Sanitation where is Indonesia's position”. National Seminar on the Role of Mathematics, Science, and Technology in achieving sustainable development goals/SDGs, FMIPA Open University. 2018. Pages 157-159.

Farooqui, A. “Invertigation of a community outbreak of typhoid fever associated with drinking water”. BMC Public Health. 2009. 9:476.

Abrischamchi, A., Ebrahimian, A., Tajrishi, M., and Marino, M. “Case Study: Application of Multicriteria Decision Making to Urban Water Supply”. J Water Resources Plan Manage. 2005. 131(4) Pages 326-335.

Cronin, A.A. “Monitoring source and domestic water quality in parallel with sanitary risk identification in Northern Mozambique to prioritise protection interventions”. J Water Health. 2006. Volume 4 Pages 333-345.

Shyamala, R. “Physicochemical Analysis of Borwell Water Samples of Telungu Palayam Area in Coimvatore District, Tamilnadu, India”. E-Journal of Chemistry. 2008. Volume 5(4), Pages 924-929.

Momba, M.N.M. “Abundance of pathogenic eschericia coli, Salmonella typhimurium and vibrio cholerae in Nkonkobe drinking water source”. J Water Health. 2006. Pages 289-296.

Eschol, J. Is fecal contamination of drinking water after collection associated with household water handling and hygiene practices? A study of urban slum households in Hyderabad, India”. Journal of Water and Health. 2009. Pages 145-154.

C. Veesommai and Y. Kiyoki. “Critical Contaminate Detection, Classification of Multiple-water-quality-parameters Values and Real-time Notification by rSPA Processes”. Surabaya ; IEEE International Electronics Sysposium, 2015.

A. Sarkar and P. Pandey, "River Water Quality Modeling Using Artificial Neural Network Technique". Aquatic Procedia, Vol.4, 2015, Pages 1070-1077.

Y. R. Ding, Y. J. Cai, P. D. Sun and B. Chen. “The Use of Combined Neural Networks and Genetic Algorithms for Prediction of River Water Quality” Journal of Applied Research and Technology. Vol.12, Issue 3, June 2014, Pages 493-499.

Noori, Roohollah, Zhiqiang Deng, Amin Kiaghadi, and Fatemeh Torabi Kachoosangi. "How Reliable Are ANN, ANFIS, and SVM Techniques for Predicting Longitudinal Dispersion Coefficient in Natural Rivers?" Journal of Hydraulic Engineering, 2015: 04015039.

Shuxiu Liang, Songlin Han, Zhaochen Sun, “Parameter optimization method for the water quality dynamic model based on data-driven theory”. China, 2011.

Yue Liao, Jianyu Xu, Wenjing Wang, “A method of Water Quality Assessment Based on Biomonitoring and Multiclass Support Vector Machine”. International Conferences on ESIAT, 2011.

Cortes C, Vapnik V. “Support-vector networks”. Machine learning. 1995;20(3):273–97.

Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, et al. “Top 10 algorithms in data mining”. Knowledge and information systems. 2008;14(1):1–37.

McCulloch WS, Pitts W. “A logical calculus of the ideas immanent in nervous activity”. The bulletin of mathematical biophysics. 1943;5(4):115–33.

Webb G.I. (2011) Naïve Bayes. In: Sammut C., Webb G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA.

Rosasco L., De Vito E., Caponnetto A., Piana M., Verri A. Are Loss Functions All the Same? Neural Computation. 16 (5), 1063-1076 (2004).

Lyaqini S., Quafafou M., Nachaoui M., Chakib A. Supervised learning as an inverse problem based on non-smooth loss function. Knowledge and Information Systems. 62, 3039-3058 (2020).

Lyaqini S., Nachaoui M., Quafafou M. Non-smooth classification model based on new smoothing technique. Journal of Physics: Conference Series. 1743, 012025 (2021).

S Lyaqini, M Nachaoui, A Hadri "An efficient primal-dual method for solving non-smooth machine learning problem" Chaos, Solitons & Fractals 155, 111754 (2022).

Nachaoui M. Parameter learning for combined first and second order total variation for image reconstruction. Advanced Mathematical Models and Applications. 5 (1), 53-69 (2020).

El Mourabit I., El Rhabi M., Hakim A., Laghrib A., Moreau E. A new denoising model for multi-frame super-resolution image reconstruction. Signal Processing. 132, 51-65 (2017).

Tikhonov, A. N., Arsenin, V. Y. Solutions to ill-posed problems. Winston-Wiley, New York (1977).

Hadamard, Jacques. Lectures on Cauchy's problem in linear partial differential equations. Courier Corporation, 2003.

S. Smale and D. Zhou. Shannon sampling II : Connections to learning theory. preprint, 2004.

Fletcher, R., and Reeves, C. M. (1964). Function minimization by conjugate gradients. The computer journal, 7(2), 149-154.

Lyaqini, S., Nachaoui, M., Quafafou, M. (2021). Non-smooth classification model based on new smoothing technique. In Journal of Physics: Conference Series (Vol. 1743, No. 1, p. 012025). IOP Publishing.

Lyaqini, S., Nachaoui, M. (2021, February). Identification of Genuine from Fake Banknotes Using an Enhanced Machine Learning Approach. In International Conference on Numerical Analysis and Optimization Days (pp. 59-70). Springer, Cham.

https://www.kaggle.com/adityakadiwal/water-potability.

Webb, G. I., Keogh, E., Miikkulainen, R. (2010). Naïve Bayes. Encyclopedia of machine learning, 15, 713-714.

Zhang, H., Jiang, L., Yu, L. (2021). Attribute and instance weighted naive Bayes. Pattern Recognition, 111, 107674.

Otchere, D. A., Ganat, T. O. A., Gholami, R., Ridha, S. (2021). Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models. Journal of Petroleum Science and Engineering, 200, 108182.

Khorshid, S. F., Abdulazeez, A. M. (2021). breast cancer diagnosis based on k-nearest neighbors: A review. PalArch's Journal of Archaeology of Egypt/Egyptology, 18(4), 1927-1951.

Ayyad, S. M., Saleh, A. I., Labib, L. M. (2019). Gene expression cancer classification using modified K-Nearest Neighbors technique. Biosystems, 176, 41-51.

Published
2023-01-09
How to Cite
Nachaoui, M., Lyaqini, S., & Chaouch, M. (2023). Indicating if water is safe for human consumption using an enhanced machine learning approach. Statistics, Optimization & Information Computing, 11(1), 70-81. https://doi.org/10.19139/soic-2310-5070-1703
Section
Research Articles