In the fast-changing world with increased water demand, water pollution, environmental problems, and related data, information on water quality and suitability for any purpose should be prompt and reliable. Traditional approaches often fail in the attempt to predict water quality classes and new ones are needed to handle a large amount or missing data to predict water quality in real-time. One of such approaches is machine-learning (ML) based prediction. This paper presents the results of the application of the Naïve Bayes, a widely used ML method, in creating the prediction model. The proposed model is based on nine water quality parameters: temperature, pH value, electrical conductivity, oxygen saturation, biological oxygen demand, suspended solids, nitrogen oxides, orthophosphates, and ammonium. It is created in software Netica and tested and verified using the data covering the period 2013–2019 from five locations in Vojvodina Province, Serbia. Forty-eight samples are used to train the model. Once trained, the Naïve Bayes model correctly predicted the class of water sample in 64 out of 68 cases, including cases with missing data. This recommends it as a trustful tool in the transition from traditional to digital water management.

  • The study tests efficiency of water quality prediction by the Naïve Bayes method.

  • Nine water quality parameters are analyzed, defined by SWQI methodology.

  • Water quality is assessed by the Naïve Bayes model at five locations in Serbia and 68 samples (cases) of data.

  • Prediction model is trained on 48 cases.

  • The model predicted the water quality class accurately in 64 out of 68 cases.

Graphical Abstract

Graphical Abstract
Graphical Abstract
This content is only available as a PDF.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (