TY - JOUR
T1 - Development of a neural-based forecasting tool to classify recreational water quality using fecal indicator organisms
AU - Motamarri, Srinivas
AU - Boccelli, Dominic L.
N1 - Funding Information:
The authors would like to thank the Ohio Water Resources Research Institute and US Geological Survey for providing partial financial support for this project, and the Charles River Watershed Association for graciously providing the meteorologic, hydrologic, and water quality data used in this study.
PY - 2012/9/15
Y1 - 2012/9/15
N2 - Users of recreational waters may be exposed to elevated pathogen levels through various point/non-point sources. Typical daily notifications rely on microbial analysis of indicator organisms (e.g., Escherichia coli) that require 18, or more, hours to provide an adequate response. Modeling approaches, such as multivariate linear regression (MLR) and artificial neural networks (ANN), have been utilized to provide quick predictions of microbial concentrations for classification purposes, but generally suffer from high false negative rates. This study introduces the use of learning vector quantization (LVQ) - a direct classification approach - for comparison with MLR and ANN approaches and integrates input selection for model development with respect to primary and secondary water quality standards within the Charles River Basin (Massachusetts, USA) using meteorologic, hydrologic, and microbial explanatory variables. Integrating input selection into model development showed that discharge variables were the most important explanatory variables while antecedent rainfall and time since previous events were also important. With respect to classification, all three models adequately represented the non-violated samples (>90%). The MLR approach had the highest false negative rates associated with classifying violated samples (41-62% vs 13-43% (ANN) and <16% (LVQ)) when using five or more explanatory variables. The ANN performance was more similar to LVQ when a larger number of explanatory variables were utilized, but the ANN performance degraded toward MLR performance as explanatory variables were removed. Overall, the use of LVQ as a direct classifier provided the best overall classification ability with respect to violated/non-violated samples for both standards.
AB - Users of recreational waters may be exposed to elevated pathogen levels through various point/non-point sources. Typical daily notifications rely on microbial analysis of indicator organisms (e.g., Escherichia coli) that require 18, or more, hours to provide an adequate response. Modeling approaches, such as multivariate linear regression (MLR) and artificial neural networks (ANN), have been utilized to provide quick predictions of microbial concentrations for classification purposes, but generally suffer from high false negative rates. This study introduces the use of learning vector quantization (LVQ) - a direct classification approach - for comparison with MLR and ANN approaches and integrates input selection for model development with respect to primary and secondary water quality standards within the Charles River Basin (Massachusetts, USA) using meteorologic, hydrologic, and microbial explanatory variables. Integrating input selection into model development showed that discharge variables were the most important explanatory variables while antecedent rainfall and time since previous events were also important. With respect to classification, all three models adequately represented the non-violated samples (>90%). The MLR approach had the highest false negative rates associated with classifying violated samples (41-62% vs 13-43% (ANN) and <16% (LVQ)) when using five or more explanatory variables. The ANN performance was more similar to LVQ when a larger number of explanatory variables were utilized, but the ANN performance degraded toward MLR performance as explanatory variables were removed. Overall, the use of LVQ as a direct classifier provided the best overall classification ability with respect to violated/non-violated samples for both standards.
KW - Artificial neural network
KW - Classification
KW - Fecal coliform
KW - Learning vector quantization
KW - Linear regression
KW - Surface water
UR - http://www.scopus.com/inward/record.url?scp=84863541341&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863541341&partnerID=8YFLogxK
U2 - 10.1016/j.watres.2012.05.023
DO - 10.1016/j.watres.2012.05.023
M3 - Article
C2 - 22743163
AN - SCOPUS:84863541341
SN - 0043-1354
VL - 46
SP - 4508
EP - 4520
JO - Water research
JF - Water research
IS - 14
ER -