Estimating statistical power for detecting long term trends in surface water Escherichia coli concentrations
Cover photo: Llano River with fisherman. ©2018 Ray Uherek.
PDF

Keywords

trend detection
E. coli
statistical power
water quality

How to Cite

Schramm, M. P. (2021). Estimating statistical power for detecting long term trends in surface water Escherichia coli concentrations. Texas Water Journal, 12(1), 140–150. https://doi.org/10.21423/twj.v12i1.7126

Abstract

Water quality monitoring programs commonly use the Mann-Kendall test or linear regression to identify statistically significant monotonic trends in fecal indicator bacteria concentrations (typically Escherichia coli [E. coli]). The statistical power of these tests to detect trends of different magnitudes (effect size) is rarely communicated to stakeholders, and it is unlikely they are considered when designing monitoring schedules. The statistical power for detecting trends in surface water E. coli bacteria concentrations using Mann-Kendall and linear regression at water quality monitoring sites across Texas was estimated using Monte Carlo simulation. The probability that an individual water quality monitoring site in Texas had adequate statistical power was also estimated using logistic regression.
Mann-Kendall and linear regression trend tests show similar statistical power. Both tests are unlikely to achieve adequate statistical power when E. coli concentrations decrease by 20% or less over 7 years under most sampling frequencies. To adequately detect concentration decreases of 30% to 40% over 7 years, monthly sampling is required. Because many sites across Texas are sampled quarterly, monotonic trends tests will not be powerful enough to detect trends of moderate magnitudes. To better facilitate stakeholder decision-making, it is important to communicate the relative power of statistical tests and detectible magnitudes of changes. I suggest data analysts conduct power analyses to improve monitoring program designs and improve communication of trend test limitations. Software and training for water quality analysts could facilitate communication of power and effect sizes. Alternative trend assessment methods may be more reliable for describing changes in fecal indicator bacteria concentrations.

https://doi.org/10.21423/twj.v12i1.7126
PDF

References

Barry J, Maxwell D, Jennings S, Walker D, Murray J. 2017. Emon: An R‐package to support the design of marine ecological and environmental studies, surveys and monitoring programmes. Methods in Ecology and Evolution. 8(10):1342-1346. Available from: https://doi.org/10.1111/2041-210X.12748.

De Cicco LA, Lorenz D, Hirsch RM, Watkins W. 2018. dataRetrieval: R packages for discovering and retrieving water data available from U.S. Federal hydrologic web services. Reston, VA: U.S. Geological Survey; U.S. Geological Survey. Available from: https://code.usgs.gov/water/dataRetrieval.

Dufour AP. 1984. Bacterial indicators of recreational water quality. Canadian Journal of Public Health. 75(1):9. Available from: https://www.jstor.org/stable/41990233.

Fujioka R, Solo-Gabriele H, Byappanahalli M, Kirs M. 2015. U.S. Recreational water quality criteria: A vision for the future. International Journal of Environmental Research and Public Health. 12(7):7752-7776. Available from: https://doi.org/10.3390/ijerph120707752.

Gitter A, Mena K, Wagner K, Boellstorff D, Borel K, Gregory L, Gentry T, Karthikeyan R. 2020. Human health risks associated with recreational waters: Preliminary approach of integrating quantitative microbial risk assessment with microbial source tracking. Water. 12(2):327. Available from: https://doi.org/10.3390/w12020327.

Goodwin KD, Schriewer A, Jirik A, Curtis K, Crumpacker A. 2017. Consideration of natural sources in a bacteria TMDL—lines of evidence, including beach microbial source tracking. Environmental Science & Technology. 51(14):7775-7784. Available from: https://doi.org/10.1021/acs.est.6b05886.

Hanel PH, Mehler DM. 2019. Beyond reporting statistical significance: Identifying informative effect sizes to improve scientific communication. Public Understanding of Science.:18. Available from: https://doi.org/10.1177/0963662519834193.

Helsel D, Hirsch R. 2002. Statistical methods in water resources. U.S. Geological Survey (Techniques of water-resources investigations of the United States Geologic Survey). Available from: http://water.usgs.gov/pubs/twri/twri4a3/.

Hirsch RM, Moyer DL, Archfield SA. 2010. Weighted regressions on time, discharge, and season (WRTDS), with an application to Chesapeake Bay river inputs. JAWRA Journal of the American Water Resources Association. 46(5):857-880. Available from: https://doi.org/10.1111/j.1752-1688.2010.00482.x.

Ishii S, Sadowsky MJ. 2008. Escherichia coli in the environment: Implications for water quality and human health. Microbes and Environments. 23(2):101-108. Available from: https://doi.org/10.1264/jsme2.23.101.

Leach WD, Pelkey NW. 2001. Making watershed partnerships work: A review of the empirical literature. Journal of Water Resources Planning and Management. 127(6):378-385. Available from: https://doi.org/10.1061/(ASCE)0733-9496(2001)127:6(378).

Nakagawa S, Cuthill IC. 2007. Effect size, confidence interval and statistical significance: A practical guide for biologists. Biological Reviews. 82(4):591-605. Available from: https://doi.org/10.1111/j.1469-185X.2007.00027.x.

Novotny V. 2004. Simplified databased total maximum daily loads, or the world is log-normal. Journal of Environmental Engineering. 130(6):674-683. Available from: https://doi.org/10.1061/(ASCE)0733-9372(2004)130:6(674).

R Core Team. 2019. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Available from: https://www.R-project.org/.

Runkel RL, Crawford CG, Cohn TA. 2004. Load estimator (LOADEST): A FORTRAN program for estimating constituent loads in streams and rivers. U.S. Geological Survey Techniques and methods Report No.: 4-A5. Available from: https://doi.org/10.3133/tm4A5.

Schoen ME, Ashbolt NJ. 2010. Assessing pathogen risk to swimmers at non-sewage impacted recreational beaches. Environmental Science & Technology. 44(7):2286-2291. Available from: https://doi.org/10.1021/es903523q.

Sigal MJ, Chalmers RP. 2016. Play it again: Teaching statistics with Monte Carlo simulation. Journal of Statistics Education. 24(3):136-156. Available from: https://doi.org/10.1080/10691898.2016.1246953.

Soller JA, Schoen ME, Bartrand T, Ravenscroft JE, Ashbolt NJ. 2010. Estimated human health risks from exposure to recreational waters impacted by human and non-human sources of faecal contamination. Water Research. 44(16):4674-4691. Available from: https://doi.org/10.1016/j.watres.2010.06.049.

[TCEQ] Texas Commission on Environmental Quality. 2019a. 2016 guidance for assessing and reporting surface water quality in Texas. Available from: https://www.tceq.texas.gov/assets/public/waterquality/swqm/assess/16txir/2016_guidance.pdf.

[TCEQ] Texas Commission on Environmental Quality. 2019b. Executive summary 2018 Texas integrated report for Clean Water Act Sections 305(b) and 303(d). Austin, TX: TCEQ. Available from: https://www.tceq.texas.gov/assets/public/waterquality/swqm/assess/18txir/2018_exec_summ.pdf.

[TCEQ] Texas Commission on Environmental Quality. 2020. List of segments with TMDLS. Segments with total maximum daily loads. [accessed 2020 Jul 6]. Available from: https://www.tceq.texas.gov/waterquality/tmdl/nav/tmdlsegments.

Wilcox R. 2013. Introduction to robust estimation and hypothesis testing. Third. Academic Press Elsevier. Available from: https://doi.org/10.1016/B978-0-12-386983-8.00015-9.

Wood SN. 2011. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models: Estimation of semiparametric generalized linear models. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 73(1):3-36. Available from: https://doi.org/10.1111/j.1467-9868.2010.00749.x.

Yen H, Daggupati P, White MJ, Srinivasan R, Gossel A, Wells D, Arnold JG. 2016. Application of large-scale, multi-resolution watershed modeling framework using the hydrologic and water quality system (HAWQS). Water. 8(4):164. Available from: https://doi.org/10.3390/w8040164.

Yue S, Pilon P, Cavadias G. 2002. Power of the Mann–Kendall and Spearman’s rho tests for detecting monotonic trends in hydrological series. Journal of Hydrology. 259(1):254-271. Available from: https://doi.org/10.1016/S0022-1694(01)00594-7.

Yue S, Wang CY. 2002. Regional streamflow trend detection with consideration of both temporal and spatial correlation. International Journal of Climatology. 22(8):933-946. Available from: https://doi.org/10.1002/joc.781.

Zhang Q, Webber JS, Moyer DL, Chanat JG. 2020. An approach for decomposing river water-quality trends into different flow classes. Science of The Total Environment. Available from: https://doi.org/10.1016/j.scitotenv.2020.143562.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2021 Michael Paul Schramm