Automated River Substrate Mapping From Sonar Imagery With Machine Learning

Image credit: Bodine, Buscombe, Hocking (2024)

Abstract

Knowledge of the variation and distribution of substrates at large spatial extents in aquatic systems, particularly rivers, is severely lacking, impeding species conservation and ecosystem restoration efforts. Recreation-grade side-scan sonar (SSS) instruments have demonstrated their unparalleled value as a low-cost scientific instrument capable of efficient and rapid imaging of the benthic environment. However, existing methods for generating georeferenced data sets from these instruments, especially substrate maps, remain a barrier of adoption for scientific inquiry due to the high degree of human intervention and required expertise. To address this shortcoming, we introduced PING-Mapper, an open-source and freely available Python-based software for automatically generating geospatial benthic data sets from popular Humminbird® instruments reproducibly. The previously released Version 1.0 of the software provided automated workflows for exporting georeferenced sonar imagery. This study extends functionality with version 2.0 by incorporating semantic segmentation with deep neural networks to reproducibly map substrates at large spatial extents. We present a novel approach for generating label-ready sonar data sets, creating label-image training sets, and model training with transfer learning; all with open-source tools. The six-class substrate model achieves an overall accuracy of 78% and the best performing class achieves 91%. Grouping substrates into three classes further improves the overall accuracy (87%) and best performing class accuracy (94%). Additional workflows enable masking sonar shadows, calculating independent bed picks and correcting attenuation effects in the imagery to improve interpretability. This software provides an improved mechanism for automatically mapping substrate distribution from recreation-grade SSS systems, thereby lowering the barrier for inclusion in wider aquatic research.

Publication
JGR: Machine Learning and Computation