“…Given the large magnitude of data, a key step is to build appropriate data storage and processing infrastructure, including automated ML pipelines (maintainable and reusable across multiple data collecting devices) that will replace the annotation currently done largely by hand by marine biologists. ML-based methods are already being used for detection and classification among marine mammals ( Gillespie et al., 2009 ; Shiu et al., 2020 ) and for sperm whale click detection and classification ( Bermant et al., 2019 ; Ferrari et al., 2020 ; Glotin et al., 2018 ; Jiang et al., 2018 ); such methods are potentially scalable to large datasets containing years of recording that would otherwise be beyond reach with previous manual approaches.…”