“…The goal of this paper is to expand on the successes of this ongoing collective research programme. The algorithm described below shares many aspects with past work, such as vector embedding (Powers 1997, Calderone 2009, Goldsmith & Xanthos 2009, Nazarov 2014, 2016, Silfverberg et al 2018, Mirea & Bicknell 2019, normalisation (Powers 95 Learning phonological classes from distributional similarity 1997, Silfverberg et al 2018), matrix decomposition (Powers 1997, Calderone 2009, Goldsmith & Xanthos 2009, Silfverberg et al 2018 and clustering algorithms (Powers 1997, Nazarov 2014, 2016, Mirea & Bicknell 2019. The innovations that will be presented below are largely in the combination and extension of these techniques, but the clustering methodology presented is relatively novel.…”