“…For example, the English word joblessness can be segmented as job+less+ness. When processing morphologically-rich languages, this helps reduce the sparsity created by the higher OOV rate due to productive morphology, and, empirically, has shown to be beneficial in a diverse variety of down-stream tasks, e.g., machine translation (Clifton and Sarkar, 2011), speech recognition (Afify et al, 2006), keyword spotting (Narasimhan et al, 2014) and parsing (Seeker and Özlem Çetinoğlu, 2015 and unsupervised approaches have been successful, but, when annotated data is available, supervised approaches typically greatly outperform unsupervised approaches (Ruokolainen et al, 2013).…”