“…Document clustering uses algorithms to partition collections of documents into groups with semantically similar information to make analysis of documents more manageable (Subakti et al, 2022 ). Document clustering has been applied to text in many contexts, for example social media (Curiskis et al, 2020 ), medicine (Sandhiya & Sundarambal, 2019 ), law (Bhattacharya et al, 2022 ; Dhanani et al, 2021 ), hospitality (Kaya et al, 2022 ), patents (Choi & Jun, 2014 ; Kim et al, 2020 ), regulatory data (Levine et al, 2022 ), and engineering documents (Arnarsson et al, 2021 ). There are many unique challenges associated with document clustering.…”