“…The numbers represent some information about each word in the text, for example, the term frequency (TF) ( Baeza-Yates & Ribeiro-Neto, 1999 ). Beyond BOW model, there are word embeddings ( Pennington, Socher & Manning, 2014 ; Bojanowski et al, 2016 ), topic modeling ( Blei, Ng & Jordan, 2003 ; Kherwa & Bansal, 2017 ), and many others ( Devlin et al, 2018 ; Peters et al, 2018 ; Brown et al, 2020 ; Pittaras et al, 2020 ; Dhanani, Mehta & Rana, 2022 ; Martino, Pio & Ceci, 2021 ; Chalkidis et al, 2020 ).…”