“…The text was extracted from the documents and statements and pre-processed to standardise the words to enhance the effectiveness of the analysis. The pre-processing techniques were employed using natural language processing procedures: remove stopwords (connectives); tokenisation , which separates each word from the text; and, most important, lemmatisation , which transforms each inflected form of the words to its lemma, so words like “changing” or “changes” are transformed into “change,” and therefore the Jaccard index can capture more clearly the concepts expressed in the words (Skorkovská, 2012).…”