“…We identified twelve preprocessing steps indicating (a) the influence of software engineering on the written text, (b) platformspecific features (e.g., Markdown), and (c) identifying/personal information. We identified removing (1) numbers [3,5,49,51], (2) hashtag [5,11], (3) URLs [3,5,11,49], and (4) @-references [3] from previous studies. Relating to the terms incorrectly represented in software engineering, we identified removing (5) quotes, (6) code blocks, and (7) images as part of Markdown characteristics.…”