“…Following Nagai et al (2019), we set the parameters as .01 for α and .01 for η, which are respectively Dirichlet priors on the per-document-topic distribution and per-topic word distribution (Gangadharan & Gupta, 2020). Meanwhile, seed confidence, that is, the probability of biasing the selection of seed-word distribution, was set at .7 (Nagai et al, 2019). Researchers running guided LDA models have included topics in addition to the number of identified seeded topics (i.e., 12, in this case) to cover documents that did not fall under any of the latter (e.g., Li et al, 2019; Ramesh et al, 2014; Shanthakumar et al, 2020), and we followed this practice as well.…”