“…The weighted mutual information between a subject descriptor h and an item i in document d is applied as in Lu and Mao (): where δ(i, h) is the weight of the pair <i, h>, which is obtained by: where tf i is the frequency of the item i in the document, N is the total number of documents in the corpus, df i & df h are the document frequencies (i.e., number of documents) of item i and subject descriptor h , respectively. The probabilities p(i,h) , p(i) , and p(h) are estimated by Maximum Likelihood Estimator (MLE) at the document level: …”