“…This theory has found many applications in pattern recognition, phylogeny, clustering and classification. For objects that are represented as computer files such applications range from weather forecasting, software, earthquake prediction, music, literature, ocr, bioinformatics, to internet [1], [2], [5], [8]- [10], [12], [19], [20], [22], [23], [25], [31]- [33], [40]. For objects that are only represented by name, or objects that are abstract like "red," "Einstein," "three," the normalized information distance uses background information provided by Google, or any search engine that produces aggregate page counts.…”