Abstract:We introduce the problem of diverse dimension decomposition in transactional databases. A dimension is a set of mutually-exclusive itemsets, and our problem is to find a decomposition of the itemset space into dimensions, which are orthogonal to each other, and that provide high coverage of the input database. The mining framework we propose effectively represents a dimensionality-reducing transformation from the space of all items to the space of orthogonal dimensions. Our approach relies on information-theor… Show more
“…The information theoretic mining framework we have proposed together with Yahoo! Research can be interpreted as a dimensionalityreducing transformation from the space of all tags to the space of orthogonal dimensions [53,54].…”
Section: Analytics On Non-structured Datamentioning
“…The information theoretic mining framework we have proposed together with Yahoo! Research can be interpreted as a dimensionalityreducing transformation from the space of all tags to the space of orthogonal dimensions [53,54].…”
Section: Analytics On Non-structured Datamentioning
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.