K. V. Vorontsov scite author profile

K. V. Vorontsov

3Publications

81Citation Statements Received

27Citation Statements Given

How they've been cited

203

How they cite others

Affiliations

Lomonosov Moscow State University, Moscow Institute of Physics and Technology, Dorodnitsyn Computing Centre

Publications

Order By: Most citations

Additive regularization of topic models

Vorontsov

Potapenko

2014

Mach Learn

View full text Add to dashboard Cite

Probabilistic topic modeling of text collections has been recently developed mainly within the framework of graphical models and Bayesian inference. In this paper we introduce an alternative semi-probabilistic approach, which we call additive regularization of topic models (ARTM). Instead of building a purely probabilistic generative model of text we regularize an ill-posed problem of stochastic matrix factorization by maximizing a weighted sum of the log-likelihood and additional criteria. This approach enables us to combine probabilistic assumptions with linguistic and problem-specific requirements in a single multi-objective topic model. In the theoretical part of the work we derive the regularized EM-algorithm and provide a pool of regularizers, which can be applied together in any combination. We show that many models previously developed within Bayesian framework can be inferred easier within ARTM and in some cases generalized. In the experimental part we show that a combination of sparsing, smoothing, and decorrelation improves several quality measures at once with almost no loss of the likelihood.

show abstract

BigARTM: Open Source Library for Regularized Multimodal Topic Modeling of Large Collections

Vorontsov

Frei

Apishev

et al. 2015

View full text Add to dashboard Cite

Additive regularization for topic models of text collections

Vorontsov

2014

Dokl. Math.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

K. V. Vorontsov

Additive regularization of topic models

BigARTM: Open Source Library for Regularized Multimodal Topic Modeling of Large Collections

Additive regularization for topic models of text collections

Contact Info

Product

Resources

About