Mining Text Data 2012
DOI: 10.1007/978-1-4614-3223-4_8
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic Models for Text Mining

Abstract: A number of probabilistic methods such as LDA, hidden Markov models, Markov random fields have arisen in recent years for probabilistic analysis of text data. This chapter provides an overview of a variety of probabilistic models for text mining. The chapter focuses more on the fundamental probabilistic techniques, and also covers their various applications to different text mining problems. Some examples of such applications include topic modeling, language modeling, document classification, document clusteri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 49 publications
0
6
0
Order By: Relevance
“…In this field, words appearing in documents are related to discrete latent variables, which in turn are called topics. Comprehensive descriptions of topic models and typical applications can be found in the text mining literature (see, e.g., Blei, Ng, and Jordan 2003;Steyvers and Griffiths 2007;Sun, Deng, and Han 2012).…”
Section: Latent Dirichlet Allocationmentioning
confidence: 99%
“…In this field, words appearing in documents are related to discrete latent variables, which in turn are called topics. Comprehensive descriptions of topic models and typical applications can be found in the text mining literature (see, e.g., Blei, Ng, and Jordan 2003;Steyvers and Griffiths 2007;Sun, Deng, and Han 2012).…”
Section: Latent Dirichlet Allocationmentioning
confidence: 99%
“…Among probabilistic models, mixture models can represent subpopulations within a population without explicating to which data aggrupation (or their observed samples) a point belongs (Marin et al., ; Sun et al., ). These models represent the population‘s pdf and they are useful to make estimations.…”
Section: Mathematical Modelmentioning
confidence: 99%
“…This is accomplished by a combination of tools and insights from natural language processing and computational linguistics, augmented by computational intelligence. They involve both rule-based and probability-based approaches 7 and a detailed survey of di↵erent tools that are employed in text mining can be found in Nenkova and McKeown (2012), Aggarwal and Zhai (2012b), and Sun, Deng and Han (2012). The potential applications are diverse, ranging from extracting information regarding new discoveries in biomedical research, gathering information and outlooks that may be useful for finance professionals, to crucial information gathering for intelligence and security services.…”
Section: Text Mining and Sentiment Analysismentioning
confidence: 99%