2013 Asilomar Conference on Signals, Systems and Computers 2013
DOI: 10.1109/acssc.2013.6810463
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic topic discovery through sequential projections

Abstract: We present algorithms for topic modeling based on the geometry of cross-document word-frequency patterns. This perspective gains significance under the so called separability condition. This is a condition on existence of novel-words that are unique to each topic. We present a suite of highly efficient algorithms based on data-dependent and random projections of word-frequency patterns to identify novel words and associated topics. We will also discuss the statistical guarantees of the data-dependent projectio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 20 publications
(43 citation statements)
references
References 18 publications
(27 reference statements)
0
43
0
Order By: Relevance
“…It would therefore be an interesting direction of further research to analyze and apply this model in other contexts. Further it would be of interest to study the robustness of Algorithm 1 if combined with random projections (see [10,4]), and to consider non-additive noise models, e.g., spectral variability in the context of HSI [33].…”
Section: Conclusion and Further Workmentioning
confidence: 99%
“…It would therefore be an interesting direction of further research to analyze and apply this model in other contexts. Further it would be of interest to study the robustness of Algorithm 1 if combined with random projections (see [10,4]), and to consider non-additive noise models, e.g., spectral variability in the context of HSI [33].…”
Section: Conclusion and Further Workmentioning
confidence: 99%
“…As can be seen, the estimated topics do form recognizable themes that can be assigned meaningful labels. The full list of all K = 100 topics estimated on the NYT dataset can be found in [3]. 3 The zzz prefix in the NYT vocabulary is used to annotate certain special named entities.…”
Section: B Real-world Datamentioning
confidence: 99%
“…The full list of all K = 100 topics estimated on the NYT dataset can be found in [3]. 3 The zzz prefix in the NYT vocabulary is used to annotate certain special named entities. For example, zzz nfl annotates NFL.…”
Section: B Real-world Datamentioning
confidence: 99%
“…One of the proposal is in [5], where the incremental primal dual gradient update is proposed. Another proposal is shown in [14], in which the random projection scenario is employed to randomly select a small set of the constraints in solving the optimization problem in (15).…”
Section: Future Workmentioning
confidence: 99%