Proceedings of the 10th ACM Symposium on Document Engineering 2010
DOI: 10.1145/1860559.1860624
|View full text |Cite
|
Sign up to set email alerts
|

On helmholtz's principle for documents processing

Abstract: Keyword extraction is a fundamental problem in text data mining and document processing. A large number of document processing applications directly depend on the quality and speed of keyword extraction algorithms. In this article, a novel approach to rapid change detection in data streams and documents is developed. It is based on ideas from image processing and especially on the Helmholtz Principle from the Gestalt Theory of human perception. Applied to the problem of keywords extraction, it delivers fast an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0
2

Year Published

2012
2012
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(31 citation statements)
references
References 6 publications
0
29
0
2
Order By: Relevance
“…Any 100-sequence of heads and tails can be generated with probability of (½)100 and Fig. 3 is generated where 1 represents heads and 0 represents tails (Balinsky et al, 2010). First sequence, s 1 is expectable for unbiased coin but second output, s 2 is highly unexpected.…”
Section: Helmholtz Principle From Gestalt Theory and Its Applicationsmentioning
confidence: 99%
See 3 more Smart Citations
“…Any 100-sequence of heads and tails can be generated with probability of (½)100 and Fig. 3 is generated where 1 represents heads and 0 represents tails (Balinsky et al, 2010). First sequence, s 1 is expectable for unbiased coin but second output, s 2 is highly unexpected.…”
Section: Helmholtz Principle From Gestalt Theory and Its Applicationsmentioning
confidence: 99%
“…This can be explained by using methods from statistical physics where we observe macro parameters but we don't know the particular configuration. For instance expectation calculations can be used for this purpose (Balinsky et al, 2010).…”
Section: Helmholtz Principle From Gestalt Theory and Its Applicationsmentioning
confidence: 99%
See 2 more Smart Citations
“…The corpus has been used in many text mining studies (see e.g. [17], [18], [19]) but according to our knowledge the present analysis is unique in its kind.…”
Section: B Text Mining Of Subjectivitymentioning
confidence: 99%