2009
DOI: 10.1007/s10032-009-0089-5
|View full text |Cite
|
Sign up to set email alerts
|

An effective coherence measure to determine topical consistency in user-generated content

Abstract: When searching for blogs on a specific topic, information seekers prefer blogs that place a central focus on that topic over blogs whose mention of the topic is diffuse or incidental. In order to present users with better blog feed search results, we developed a measure of topical consistency that is able to capture whether or not a blog is topically focused. The measure, called the coherence score, is inspired by the genetics literature and captures the tightness of the clustering structure of a data set rela… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 21 publications
(18 citation statements)
references
References 22 publications
0
18
0
Order By: Relevance
“…Normalized TF video representation appears to be more robust to parameter setting than TF-IDF, since it shows consistent improvement for various values of parameter 胃 . In [8], the suggested parameter value is 95 %, but here it seems that the indicator calculated on concept-based features may be even more robust than the one calculated using conventional (text-based) TF or TF-IDF document representations. Regarding the choice for x cv , we investigate for which choice statistically significant improvements are obtained.…”
Section: Robustness To Parameter Settingmentioning
confidence: 96%
See 3 more Smart Citations
“…Normalized TF video representation appears to be more robust to parameter setting than TF-IDF, since it shows consistent improvement for various values of parameter 胃 . In [8], the suggested parameter value is 95 %, but here it seems that the indicator calculated on concept-based features may be even more robust than the one calculated using conventional (text-based) TF or TF-IDF document representations. Regarding the choice for x cv , we investigate for which choice statistically significant improvements are obtained.…”
Section: Robustness To Parameter Settingmentioning
confidence: 96%
“…The approach is low in computational complexity and requires no labeled training data. Further, the coherence-based approach is appealing, because it goes beyond measuring the similarity of the top documents in a results list to measuring their topical clustering structure [8]. The coherence score is thus able to identify a results list as high-quality even in the face of relatively large diversity among the topical clusters in the top of results list.…”
Section: Query Performance Predictionmentioning
confidence: 99%
See 2 more Smart Citations
“…This representation is deployed by a QPP framework to evaluate the coherence (e.g. [29]) of the candidate video search list and to select the list which is most likely to respond to a given topical query. The potential power of this hybrid solution can be observed from the fact that the proposed approach is able to select the most suitable video search list for 30% more queries than in the cases where only textual information is used to compare the videos.…”
Section: Advanced Semantic Inference: Inferring the Aboutness Of The mentioning
confidence: 99%