2015
DOI: 10.1111/coin.12064
|View full text |Cite
|
Sign up to set email alerts
|

Document Clustering With Dual Supervision Through Feature Reweighting

Abstract: Traditional semi‐supervised clustering uses only limited user supervision in the form of instance seeds for clusters and pairwise instance constraints to aid unsupervised clustering. However, user supervision can also be provided in alternative forms for document clustering, such as labeling a feature by indicating whether it discriminates among clusters. This article thus fills this void by enhancing traditional semi‐supervised clustering with feature supervision, which asks the user to label discriminating f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 20 publications
(13 citation statements)
references
References 14 publications
0
13
0
Order By: Relevance
“…Arguably, human expert judgment is needed to balance these requirements by steering the clustering process and evaluating the results [54]. In the same context, in Bekkerman et al [16], Choo et al [30], Corrêa et al [34], El-Assady et al [42], Hoque and Carenini [53], Hu et al [56][57][58], the interaction supports generating clusters that are based on user's domain knowledge in accordance with their understanding to get clusters that fit user's expectations well. In Awasthi and Zadeh [10], the following scenario is given as motivation: "Consider documents representing news articles that could be clustered as {politics, sports, entertainment, other}; however, perhaps the user would like these articles to be clustered into {news articles, opinion pieces}" ([10], p. 1).…”
Section: Subjective Clusteringmentioning
confidence: 99%
See 4 more Smart Citations
“…Arguably, human expert judgment is needed to balance these requirements by steering the clustering process and evaluating the results [54]. In the same context, in Bekkerman et al [16], Choo et al [30], Corrêa et al [34], El-Assady et al [42], Hoque and Carenini [53], Hu et al [56][57][58], the interaction supports generating clusters that are based on user's domain knowledge in accordance with their understanding to get clusters that fit user's expectations well. In Awasthi and Zadeh [10], the following scenario is given as motivation: "Consider documents representing news articles that could be clustered as {politics, sports, entertainment, other}; however, perhaps the user would like these articles to be clustered into {news articles, opinion pieces}" ([10], p. 1).…”
Section: Subjective Clusteringmentioning
confidence: 99%
“…In several cases, user feedback involves updating the weights, either different data instances or different features [34,[56][57][58]. For example, Babaee et al [11] discovers the semantic structure of synthetic-aperture radar (SAR) image collections by letting users give higher weights to the most influential image in each cluster.…”
Section: Interacting With the Model's Parametersmentioning
confidence: 99%
See 3 more Smart Citations