2007
DOI: 10.1007/s10618-006-0053-7
|View full text |Cite
|
Sign up to set email alerts
|

The complexity of non-hierarchical clustering with instance and cluster level constraints

Abstract: Recent work has looked at extending clustering algorithms with instance level must-link (ML) and cannot-link (CL) background information. Our work introduces δ and ǫ cluster level constraints that influence inter-cluster distances and cluster composition. The addition of background information, though useful at providing better clustering results, raises the important feasibility question: Given a collection of constraints and a set of data, does there exist at least one partition of the data set satisfying al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
50
0
2

Year Published

2007
2007
2020
2020

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 54 publications
(53 citation statements)
references
References 16 publications
1
50
0
2
Order By: Relevance
“…The use of hard constraints for clustering tasks in purely statistical approaches to learning, as opposed to the ubiquitous use of "soft" prior knowledge, has been approached in, e.g., [15]. A common characteristic of these approaches is that they work with a relatively narrow, semi-formal notion of constraints and do not relate constraints to relational learning.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The use of hard constraints for clustering tasks in purely statistical approaches to learning, as opposed to the ubiquitous use of "soft" prior knowledge, has been approached in, e.g., [15]. A common characteristic of these approaches is that they work with a relatively narrow, semi-formal notion of constraints and do not relate constraints to relational learning.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to these efforts, our approach allows for rich constraints which take the form of a OWL DL knowledge base (with much higher expressivity). The notion of forbidden pairings of data points (cannot-link constraints [15]) is replaced with the more general notion of logical (un-)satisfiability w.r.t. formal background knowledge.…”
Section: Related Workmentioning
confidence: 99%
“…To provide a proper perspective for our contributions, we mention the assumptions used in this work. Firstly, our earlier work on the feasibility problem [4,5,7] addressed the question of determining whether there is a partition that satisfies a given set of constraints. In this paper, we assume that a clustering satisfying all the initial constraints (C) is given and that the addition of new constraints (C ′ ) may give rise to a combined set of constraints for which no feasible clustering exists for the given value of k (the number of clusters).…”
Section: Problem Definitionsmentioning
confidence: 99%
“…It is not that the graph is not 3-colorable, rather the ordering of the nodes presents a problem. This is of pragmatic importance [7] since the ordering of the points as processed by clustering algorithms is typically fixed apriori and does not change.…”
Section: Brooks's Theoremmentioning
confidence: 99%
See 1 more Smart Citation