2019
DOI: 10.1002/widm.1335
|View full text |Cite
|
Sign up to set email alerts
|

Core‐sets: An updated survey

Abstract: In optimization or machine learning problems we are given a set of items, usually points in some metric space, and the goal is to minimize or maximize an objective function over some space of candidate solutions. For example, in clustering problems, the input is a set of points in some metric space, and a common goal is to compute a set of centers in some other space (points, lines) that will minimize the sum of distances to these points. In database queries, we may need to compute such a some for a specific q… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 39 publications
(27 citation statements)
references
References 91 publications
0
27
0
Order By: Relevance
“…The primary application of coresets is to create a compact representation of a large dataset, to allow for fast inference on downstream tasks (see [28] for a recent survey). However, such compact representations have also proved beneficial in interpretation of both models and datasets.…”
Section: Coresets For Understanding Datasets and Modelsmentioning
confidence: 99%
“…The primary application of coresets is to create a compact representation of a large dataset, to allow for fast inference on downstream tasks (see [28] for a recent survey). However, such compact representations have also proved beneficial in interpretation of both models and datasets.…”
Section: Coresets For Understanding Datasets and Modelsmentioning
confidence: 99%
“…A different approach that is to use data summarization techniques. Coresets in particular were first used to solve problems in computational geometry [1] and got increasing attention in both the industry [3,4,5,17,35] and academy [6,8,23,24] over the recent years; see surveys in [20,44,47]. Informally, coreset is a small weighted subset of the input points (unlike e.g.…”
Section: Modern Machine Learningmentioning
confidence: 99%
“…The size of the coreset is usually polynomial in 1/ε but independent or near-logarithmic in the size of the input. Since such a coreset approximates every query (and not just the optimal one), it supports constraint optimization, and the above computation models using merge-and-reduce trees; see details in [20]. Moreover, coresets may be computed in time that is near-linear in the input, even for NP-hard optimization problems.…”
Section: Modern Machine Learningmentioning
confidence: 99%
“…While this overview covers the accurate coreset constructions in literature, as well as the required background for understanding their correctness and proofs, there are a small number of other, different but related, overviews and surveys. The most related recent survey we are aware of is (Feldman, 2020), which: (a) covers the main advantages and disadvantages of (both accurate and non‐accurate) coresets in general, (b) discusses the applications that coresets can make feasible (e.g., the streaming and distributed data models), (c) discusses the different coreset types, and (d) dives in detail into a general framework for (non‐accurate) coreset construction. However, it does not provide the background needed for understanding those techniques and algorithms, and does not discuss concrete coreset construction algorithms and their correctness.…”
Section: Introductionmentioning
confidence: 99%