Weighted Random sampling based hierarchical amnesic synopses for data streams

Chen, Huahui; Kang-Li, Liao

doi:10.1109/iccse.2010.5593801

Search citation statements

Order By: Relevance

Paper Sections

Select...

Concise and Counting Sampling1

Probability Sampling Techniques1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2015

2017

Publication Types

Select...

Book1

Article1

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

(2 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, WRS can be applied over data stream that is considered as big data, to take a sample from the recent data streams since based on weighting different part of data streams, it is possible to choose a part of data streams which has high weight according to recent data streams. There are various extensions of WRS in the literature such as [70] and [71].…”

Section: Concise and Counting Samplingmentioning

confidence: 99%

Data Summarization Techniques for Big Data—A Survey

Hesabi

Tari

Gościński

et al. 2015

Handbook on Data Centers

View full text Add to dashboard Cite

Section: Concise and Counting Samplingmentioning

confidence: 99%

Data Summarization Techniques for Big Data—A Survey

Hesabi

Tari

Gościński

et al. 2015

Handbook on Data Centers

View full text Add to dashboard Cite

“…In situations where N is not known, such as streaming data or very large data sets, there are several variations of simple random sampling such as Random Sampling with Reservoir , Biased Reservoir Sampling , Acceptance/Rejection Sampling , Chain Sampling , and Weighted Random Sampling. Simple random sampling includes only Step 1 of Algorithm 1.…”

Section: Probability Sampling Techniquesmentioning

confidence: 99%

Recent advances in scaling‐down sampling methods in machine learning

ElRafey

Wojtusiak

2017

WIREs Computational Stats

View full text Add to dashboard Cite

Data sampling methods have been investigated for decades in the context of machine learning and statistical algorithms, with significant progress made in the past few years driven by strong interest in big data and distributed computing. Most recently, progress has been made in methods that can be broadly categorized into random sampling including density-biased and nonuniform sampling methods; active learning methods, which are a type of semi-supervised learning and an area of intense research; and progressive sampling methods which can be viewed as a combination of the above two approaches. A unified view of scalingdown sampling methods is presented in this article and complemented with descriptions of relevant published literature.

show abstract

Weighted Random sampling based hierarchical amnesic synopses for data streams

Cited by 2 publications

References 14 publications

Data Summarization Techniques for Big Data—A Survey

Data Summarization Techniques for Big Data—A Survey

Recent advances in scaling‐down sampling methods in machine learning

Contact Info

Product

Resources

About