Bucket Selection: A Model-Independent Diverse Selection Strategy for Widening

Fillbrunn

Siebes

2021

Data Min Knowl Disc

Self Cite

This paper provides a unified description of Widening, a framework for the use of parallel (or otherwise abundant) computational resources to improve model quality. We discuss different theoretical approaches to Widening with and without consideration of diversity. We then soften some of the underlying constraints so that Widening can be implemented in real world algorithms. We summarize earlier experimental results demonstrating the potential impact as well as promising implementation strategies before concluding with a survey of related work.

mentioning

confidence: 88%

Section: Implicit Diversity: Hashed Bucket Selectormentioning

confidence: 99%

Widening: using parallel resources to improve model quality

Fillbrunn

Siebes

2021

Data Min Knowl Disc

Self Cite

“…Diversity is an important issue in bio-and chem-informatics and has been studied regarding protein and molecular similarity in [11]. In data mining the effect of diversity on the parallel exploration of the solution space was studied in [12]- [16].…”

Section: Sets Of Diverse Portfoliosmentioning

confidence: 99%

Widened Learning of Index Tracking Portfolios

Gavriushina

Sampson

2019 18th IEEE International Conference on Machine Learning and Applications (ICMLA)

et al. 2019

Self Cite

Index investing has an advantage over active investment strategies, because less frequent trades results in lower expenses, yielding higher long-term returns. Index tracking is a popular investment strategy that attempts to find a portfolio replicating the performance of a collection of investment vehicles. This paper considers index tracking from the perspective of solution space exploration. Three search space heuristics in combination with three portfolio tracking error methods are compared in order to select a tracking portfolio with returns that mimic a benchmark index. Experimental results conducted on real-world datasets show that Widening, a metaheuristic using diverse parallel search paths, finds superior solutions than those found by the reference heuristics. Presented here are the first results using Widening on time-series data.

“…The widening framework terms this Top-k-widening, i.e., M i+1 = s T op−k (r(M i )) : |M i+1 | = k. WIDENING begins to widen the search paths beyond a simple greedy mechanism when diversity is brought into play. The notion of diversity can be implemented in either the refining step as in [24,25] or in the selection step as in [11,12]. Given a diverse refinement operator, r ∆ (•), as in [24,25], where a diversity function, ∆, is imposed on the output, DIVERSE TOP-K WIDENING is described by…”

Section: Wideningmentioning

confidence: 99%

“…Not Faster." Although the demonstrated examples, such as WIDENED KRIMP [24], WIDENED HIERARCHICAL CLUS-TERING [11], WIDENED BAYESIAN NETWORKS [25] and BUCKET SELECTION [12] have been able to find superior solutions, i.e., "better," they have been unable to demonstrate this ability in a run-time that is comparable to the standard versions of the greedy algorithms. "Not faster" is not intended to mean "slower.…”

Section: Introductionmentioning

confidence: 99%

Communication-Free Widened Learning of Bayesian Network Classifiers Using Hashed Fiedler Vectors

Sampson

Borgelt

Advances in Intelligent Data Analysis XVII

2018

Self Cite

Widening is a method where parallel resources are used to find better solutions from greedy algorithms instead of merely trying to find the same solutions more quickly. To date, every example of Widening has used some form of communication between the parallel workers to maintain their distances from one another in the model space. For the first time, we present a communication-free, widened extension to a standard machine learning algorithm. By using Locality Sensitive Hashing on the Bayesian networks' Fiedler vectors, we demonstrate the ability to learn classifiers superior to those of standard implementations and to those generated with a greedy heuristic alone.