FLAG: Faster Learning on Anchor Graph with Label Predictor Optimization

Fu, Weijie; Wang, Meng; Hao, Shijie; Mu, Tingting

doi:10.1109/tbdata.2017.2757522

Cited by 6 publications

(4 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Graph-based Models label propagation on sparse graph approximate search [110], [227], [232], division and conquer [45], [197]. optimization with anchor graph single layer [132], [201], [228], hierarchical layers [77], [200].…”

Section: Strategies Representative Methodsmentioning

confidence: 99%

“…After that, hierarchical anchor graphs propose to retain sparse similarities over all instances while keeping a small number of anchors for label inference [200]. In case that the smallest set of anchors still needs to be large and brings considerable computations, FLAG developes label optimizers for further acceleration [77]. Besides, EAGR proposes to perform label smoothness over anchors with pruned adjacency [201].…”

Section: For Graph-based Modelsmentioning

confidence: 99%

“…The reason is that, once a set of anchors can be stored in the memory, anchor graphs can be efficiently constructed with the memory cost of O(md+nk) rather than O(nd). For example, with hierarchical anchor graphs, the classification on 8 million instances could be implemented on a personal computer within 2 mins [77], [200].…”

Section: For Graph-based Modelsmentioning

confidence: 99%

See 2 more Smart Citations

A Survey on Large-Scale Machine Learning

Wang

et al. 2020

IEEE Trans. Knowl. Data Eng.

Self Cite

View full text Add to dashboard Cite

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However, most sophisticated machine learning approaches suffer from huge time costs when operating on large-scale data. This issue calls for the need of Largescale Machine Learning (LML), which aims to learn patterns from big data with comparable performance efficiently. In this paper, we offer a systematic survey on existing LML methods to provide a blueprint for the future developments of this area. We first divide these LML methods according to the ways of improving the scalability: 1) model simplification on computational complexities, 2) optimization approximation on computational efficiency, and 3) computation parallelism on computational capabilities. Then we categorize the methods in each perspective according to their targeted scenarios and introduce representative methods in line with intrinsic strategies. Lastly, we analyze their limitations and discuss potential directions as well as open issues that are promising to address in the future.

show abstract

Section: Strategies Representative Methodsmentioning

confidence: 99%

Section: For Graph-based Modelsmentioning

confidence: 99%

See 1 more Smart Citation

A Survey on Large-Scale Machine Learning

Wang

et al. 2020

IEEE Trans. Knowl. Data Eng.

Self Cite

View full text Add to dashboard Cite

show abstract

“…As a result, the computational complexity can be greatly reduced. While there are different ways to build the adjacency matrix S in AGR [24][25][26], we argue that most of them are developed intuitively and lack a probability explanation. In addition, AGR cannot directly infer the class labels of incoming data.…”

Section: Introductionmentioning

confidence: 92%

Sub-Graph Regularization on Kernel Regression for Robust Semi-Supervised Dimensionality Reduction

Liu

Zhao

Kong

2019

Entropy

View full text Add to dashboard Cite

Dimensionality reduction has always been a major problem for handling huge dimensionality datasets. Due to the utilization of labeled data, supervised dimensionality reduction methods such as Linear Discriminant Analysis tend achieve better classification performance compared with unsupervised methods. However, supervised methods need sufficient labeled data in order to achieve satisfying results. Therefore, semi-supervised learning (SSL) methods can be a practical selection rather than utilizing labeled data. In this paper, we develop a novel SSL method by extending anchor graph regularization (AGR) for dimensionality reduction. In detail, the AGR is an accelerating semi-supervised learning method to propagate the class labels to unlabeled data. However, it cannot handle new incoming samples. We thereby improve AGR by adding kernel regression on the basic objective function of AGR. Therefore, the proposed method can not only estimate the class labels of unlabeled data but also achieve dimensionality reduction. Extensive simulations on several benchmark datasets are conducted, and the simulation results verify the effectiveness for the proposed work.

show abstract