Transfer Learning with Dynamic Distribution Adaptation

Wang, Jindong; Chen, Yiqiang; Feng, Wenjie; Yu, Han; Huang, Mei‐Yu; Yang, Qiang

doi:10.1145/3360309

Cited by 172 publications

(121 citation statements)

References 65 publications

Supporting

Mentioning

121

Contrasting

Order By: Relevance

“…Furthermore, the original training dataset provided by the customer may itself be from multiple private sources (e.g., mobile crowdsensing) and may follow some multimodal distribution. The AI marketplace should also be able to help the customer by allowing the aggregation of multiple alternative datasets from other data owners in the marketplace while also ensuring the aggregate dataset follows a similar distribution (e.g., using a transfer learning approach [18]) as the validation dataset. This is important since AI learning algorithms suffer from major model quality loss (or even divergence) when trained on non-IID data [19].…”

Section: Technical Aspects Of Ai Marketplacementioning

confidence: 99%

Sketching an AI Marketplace: Tech, Economic, and Regulatory Aspects

et al. 2021

View full text Add to dashboard Cite

Section: Technical Aspects Of Ai Marketplacementioning

confidence: 99%

Sketching an AI Marketplace: Tech, Economic, and Regulatory Aspects

et al. 2021

View full text Add to dashboard Cite

“…Joint Distribution Analysis (JDA) [27] improves TCA by considering not only the marginal distribution shift but also the conditional distribution shift with the pseudo-labels of the target domain. Wang et al [53,54] improves JDA by adaptively leveraging the marginal and conditional distributions. Li et al [55][56][57] adopted MMD to eliminate the discrepancy of features and distributions between the source and target domains under the heterogeneous domain adaptation.…”

Section: A Domain Adaptationmentioning

confidence: 99%

Cross Domain Mean Approximation for Unsupervised Domain Adaptation

et al. 2020

View full text Add to dashboard Cite

Unsupervised Domain Adaptation (UDA) aims to leverage the knowledge from the labeled source domain to help the task of target domain with the unlabeled data. It is a key step for UDA to minimize the cross-domain distribution divergence. In this paper, we firstly propose a novel discrepancy metric, referred to as Cross Domain Mean Approximation (CDMA) discrepancy, to evaluate the distribution differences between source and target domains, which calculate the sum of the squares of the distances from the source and target domains to the mean of the other domain. Secondly, Joint Distribution Adaptation based on Cross Domain Mean Approximation (JDA-CDMA) is developed on the basis of CDMA to extract shared feature and simultaneously reduce the marginal and conditional distribution discrepancy between domains during the label refinement process. Thirdly, we construct a classifier utilizing CDMA metric and neighbor information. Finally, the proposed feature extraction approach and classifier are combined to realize transfer learning. Results from extensive experiments on five visual benchmarks including object, face, and digit images, show the proposed methods outperform the state-of-the-art unsupervised domain adaptation. I. INTRODUCTION I N machine vision, many machine learning methods, such as Linear Regression [1], Logistic Regression (LR) [2], k-Nearest Neighbor (k-NN) [3], Bayesian [4], Decision Tree [5], and Support Vector Machine (SVM)[6], are applied to image classification tasks. However, when the image feature representation is too redundant or poor in quality, their accuracy will be lowered. Therefore, it is of great importance to extract high-quality image feature. Feature extraction, as an important manner of mining image latent knowledge, is not only conducive to the in-depth understanding of image content, but also crucial to improve the accuracy of image classification and recognition [7]. Consequently it has attracted much attention from researchers. Principal Component Analysis (PCA) [8], Independent Component Analysis (ICA) [9], Linear Discriminant Analysis (LDA) [10], Maximum Margin Criterion (MMC) [11] and other algorithms are often used for feature extraction. In order to discover the nonlinear structure hidden in the high dimensional data and mine the local geometric structure information of data, Laplacian Eigen-maps (LE) [12], Locality Linear Embedding

show abstract

“…General feature-based transfer learning methods include Transfer Component Analysis (TCA) [27], Joint Distribution Adaptation (JDA) [28], Balanced boundary Distribution Adaptation (BDA) [29], etc. These methods are types of unsupervised transfer learning, where there is no label information in the target domain.…”

Section: Transfer Learning In the Wireless Fingerprinting Localizationmentioning

confidence: 99%

“…The results can be constructed with dimension reduction method, that is, solve the first m eigenvalues of (KLK + µI) −1 KHK, reducing the computational cost of solving SDP. The TCA method is the base method in feature-based transfer learning, many other methods are extended upon it, such as JDA [28] and BDA [29].…”

Section: Transfer Component Analysismentioning

confidence: 99%

Transfer Learning for Wireless Fingerprinting Localization Based on Optimal Transport

Bai

Luo

Wan

2020

Sensors

View full text Add to dashboard Cite

Wireless fingerprinting localization (FL) systems identify locations by building radio fingerprint maps, aiming to provide satisfactory location solutions for the complex environment. However, the radio map is easy to change, and the cost of building a new one is high. One research focus is to transfer knowledge from the old radio maps to a new one. Feature-based transfer learning methods help by mapping the source fingerprint and the target fingerprint to a common hidden domain, then minimize the maximum mean difference (MMD) distance between the empirical distributions in the latent domain. In this paper, the optimal transport (OT)-based transfer learning is adopted to directly map the fingerprint from the source domain to the target domain by minimizing the Wasserstein distance so that the data distribution of the two domains can be better matched and the positioning performance in the target domain is improved. Two channel-models are used to simulate the transfer scenarios, and the public measured data test further verifies that the transfer learning based on OT has better accuracy and performance when the radio map changes in FL, indicating the importance of the method in this field.

show abstract

Transfer Learning with Dynamic Distribution Adaptation

Cited by 172 publications

References 65 publications

Sketching an AI Marketplace: Tech, Economic, and Regulatory Aspects

Sketching an AI Marketplace: Tech, Economic, and Regulatory Aspects

Cross Domain Mean Approximation for Unsupervised Domain Adaptation

Transfer Learning for Wireless Fingerprinting Localization Based on Optimal Transport

Contact Info

Product

Resources

About