Yanjun Zhang scite author profile

Coupling learning is designed to estimate, discover and extract the interactions and relationships among learning components. It provides insights into complex interactive data, and has been extensively incorporated into recommender systems to enhance the interpretability of sophisticated relationships between users and items. Coupling learning can be further fostered once the trending collaborative learning can be engaged to take advantage of the cross-platform data. To facilitate this, privacy-preserving solutions are in high demand-it is desired that the collaboration should not expose either the private data of each individual owner or the model parameters trained on their datasets. In this work, we develop a distributed collaborative coupling learning system which enables differential privacy. The proposed system defends against the adversary who has gained full knowledge of the training mechanism and the access to the model trained collaboratively. It also addresses the privacy-utility tradeoff by a provable tight sensitivity bound. Our experiments demonstrate that the proposed system guarantees favourable privacy gains at a modest cost in recommendation quality, even in scenarios with a large number of training epochs. Index Terms-Coupling learning, differential privacy, collaborative learning COUPLING LEARNING is an emerging research topic that refers to understanding, formalizing and quantifying the complex relations and interactions, i.e., couplings hidden in complex data. Effective discovery and extraction of the

show abstract

Better Together: Attaining the Triad of Byzantine-robust Federated Learning via Local Update Amplification

Shen

Zhang

Wang

et al. 2022

View full text Add to dashboard Cite

Preserving Privacy for Distributed Genome-Wide Analysis Against Identity Tracing Attacks

Zhang

Bai

et al. 2023

IEEE Trans. Dependable and Secure Comput.

View full text Add to dashboard Cite

Genome-wide analysis has demonstrated both health and social benefits. However, large scale sharing of such data may reveal sensitive information about individuals. One of the emerging challenges is identity tracing attack that exploits correlations among genomic data to reveal the identity of DNA samples. In this paper, we first demonstrate that the adversary can narrow down the sample's identity by detecting his/her genetic relatives and quantify such privacy threat by employing a Shannon entropy-based measurement. For example, we exemplify that when the dataset size reaches 30% of the population, for any target from that population, the uncertainty of the target's identity is reduced to merely 2.3 bits of entropy (i.e., the identity is pinned down within 5 people). Direct application of existing approaches such as differential privacy (DP), secure multiparty computation (MPC) and homomorphic encryption (HE) may not be applicable to this challenge in genome-wide analysis because of the compromise on utility (i.e., accuracy or efficiency). Towards addressing this challenge, this paper proposes a framework named υFRAG to facilitate privacy-preserving data sharing and computation in genome-wide analysis. υFRAG mitigates privacy risks by using a vertical fragmentation to disrupt the genetic architecture on which the adversary relies for identity tracing without sacrificing the capability of genome-wide analysis. We theoretically prove that it preserves the correctness of the primitive functionalities and algorithms ranging from basic summary statistics to advanced neural networks. Our experiments demonstrate that υFRAG outperforms secure multiparty computation (MPC) and homomorphic encryption (HE) protocols, with a speedup of more than 221x for training neural networks, and also traditional non-private algorithms and a state-of-the-art noise-based differential privacy (DP) solution in most settings.

show abstract

Characterizing Cryptocurrency-themed Malicious Browser Extensions

Wang

Ling

Zhang

et al. 2023

View full text Add to dashboard Cite

Privacy-preserving sharing for genome-wide analysis

Zhang¹

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yanjun Zhang

Differentially Private Collaborative Coupling Learning for Recommender Systems

Better Together: Attaining the Triad of Byzantine-robust Federated Learning via Local Update Amplification

Preserving Privacy for Distributed Genome-Wide Analysis Against Identity Tracing Attacks

Characterizing Cryptocurrency-themed Malicious Browser Extensions

Privacy-preserving sharing for genome-wide analysis

Contact Info

Product

Resources

About