2020
DOI: 10.1101/2020.03.19.998898
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Facing small and biased data dilemma in drug discovery with federated learning

Abstract: Artificial intelligence (AI) models usually require large amounts of high quality training data, which is in striking contrast to the situation of small and biased data faced by current drug discovery pipelines. The concept of federated learning has been proposed to utilize distributed data from different sources without leaking sensitive information of these data. This emerging decentralized machine learning paradigm is expected to dramatically improve the success of AI-powered drug discovery. We here simulat… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(9 citation statements)
references
References 43 publications
1
8
0
Order By: Relevance
“…Surprisingly, FL has also been used for drug discovery. At least two works, one by [161] and another by [162]. Both works cover FL's applicability for drug discovery, however, these two works tackle specific problems related to drug discovery and apply FL to solve them.…”
Section: ) Ranking Browser History Suggestionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Surprisingly, FL has also been used for drug discovery. At least two works, one by [161] and another by [162]. Both works cover FL's applicability for drug discovery, however, these two works tackle specific problems related to drug discovery and apply FL to solve them.…”
Section: ) Ranking Browser History Suggestionsmentioning
confidence: 99%
“…Both works cover FL's applicability for drug discovery, however, these two works tackle specific problems related to drug discovery and apply FL to solve them. In [161], the focus on incorporating FL for drug discovery deals with datasets with high biases. Biased data can be a huge problem for model training because the data itself can be skewed due to these biases.…”
Section: ) Ranking Browser History Suggestionsmentioning
confidence: 99%
“…There are several works focusing on federated graph neural networks 9 , 39 , 40 , 41 , 42 , 43 , 44 and federated molecular-property prediction. 45 , 46 GraphFL applies MAML to improve the robustness of training. 43 The method in Xie et al.…”
Section: Related Workmentioning
confidence: 99%
“…Other applications of FL include ranking browser history suggestions based on user-interactions [43], visual object detection [44], patient clustering to predict hospital stay time as well as mortality [45], drug discovery [46,47] and brain tumor segmentation [48,49,50]. The application fields of FL are quickly increasing and are a promising research direction of ML.…”
Section: Applicationsmentioning
confidence: 99%