2021
DOI: 10.1186/s12920-020-00869-9
|View full text |Cite
|
Sign up to set email alerts
|

High performance logistic regression for privacy-preserving genome analysis

Abstract: Background In biomedical applications, valuable data is often split between owners who cannot openly share the data because of privacy regulations and concerns. Training machine learning models on the joint data without violating privacy is a major technology challenge that can be addressed by combining techniques from machine learning and cryptography. When collaboratively training machine learning models with the cryptographic technique named secure multi-party computation, the price paid for… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
54
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(54 citation statements)
references
References 31 publications
0
54
0
Order By: Relevance
“…XGBoost used the same parameters as random forests with an additional learning rate parameter that was varied with the following values (0.01, 0.05,0.1). LR algorithm estimates the maximum probability of datapoints pertaining to a particular label based on the values of the laboratory markers that are independent in nature [85,86]. The model can then be used to make predictions that a data-point belongs to a particular label.…”
Section: Methodsmentioning
confidence: 99%
“…XGBoost used the same parameters as random forests with an additional learning rate parameter that was varied with the following values (0.01, 0.05,0.1). LR algorithm estimates the maximum probability of datapoints pertaining to a particular label based on the values of the laboratory markers that are independent in nature [85,86]. The model can then be used to make predictions that a data-point belongs to a particular label.…”
Section: Methodsmentioning
confidence: 99%
“…The authors used the 5% Bot-IoT dataset, which they split in an 80:20 ratio for training and testing. The logistic cost function [32] was utilized, as it has been shown to be efficient at separating normal from attack traffic. The cost function is defined by the following equation [29]:…”
Section: Related Workmentioning
confidence: 99%
“…LR is a predictive model that maps the relationship between a dependent variable (y) and one or more ratio-level, interval, nominal, or ordinal independent variables [38]. LR is primarily used for binary dependent variables, however, the model can be extended to scenarios involving 3 or more dependent variables (ordinal, multinomial).…”
Section: Logistic Regressionmentioning
confidence: 99%