2022
DOI: 10.3233/cbm-210306
|View full text |Cite
|
Sign up to set email alerts
|

Optimal vocabulary selection approaches for privacy-preserving deep NLP model training for information extraction and cancer epidemiology

Abstract: BACKGROUND: With the use of artificial intelligence and machine learning techniques for biomedical informatics, security and privacy concerns over the data and subject identities have also become an important issue and essential research topic. Without intentional safeguards, machine learning models may find patterns and features to improve task performance that are associated with private personal information. OBJECTIVE: The privacy vulnerability of deep learning models for information extraction from medical… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 30 publications
0
7
0
Order By: Relevance
“…Automated approaches to clinical epidemiology include “Data extraction for epidemiological research” or DExtER [ 35 ]. NLP approaches to information extraction include analyzing social media to analyze drug abuse epidemiology[ 36 ] and detecting cancer cases to calculate epidemiologic prevalence [ 37 , 38 ].…”
Section: Introductionmentioning
confidence: 99%
“…Automated approaches to clinical epidemiology include “Data extraction for epidemiological research” or DExtER [ 35 ]. NLP approaches to information extraction include analyzing social media to analyze drug abuse epidemiology[ 36 ] and detecting cancer cases to calculate epidemiologic prevalence [ 37 , 38 ].…”
Section: Introductionmentioning
confidence: 99%
“…Also, the sharing of good practices for data linkage within EU member states has been proposed ( 41 ). Although information technology has played a positive role in quality improvement and facility of cancer registries, more strict restriction strategies, such as identifying authentication levels, controlling and coding data approaches, tailored de-identification methods, and other technical measures, had to be developed to secure the patients’ privacy ( 22 , 42 , 43 ).…”
Section: Discussionmentioning
confidence: 99%
“…In addition, data from patients are governed by privacy laws ( 194 ). The lack of supervision of these data may lead to breaching patient privacy rules; therefore, appropriate intervention and the improvement of laws and regulations are required.…”
Section: Current Challenges and Future Prospectsmentioning
confidence: 99%
“…Certainly, studies have focused on solving the privacy problem with regards to patient data. Under the same performance, the privacy vulnerability is reduced by vocabulary selection means ( 194 ). At the same time, fusing these data should comply with the principles of medical ethics.…”
Section: Current Challenges and Future Prospectsmentioning
confidence: 99%