Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set

Catelli, Rosario; Gargiulo, Francesco; Casola, Valentina; Pietro, Giuseppe De; Fujita, Hamido; Esposito, Massimo

doi:10.1016/j.asoc.2020.106779

Cited by 62 publications

(40 citation statements)

References 63 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In the lone study that integrated heterogeneous data for modeling, Abdalla et al integrated 43 sociodemographic variables from multiple sources (eg, Census Bureau, US Department of Agriculture, Centers for Disease Control and Prevention) and built elastic net models to examine how sociodemographics impacted county-level social distancing ( Table 4 ). 130 Of the remaining studies, 1 used ANN to perform a drive-through mass vaccination simulation, 138 while the other 4 used NLP methods and tools on various research topics, including cross-lingual clinical deidentification in electronic health records (EHRs), 139 dream reports analysis, 140 drug safety analysis by mining the FDA adverse event system, 141 COVID-19 clinical concept (signs and symptoms) identification, and normalization in EHRs. 142 …”

Section: Resultsmentioning

confidence: 99%

The application of artificial intelligence and data integration in COVID-19 studies: a scoping review

Guo

Zhang

Lyu

et al. 2021

Journal of the American Medical Informatics Association

View full text Add to dashboard Cite

Objective To summarize how artificial intelligence (AI) is being applied in COVID-19 research and determine whether these AI applications integrated heterogenous data from different sources for modeling. Materials and Methods We searched 2 major COVID-19 literature databases, the National Institutes of Health’s LitCovid and the World Health Organization’s COVID-19 database on March 9, 2021. Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guideline, 2 reviewers independently reviewed all the articles in 2 rounds of screening. Results In the 794 studies included in the final qualitative analysis, we identified 7 key COVID-19 research areas in which AI was applied, including disease forecasting, medical imaging-based diagnosis and prognosis, early detection and prognosis (non-imaging), drug repurposing and early drug discovery, social media data analysis, genomic, transcriptomic, and proteomic data analysis, and other COVID-19 research topics. We also found that there was a lack of heterogenous data integration in these AI applications. Discussion Risk factors relevant to COVID-19 outcomes exist in heterogeneous data sources, including electronic health records, surveillance systems, sociodemographic datasets, and many more. However, most AI applications in COVID-19 research adopted a single-sourced approach that could omit important risk factors and thus lead to biased algorithms. Integrating heterogeneous data for modeling will help realize the full potential of AI algorithms, improve precision, and reduce bias. Conclusion There is a lack of data integration in the AI applications in COVID-19 research and a need for a multilevel AI framework that supports the analysis of heterogeneous data from different sources.

show abstract

Section: Resultsmentioning

confidence: 99%

The application of artificial intelligence and data integration in COVID-19 studies: a scoping review

Guo

Zhang

Lyu

et al. 2021

Journal of the American Medical Informatics Association

View full text Add to dashboard Cite

show abstract

“…Different models have been proposed in the literature for predicting COVID-19 spread such as Fong et al [ 14 , 15 ], Hernandez et al [ 16 ]. It should be noted that Catelli et al [ 10 , 11 ] performed interesting studies on Italy dataset. We compare our results for Italy and Turkey with the results of [ 16 ], who used an ARIMA-based model.…”

Section: Resultsmentioning

confidence: 99%

Data driven covid-19 spread prediction based on mobility and mask mandate information

Banerjee

Lian

2021

Appl Intell

View full text Add to dashboard Cite

COVID-19 is one of the largest spreading pandemic diseases faced in the documented history of mankind. Human to human interaction is the most prolific method of transmission of this virus. Nations all across the globe started to issue stay at home orders and mandating to wear masks or a form of face-covering in public to minimize the transmission by reducing contact between majority of the populace. The epidemiological models used in the literature have considerable drawbacks in the assumption of homogeneous mixing among the populace. Moreover, the effect of mitigation strategies such as mask mandate and stay at home orders cannot be efficiently accounted for in these models. In this work, we propose a novel data driven approach using LSTM (Long Short Term Memory) neural network model to form a functional mapping of daily new confirmed cases with mobility data which has been quantified from cell phone traffic information and mask mandate information. With this approach no pre-defined equations are used to predict the spread, no homogeneous mixing assumption is made, and the effect of mitigation strategies can be accounted for. The model learns the spread of the virus based on factual data from verified resources. A study of the number of cases for the state of New York (NY) and state of Florida (FL) in the USA are performed using the model. The model correctly predicts that with higher mobility the cases would increase and vice-versa. It further predicts the rate of new cases would see a decline if a mask mandate is administered. Both these predictions are in agreement with the opinions of leading medical and immunological experts. The model also predicts that with the mask mandate option even a higher mobility would reduce the daily cases than lower mobility without masks. We additionally generate results and provide RMSE (Root Mean Square Error) comparison with ARIMA based model of other published work for Italy, Turkey, Australia, Brazil, Canada, Egypt, Japan, and the UK. Our model reports lower RMSE than the ARIMA based work for all eight countries which were tested. The proposed model would provide administrations with a quantifiable basis of how mobility, mask mandates are related to new confirmed cases; so far no epidemiological models provide that information. It gives fast and relatively accurate prediction of the number of cases and would enable the administrations to make informed decisions and make plans for mitigation strategies and changes in hospital resources.

show abstract

“…Also, a DL based drug detection pipeline has been applied to intend and produce new drug-like compounds against COVID-19 [ 65 ] respectively. At present, many attempts [ 124 ][ 125 ] have been made to enhance new analytical advances with both ML as well as DL. Some of the examples include: ML-based transmission of SARS-CoV-2 analyzing designs utilizing a CRISPR-based virus recognition system was confirmed with high sensitivity and speed [ 27 ].…”

Section: Discussionmentioning

confidence: 99%

Intelligent system for COVID-19 prognosis: a state-of-the-art survey

Nayak

Naik

Dinesh³

et al. 2021

Appl Intell

View full text Add to dashboard Cite

Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set

Cited by 62 publications

References 63 publications

The application of artificial intelligence and data integration in COVID-19 studies: a scoping review

The application of artificial intelligence and data integration in COVID-19 studies: a scoping review

Data driven covid-19 spread prediction based on mobility and mask mandate information

Intelligent system for COVID-19 prognosis: a state-of-the-art survey

Contact Info

Product

Resources

About