Effectively and efficiently diagnosing COVID-19 patients with accurate clinical type is essential to achieve optimal outcomes of the patients as well as reducing the risk of overloading the healthcare system. Currently, severe and non-severe COVID-19 types are differentiated by only a few clinical features, which do not comprehensively characterize complicated pathological, physiological, and immunological responses to SARS-CoV-2 invasion in different types. In this study, we recruited 214 confirmed COVID-19 patients in non-severe and 148 in severe type, from Wuhan, China. The patients' comorbidity and symptoms (26 features), and blood biochemistry (26 features) upon admission were acquired as two input modalities. Exploratory analyses demonstrated that these features differed substantially between two clinical types. Machine learning random forest (RF) models using features in each modality were developed and validated to classify COVID-19 clinical types. Using comorbidity/symptom and biochemistry as input independently, RF models achieved >90% and >95% predictive accuracy, respectively. Input features' importance based on Gini impurity were further evaluated and top five features from each modality were identified (age, hypertension, cardiovascular disease, gender, diabetes; D-Dimer, hsTNI, neutrophil, IL-6, and LDH). Combining top 10 multimodal features, RF model achieved >99% predictive accuracy. These findings shed light on how the human body reacts to SARS-CoV-2 invasion as a unity and provide insights on effectively evaluating COVID-19 patient's severity and developing treatment plans accordingly. We suggest that symptoms and comorbidities can be used as an initial screening tool for triaging, while biochemistry and features combined are applied when accuracy is the priority.
SignificanceWe adapted natural language processing to the biological literature and demonstrated end-to-end automated knowledge discovery by exploring subtle word connections. General text mining scanned 21 million publication abstracts and selected a reliable 130,000 from which hypothesis generation algorithms predicted kinases not known to phosphorylate p53, but likely to do so. Six of these p53 kinase candidates passed experimental validation. Among them NEK2 was examined in depth and shown to repress p53 and promote cell division. This work demonstrates the possibility of integrating a vast corpora of written knowledge to compute valuable hypotheses that will often test true and fuel discovery.
IntroductionRegulations are increasing the scope of activities that fall under the remit of drug safety. Currently, individual case safety report (ICSR) collection and collation is done manually, requiring pharmacovigilance professionals to perform many transactional activities before data are available for assessment and aggregated analyses. For a biopharmaceutical company to meet its responsibilities to patients and regulatory bodies regarding the safe use and distribution of its products, improved business processes must be implemented to drive the industry forward in the best interest of patients globally. Augmented intelligent capabilities have already demonstrated success in capturing adverse events from diverse data sources. It has potential to provide a scalable solution for handling the ever-increasing ICSR volumes experienced within the industry by supporting pharmacovigilance professionals’ decision-making.ObjectiveThe aim of this study was to train and evaluate a consortium of cognitive services to identify key characteristics of spontaneous ICSRs satisfying an acceptable level of accuracy determined by considering business requirements and effective use in a real-world setting. The results of this study will serve as supporting evidence for or against implementing augmented intelligence in case processing to increase operational efficiency and data quality consistency.MethodsA consortium of ten cognitive services to augment aspects of ICSR processing were identified and trained through deep-learning approaches. The input data for model training were 20,000 ICSRs received by Celgene drug safety over a 2-year period. The data were manually made machine-readable through the process of transcription, which converts images into text. The machine-readable documents were manually annotated for pharmacovigilance data elements to facilitate the training and testing of the cognitive services. Once trained by cognitive developers, the cognitive services’ output was reviewed by pharmacovigilance subject-matter experts against the accepted ground-truth for correctness and completeness. To be considered adequately trained and functional, each cognitive service was required to reach a threshold of F1 or accuracy score ≥ 75%.ResultsAll ten cognitive services under development have reached an evaluative score ≥ 75% for spontaneous ICSRs.ConclusionAll cognitive services under development have achieved the minimum evaluative threshold to be considered adequately trained, demonstrating how machine-learning and natural language processing techniques together provide accurate outputs that may augment pharmacovigilance professionals’ processing of spontaneous ICSRs quickly and accurately. The intention of augmented intelligence is not to replace the pharmacovigilance professional, but rather support them in their consistent decision-making so that they may better handle the overwhelming amount of data otherwise manually curated and monitored for ongoing drug surveillance requirements. Through this supported decision-making...
The objective of this research was to eliminate the influence of environmental stray light on the measured spectral reflectance in a spectral sensor system used on a weeding teleoperated robot for rape fields. The system mainly consists of a light source, signal acquisition, and data processing. In this study, optical modulation technology and discrete Fourier transform method were used to carry out experiments based on four characteristic wavelengths. The results show that the experimental system can obtain a stable reflectivity value no matter whether the stray light changes slowly or dramatically in the field environment. Furthermore, the calibration equation of spectral reflectance was obtained by curve fitting based on a FieldSpec3 portable spectrometer. The determination coefficients are all close to 1, and the root-meansquare errors are small. Through verification experiments, the results show that the average measurement error is less than 5%. Hence, this proposed method would be very helpful in improving the accuracy and efficiency of the spectral reflectance measurement, which in turn provides a theoretical basis for weed identification during in-field weeding by teleoperated robots.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.