Objective. Wearable devices equipped with plethysmography (PPG) sensors provided a low-cost, long-term solution to early diagnosis and continuous screening of heart conditions. However PPG signals collected from such devices often suffer from corruption caused by artifacts. The objective of this study is to develop an effective supervised algorithm to locate the regions of artifacts within PPG signals. Approach. We treat artifact detection as a 1D segmentation problem. We solve it via a novel combination of an active-contour-based loss and an adapted U-Net architecture. The proposed algorithm was trained on the PPG DaLiA training set, and further evaluated on the PPG DaLiA testing set, WESAD dataset and TROIKA dataset. Main results. We evaluated with the DICE score, a well-established metric for segmentation accuracy evaluation in the field of computer vision. The proposed method outperforms baseline methods on all three datasets by a large margin (≈ 7 percentage points above the next best method). On the PPG DaLiA testing set, WESAD dataset and TROIKA dataset, the proposed method achieved 0.8734±0.0018, 0.9114±0.0033 and 0.8050±0.0116 respectively. The next best method only achieved 0.8068±0.0014, 0.8446±0.0013 and 0.7247±0.0050. Significance. The proposed method is able to pinpoint exact locations of artifacts with high precision; in the past, we had only a binary classification of whether a PPG signal has good or poor quality. This more nuanced information will be critical to further inform the design of algorithms to detect cardiac arrhythmia.
Photoplethysmography (PPG) is a noninvasive way to monitor various aspects of the circulatory system, and is becoming more and more widespread in biomedical processing. Recently, deep learning methods for analyzing PPG have also become prevalent, achieving state of the art results on heart rate estimation, atrial fibrillation detection, and motion artifact identification. Consequently, a need for interpretable deep learning has arisen within the field of biomedical signal processing. In this paper, we pioneer novel explanatory metrics which leverage domain-expert knowledge to validate a deep learning model. We visualize model attention over a whole testset using saliency methods and compare it to human expert annotations. Congruence, our first metric, measures the proportion of model attention within expert-annotated regions. Our second metric, Annotation Classification, measures how much of the expert annotations our deep learning model pays attention to. Finally, we apply our metrics to compare between a signal based model and an image based model for PPG signal quality classification. Both models are deep convolutional networks based on the ResNet architectures. We show that our signal-based one dimensional model acts in a more explainable manner than our image based model; on average 50.78% of the one dimensional model's attention are within expert annotations, whereas 36.03% of the two dimensional model's attention are within expert annotations. Similarly, when thresholding the one dimensional model attention, one can more accurately predict if each pixel of the PPG is annotated as artifactual by an expert. Through this testcase, we demonstrate how our metrics can provide a quantitative and dataset-wide analysis of how explainable the model is.
Bacterial
identification is of great importance in clinical diagnosis,
environmental monitoring, and food safety control. Among various strategies,
matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
(MALDI-TOF MS) has drawn significant interest and has been clinically
used. Nevertheless, current bioinformatics solutions use spectral
libraries for the identification of bacterial strains. Spectral library
generation requires acquisition of MALDI-TOF spectra from monoculture
bacterial colonies, which is time-consuming and not possible for many
species and strains. We propose a strategy for bacterial typing by
MALDI-TOF using protein sequences from public database, that is, UniProt.
Ten genes were identified to encode proteins most often observed by
MALD-TOF from bacteria through 500 times repeated a 10-fold double
cross-validation procedure, using 403 MALDI-TOF spectra corresponding
to 14 genera, 81 species, and 403 strains, and the protein sequences
of 1276 species in UniProt. The 10 genes were then used to annotate
peaks on MALDI-TOF spectra of bacteria for bacterial identification.
With the approach, bacteria can be identified at the genus level by
searching against a database containing the protein sequences of 42
genera of bacteria from UniProt. Our approach identified 84.1% of
the 403 spectra correctly at the genus level. Source code of the algorithm
is available at .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.