A BERT model generates diagnostically relevant semantic embeddings from pathology synopses with active learning

Mu, Youqing; Tizhoosh, Hamid R.; Tayebi, Rohollah Moosavi; Ross, Catherine; Sur, Monalisa; Leber, Brian; Campbell, Clinton J. V.

doi:10.1038/s43856-021-00008-0

Cited by 11 publications

(15 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of the weaknesses of our system, WSI labels came from our previously published BERT model's predictions [46] rather than experts, which are not perfectly correct. This was implemented as it is not practically feasible to manually label many hundreds of WSI with keywords from semi-structured diagnostic synopses.…”

Section: Discussionmentioning

confidence: 99%

“…Labels were created by simplifying the predictions of a fine-tuned BERT model on WSI synopses, as previously reported by our group [46]. This BERT model’s predictions took the form of a multi-label task.…”

Section: Methodsmentioning

confidence: 99%

“…SupCon loss is also robust to noisy labels and more stable to hyperparameter settings like optimizers and data augmentations [59]. Because our labels come from our previously published BERT model's predictions [46] rather than experts, we must take its prediction errors into consideration. SupCon loss's tolerance to noisy labels became an obvious option for this training.…”

Section: Mil With Modern Hopfield Network and Metric Learningmentioning

confidence: 99%

See 2 more Smart Citations

Whole slide image representation in bone marrow cytology

Tizhoosh

Dehkharghanian

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

One of the goals of AI-based computational pathology is to generate compact WSI representations, identifying the essential information required for diagnosis. While such approaches have been applied to histopathology, few applications have been reported in cytology. Bone marrow aspirate cytology is the basis for key clinical decisions in hematology. However, visual inspection of aspirate specimens is a tedious and complex process subject to variation in interpretation, and hematopathology expertise is scarce. The ability to generate a compact representation of an aspirate specimen may form the basis for clinical decision support tools in hematology. We have previously published an end-to-end AI-based system for counting and classifying cells from bone marrow aspirate WSI. Using deep embeddings from this model, we construct bags of individual cell features from each WSI, and apply multiple instance learning to extract vector representations for each WSI. Using these representations in vector search, we achieved 0.58 ± 0.02 mAP@10 in WSI-level image retrieval, which outperforms the Random baseline (0.39 ± 0.1). Using a weighted k-nearest-neighbours (k-NN) model on these slide vectors, we predict five broad diagnostic labels on individual aspirate WSI with a weighted-macro-average F1 score of 0.57 ± 0.03 on the test set of 278 randomly sampled WSIs, which outperforms a classifier using empirical class prior probabilities (0.26 ± 0.02). We present the first example of exploring trainable mechanisms to generate compact, slide-level representations in bone marrow cytology with deep learning. This method has the potential to summarize complex semantic information in WSIs toward improved diagnostics in hematology, and may eventually support AI-assisted computational pathology approaches.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Mil With Modern Hopfield Network and Metric Learningmentioning

confidence: 99%

See 1 more Smart Citation

Whole slide image representation in bone marrow cytology

Tizhoosh

Dehkharghanian

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…When the data is textual, the extraction of features is a bit different where the aim is to create word or text embeddings. Generally in medical NLP and at the feature extraction level, the BERT (as a state-of-the-art model) was used in several studies as in [14] , [15] , [16] , [17] , whereas, the Word2Vec model was used in [18] , [19] , [20] . In contrast, and at the algorithmic level, various deep learning models were widely used in medical NLP.…”

Section: Literature Reviewmentioning

confidence: 99%

Automatic symptoms identification from a massive volume of unstructured medical consultations using deep neural and BERT models

Faris

Faris²,

Habib³

et al. 2022

Heliyon

View full text Add to dashboard Cite

“…When the data is textual, the extraction of features is a bit different where the aim is to create word or text embeddings. Generally in medical NLP and at the feature extraction level, the BERT (as a state-of-the-art model) was used in several studies as in [14,15,16,17], whereas, the Word2Vec model was used in [18,19,20]. In contrast, and at the algorithmic level, various deep learning models were widely used in medical NLP.…”

Section: Literature Reviewmentioning

confidence: 99%