2022
DOI: 10.3390/diagnostics12061491
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning of Raman Spectroscopy Data for Classifying Cancers: A Review of the Recent Literature

Abstract: Raman Spectroscopy has long been anticipated to augment clinical decision making, such as classifying oncological samples. Unfortunately, the complexity of Raman data has thus far inhibited their routine use in clinical settings. Traditional machine learning models have been used to help exploit this information, but recent advances in deep learning have the potential to improve the field. However, there are a number of potential pitfalls with both traditional and deep learning models. We conduct a literature … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(27 citation statements)
references
References 59 publications
0
27
0
Order By: Relevance
“…A Python library BaselineRemoval contains several methods to remove the noise, caused by fluorescence in Raman spectra, and according to our previous findings [ 54 ], the airPLS algorithm [ 51 ] has a good performance. Principal component analysis (PCA) was used to extract informative features [ 55 , 56 ].…”
Section: Methodsmentioning
confidence: 99%
“…A Python library BaselineRemoval contains several methods to remove the noise, caused by fluorescence in Raman spectra, and according to our previous findings [ 54 ], the airPLS algorithm [ 51 ] has a good performance. Principal component analysis (PCA) was used to extract informative features [ 55 , 56 ].…”
Section: Methodsmentioning
confidence: 99%
“…This allows for hyperparameters to be optimised in an inner CV loop, and then tested against held-out data in an outer CV loop ( Figure 1 ). Nested CV has been shown to reduce a model’s estimated accuracy by as much as 20% in oncological applications of RS, giving a more realistic assessment of the model’s generalisability [ 8 ]. A single metric needs to be selected to optimise: the log loss was chosen as it is a proper scoring metric which utilises distributional information, compared to typical binarised metrics such as accuracy [ 17 ].…”
Section: Materials and Methodsmentioning
confidence: 99%
“…A Raman spectrum represents the change in photon wavenumber from a monochromatic light source along the x-axis and the intensity (i.e., number of photons) thus scattered on the y-axis. RS has been successfully applied to discriminate between numerous cancer types in human tissues, most recently to the brain, breast, cervical, colon, lung, nasopharyngeal, prostate, skin and tongue [ 8 ]. The applications include early diagnosis, biopsy guidance and intraoperative tumour margin detection.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A strength of the technique is that this information can be reinterpreted as a molecular fingerprint lending an interpretation of the material’s composition in terms of the relative concentration of proteins and specific amino acids, lipids, deoxyribonucleic acid, and ribonucleic acid, as well as water and other metabolites. 4 , 5 …”
Section: Introductionmentioning
confidence: 99%