2022
DOI: 10.1109/mbits.2022.3205143
|View full text |Cite
|
Sign up to set email alerts
|

Use and Misuse of Machine Learning in Anthropology

Abstract: Machine learning (ML), being now widely accessible to the research community at large, has fostered a proliferation of new and striking applications of these emergent mathematical techniques across a wide range of disciplines. In this paper, we will focus on a particular case study: the field of paleoanthropology, which seeks to understand the evolution of the human species based on biological (e.g. bones, genetics) and cultural (e.g. stone tools) evidence. As we will show, the easy availability of ML algorith… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…2, Table 5), which included the separation of training data and testing data prior to the calculation of PCA, algorithms maintain high classification rates (Table 5), with even lower loss values (RMSE = 0.017). This proves that even when trying to reduce the possible contamination of the testing set related to PCA preprocessing as described by Calder et al (2022), MT approaches presented here are able to produce highly powerful classification algorithms that may be more realistically applicable to the fossil record.…”
Section: Machine Teachingmentioning
confidence: 80%
See 3 more Smart Citations
“…2, Table 5), which included the separation of training data and testing data prior to the calculation of PCA, algorithms maintain high classification rates (Table 5), with even lower loss values (RMSE = 0.017). This proves that even when trying to reduce the possible contamination of the testing set related to PCA preprocessing as described by Calder et al (2022), MT approaches presented here are able to produce highly powerful classification algorithms that may be more realistically applicable to the fossil record.…”
Section: Machine Teachingmentioning
confidence: 80%
“…Experiment 1 performed train:test splits after PCA had been performed on the entire dataset, and was then subjected to MT, following the approaches that other studies have used in the past (Courtenay et al, 2021). Experiment 2, however, performed the train:test split on the Procrustes Superimposed coordinates, so as to avoid the possible contamination implied by Calder et al (2022). The train set was then used to calculate PC scores, which were then augmented and the synthetic values used to train a SVM.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations