2020
DOI: 10.1111/pala.12512
|View full text |Cite
|
Sign up to set email alerts
|

Learning to see the wood for the trees: machine learning, decision trees, and the classification of isolated theropod teeth

Abstract: Taxonomic identification of fossils based on morphometric data traditionally relies on the use of standard linear models to classify such data. Machine learning and decision trees offer powerful alternative approaches to this problem but are not widely used in palaeontology. Here, we apply these techniques to published morphometric data of isolated theropod teeth in order to explore their utility in tackling taxonomic problems. We chose two published datasets consisting of 886 teeth from 14 taxa and 3020 teeth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 100 publications
0
13
0
Order By: Relevance
“…2019; Wills et al . 2021; MacLeod et al . 2022) we chose to use these 2D linear measurements because these variables are common to most published analyses of isolated theropod tooth datasets (e.g.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…2019; Wills et al . 2021; MacLeod et al . 2022) we chose to use these 2D linear measurements because these variables are common to most published analyses of isolated theropod tooth datasets (e.g.…”
Section: Methodsmentioning
confidence: 99%
“…2020) and machine learning analysis (Wills et al . 2021). Given this and the lack of comparative digital image‐based theropod tooth datasets we feel that the approach we have taken is appropriate.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…It is mainly used for robotics, navigation, and gaming [ 58 ]. Moreover, ML techniques have specific requirements for training data [ 59 , 60 ]. ML can build regression to find the relationship between variables, classify unlabeled datasets, and cluster the data into different groups [ 61 , 62 ].…”
Section: ML Overviewmentioning
confidence: 99%
“…The leaf node represents the target class or classification label, which is the final decision or prediction made after following the path from the root to the leaf (classification rule) [17,18]. The main advantages of DT are as follows: (1) missing data can be accommodated; (2) the data do not need to conform to a normal distribution; (3) outliers have almost no effect on the final classification; (4) categorical data and numerical data can be used as predictors; (5) Transforming predictors have no effect on the tree structure [19]. In this study, when using the DT algorithm to establish the prediction model of yellow leaf disease of arecanut, the Gini diversity index was selected as the splitting criterion, the maximum splitting number was set to 100, and the system default values were used for other parameters.…”
Section: Dt Algorithm Modelmentioning
confidence: 99%