2020
DOI: 10.20944/preprints202011.0696.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Geometric Morphometric Data Augmentation using Generative Computational Learning Algorithms

Abstract: The fossil record is notorious for being incomplete and distorted, frequently conditioning the type of knowledge that can be extracted from it. In many cases, this often leads to issues when performing complex statistical analyses, such as classification tasks, predictive modelling, and variance analyses, such as those used in Geometric Morphometrics. Here different Generative Adversarial Network architectures are experimented with, testing the effects of sample size and domain dimensionality on model performa… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 52 publications
(67 reference statements)
0
10
0
Order By: Relevance
“…Firstly, prior augmentation of each dataset provided both algorithms with enough information to learn from, obtaining above average accuracy when used to classify the original samples. While the present datasets are unable to reach the 100% accuracy reported originally using SVMs 20 , this is likely due to the use of bootstrapping in the original study 41 . Here, more robust data augmentation techniques produced completely new synthetic data from which to learn from, providing a more general overview of the target domain.…”
Section: Discussionmentioning
confidence: 70%
See 4 more Smart Citations
“…Firstly, prior augmentation of each dataset provided both algorithms with enough information to learn from, obtaining above average accuracy when used to classify the original samples. While the present datasets are unable to reach the 100% accuracy reported originally using SVMs 20 , this is likely due to the use of bootstrapping in the original study 41 . Here, more robust data augmentation techniques produced completely new synthetic data from which to learn from, providing a more general overview of the target domain.…”
Section: Discussionmentioning
confidence: 70%
“…Here we have shown how a number of different data science tools can be employed for GM analyses. From one perspective, unsupervised computational learning approaches were able to produce highly realistic augmented datasets, using both neural network based approaches 38 – 41 , as well as Bayesian Inference Engines 42 – 45 . While the use of Graphics Processing Units (GPUs) are likely to speed up GAN performance, MCMC can still be considered the fastest approach to modelling these datasets with exceptional synthetic-data quality.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations