2021
DOI: 10.3897/bdj.9.e71378
|View full text |Cite
|
Sign up to set email alerts
|

Molecular Acquisition, Cleaning and Evaluation in R (MACER) - A tool to assemble molecular marker datasets from BOLD and GenBank

Abstract: Molecular sequence data is an essential component for many biological fields of study. The strength of these data is in their ability to be centralised and compared across research studies. There are many online repositories for molecular sequence data, some of which are very large accumulations of varying data types like NCBI’s GenBank. Due to the size and the complexity of the data in these repositories, challenges arise in searching for data of interest. While data repositories exist for molecular markers, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…Similar computational and statistical tools, in the form of MATLAB packages, R packages, Python packages and methodological pipelines, used to assess anomalies in DNA (meta)barcodes, have been released. Examples include divisive hierarchical clustering: DADA ( Rosen et al 2012 ) and DADA2 ( Callahan et al 2016 ); artificial neural networks: ( Ma et al 2018 ); Profile Hidden Markov Models: coil ( Nugent et al 2020 ), debar ( Nugent et al 2021 and Porter and Hajibabaei 2021 ); distribution sample quantiles: MACER ( Young et al 2021 ); and Shannon entropy: SequenceBouncer ( Dunn 2021 ), A2G2 ( Hleap et al 2020 ), DnoisE ( Antich et al 2022 and Turon et al 2020 ). These methods and programmes are beginning to see widespread use within the biodiversity and regulatory science communities.…”
Section: Discussionmentioning
confidence: 99%
“…Similar computational and statistical tools, in the form of MATLAB packages, R packages, Python packages and methodological pipelines, used to assess anomalies in DNA (meta)barcodes, have been released. Examples include divisive hierarchical clustering: DADA ( Rosen et al 2012 ) and DADA2 ( Callahan et al 2016 ); artificial neural networks: ( Ma et al 2018 ); Profile Hidden Markov Models: coil ( Nugent et al 2020 ), debar ( Nugent et al 2021 and Porter and Hajibabaei 2021 ); distribution sample quantiles: MACER ( Young et al 2021 ); and Shannon entropy: SequenceBouncer ( Dunn 2021 ), A2G2 ( Hleap et al 2020 ), DnoisE ( Antich et al 2022 and Turon et al 2020 ). These methods and programmes are beginning to see widespread use within the biodiversity and regulatory science communities.…”
Section: Discussionmentioning
confidence: 99%
“…DNA sequences were obtained for selected taxa from NCBI GenBank and the Barcode of Life Datasystems with the R (R Core Team, 2020) package MACER (V 2.1) (Young et al, 2021) using the auto_seq_download() function with default settings. A total of 19 species of tuna and common substitutes were included in this study as outlined in Table 3.…”
Section: Inclusion/exclusion Criteriamentioning
confidence: 99%
“…In addition, many computational resources and software have been developed to accommodate the expanding role of DNA barcodes. Some of these packages (e.g., MDOP [19]) help researchers to organize DNA barcoding data before uploading to databases, such as BOLD and NCBI's Genbank, and still others are designed to assess the quality of data that have already been made publicly available (e.g., BAGS [20] and MACER [21]). The quality of DNA barcode data can be impacted by a number of factors, including poor sequence annotation, a lack of physical specimen vouchers, poor sequence quality, and incorrect consensus sequence building.…”
Section: Novel Computational Resources and Softwarementioning
confidence: 99%