Species occurrence records provide the basis for many biodiversity studies. They derive from georeferenced specimens deposited in natural history collections and visual observations, such as those obtained through various mobile applications. Given the rapid increase in availability of such data, the control of quality and accuracy constitutes a particular concern. Automatic filtering is a scalable and reproducible means to identify potentially problematic records and tailor datasets from public databases such as the Global Biodiversity Information Facility (GBIF; http://www.gbif.org), for biodiversity analyses. However, it is unclear how much data may be lost by filtering, whether the same filters should be applied across all taxonomic groups, and what the effect of filtering is on common downstream analyses. Here, we evaluate the effect of 13 recently proposed filters on the inference of species richness patterns and automated conservation assessments for 18 Neotropical taxa, including terrestrial and marine animals, fungi, and plants downloaded from GBIF. We find that a total of 44.3% of the records are potentially problematic, with large variation across taxonomic groups (25–90%). A small fraction of records was identified as erroneous in the strict sense (4.2%), and a much larger proportion as unfit for most downstream analyses (41.7%). Filters of duplicated information, collection year, and basis of record, as well as coordinates in urban areas, or for terrestrial taxa in the sea or marine taxa on land, have the greatest effect. Automated filtering can help in identifying problematic records, but requires customization of which tests and thresholds should be applied to the taxonomic group and geographic area under focus. Our results stress the importance of thorough recording and exploration of the meta-data associated with species records for biodiversity research.
28Species occurrence records provide the basis for many biodiversity studies. They derive from geo-referenced specimens deposited in natural history collections and visual observations, such as those obtained through various mobile applications. Given the rapid increase in availability of such data, the control of quality and accuracy constitutes a particular concern. Automatic flagging and filtering are a scalable and reproducible means to identify potentially problematic records in datasets from public databases such as the Global Biodiversity Information Facility (GBIF; www.gbif.org). However, it is unclear how much data may be lost by filtering, whether the same tests should be applied across all taxonomic groups, and what is the effect of filtering for common downstream analyses. Here, we evaluate the effect of 13 recently proposed filters on the inference of species richness patterns and automated conservation assessments for 18 Neotropical taxa including animals, fungi, and plants, terrestrial and marine, downloaded from GBIF. We find that 29-90% of the records are potentially erroneous, with large variation across taxonomic groups. Tests for duplicated information, collection year, basis of record as well as urban areas and coordinates for terrestrial taxa in the sea or marine taxa on land have the greatest effect. While many flagged records might not be de facto erroneous, they could be overly imprecise and increase uncertainty in downstream analyses. Automated flagging can help in identifying problematic records, but requires customization of which tests and thresholds should be applied to the taxonomic group and geographic area under focus. Our results stress the importance of thorough exploration of the meta-data associated with species records for biodiversity research. 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44Publicly available species distribution data have become a crucial resource in biodiversity research, including studies in 46 ecology, biogeography, systematics and conservation biology. In particular, the availability of digitized collections from 47 museums and herbaria, and citizen science observations has increased drastically over the last few years. As of today, 48 the largest public aggregator for geo-referenced species occurrences data, the Global Biodiversity Information Facility 49 (www.gbif.org), provides access to more than 1.3 billion geo-referenced occurrence records for species from across the 50 globe and the tree of life. 51A central challenge to the use of these publicly available species occurrence data in research are erroneous geographic 52 coordinates (Anderson et al. 2016). Errors mostly arise because public databases integrate records collected with 53 different methodologies in different places, at different times; often without centralized curation and only rudimentary 54 meta-data. For instance, erroneous coordinates caused by data-entry errors or automated geo-referencing from vague 55 locality descriptions are common (Maldonado et al. 2015; Yesson et al. 2007)...
Herein, eyeless Pseudosinella species from Brazilian caves are reviewed, including the description of 23 new species, new records plus additional notes on the descriptions of P. ambigua Zeppelini, Brito, and Lima and of P. guanhaensis Zeppelini, Brito, and Lima. We also provide an identification key to 27 eyeless species recorded from Brazil. To organize the 26 Brazilian eyeless taxa analyzed in this work, we organize them in apparently artificial groups: 11 species have one larger tooth on the unguiculus outer lamella (petterseni group); one presents unguiculus outer lamella smooth or serrated (never with a larger tooth), with 9 held prelabral chaetae undivided and the last 6 held prelabral chaetae bifurcated. The Brazilian species of eyeless Pseudosinella herein described present a remarkably conservate dorsal chaetotaxy; therefore, the main diagnostic characters are related to other features like prelabral, labral, and ventral head chaetotaxy and empodial complex morphology. In addition, our study suggests that Brazilian caves possibly shelter a great diversity of Pseudosinella taxa, several of them potentially cave dependent.
Herein we describe two new species of Brazilian springtails, both from Rio Grande do Norte, Brazil. Brachystomella nordestina sp.n. resembles other Neotropical congeners with 8+8 eyes, postantennal organ with four lobes and 19/19/18 chaetae on tibiotarsi I–III, but it is unique in the combination of trilobed apical bulb, 2+2 dorsal chaetae on thorax I, male’s genital plate with 4+4 modified eugenital chaetae and 2+2 hr chaetae on lateral anal valves. Seira (Lepidocyrtinus) dapeste sp.n. is similar to other Neotropical species of Lepidocyrtinus Börner, 1903 in long antennae, mesonotum projected over head and presence of modified blunt macrochaetae on dorsal furca, but differs in the combination of dorsal head chaetotaxy, 14–19 a macrochaetae on thorax II, 7–9 central macrochaetae on thorax III and 4 central macrochaetae on abdomen II. We also present a detailed comparison and an updated key of the Neotropical species of Brachystomella Ågren, 1903.
Mitogenomes represent useful tools for investigating the phylogeny of many metazoan clades. Regarding Collembola, the use of mitogenomics has already shown promising results, but few published works include sufficient taxon sampling to study its evolution and systematics on a broader scale. Here, we present a phylogenetic study based on the mitogenomes of 124 species from 24 subfamilies, 16 families, and four orders—one of the most comprehensive datasets used in a molecular study of Collembola evolution to date—and compare our results with the trees from recently published papers and traditional systematic hypotheses. Our main analysis supported the validity of the four orders and the clustering of Poduromorpha with Entomobryomorpha (the traditional Arthropleona). Our data also supported the split of Symphypleona s. str. into the Appendiciphora and Sminthuridida suborders, and the division of the Neelipleona into two subfamilies: Neelinae and Neelidinae subfam. nov. On the other hand, the traditional Symphypleona s. lat., Isotomoidea, and all the Isotomidae subfamilies were refuted by our analyses, indicating a need for a systematic revision of the latter family. Though our results are endorsed by many traditional and recent systematic findings, we highlight a need for additional mitogenomic data for some key taxa and the inclusion of nuclear markers to resolve some residual problematic relationships.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.