Large-scale analysis of the fossil record requires aggregation of palaeontological data from individual fossil localities. Prior to computers, these synoptic datasets were compiled by hand, a laborious undertaking that took years of effort and forced palaeontologists to make difficult choices about what types of data to tabulate. The advent of desktop computers ushered in palaeontology's first digital revolution-online literature-based databases, such as the Paleobiology Database (PBDB). However, the published literature represents only a small proportion of the palaeontological data housed in museum collections. Although this issue has long been appreciated, the magnitude, and thus potential significance, of these so-called 'dark data' has been difficult to determine. Here, in the early phases of a second digital revolution in palaeontology--the digitization of museum collections-we provide an estimate of the magnitude of palaeontology's dark data. Digitization of our nine institutions' holdings of Cenozoic marine invertebrate collections from California, Oregon and Washington in the USA reveals that they represent 23 times the number of unique localities than are currently available in the PBDB. These data, and the vast quantity of similarly untapped dark data in other museum collections, will, when digitally mobilized, enhance palaeontologists' ability to make inferences about the patterns and processes of past evolutionary and ecological changes.
Body size distributions can vary widely among communities, with important implications for ecological dynamics, energetics, and evolutionary history. Here we present a dataset of body size and shape for 12,035 extant Patellogastropoda (true limpet) specimens from the collections of the University of California Museum of Paleontology, compiled using a novel high-throughput morphometric imaging method. These specimens were collected over the past 150 years at 355 localities along a latitudinal gradient ranging from Alaska to Baja California, Mexico and are presented here with individual images, 2D outline coordinates, and 2D measurements of body size and shape. This dataset provides a resource for assemblage-scale macroecological questions and documents the size and diversity of recent patellogastropods in the northeastern Pacific.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.