There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
Some cases of Alzheimer's disease are inherited as an autosomal dominant trait. Genetic linkage studies have mapped a locus (AD3) associated with susceptibility to a very aggressive form of Alzheimer's disease to chromosome 14q24.3. We have defined a minimal cosegregating region containing the AD3 gene, and isolated at least 19 different transcripts encoded within this region. One of these transcripts (S182) corresponds to a novel gene whose product is predicted to contain multiple transmembrane domains and resembles an integral membrane protein. Five different missense mutations have been found that cosegregate with early-onset familial Alzheimer's disease. Because these changes occurred in conserved domains of this gene, and are not present in normal controls, they are likely to be causative of AD3.
Systemic lupus erythematosus (SLE, OMIM 152700) is a complex autoimmune disease that affects 0.05% of the Western population, predominantly women. A number of susceptibility loci for SLE have been suggested in different populations, but the nature of the susceptibility genes and mutations is yet to be identified. We previously reported a susceptibility locus (SLEB2) for Nordic multi-case families. Within this locus, the programmed cell death 1 gene (PDCD1, also called PD-1) was considered the strongest candidate for association with the disease. Here, we analyzed 2,510 individuals, including members of five independent sets of families as well as unrelated individuals affected with SLE, for single-nucleotide polymorphisms (SNPs) that we identified in PDCD1. We show that one intronic SNP in PDCD1 is associated with development of SLE in Europeans (found in 12% of affected individuals versus 5% of controls; P = 0.00001, r.r. (relative risk) = 2.6) and Mexicans (found in 7% of affected individuals versus 2% of controls; P = 0.0009, r.r. = 3.5). The associated allele of this SNP alters a binding site for the runt-related transcription factor 1 (RUNX1, also called AML1) located in an intronic enhancer, suggesting a mechanism through which it can contribute to the development of SLE in humans.
The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.