In many databases, biocuration primarily involves literature curation, which usually involves retrieving relevant articles, extracting information that will translate into annotations and identifying new incoming literature. As the volume of biological literature increases, the use of text mining to assist in biocuration becomes increasingly relevant. A number of groups have developed tools for text mining from a computer science/linguistics perspective, and there are many initiatives to curate some aspect of biology from the literature. Some biocuration efforts already make use of a text mining tool, but there have not been many broad-based systematic efforts to study which aspects of a text mining tool contribute to its usefulness for a curation task. Here, we report on an effort to bring together text mining tool developers and database biocurators to test the utility and usability of tools. Six text mining systems presenting diverse biocuration tasks participated in a formal evaluation, and appropriate biocurators were recruited for testing. The performance results from this evaluation indicate that some of the systems were able to improve efficiency of curation by speeding up the curation task significantly (∼1.7- to 2.5-fold) over manual curation. In addition, some of the systems were able to improve annotation accuracy when compared with the performance on the manually curated set. In terms of inter-annotator agreement, the factors that contributed to significant differences for some of the systems included the expertise of the biocurator on the given curation task, the inherent difficulty of the curation and attention to annotation guidelines. After the task, annotators were asked to complete a survey to help identify strengths and weaknesses of the various systems. The analysis of this survey highlights how important task completion is to the biocurators’ overall experience of a system, regardless of the system’s high score on design, learnability and usability. In addition, strategies to refine the annotation guidelines and systems documentation, to adapt the tools to the needs and query types the end user might have and to evaluate performance in terms of efficiency, user interface, result export and traditional evaluation metrics have been analyzed during this task. This analysis will help to plan for a more intense study in BioCreative IV.
BackgroundThe development and homeostasis of multicellular organisms depends on sheets of epithelial cells. Bazooka (Baz; PAR-3) localizes to the apical circumference of epithelial cells and is a key hub in the protein interaction network regulating epithelial structure. We sought to identify additional proteins that function with Baz to regulate epithelial structure in the Drosophila embryo.Methodology/Principal FindingsThe baz zygotic mutant cuticle phenotype could be dominantly enhanced by loss of known interaction partners. To identify additional enhancers, we screened molecularly defined chromosome 2 and 3 deficiencies. 37 deficiencies acted as strong dominant enhancers. Using deficiency mapping, bioinformatics, and available single gene mutations, we identified 17 interacting genes encoding known and predicted polarity, cytoskeletal, transmembrane, trafficking and signaling proteins. For each gene, their loss of function enhanced adherens junction defects in zygotic baz mutants during early embryogenesis. To further evaluate involvement in epithelial polarity, we generated GFP fusion proteins for 15 of the genes which had not been found to localize to the apical domain previously. We found that GFP fusion proteins for Drosophila ASAP, Arf79F, CG11210, Septin 5 and Sds22 could be recruited to the apical circumference of epithelial cells. Nine of the other proteins showed various intracellular distributions, and one was not detected.Conclusions/SignificanceOur enhancer screen identified 17 genes that function with Baz to regulate epithelial structure in the Drosophila embryo. Our secondary localization screen indicated that some of the proteins may affect epithelial cell polarity by acting at the apical cell cortex while others may act through intracellular processes. For 13 of the 17 genes, this is the first report of a link to baz or the regulation of epithelial structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.