Aashish Jain scite author profile

BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

show abstract

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Zhou¹,

Jiang²,

Bergquist³

et al. 2019

Preprint

View full text Add to dashboard Cite

The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Here we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility (P. aureginosa only). We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. We conclude that, while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. We finally report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bioontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens. 157 project. Predicting GO terms for a protein (protein-centric) and predicting which proteins are associated 158 with a given function (term-centric) are related but different computational problems: the former is a 159 multi-label classification problem with a structured output, while the latter is a binary classification task. 160Predicting the results of a genome-wide screen for a single or a small number of functions fits the term-centric 161 formulation. To see how well all participating CAFA methods perform term-centric predictions, we mapped 162 results from the protein-centric CAFA3 methods onto these terms. In addition we held a separate CAFA 163 challenge, CAFA-π whose purpose was to attract additional submissions from algorithms that specialize in 164 term-centric tasks. 165 We performed screens for three functions in three species, which we then used to assess protein function 166 prediction. In the bacterium Pseudomonas aeruginosa and the fungus Candida albicans we performed 167 genome-wide screens capable of uncovering genes with two functions, biofilm formation (GO:0042710) and 168 motility (for P. aeruginosa only) (GO:0001539), as described in Methods. In Drosophila melanogaster we 169 performed targeted assays, guided by previous CAFA submissions, of a ...

show abstract

Dynamic Crossover Scaling in Polymer Solutions

2012

View full text Add to dashboard Cite

The crossover region in the phase diagram of polymer solutions, in the regime above the overlap concentration, is explored by Brownian Dynamics simulations, to map out the universal crossover scaling functions for the gyration radius and the single-chain diffusion constant. Scaling considerations, our simulation results, and recently reported data on the polymer contribution to the viscosity obtained from rheological measurements on DNA systems, support the assumption that there are simple relations between these functions, such that they can be inferred from one another.

show abstract

Optimization of a Brownian-dynamics algorithm for semidilute polymer solutions

et al. 2012

View full text Add to dashboard Cite

Simulating the static and dynamic properties of semidilute polymer solutions with Brownian dynamics (BD) requires the computation of a large system of polymer chains coupled to one another through excluded-volume and hydrodynamic interactions. In the presence of periodic boundary conditions, long-ranged hydrodynamic interactions are frequently summed with the Ewald summation technique. By performing detailed simulations that shed light on the influence of several tuning parameters involved both in the Ewald summation method, and in the efficient treatment of Brownian forces, we develop a BD algorithm in which the computational cost scales as O(N 1.8 ), where N is the number of monomers in the simulation box. We show that Beenakker's original implementation of the Ewald sum, which is only valid for systems without bead overlap, can be modified so that θ-solutions can be simulated by switching off excluded-volume interactions. A comparison of the predictions of the radius of gyration, the end-to-end vector, and the self-diffusion coefficient by BD, at a range of concentrations, with the hybrid Lattice Boltzmann/Molecular Dynamics (LB/MD) method shows excellent agreement between the two methods. In contrast to the situation for dilute solutions, the LB/MD method is shown to be significantly more computationally efficient than the current implementation of BD for simulating semidilute solutions. We argue however that further optimisations should be possible.

show abstract

The Balancing Act of Intrinsically Disordered Proteins: Enabling Functional Diversity while Minimizing Promiscuity

Macossay-Castillo

Marvelli

Guharoy

et al. 2019

Journal of Molecular Biology

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Aashish Jain

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Dynamic Crossover Scaling in Polymer Solutions

Optimization of a Brownian-dynamics algorithm for semidilute polymer solutions

The Balancing Act of Intrinsically Disordered Proteins: Enabling Functional Diversity while Minimizing Promiscuity

Contact Info

Product

Resources

About