Despite our rapidly growing knowledge about the human genome, we do not know all of the genes required for some of the most basic functions of life. To start to fill this gap we developed a high-throughput phenotypic screening platform combining potent gene silencing by RNA interference, time-lapse microscopy and computational image processing. We carried out a genome-wide phenotypic profiling of each of the ,21,000 human protein-coding genes by two-day live imaging of fluorescently labelled chromosomes. Phenotypes were scored quantitatively by computational image processing, which allowed us to identify hundreds of human genes involved in diverse biological functions including cell division, migration and survival. As part of the Mitocheck consortium, this study provides an in-depth analysis of cell division phenotypes and makes the entire high-content data set available as a resource to the community.To target the ,21,000 protein-coding genes in the human genome, we used a chemically synthesized short interfering RNA (siRNA) library designed to uniquely target each gene with 2-3 independent sequences (Supplementary Methods). The siRNAs in this library were tested individually and reduced the messenger RNAs of targeted genes to below 30% of original levels (to an average of 13%) for 97% of more than 1,000 genes tested (Supplementary Table 1). To allow high-throughput phenotyping of each individual siRNA in triplicates by live-cell imaging, we used a previously established workflow for solid-phase transfection using siRNA microarrays coupled to automatic time-lapse microscopy 1 . As a high-content phenotypic assay we chose to monitor fluorescent chromosomes in a human cell line stably expressing core histone 2B tagged with green fluorescent protein (GFP) 1 . After seeding on the siRNA microarrays, on average 67 (630) cells for each siRNA of the library were imaged in triplicates for 2 days, thus documenting many of their basic functions such as cell division, proliferation, survival and migration. Image processing reveals mitotic hitsThis resulted in a large data set of ,190,000 time-lapse movies providing time-resolved records of over 19 million cell divisions. To automatically score and annotate phenotypes in this large data set, we developed a computational pipeline 2 ( Fig. 1) extending previously established methods of morphology recognition by supervised machine learning [3][4][5][6] . In brief, after segmentation, about 200 quantitative features were extracted from each nucleus and used for classification into one of 16 morphological classes ( Fig. 1 and Supplementary Movies 1-30) by a support vector machine classifier previously trained on a set of ,3,000 manually annotated nuclei (Supplementary Methods). This classifier automatically recognizes changes in nuclear morphology due to the cell cycle, cell death or other phenotypic changes with an overall accuracy of 87% (Supplementary Fig. 1) and allows us to convert each time-lapse movie into a phenotypic profile that quantifies the response to each siRNA ...
The interpretation of genome sequences requires reliable and standardized methods to assess protein function at high throughput. Here we describe a fast and reliable pipeline to study protein function in mammalian cells based on protein tagging in bacterial artificial chromosomes (BACs). The large size of the BAC transgenes ensures the presence of most, if not all, regulatory elements and results in expression that closely matches that of the endogenous gene. We show that BAC transgenes can be rapidly and reliably generated using 96-well-format recombineering. After stable transfection of these transgenes into human tissue culture cells or mouse embryonic stem URL.The BACFinder clone search and oligo design tool is available online at http://www.mitocheck.org/cgi-bin/BACfinder.Database accession codes. The ChIP/chip data has been submitted to the Gene Expression Omnibus database with accession number GSE10845. COMPETING INTERESTS STATEMENTThe authors declare competing financial interests: details accompany the full-text HTML version of the paper at http:// www.nature.com/naturemethods/. Europe PMC Funders GroupAuthor Manuscript Nat Methods. Author manuscript; available in PMC 2010 May 17. Europe PMC Funders Author ManuscriptsEurope PMC Funders Author Manuscripts cells, the localization, protein-protein and/or protein-DNA interactions of the tagged protein are studied using generic, tag-based assays. The same high-throughput approach will be generally applicable to other model systems.At a time when the 'thousand-dollar genome' seems a realistic goal for the near future, methods for dissecting the functions of the encoded genetic information lag far behind the genome sequence, both in throughput and in quality of the produced data. Genome sequencing and subsequent bioinformatics analysis have made it possible to study the function of genes in mammalian tissue culture cells using systematic reverse-genetic approaches1-3 and have radically improved researchers' ability to identify human disease genes. Such studies typically identify single genes, whose biological function has often not yet been described. In order to place the proteins these genes encode in pathways, these studies must be followed by detailed molecular-level analysis, of which the most powerful types are protein localization and protein-protein interaction. The power of protein localization and protein-protein interaction studies can be seen from the genome-wide application of GFP localization and tandem affinity tag-based complex purification in the yeast Saccharomyces cerevisiae, which has produced a comprehensive picture of the core proteome of a simple, well-studied model system4-8. The key advantage of yeast for these studies was their efficient intrinsic homologous recombination, which allowed the same tagcoding sequence to be introduced at the endogenous locus of nearly every gene of the genome. The tagged proteins were then systematically analyzed through standardized, generic, tag-based assays.To transfer this approach to mammali...
Chromosome segregation and cell division are essential, highly ordered processes that depend on numerous protein complexes. Results from recent RNA interference (RNAi) screens indicate that the identity and composition of these protein complexes is incompletely understood. Using gene tagging on bacterial artificial chromosomes, protein localization and tandem affinity purificationmass spectrometry, the MitoCheck consortium has analyzed about 100 human protein complexes, many of which had not or only incompletely been characterized. This work has led to the discovery of previously unknown, evolutionarily conserved subunits of the anaphase-promoting complex (APC/C) and the γ-tubulin ring complex (γ-TuRC), large complexes which are essential †
TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all animal gene families, as well as orthologue and paralogue assignments. Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families. We have expanded TreeFam to include 25 fully sequenced animal genomes, as well as four genomes from plant and fungal outgroup species. We have also introduced more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues. The user interface for viewing phylogenetic trees and family information has been improved. Furthermore, a new perl API lets users easily extract data from the TreeFam mysql database.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.