Enhancer sequences control gene expression and comprise binding sites (motifs) for different transcription factors (TFs). Despite extensive genetic and computational studies, the relationship between DNA sequence and regulatory activity is poorly understood and enhancer de novo design is considered impossible. Here we built a deep learning model, DeepSTARR, to quantitatively predict the activities of thousands of developmental and housekeeping enhancers directly from DNA sequence in Drosophila melanogaster S2 cells. The model learned relevant TF motifs and higher-order syntax rules, including functionally non-equivalent instances of the same TF motif that are determined by motif-flanking sequence and inter-motif distances. We validated these rules experimentally and demonstrated their conservation in human by testing more than 40,000 wildtype and mutant Drosophila and human enhancers. Finally, we designed and functionally validated synthetic enhancers with desired activities de novo..
Centrosomes are the major microtubule organising centres of animal cells. Deregulation in their number occurs in cancer and was shown to trigger tumorigenesis in mice. However, the incidence, consequence and origins of this abnormality are poorly understood. Here, we screened the NCI-60 panel of human cancer cell lines to systematically analyse centriole number and structure. Our screen shows that centriole amplification is widespread in cancer cell lines and highly prevalent in aggressive breast carcinomas. Moreover, we identify another recurrent feature of cancer cells: centriole size deregulation. Further experiments demonstrate that severe centriole over-elongation can promote amplification through both centriole fragmentation and ectopic procentriole formation. Furthermore, we show that overly long centrioles form over-active centrosomes that nucleate more microtubules, a known cause of invasiveness, and perturb chromosome segregation. Our screen establishes centriole amplification and size deregulation as recurrent features of cancer cells and identifies novel causes and consequences of those abnormalities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.