Background: Tuberculosis remains one of the leading causes of morbidity and mortality worldwide. Therefore, understanding the pathophysiology of Mycobacterium tuberculosis is imperative for developing new drugs. Posttranscriptional regulation plays a significant role in microbial adaptation to different growth conditions. While the proteins associated with gene expression regulation have been extensively studied in the pathogenic strain M. tuberculosis H37Rv, post-transcriptional regulation involving small RNAs (sRNAs) remains poorly understood. Results:We developed a novel moving-window based approach to detect sRNA expression using RNA-Seq data.Overlaying ChIP-seq data of RNAP (RNA Polymerase) and NusA suggest that these putative sRNA coding regions are significantly bound by the transcription machinery. Besides capturing many experimentally validated sRNAs, we observe the context-dependent expression of novel sRNAs in the intergenic regions of M. tuberculosis genome. For example, ncRv11806 shows expression only in the stationary phase, suggesting its role in mycobacterial latency which is a key attribute to long term pathogenicity. Also, ncRv11875C showed expression in the iron-limited condition, which is prevalent inside the macrophages of the host cells. Conclusion: The systems level analysis of sRNA highlights the condition-specific expression of sRNAs which might enable the pathogen survival by rewiring regulatory circuits.
The overall function of a multi-domain protein is determined by the functional and structural interplay of its constituent domains. Traditional sequence alignment-based methods commonly utilize domain-level information and provide classification only at the level of domains. Such methods are not capable of taking into account the contributions of other domains in the proteins, and domain-linker regions and classify multi-domain proteins. An alignment-free protein sequence comparison tool, CLAP (CLAssification of Proteins) was previously developed in our laboratory to especially handle multi-domain protein sequences without a requirement of defining domain boundaries and sequential order of domains. Through this method we aim to achieve a biologically meaningful classification scheme for multi-domain protein sequences. In this article, CLAP-based classification has been explored on 5 datasets of multi-domain proteins and we present detailed analysis for proteins containing (1) Tyrosine phosphatase and (2) SH3 domain. At the domain-level CLAP-based classification scheme resulted in a clustering similar to that obtained from an alignment-based method. CLAP-based clusters obtained for full-length datasets were shown to comprise of proteins with similar functions and domain architectures. Our study demonstrates that multi-domain proteins could be classified effectively by considering full-length sequences without a requirement of identification of domains in the sequence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.