23Escherichia coli uses two-component systems (TCSs) to respond to environmental 24 signals. TCSs affect gene expression and are parts of E. coli's global transcriptional regulatory 25 network (TRN). Here, we identified the regulons of five TCSs in E. coli MG1655: BaeSR and 26 CpxAR, which were stimulated by ethanol stress; KdpDE and PhoRB, induced by limiting 27 potassium and phosphate, respectively; and ZraSR, stimulated by zinc. We analyzed RNA-seq 28 data using independent component analysis (ICA). ChIP-exo data was used to validate condition-29 specific target gene binding sites. Based on this data we (1) identify the target genes for each 30 TCS; (2) show how the target genes are transcribed in response to stimulus; and (3) reveal novel 31 relationships between TCSs, which indicate non-cognate inducers for various response 32 regulators, such as BaeR to iron starvation, CpxR to phosphate limitation, and PhoB and ZraR to 33 cell envelope stress. Our understanding of the TRN in E. coli is thus notably expanded.
35Importance 36 E. coli is a common commensal microbe found in human gut microenvironment; 37 however, some strains cause diseases like diarrhea, urinary tract infections and meningitis. E. 38 coli's two-component system (TCS) modulates target gene expression, specially related to 39 virulence, pathogenesis and anti-microbial peptides, in response to environmental stimuli. Thus, 40 it is of utmost importance to understand the transcriptional regulation of the TCSs to infer its 41 environmental adaptation and disease pathogenicity. Utilizing a combinatorial approach 42 integrating RNAseq, independent component analysis, ChIP-exo and data mining, we show that 43 TCSs have five different modes of transcriptional regulation. Our data further highlights non-44 cognate inducers of TCSs emphasizing cross-regulatory nature of TCSs in E. coli and suggests 45 that TCSs may have a role beyond their cognate functionalities. In summary, these results when 46 further incorporated with genome scale metabolic models can lead to understanding of metabolic 47 capabilities of bacteria and correctly predict complex phenotype under diverse conditions. 48 49 Keywords 50 51 Two-component systems, E. coli, independent component analysis, transcriptomics, ChIP-exo, 52 transcriptional regulatory network, gene targets 53 54 Introduction 55 56Bacterial survival and resilience across diverse conditions relies upon environmental 57 sensing and a corresponding response. One pervasive biological design towards this goal consists 58 of a histidine kinase unit to sense the environment and a related response regulator unit to receive 59 the signal and translate it into gene expression changes. This signaling process is known as a 60 two-component system (TCS) (1). In the case of Escherichia coli (E. coli) strain K12 MG1655, 61 there are 30 histidine kinases and 32 response regulators involved in 29 complete two-62 component systems that mediate responses to various environmental stimuli such as metal 63 sen...