“…We initially focused on six types of features for describing a training set of promoters (Bar-Joseph et al, 2003;Beer and Tavazoie, 2004;Li et al, 2002;Zwir et al, 2005b): submotifs, which model the studied transcription factor-binding motifs; orientation, which characterizes the binding boxes as either in direct or opposite orientation relative to the open reading frame; RNA pol sites, which characterize the RNA polymerase motif (Cotik et al, 2005), the class of σ70 promoter (Romero Zaliz et al, 2004) that differentiates class I from class II promoters, and distance distributions (close, medium, and remote) between RNA polymerase and transcription factor-binding sites in activated and repressed promoters (Salgado et al, 2004); activated/repressed, where we learn activation and repression distributions by compiling distances between binding sites for RNA polymerase and a transcription factor; interactions, where we evaluate motifs for several transcription factor-binding sites and model the distance distributions between motifs colocated in the same promoter regions; and expression, which considers gene expression levels.…”