Of the various genetic subtypes of human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2) and simian immunodeficiency virus (SIV), only in subtype C of HIV-1 is a genetically variant NF-B binding site found at the core of the viral promoter in association with a subtype-specific Sp1III motif. How the subtype-associated variations in the core transcription factor binding sites (TFBS) influence gene expression from the viral promoter has not been examined previously. Using panels of infectious viral molecular clones, we demonstrate that subtype-specific NF-B and Sp1III motifs have evolved for optimal gene expression, and neither of the motifs can be replaced by a corresponding TFBS variant. The variant NF-B motif binds NF-B with an affinity 2-fold higher than that of the generic NF-B site. Importantly, in the context of an infectious virus, the subtype-specific
Based on genetic variation, human immunodeficiency virus type 1 (HIV-1) is classified into four distinct groups (M, N, O, and P), and group M is subclassified into nine molecular subtypes (A, B, C, D, F, G, H, J, and K) and numerous circulating recombinant forms, including A/E (1). The global distribution of HIV-1 subtypes is uneven, with C, A, and B being the most widespread subtypes. Subtype C is predominant in southern and eastern African countries, India, and Nepal and in recombinant forms in China, and it is currently emerging in southern Brazil. Subtype C is responsible for approximately half of the global HIV-1 infections and more than 95% of the infections in India (2). The factors that contribute to the widespread expansion of subtype C are not well understood. Diverse viral subtypes differ from one another up to 30 to 35% in certain gene segments, such as the envelope (3). A genetic variation to this large an extent is expected to have a significant impact on the biological properties of the subtypes influencing their relative fitness properties. Subtype-specific genetic variations in elements such as the viral promoter (enhancer and other regulatory elements), regulatory proteins, and structural proteins may underlie the biological differences, although drawing such a correlation between these factors may not always be possible.The individual components of the viral promoter, including the modulator region, the enhancer, and the core promoter, are characterized by several subtype-specific molecular variations. Such differences have been mapped to many transcription factor binding sites (TFBS), including USF, c-Myb, NF-AT, Ap-1, NF-B, and Sp1, and regulatory elements such as the TATA box and