In addition to the major ectodomain, the gp41 transmembrane glycoprotein of HIV-1 is now known to have a minor ectodomain that is part of the long C-terminal tail. Both ectodomains are highly antigenic, carry neutralizing and non-neutralizing epitopes, and are involved in virus-mediated fusion activity. However, data have so far been biologically based, and derived solely from T cell line-adapted (TCLA), B clade viruses. Here we have carried out sequence and theoretically based structural analyses of 357 gp41 C-terminal sequences of mainly primary isolates of HIV-1 clades A, B, C, and D. Data show that all these viruses have the potential to form a tail loop structure (the minor ectodomain) supported by three, beta-sheet, membrane-spanning domains (MSDs). This means that the first (N-terminal) tyrosine-based sorting signal of the gp41 tail is situated outside the cell membrane and is non-functional, and that gp41 that reaches the cell surface may be recycled back into the cytoplasm through the activity of the second tyrosine-sorting signal. However, we suggest that only a minority of cell-associated gp41 molecules - those destined for incorporation into virions - has 3 MSDs and the minor ectodomain. Most intracellular gp41 has the conventional single MSD, no minor ectodomain, a functional first tyrosine-based sorting signal, and in line with current thinking is degraded intracellularly. The gp41 structural diversity suggested here can be viewed as an evolutionary strategy to minimize HIV-1 envelope glycoprotein expression on the cell surface, and hence possible cytotoxicity and immune attack on the infected cell.