SRY
, the Y‐chromosome‐encoded male sex‐determining factor, is the signature member of a larger family of related transcription factors, the
SOX
proteins.
SOX
proteins are related by a DNA (
deoxyribonucleic acid
)‐binding domain known as the
HMG
(
high mobility group
) domain, which in turn unites a larger superfamily of proteins.
SRY
and
SOX
proteins are thought to regulate gene transcription through binding to specific DNA sequence motifs and causing the DNA to bend – a property of so‐called architectural transcription factors. Similarly, a related family of proteins, including
TCF‐1
(
T‐cell factor 1
),
LEF‐1
(
lymphoid enhancer factor 1
) and their relatives, regulates transcription by virtue of sequence‐specific DNA binding. A third family, the true or canonical
HMG
proteins, differs by containing multiple
HMG
domains, not binding to specific target DNA sequences, and having poorly defined roles. Here, we examine the diversity, origins, molecular functions and biological roles of the
HMG
domain superfamily.
Key Concepts:
The 80 amino acid HMG domain characterizes a superfamily of proteins found in plants and animals.
HMG domain proteins associate with DNA and usually bend it, thereby influencing transcription of genes.
The most ancient family of these proteins, represented by the canonical HMG proteins HMG‐1 and HMG‐2, have multiple HMG domains, and do not bind to any specific DNA sequences.
Genes encoding canonical HMG proteins may have evolved into two further families of genes encoding proteins that have single HMG domains and bind to specific DNA sequence motifs.
These families include SRY (the mammalian Y‐linked sex‐determining factor) and the SOX, TCF‐1 and LEF‐1 transcription factors.
Unlike canonical HMG proteins, these other HMG domain proteins have specific, important and well‐defined biological roles, particularly in embryonic development.