We previously demonstrated expression of full-length transcripts for sublingual mucin apoprotein, Muc19, of approximately 24 kb (Fallon MA, Latchney LR, Hand AR, Johar A, Denny PA, Georgel PT, Denny PC, and Culp DJ. Physiol Genomics 14: 95-106, 2003). We now describe the complete sequence and genomic organization of the apomucin encoded by 43 exons. Southern analyses indicate a central exon of approximately 18 kb containing 36 tandem repeats, each encoding 163 residues rich in serine and threonine. Full-length transcripts are an estimated 22,795 bp in length that span 106 kb of genomic DNA. The transcriptional start site is 24 bp downstream of a TATA box and 42 bp upstream of the conceptual translational start codon. The putative apoprotein has an estimated mass of 693.4 kDa and contains 7,524 amino acids (80% serine, threonine, glycine, alanine, and proline). We present a model for rat Muc19 transcripts and compare the conceptually translated Muc19 proteins for mouse, rat, pig, and the 3' end of human Muc19. Conserved among these apoproteins are a signal peptide, a large tandem repeat region, von Willebrand factor type C and D domains, a trypsin inhibitor-like Cys-rich domain, and a COOH-terminal cystine knot-like domain. Southern blot analyses indicate transcripts for Muc19 and Smgc (submandibular gland protein C) are splice variants of a larger gene, Muc19/Smgc. Comparative Northern analyses between the major salivary glands demonstrate highly selective Muc19 expression in neonatal and adult sublingual glands, whereas Smgc is expressed in neonatal submandibular and sublingual glands. Regulation of Muc19/Smgc gene expression is discussed with respect to alternative splicing and mucous cell cytodifferentiation.