Human genomic clones that span the entire protein S expressed gene (PS alpha) and the 3' two-thirds of the protein S pseudogene (PS beta) have been isolated and characterized. The PS alpha gene is greater than 80 kilobases in length and contains 14 introns and 15 exons, as well as 6 repetitive "Alu" sequences. Exons I and XV contain 112 and 1139 bp 5' and 3' noncoding segments in addition to the amino and carboxyl termini, respectively. Exons I-VIII encode protein segments that are homologous to the vitamin K dependent clotting proteins and are bounded by introns whose position and type are identical with other members of this protein family. Exons IX-XV encode protein segments homologous to sex hormone binding globulin (SHBG) and are bounded by introns of identical type and position as in the SHBG gene. Genomic clones for the PS beta gene cover a distance of greater than 55 kilobases and contain segments corresponding to amino acids 46-635 of the mature protein and the 1.1-kb 3' noncoding region of the cDNA. The presence of multiple base changes in the coding portions of this gene, resulting in termination codons and frame shifts, suggests that it is a pseudogene. Comparison of DNA sequences for the two genes reveals 97% identity for coding and 3' noncoding, and 95.4% for intronic regions, suggesting divergence of the two genes is a relatively recent event.
Genomic DNA samples from 12 protein S-deficient families with hereditary thrombophilia were analyzed by Southern hybridization using protein S cDNA probes. Protein S-deficient members of families A and B possessed identical restriction fragment length polymorphisms, which suggest the absence of 5.3 kb from one of their protein S alpha alleles. The abnormal alleles from individuals A7 and B1 were amplified by the polymerase chain reaction using a forward primer in intron K and a reverse primer in exon XIV. The amplified DNA was cloned and sequenced. Sequence comparison with the normal protein S alpha gene showed that most of intron L (roughly 4.7 kb), the entire exon XIII (151 bp), and about a quarter of intron M (407 bp) were missing from both the A7 and B1 clones. Exon XIII contains all three potential N- glycosylation sites in human protein S. This deletion may result in RNA transcripts in which exon XII is spliced to exon XIV. Such an arrangement would generate a stop codon at position 463 and consequently produce a nonglycosylated protein S molecule truncated by 173 amino acids.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.