24Genomes can vary within individual organisms. Programmed DNA elimination leads to dramatic 25 changes in genome organisation during the germline-soma differentiation of ciliates 1 , lampreys 2 , 26 nematodes 3,4 , and various other eukaryotes 5 . A particularly remarkable example of tissue-specific 27 genome differentiation is the germline-restricted chromosome (GRC) in the zebra finch which is 28 consistently absent from somatic cells 6 . Although the zebra finch is an important animal model 29 system 7 , molecular evidence from its large GRC (>150 megabases) is limited to a short intergenic 30 region 8 and a single mRNA 9 . Here, we combined cytogenetic, genomic, transcriptomic, and 31 proteomic evidence to resolve the evolutionary origin and functional significance of the GRC.
32First, by generating tissue-specific de-novo linked-read genome assemblies and re-sequencing 33 two additional germline and soma samples, we found that the GRC contains at least 115 genes 34 which are paralogous to single-copy genes on 18 autosomes and the Z chromosome. We detected 35 an amplification of ≥38 GRC-linked genes into high copy numbers (up to 308 copies) but, 36 surprisingly, no enrichment of transposable elements on the GRC. Second, transcriptome and 37 proteome data provided evidence for functional expression of GRC genes at the RNA and protein 38 levels in testes and ovaries. Interestingly, the GRC is enriched for genes with highly expressed 39 orthologs in chicken gonads and gene ontologies involved in female gonad development. Third, 40 we detected evolutionary strata of GRC-linked genes. Developmental genes such as bicc1 and 41 trim71 have resided on the GRC for tens of millions of years, whereas dozens have become 42 GRC-linked very recently. The GRC is thus likely widespread in songbirds (half of all bird 43 species) and its rapid evolution may have contributed to their diversification. Together, our 44 results demonstrate a highly dynamic evolutionary history of the songbird GRC leading to 45 dramatic germline-soma genome differences as a novel mechanism to minimise genetic conflict 46 between germline and soma. Text 48 Not all cells of an organism must contain the same genome. Some eukaryotes exhibit dramatic 49 differences between their germline and somatic genomes, resulting from programmed DNA 50 elimination of chromosomes or fragments thereof during germline-soma differentiation 5 . Here 51 we present the first comprehensive analyses of a germline-restricted chromosome (GRC). The 52 zebra finch (Taeniopygia guttata) GRC is the largest chromosome of this songbird 6 and likely 53 comprises >10% of the genome (>150 megabases) 7,10 . Cytogenetic evidence suggests the GRC is 54 inherited through the female germline, expelled late during spermatogenesis, and eliminated from 55 the soma during early embryo development 6,11 . Previous analyses of a 19-kb intergenic region 56 suggested that the GRC contains sequences with high similarity to regular chromosomes ('A 57 chromosomes') 8 . 58 59In order to reliably ide...