Key findings• Simultaneous inference of mutational signatures across mutation types and genomic features refines signature spectra and defines their genomic determinants • Two distinct mutational signatures of UV exposure found in active and quiescent chromatin, which may be attributed to differential activity of nucleotide excision repair • Transcription-associated mutagenesis manifesting as A[T>C] mutations is found in a range of cancer types • APOBEC mutagenesis produces two signatures reflecting highly clustered, double strand break repair initiated and lowly clustered replication-driven mutagenesis, respectively • Somatic hypermutation produces a strongly clustered, TSS-associated signature in lymphoid cancers, which is distinct from a weakly clustered TLS signature found in multiple tumour types.
AbstractMutational signature analysis is an essential part of the cancer genome analysis toolkit. Conventionally, mutational signature analysis extracts patterns of different mutation types across many cancer genomes. Here we present TensorSignatures, an algorithm to learn mutational signatures jointly across all variant categories and their genomic context. The analysis of 2,778 cancer genomes of the PCAWG consortium shows that practically all signatures operate dynamically in response to various genomic and epigenomic states. The analysis pins differential spectra of UV mutagenesis found in active and inactive chromatin to global genome nucleotide excision repair. TensorSignatures accurately characterises transcription-associated mutagenesis, which is detected in 7 different cancer types. The analysis also unmasks replication and double strand break repair driven APOBEC mutagenesis, which manifests with differential numbers and length of mutation clusters indicating a differential processivity of the two triggers. As a fourth example, TensorSignatures detects a signature of somatic hypermutation generating highly clustered variants around the transcription start sites of active genes in lymphoid leukaemia, distinct from a more general and less clustered signature of Polη-driven TLS found in a broad range of cancer types.
Directional effectsMutation rates may differ between template and coding strand, because RNA polymerase II recruits transcription coupled nucleotide excision repair (TC-NER) upon lesion recognition