21Understanding genome organization and gene regulation requires insight into RNA transcription, 22 processing and modification. We adapted nanopore direct RNA sequencing to examine RNA from a 23 wild-type accession of the model plant Arabidopsis thaliana and a mutant defective in mRNA 24 methylation (m 6 A). Here we show that m 6 A can be mapped in full-length mRNAs transcriptome-wide 25 and reveal the combinatorial diversity of cap-associated transcription start sites, splicing events, 26 poly(A) site choice and poly(A) tail length. Loss of m 6 A from 3' untranslated regions is associated 27 with decreased relative transcript abundance and defective RNA 3′ end formation. A functional 28 consequence of disrupted m 6 A is a lengthening of the circadian period. We conclude that nanopore 29 direct RNA sequencing can reveal the complexity of mRNA processing and modification in full-length 30 single molecule reads. These findings can refine Arabidopsis genome annotation. Further, applying 31 this approach to less well-studied species could transform our understanding of what their genomes 32 encode. 33 34 misidentification of 3′ ends through internal priming 3 , spurious antisense and splicing events 46 produced by RT template switching 4,5 , and the inability to detect all base modifications in the 47 copying process 6 . The fragmentation of RNA prior to short-read sequencing makes it difficult to 48 interpret the combination of authentic RNA processing events and remains an unsolved problem 7 . 49We investigated whether long-read direct RNA sequencing (DRS) with nanopores 8 could 50 reveal the complexity of Arabidopsis mRNA processing and modifications. In nanopore DRS, the 51 protein pore (nanopore) sits in a membrane through which an electrical current is passed, and intact 52 RNA is fed through the nanopore by a motor protein 8 . Each RNA sequence within the nanopore 53 (5 bases) can be identified by the magnitude of signal it produces. Arabidopsis is a pathfinder model 54 in plant biology, and its genome annotation strongly influences the annotation and our 55 understanding of what other plant genomes encode. We applied nanopore DRS and Illumina RNAseq 56 to wild-type Arabidopsis (Col-0) and mutants defective in m 6 A 9 and exosome-mediated RNA decay 10 . 57We reveal m 6 A and combinations of RNA processing events (alternative patterns of 5′ capped 58 transcription start sites, splicing, 3′ polyadenylation and poly(A) tail length) in full-length Arabidopsis 59 mRNAs transcriptome-wide. 60 61
Results 62Nanopore DRS detects long, complex mRNAs and short, structured non-coding RNAs 63We purified poly (A)+ RNA from four biological replicates of 14-day-old Arabidopsis Col-0 seedlings. 64We incorporated synthetic External RNA Controls Consortium (ERCC) RNA Spike-In mixes into all 65 replicates 11,12 and carried out nanopore DRS. Illumina RNAseq was performed in parallel on similar 66 material. Using Guppy base-calling (Oxford Nanopore Technologies) and minimap2 alignment 67 software 13 , we identified around 1 mi...