2022
DOI: 10.1101/2022.02.19.481167
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Genome assembly and isoform analysis of a highly heterozygous New Zealand fisheries species, the tarakihi (Nemadactylus macropterus)

Abstract: Although being some of the most valuable and heavily exploited wild organisms, few fisheries species have been studied at the whole-genome level. This is especially the case in New Zealand, where genomics resources are urgently needed to assist fisheries management attains its sustainability goals. Here we generated 55 Gb of short Illumina reads (92 coverage) and 73 Gb of long Nanopore reads (122) to produce the first genome assembly of the marine teleost tarakihi (Nemadactylus macropterus), a highly valuable … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
1
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(17 citation statements)
references
References 122 publications
(131 reference statements)
0
17
0
Order By: Relevance
“…Trimmed and filtered paired reads were mapped to the tarakihi reference genome (1,214 scaffolds) assembled in a previous study (Papa et al, 2021c) by using the Burrows-Wheeler Alignment (BWA) method with bwa-kit v0.7.15 (Li and Durbin, 2009). In brief, a BWA index was created for the reference genome, as well as a SAM index with SAMtools v1.9 (Li et al, 2009).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Trimmed and filtered paired reads were mapped to the tarakihi reference genome (1,214 scaffolds) assembled in a previous study (Papa et al, 2021c) by using the Burrows-Wheeler Alignment (BWA) method with bwa-kit v0.7.15 (Li and Durbin, 2009). In brief, a BWA index was created for the reference genome, as well as a SAM index with SAMtools v1.9 (Li et al, 2009).…”
Section: Methodsmentioning
confidence: 99%
“…SNP filtering for the neutral dataset was identical to the pruned dataset but with an additional step of filtering out potentially adaptive outliers (see below). The 250 longest scaffolds were retained because (1) they contained more than 90% of the total number of bases in the tarakihi reference genome (L90 = 219 (Papa et al, 2021c)), (2) for computational efficiency, and (3) because the default parameters used for the OutFLANK analysis (see below) were not optimal anymore to fit the F ST curve in some scaffolds past that number, which means they would have had to be manually tuned for the c. 1,000 remaining scaffolds. Chatham Islands, Fiordland and the juvenile TBGB samples were discarded at that stage because of their low number of samples (two, three, and five, respectively), which was considered too low to confidently predict population allele frequencies.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Repetitive elements in the five genomes were identified using the same method and tools as reported in Papa, Wellenreuther, et al (2021). During the repeat identification step, the Actinopterygii homology-based repeat library produced in Papa, Wellenreuther, et al (2021) was combined with each of the de novo repeat libraries produced separately for each species. Genome annotation was carried out with the MAKER v2.31.10 (Holt & Yandell, 2011) pipeline on the unmasked genomes.…”
Section: Methodsmentioning
confidence: 99%
“…To achieve this, the five new genomes were assembled using Illumina short read sequences. The high-coverage genome assembly and annotation of tarakihi produced in Papa, Wellenreuther, et al (2021) was combined with these five genomes to explore the diversity of repeat elements and genic features. High-quality genomic data from eight other fishes retrieved from Ensembl were then added to identify genes under positive selection in tarakihi and Latridae in a phylogenomic framework.…”
Section: Introductionmentioning
confidence: 99%