2015
DOI: 10.1371/journal.pone.0119815
|View full text |Cite
|
Sign up to set email alerts
|

Mapping the Space of Genomic Signatures

Abstract: We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity betwee… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
44
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 34 publications
(44 citation statements)
references
References 48 publications
0
44
0
Order By: Relevance
“…Moreover, the alignment 62 demands the sequences to be continuously homologous which is not always the case. 63 Alignment-free methods [48][49][50][51][52] have been proposed in the past as an alternative to 64 address the limitations of the alignment-based methods. Comparative genomics beyond 65 alignment-based approaches have benefited from the computational power of machine 66 learning.…”
mentioning
confidence: 99%
“…Moreover, the alignment 62 demands the sequences to be continuously homologous which is not always the case. 63 Alignment-free methods [48][49][50][51][52] have been proposed in the past as an alternative to 64 address the limitations of the alignment-based methods. Comparative genomics beyond 65 alignment-based approaches have benefited from the computational power of machine 66 learning.…”
mentioning
confidence: 99%
“…The third limitation is the computational power required. 31, 32 Future whole-genome sequences will need to be analyzed when NGS analysis is expanded into non-coding regions of the genome, and this will require “super computers”. Currently, these are the barriers, but it is also expected that these issues may be resolved with the advancement and widespread use of the technology, just like personal computers have become cheaper and faster.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, with next-generation sequencing (NGS) playing an increasingly important role, it may not always be possible to align many short reads coming from different parts of genomes [9]. To address situations where alignment-based methods fail or are insufficient, alignment-free methods have been proposed [10], including approaches based on Chaos Game Representation of DNA sequences [11,12,13], random walk [14], graph theory [15], iterated maps [16], information theory [17], category-position-frequency [18], spaced-words frequencies [19], Markov-model [20], thermal melting profiles [21], word analysis [22], among others. Software implementations of alignment-free methods also exist, among them COMET [23], CASTOR [24], SCUEAL [25], REGA [26], KAMERIS [27], and FFP (Feature Frequency Profile) [28].…”
Section: Introductionmentioning
confidence: 99%