2016
DOI: 10.1016/j.compbiomed.2015.05.022
|View full text |Cite
|
Sign up to set email alerts
|

Set of rules for genomic signal downsampling

Abstract: Comparison and classification of organisms based on molecular data is an important task of computational biology, since at least parts of DNA sequences for many organisms are available. Unfortunately, methods for comparison are computationally very demanding, suitable only for short sequences. In this paper, we focus on the redundancy of genetic information stored in DNA sequences. We proposed rules for downsampling of DNA signals of cumulated phase. According to the length of an original sequence, we are able… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
2

Year Published

2016
2016
2022
2022

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 32 publications
0
5
0
2
Order By: Relevance
“…The purpose of subtracting the mean is to suppress the zero-frequency component [ 34 ], since direct current component is not significant in the context of detecting the three-base periodicity. The magnitude spectrum of ‘ y(n) ’ is computed using the DFT as depicted in Eqs.…”
Section: Methodsmentioning
confidence: 99%
“…The purpose of subtracting the mean is to suppress the zero-frequency component [ 34 ], since direct current component is not significant in the context of detecting the three-base periodicity. The magnitude spectrum of ‘ y(n) ’ is computed using the DFT as depicted in Eqs.…”
Section: Methodsmentioning
confidence: 99%
“…Different methods for length-normalizing digital signals were tested: down-sampling [55], up-sampling to the maximum length using zero padding [30], even scaling extension [56], periodic extension, symmetric padding, or anti-symmetric padding [57]. For example, zero-padding, which adds zeroes to all of the sequences shorter than the maximum length, was used in [30], e.g., for taxonomic classifications of ribosomal S18 subunit genes from twelve organisms.…”
Section: Methods and Implementationmentioning
confidence: 99%
“…Different methods for length-normalizing digital signals were tested: downsampling [55], up-sampling to the maximum length using zero padding [15], even scaling extension [56], periodic extension, symmetric padding, or anti-symmetric padding [57]. For example, zero-padding, which adds zeroes to all of the sequences shorter than the maximum length, was used in [30], e.g., for taxonomic classifications of ribosomal S18 subunit genes from twelve organisms.…”
Section: Pearson Correlation Coefficient (Pcc)mentioning
confidence: 99%