1992
DOI: 10.1007/bf00160262
|View full text |Cite
|
Sign up to set email alerts
|

Finite-state models in the alignment of macromolecules

Abstract: Minimum message length encoding is a technique of inductive inference with theoretical and practical advantages. It allows the posterior odds-ratio of two theories or hypotheses to be calculated. Here it is applied to problems of aligning or relating two strings, in particular two biological macromolecules. We compare the r-theory, that the strings are related, with the null-theory, that they are not related. If they are related, the probabilities of the various alignments can be calculated. This is done for o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
56
0

Year Published

1994
1994
2010
2010

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 57 publications
(57 citation statements)
references
References 34 publications
1
56
0
Order By: Relevance
“…The machine could be made to match or recognize a particular family of sequences but only if it were given one or more examples prepended to the data to be searched. To an extent, it models the process by which a sequence could be generated and it is natural to use the term machine for this reason, and also because it is common in compression, makes a distinction with the other kind of HMM and is consistent with earlier work (Allison et al, 1992).…”
Section: Approximate Repeatsmentioning
confidence: 74%
See 3 more Smart Citations
“…The machine could be made to match or recognize a particular family of sequences but only if it were given one or more examples prepended to the data to be searched. To an extent, it models the process by which a sequence could be generated and it is natural to use the term machine for this reason, and also because it is common in compression, makes a distinction with the other kind of HMM and is consistent with earlier work (Allison et al, 1992).…”
Section: Approximate Repeatsmentioning
confidence: 74%
“…The possibility of changes, insertions and deletions in a repeat allow instances to differ. In essence, states R, R2 and R3 embody a simple mutation machine, as can be used in the sequence alignment problem (Allison et al, 1992), here used to allow local alignments of the sequence with itself. For the analysis of DNA, approximate reverse complementary repeats are allowed by a further set of states R%, R2% and R3% and corresponding operations, not shown.…”
Section: Approximate Repeatsmentioning
confidence: 99%
See 2 more Smart Citations
“…As in [25,26], our work is based on the premise that if two sequences are related, one sequence must tell something useful about the other: A predictive model can predict a sequence better if a related sequence is known. The information content of a sequence is measured by lossless compression.…”
Section: Introductionmentioning
confidence: 99%