2007
DOI: 10.1186/1471-2105-8-278
|View full text |Cite
|
Sign up to set email alerts
|

Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants

Abstract: Background: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as measured by Simpson's Index of Diversity, or on the ability to identify specific variants. Here we describe the Not-N algorithm, which is designed to identify small sets of genetic markers diagnostic for user-specified … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(28 citation statements)
references
References 47 publications
0
28
0
Order By: Relevance
“…The problem to be addressed searching for minimal sets of SNPs with an optimal pairwise differentiation among OTUs is often called “feature selection” (e.g., [30]) and is used to address similar questions, e.g., to find tag SNPs for capturing haplotype patterns used in studies on disease association and drug design (e.g., [31,32]; see [33] and references therein), or for computer-aided identification of polymorphism sets for bacteria and viruses [34]). However, none of these were designed for our exact purpose.…”
Section: Resultsmentioning
confidence: 99%
“…The problem to be addressed searching for minimal sets of SNPs with an optimal pairwise differentiation among OTUs is often called “feature selection” (e.g., [30]) and is used to address similar questions, e.g., to find tag SNPs for capturing haplotype patterns used in studies on disease association and drug design (e.g., [31,32]; see [33] and references therein), or for computer-aided identification of polymorphism sets for bacteria and viruses [34]). However, none of these were designed for our exact purpose.…”
Section: Resultsmentioning
confidence: 99%
“…We used the program ‘Minimum SNPs’, with incorporated Not-N algorithm [28], to find population-specific SNPs from both Structure and BAPS defined populations. Using STs with ≥95% population assignment from Structure , we identified a set of 25 SNPs that were needed to discriminate STs from Population 2 from all other STs, albeit with a confidence of only 92.5%.…”
Section: Resultsmentioning
confidence: 99%
“…[27] and would be useful to assess the discriminatory power of combinations of candidate targets for typing systems for other pathogens. It can be used for a wide range of data types, but for interrogation of informative SNPs, we recommend Minimum SNPs, which has been designed specifically for this purpose [6,7]. Minimum SNPs should be used to examine input data in the form of multiple sequence alignments.…”
Section: Resultsmentioning
confidence: 99%
“…Therefore, alternative approaches are required. Software has been developed to interrogate informative single nucleotide polymorphisms (SNPs) in sequence based data (Minimum SNPs) but it is not designed to handle other forms of typing data [6,7]. Furthermore, while it can be used to identify SNPs, which are most predictive of a user-nominated sequence type, it does not consider overall measures of concordance between typing systems.…”
Section: Introductionmentioning
confidence: 99%