2022
DOI: 10.26434/chemrxiv-2022-27gqn
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine Learning Directed Aptamer Search from Conserved Primary Sequence and Secondary Structure

Abstract: Computer-aided prediction of aptamer sequences has been focused on primary sequence alignment and motif comparison. We observed that many aptamers have a conserved hairpin, yet the sequence of the hairpin can be highly variable. Taking such a secondary structure information into consideration, a new algorithm combining conserved primary sequences and secondary structures is developed, that combines three scores based on sequence abundance, stability, and structure, respectively. This algorithm was used in the … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…The scarcity of data hampers the development of reliable algorithms for aptamer research. Machine learning has shown promise in predicting aptamer-protein interactions [26], generating aptamer structures [27], optimizing SELEX protocols [28], and understanding aptamer folding behavior [29]. Yet, the lack of data, with only ∼ 1400 data points in public databases (https://sites.utexas.edu/aptamerdatabase/), limits the full exploitation of modern deep learning architectures that require larger datasets for reliable generalization [30,31,32].…”
Section: Data Scarcity Obstructs the Development Of Reliable Algorithmsmentioning
confidence: 99%
“…The scarcity of data hampers the development of reliable algorithms for aptamer research. Machine learning has shown promise in predicting aptamer-protein interactions [26], generating aptamer structures [27], optimizing SELEX protocols [28], and understanding aptamer folding behavior [29]. Yet, the lack of data, with only ∼ 1400 data points in public databases (https://sites.utexas.edu/aptamerdatabase/), limits the full exploitation of modern deep learning architectures that require larger datasets for reliable generalization [30,31,32].…”
Section: Data Scarcity Obstructs the Development Of Reliable Algorithmsmentioning
confidence: 99%