2022
DOI: 10.1038/s41467-022-31955-4
|View full text |Cite
|
Sign up to set email alerts
|

Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning

Abstract: In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 1022 sequences, practical considerations limit starting sequences to ≤~1015 distinct molecules. Selection-induced sequence convergence and limited sequencing depth further constrain experimentally observable sequence space. To address these limitations, we integrate experimental and machine learning approa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 71 publications
0
9
0
Order By: Relevance
“…Although the interaction of self‐assembled DNA/RNA nanostructures to other small molecules, cells, and proteins (including Cas protein [27] ) has been studied previously,[ 27 , 28 ] the origin of binding affinity between Cas12a and λ DNA requires further investigation. In this perspective, a modeling approach [29] could help identify the specific regions of λ DNA that show an affinity to Cas12a in future.…”
Section: Resultsmentioning
confidence: 99%
“…Although the interaction of self‐assembled DNA/RNA nanostructures to other small molecules, cells, and proteins (including Cas protein [27] ) has been studied previously,[ 27 , 28 ] the origin of binding affinity between Cas12a and λ DNA requires further investigation. In this perspective, a modeling approach [29] could help identify the specific regions of λ DNA that show an affinity to Cas12a in future.…”
Section: Resultsmentioning
confidence: 99%
“…Recently, with the development of artificial intelligence, machine learning (ML) has shown excellent application potential in natural sciences. For instance, ML-driven approaches have been applied to molecular recognition, 13–16 materials design and discovery, 17–20 and synthesis reaction prediction. 21–24 However, the low interpretability of ML models obscures the distribution and importance of parameters and the interaction between parameters, which affects the credibility and dependency of ML models.…”
Section: Introductionmentioning
confidence: 99%
“…The types of molecules comprising the libraries can include the oligonucleotides themselves, functionalized nucleic acid polymers 4 , small molecules 5,6 , linear, cyclic or chemically modified peptides [7][8][9][10][11] , and novel hybrid molecules 12 . A method for selecting target binders from libraries of oligonucleotides of varying sequences 13,14 is called SELEX (systematic evolution of ligands by exponential enrichment).…”
Section: Introductionmentioning
confidence: 99%
“…With the advent of artificial intelligence (AI) approaches, there is a large interest in using experimental selection datasets for training machine learning (ML) models to predict oligonucleotides, peptide-based and other molecules with high affinity for targets 4,22 or the highest ability to impart functional response to the target, such as the induction of a fluorescence signal [23][24][25] . Bioinformatics analyses can be used to understand sequence composition of experimental datasets and also to assess the sequence patterns in sets of molecules predicted to have the highest affinity for targets by the ML models.…”
Section: Introductionmentioning
confidence: 99%