2022
DOI: 10.1007/s11227-022-04673-3
|View full text |Cite
|
Sign up to set email alerts
|

A new fast technique for pattern matching in biological sequences

Abstract: At numerous phases of the computational process, pattern matching is essential. It enables users to search for specific DNA subsequences or DNA sequences in a database. In addition, some of these rapidly expanding biological databases are updated on a regular basis. Pattern searches can be improved by using high-speed pattern matching algorithms. Researchers are striving to improve solutions in numerous areas of computational bioinformatics as biological data grows exponentially. Faster algorithms with a low e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 31 publications
0
2
0
Order By: Relevance
“…Table 4 provides a comparison of the machine learning techniques discussed above with traditional techniques, including our two proposed methods (EFLPM and EPAPM) [47] based on their execution time for different pattern lengths in DNA sequences.…”
Section: The Experimental Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Table 4 provides a comparison of the machine learning techniques discussed above with traditional techniques, including our two proposed methods (EFLPM and EPAPM) [47] based on their execution time for different pattern lengths in DNA sequences.…”
Section: The Experimental Resultsmentioning
confidence: 99%
“…We opted to build databases on DNA to examine the machine learning algorithms discussed in the following paragraphs. The rationale behind this decision was to sample some of the genes that we had worked on in our previous research endeavors [15], Our objective was to integrate automated learning algorithms and pattern-matching algorithms that are based on specific DNA sequences, in order to create a biological data collection that could be utilized in a classification process. We conducted experiments on a dataset that included DNA sequences, where we compared the effectiveness of searching for a specific pattern with other classification models, such as Random Forest [3,16], KNN [16][17][18][19][20], Naïve Bayes [21][22][23][24], Decision tree [23,[25][26][27][28][29][30], and Support Vector Machine [18,[31][32][33][34][35][36] with Linear [37,38], RBF [37,39], and sigmoid [21,40] classifiers, the results of these classifiers models are calculated by F1 score, recall, precision rate, execution time, and with the accuracy which calculates the most effective patternmatching classifier.…”
Section: Methodology For Pm From Dna Sequencesmentioning
confidence: 99%
“…//Create substring based on pattern length (5) for (i � 0 to m) do (6) sum ⟵ ASCII (s i ) (7) end for loop (8) //Create hash value using predefned prime number (9) h (s) ⟵ sum mod q (10) //Create quotient value using predefned prime number (11) r (s) ⟵ sum divide q (12) if h(p) � h(s) and r(p) � r(s)…”
Section: Searching Phasementioning
confidence: 99%
“…To comprehend biological data, mainly when the datasets are enormous and complicated, the interdisciplinary discipline of bioinformatics develops techniques and software tools [5]. Pattern matching issues appear in many computational bioinformatics tasks, including basic local synchronization search, biomarker discovery, sequence matching, homologous sequence identifcation, and proteogenomic mapping [6,7]. Pattern matching can be used in biotechnology, forensics, medical, and agricultural research to look into probable disease or anomaly diagnoses [8].…”
Section: Introductionmentioning
confidence: 99%
“…Computational methods, on the other hand, are efficient and effective, and they play an important role in many areas of bioinformatics. For example, in silico techniques are being used rapidly in research on diseasegene interactions [25,26], protein structure prediction [27], peptide therapeutic function, gene editing experiments [28], meaningful pattern detection [29], and drug repurposing [30,31]. Previously, researchers have been proposed a few computational models for predicting the 2-OM sites based on single machine learning (ML) and deep learning (DL) approaches [32][33][34][35][36][37][38][39].…”
Section: Introductionmentioning
confidence: 99%