Identifying Neuropeptides via Evolutionary and Sequential Based Multi-Perspective Descriptors by Incorporation With Ensemble Classification Strategy

Akbar, Shahid; Mohamed, Heba G.; Ali, Hashim; Saeed, Aamir; Ahmed, Aftab; Gul, Sarah; Ahmad, Ashfaq; Ali, Farman; Ghadi, Yazeed Yasin; Assam, Muhammad

doi:10.1109/access.2023.3274601

Cited by 26 publications

(7 citation statements)

References 54 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To evaluate the practical effectiveness of current BP classification methods, CICERON was tested against state-of-the-art (SOTA) classifiers specific to the various peptide functional classes, which were selected upon careful literature inspection [62] , [63] , [64] , [65] , [66] , [67] . The main characteristics of SOTA models and their development dataset are summarized in Table 2 .…”

Section: Resultsmentioning

confidence: 99%

Classification of bioactive peptides: A systematic benchmark of models and encodings

Bizzotto,

Zampieri,

Treu

et al. 2024

Computational and Structural Biotechnology Journal

View full text Add to dashboard Cite

Section: Resultsmentioning

confidence: 99%

Classification of bioactive peptides: A systematic benchmark of models and encodings

Bizzotto,

Zampieri,

Treu

et al. 2024

Computational and Structural Biotechnology Journal

View full text Add to dashboard Cite

“…In the future work, we further plan to use another recent predictors such as pAtbP-EnC 45 , AIPs-SnTCN 46 , AFP-CMBPred 47 , cACP-DeepGram 48 , iACP-GAEnsC 49 , and Target-ensC_NP. Furthermore, we intended to used the CD-HIT tool was utilized to eliminate redundant peptide samples with homology 50 .…”

Section: Discussionmentioning

confidence: 99%

Optimizing classification of diseases through language model analysis of symptoms

Hassan,

Abd El-Hafeez,

Shams

2024

Sci Rep

View full text Add to dashboard Cite

This paper investigated the use of language models and deep learning techniques for automating disease prediction from symptoms. Specifically, we explored the use of two Medical Concept Normalization—Bidirectional Encoder Representations from Transformers (MCN-BERT) models and a Bidirectional Long Short-Term Memory (BiLSTM) model, each optimized with a different hyperparameter optimization method, to predict diseases from symptom descriptions. In this paper, we utilized two distinct dataset called Dataset-1, and Dataset-2. Dataset-1 consists of 1,200 data points, with each point representing a unique combination of disease labels and symptom descriptions. While, Dataset-2 is designed to identify Adverse Drug Reactions (ADRs) from Twitter data, comprising 23,516 rows categorized as ADR (1) or Non-ADR (0) tweets. The results indicate that the MCN-BERT model optimized with AdamP achieved 99.58% accuracy for Dataset-1 and 96.15% accuracy for Dataset-2. The MCN-BERT model optimized with AdamW performed well with 98.33% accuracy for Dataset-1 and 95.15% for Dataset-2, while the BiLSTM model optimized with Hyperopt achieved 97.08% accuracy for Dataset-1 and 94.15% for Dataset-2. Our findings suggest that language models and deep learning techniques have promise for supporting earlier detection and more prompt treatment of diseases, as well as expanding remote diagnostic capabilities. The MCN-BERT and BiLSTM models demonstrated robust performance in accurately predicting diseases from symptoms, indicating the potential for further related research.

show abstract

“…96−98 A confusion matrix is initially generated to evaluate a model that captures true-positive, true-negative, false-positive, and false-negative outcomes in the training process. 99 While accuracy is commonly used as a reliable metric for evaluating classification models, it may not be sufficient when dealing with imbalanced training data sets. 72…”

Section: Performance Evaluation Parametersmentioning

confidence: 99%

“…In deep learning, various performance metrics are applied to assess the effectiveness of computational models across different aspects. − A confusion matrix is initially generated to evaluate a model that captures true-positive, true-negative, false-positive, and false-negative outcomes in the training process . While accuracy is commonly used as a reliable metric for evaluating classification models, it may not be sufficient when dealing with imbalanced training data sets. ,− Hence, we also incorporated additional performance assessment parameters such as sensitivity, specificity, Matthews’s correlation coefficient (MCC), and area under the curve (AUC) to evaluate our proposed model comprehensively. a c c u r a c y = 1 − A i p − + + A i p + − A i p + + A i p − s e n s i t i v i t y = 1 − A i p − + A i p + s p e c i f i c i t y =…”

Section: Performance Evaluation Parametersmentioning

confidence: 99%

AIPs-SnTCN: Predicting Anti-Inflammatory Peptides Using fastText and Transformer Encoder-Based Hybrid Word Embedding with Self-Normalized Temporal Convolutional Networks

Raza,

Uddin,

Almuhaimeed

et al. 2023

J. Chem. Inf. Model.

Self Cite

View full text Add to dashboard Cite

Inflammation is a biologically resistant response to harmful stimuli, such as infection, damaged cells, toxic chemicals, or tissue injuries. Its purpose is to eradicate pathogenic micro-organisms or irritants and facilitate tissue repair. Prolonged inflammation can result in chronic inflammatory diseases. However, wet-laboratory-based treatments are costly and time-consuming and may have adverse side effects on normal cells. In the past decade, peptide therapeutics have gained significant attention due to their high specificity in targeting affected cells without affecting healthy cells. Motivated by the significance of peptide-based therapies, we developed a highly discriminative prediction model called AIPs-SnTCN to predict anti-inflammatory peptides accurately. The peptide samples are encoded using word embedding techniques such as skip-gram and attention-based bidirectional encoder representation using a transformer (BERT). The conjoint triad feature (CTF) also collects structure-based cluster profile features. The fused vector of word embedding and sequential features is formed to compensate for the limitations of single encoding methods. Support vector machine-based recursive feature elimination (SVM-RFE) is applied to choose the ranking-based optimal space. The optimized feature space is trained by using an improved self-normalized temporal convolutional network (SnTCN). The AIPs-SnTCN model achieved a predictive accuracy of 95.86% and an AUC of 0.97 by using training samples. In the case of the alternate training data set, our model obtained an accuracy of 92.04% and an AUC of 0.96. The proposed AIPs-SnTCN model outperformed existing models with an ∼19% higher accuracy and an ∼14% higher AUC value. The reliability and efficacy of our AIPs-SnTCN model make it a valuable tool for scientists and may play a beneficial role in pharmaceutical design and research academia.

show abstract

Identifying Neuropeptides via Evolutionary and Sequential Based Multi-Perspective Descriptors by Incorporation With Ensemble Classification Strategy

Cited by 26 publications

References 54 publications

Classification of bioactive peptides: A systematic benchmark of models and encodings

Classification of bioactive peptides: A systematic benchmark of models and encodings

Optimizing classification of diseases through language model analysis of symptoms

AIPs-SnTCN: Predicting Anti-Inflammatory Peptides Using fastText and Transformer Encoder-Based Hybrid Word Embedding with Self-Normalized Temporal Convolutional Networks

Contact Info

Product

Resources

About