2017
DOI: 10.1155/2017/9852063
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Data Mining Algorithms for Screening Potential Proteins of Drug Target

Abstract: The past few decades have witnessed the boom in pharmacology as well as the dilemma of drug development. Playing a crucial role in drug design, the screening of potential human proteins of drug targets from open access database with well-measured physical and chemical properties is a task of challenge but significance. In this paper, the screening of potential drug target proteins (DTPs) from a fine collected dataset containing 5376 unlabeled proteins and 517 known DTPs was researched. Our objective is to scre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…Because at present the chemical and physical properties of known drug targets can be found but nondrug targets cannot, the prediction of drug targets is one classification problem for which there is no good solution. Currently, most studies usually use pending test proteins as nondrug targets, but they inevitably contain proteins that subsequently turn out to be drug targets although the majority of the proteins are normal proteins [ 28 ]. Therefore, there is an alternatively reasonable way to collect a more accurate nondrug targets dataset based on the drug targets structure characteristics.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Because at present the chemical and physical properties of known drug targets can be found but nondrug targets cannot, the prediction of drug targets is one classification problem for which there is no good solution. Currently, most studies usually use pending test proteins as nondrug targets, but they inevitably contain proteins that subsequently turn out to be drug targets although the majority of the proteins are normal proteins [ 28 ]. Therefore, there is an alternatively reasonable way to collect a more accurate nondrug targets dataset based on the drug targets structure characteristics.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, they can provide us with information that whether a protein is suitable to be a drug target protein. The chemical and physical properties which were used in our analysis are 26 amino acids (counted by Mole%) and number of charged residues, basic residues, acidic average molecular weight, and isoelectric point are used to train the model and predict the potential drug targets [ 28 ]. Besides, they are important clues as well as the PPI topological features for the judgement of which proteins could be targets.…”
Section: Data Collectionmentioning
confidence: 99%
“…However, with the in-depth research and theoretical development, especially in the 21st century, with the significant improvement of computer ability, learning natural language based on deep learning technology has gradually matured. In 2013, Wang and others proposed the word2vec algorithm and a bag-of-words model was constructed using neural network, and the word vector representation of the target language was calculated according to the context word distribution of the target language in a large-scale corpus [7]. Word2vec algorithm completes the transformation from text representation to static number vector.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Some studies were dedicated to developing a more efficient statistical inference method. With the boom in machine learning methods and high credibility biological database [ 7 ], new methods are in great need to help identify novel pathway regulation relationships of protein interactions.…”
Section: Introductionmentioning
confidence: 99%