2021
DOI: 10.1186/s13321-020-00483-y
|View full text |Cite
|
Sign up to set email alerts
|

Profiling and analysis of chemical compounds using pointwise mutual information

Abstract: Pointwise mutual information (PMI) is a measure of association used in information theory. In this paper, PMI is used to characterize several publicly available databases (DrugBank, ChEMBL, PubChem and ZINC) in terms of association strength between compound structural features resulting in database PMI interrelation profiles. As structural features, substructure fragments obtained by coding individual compounds as MACCS, PubChemKey and ECFP fingerprints are used. The analysis of publicly available databases re… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 61 publications
(104 reference statements)
0
2
0
Order By: Relevance
“…The PMI (pointwise mutual information) measure was used to find the collocations and relationships of related words in the interview recordings [88]. The optimum value of the frequency threshold of binary (bigrams) and ternary (trigrams) relations was set as 5 and n-grams with frequencies lower than the threshold were removed.…”
Section: Text Mining Analysis Of Interview Datamentioning
confidence: 99%
“…The PMI (pointwise mutual information) measure was used to find the collocations and relationships of related words in the interview recordings [88]. The optimum value of the frequency threshold of binary (bigrams) and ternary (trigrams) relations was set as 5 and n-grams with frequencies lower than the threshold were removed.…”
Section: Text Mining Analysis Of Interview Datamentioning
confidence: 99%
“…This algorithm is a complementary (testing) method, very recently employed in chemistry, which measures the structural similarity between compounds by computing the mutual information (from information theory) between the InChIKey codes. Such approaches were recently used to characterize structure similarity [37] and drug-target compatibility [38]. This phase is necessary in order to narrow the search range for a probable mechanism of action characteristic of the studied lupane derivatives.…”
Section: Machine Learning Antitubercular Activity and Compound-drug Similarity Predictionmentioning
confidence: 99%