2019
DOI: 10.1093/database/bay139
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced taxonomy annotation of antiviral activity data from ChEMBL

Abstract: The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but virus taxonomy annotation of these data is not sufficient for thorough studies of antiviral chemical space. We developed a procedure for semi-automatic extraction of antiviral activity data from ChEMBL and mapped them… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(25 citation statements)
references
References 31 publications
0
24
0
1
Order By: Relevance
“…About 71 K structures and 787 K bioactivity data points were available for 6128 formulae found in ChEMBL. Antiviral activity data points (21,559 entries) linked to 7958 structures were extracted from thoroughly curated subset of antiviral activity data (ViralChEMBL) 44 to reveal that they were tested against viruses belonging to 25 distinct families. Among the most studied viruses were HIV-1, HCV, and Influenza virus A, from Retroviridae , Flaviviridae , and Orthomyxoviridae families, respectively (Supplementary Fig.…”
Section: Resultsmentioning
confidence: 99%
“…About 71 K structures and 787 K bioactivity data points were available for 6128 formulae found in ChEMBL. Antiviral activity data points (21,559 entries) linked to 7958 structures were extracted from thoroughly curated subset of antiviral activity data (ViralChEMBL) 44 to reveal that they were tested against viruses belonging to 25 distinct families. Among the most studied viruses were HIV-1, HCV, and Influenza virus A, from Retroviridae , Flaviviridae , and Orthomyxoviridae families, respectively (Supplementary Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Such models can be built using publicly and commercially available large-scale sources of information about antiretroviral agents including the NIAID HIV/OI/TB Therapeutics [17], EBI ChEMBL [18], and Clarivate Analytics Integrity [19] databases. It is well-known that in any particular drug discovery project, careful curation of the information from the databases is needed [20][21][22]. Thus, our study consisted of the preparation of cleaned versions of the training sets on HIV-1 integrase (IN), protease (PR), and reverse transcriptase (RT) inhibitors; creation and validation of (Q)SAR models using the PASS [23] and GUSAR [24] software; investigation of the possibilities and limitations of practical application of the developed models; and creation of a freely available web service for prediction of anti-HIV activity using the integrated training set.…”
Section: Introductionmentioning
confidence: 99%
“…The subsets FlaviChEMBL and EnteroChEMBL (Supplementary File 1), containing compounds tested against members of Flavivirus and Enterovirus genera, respectively, were extracted from ViralChEMBL . The statistics of the FlaviChEMBL and EnteroChEMBL datasets content is summarized in Table .…”
Section: Resultsmentioning
confidence: 99%
“…The subset of ViralChEMBL dataset containing 7455 compounds tested against members of Flavivirus (FlaviChEMBL) and Enterovirus (EnteroChEMBL) genera (Supplementary File 1) was extracted from ViralChEMBL …”
Section: Methodsmentioning
confidence: 99%