2022
DOI: 10.1093/database/baac019
|View full text |Cite
|
Sign up to set email alerts
|

Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration

Abstract: The scientific knowledge about which genes are involved in which diseases grows rapidly, which makes it difficult to keep up with new publications and genetics datasets. The DISEASES database aims to provide a comprehensive overview by systematically integrating and assigning confidence scores to evidence for disease–gene associations from curated databases, genome-wide association studies (GWAS) and automatic text mining of the biomedical literature. Here, we present a major update to this resource, which gre… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
44
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 60 publications
(44 citation statements)
references
References 41 publications
0
44
0
Order By: Relevance
“…g. PGSEA analysis for unaffected control (n = 5) and NA-GBM (n = 5) samples against the Jensen disease database 41 , scale bar indicates the Z-score for each sample. Left plot shows the top 5 hits (p < 0.02), and the right plot shows the results for selected neurodegenerative diseases (p > 0.02).…”
Section: Resultsmentioning
confidence: 99%
“…g. PGSEA analysis for unaffected control (n = 5) and NA-GBM (n = 5) samples against the Jensen disease database 41 , scale bar indicates the Z-score for each sample. Left plot shows the top 5 hits (p < 0.02), and the right plot shows the results for selected neurodegenerative diseases (p > 0.02).…”
Section: Resultsmentioning
confidence: 99%
“…The online tool DISEASES (18) was used to analyze the associations of ATP5MG with diseases. Only the human gene ATP5MG was selected, and the Z-score data were downloaded by selecting "text mining."…”
Section: Methodsmentioning
confidence: 99%
“…Very briefly, the TDL is one of four potential values: Tclin, Tchem, Tbio or Tdark. Tclin are targets for which an approved drug exists ( 5 , 6 ), which currently includes 704 human proteins; Tchem are proteins that are not Tclin, but are known to bind small molecules with high potency (currently N = 1971); Tbio includes proteins that have Gene Ontology ( 7 ) leaf term annotations based on experimental evidence; or meet two of the following three conditions: A fractional publication count ( 8 ) >5, three or more Gene RIF, ‘Reference Into Function’ annotations ( https://www.ncbi.nlm.nih.gov/gene/about-generif ), or 50 or more commercial antibodies, as counted in the Antibodypedia portal ( 9 ). The fourth category, Tdark, currently includes ∼31% of the human proteins that were manually curated at the primary sequence level in UniProt, but do not meet any of the Tclin, Tchem or Tbio criteria.…”
Section: Introductionmentioning
confidence: 99%
“…Pharos typically links to external sites for more information, and to TCRD’s primary data sources, when a website is available. Many other sites link to Pharos as well, including: PDB ( 12 ), ChEMBL ( 13 ), DISEASES ( 8 ), DrugCentral ( 14 ), MARRVEL ( 15 ), Reactome ( 16 ), KEGG ( 17 ), Guide to Pharmacology ( 18 ), GlyGen ( 19 ), UniProt ( 20 ) and more. Data is accessible by full TCRD download ( http://juniper.health.unm.edu/tcrd/ ), or via the GraphQL API ( https://pharos-api.ncats.io/graphql ).…”
Section: Introductionmentioning
confidence: 99%