2019
DOI: 10.1093/database/baz059
|View full text |Cite
|
Sign up to set email alerts
|

Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases

Abstract: Metadata—the machine-readable descriptions of the data—are increasingly seen as crucial for describing the vast array of biomedical datasets that are currently being deposited in public repositories. While most public repositories have firm requirements that metadata must accompany submitted datasets, the quality of those metadata is generally very poor. A key problem is that the typical metadata acquisition process is onerous and time consuming, with little interactive guidance or assistance provided to users… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(27 citation statements)
references
References 30 publications
0
27
0
Order By: Relevance
“…Detailed machine-readable and actionable descriptions to enable data processing without human guidance [ 10 , 25 ]…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Detailed machine-readable and actionable descriptions to enable data processing without human guidance [ 10 , 25 ]…”
Section: Resultsmentioning
confidence: 99%
“…As an example, matching can be understood in various ways. Metadata matches to instance data [ 8 ] or to semantic attributes [ 9 ] or other metadata [ 10 ]. The general understanding is ambiguous.…”
Section: Methodsmentioning
confidence: 99%
“…7 Final dataset with reconstructions and all experimental groups added in the metadata portal or smart recommendation systems to further expedite and improve the critical bottleneck of database curation. Recently, machine learning algorithms have been successfully deployed for metadata extraction [27]. In particular, text mining tools, such as named entity recognition, can learn, identify, and label crucial elements of neuroscience documents like neuron names, brain regions, and experimental conditions [5,37] .…”
Section: Resultsmentioning
confidence: 99%
“…Effective and efficient management of increasingly complex genomic datasets requires addressing challenges with these emerging approaches as well as innovations in the use of hardware, algorithms, software, standards, and platforms 40 . Current barriers include the lack of interoperable genomic data resources (which limits downstream access, integration, and analyses) and the absence of controlled and consistently adopted data and metadata vocabularies and ontologies 41,42 . User-friendly systems that capture metadata in a scalable, intelligent, and cost-effective manner and that allow for intuitive data visualizations are essential.…”
Section: Boxmentioning
confidence: 99%