Background Metadata are created to describe the corresponding data in a detailed and unambiguous way and is used for various applications in different research areas, for example, data identification and classification. However, a clear definition of metadata is crucial for further use. Unfortunately, extensive experience with the processing and management of metadata has shown that the term “metadata” and its use is not always unambiguous. Objective This study aimed to understand the definition of metadata and the challenges resulting from metadata reuse. Methods A systematic literature search was performed in this study following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for reporting on systematic reviews. Five research questions were identified to streamline the review process, addressing metadata characteristics, metadata standards, use cases, and problems encountered. This review was preceded by a harmonization process to achieve a general understanding of the terms used. Results The harmonization process resulted in a clear set of definitions for metadata processing focusing on data integration. The following literature review was conducted by 10 reviewers with different backgrounds and using the harmonized definitions. This study included 81 peer-reviewed papers from the last decade after applying various filtering steps to identify the most relevant papers. The 5 research questions could be answered, resulting in a broad overview of the standards, use cases, problems, and corresponding solutions for the application of metadata in different research areas. Conclusions Metadata can be a powerful tool for identifying, describing, and processing information, but its meaningful creation is costly and challenging. This review process uncovered many standards, use cases, problems, and solutions for dealing with metadata. The presented harmonized definitions and the new schema have the potential to improve the classification and generation of metadata by creating a shared understanding of metadata and its context.
With the steady increase in the connectivity of the healthcare system, new requirements and challenges are emerging. In addition to the seamless exchange of data between service providers on a national level, the local legacy data must also meet the new requirements. For this purpose, the applications used must be tested securely and sufficiently. However, the availability of suitable and realistic test data is not always given. Therefore, this study deals with the creation of test data based on real electronic health record data provided by the Medical Information Mart for Intensive Care (MIMIC-IV) database. In addition to converting the data to the current FHIR R4, conversion to the core data sets of the German Medical Informatics Initiative was also presented and made available. The test data was generated to simulate a legacy data transfer. Moreover, four different FHIR servers were tested for performance. This study is the first step toward comparable test scenarios around shared datasets and promotes comparability among providers on a national level.
Clinical trials are carried out to prove the safety and effectiveness of new interventions and therapies. As diseases and their causes continue to become more specific, so do inclusion and exclusion criteria for trials. Patient recruitment has always been a challenge, but with medical progress, it becomes increasingly difficult to achieve the necessary number of cases. In Germany, the Medical Informatics Initiative is planning to use the central application and registration office to conduct feasibility analyses at an early stage and thus to identify suitable project partners. This approach aims to technically adapt/integrate the envisioned infrastructure in such a way that it can be used for trial case number estimation for the planning of multicenter clinical trials. We have developed a fully automated solution called APERITIF that can identify the number of eligible patients based on free-text eligibility criteria, taking into account the MII core data set and based on the FHIR standard. The evaluation showed a precision of 62.64 % for inclusion criteria and a precision of 66.45 % for exclusion criteria.
BackgroundHeterogeneous healthcare instance data can hardly be integrated without harmonizing its schema-level metadata. Many medical research projects and organizations use metadata repositories to edit, store and reuse data elements. However, existing metadata repositories differ regarding software implementation and have shortcomings when it comes to exchanging metadata. This work aims to define a uniform interface with a technical interlingua between the different MDR implementations in order to enable and facilitate the exchange of metadata, to query over distributed systems and to promote cooperation. To design a unified interface for multiple existing MDRs, a standardized data model must be agreed on. The ISO 11179 is an international standard for the representation of metadata, and since most MDR systems claim to be at least partially compliant, it is suitable for defining an interface thereupon. Therefore, each repository must be able to define which parts can be served and the interface must be able to handle highly linked data. GraphQL is a data access layer and defines query techniques designed to navigate easily through complex data structures.ResultsWe propose QL4MDR, an ISO 11179-3 compatible GraphQL query language. The GraphQL schema for QL4MDR is derived from the ISO 11179 standard and defines objects, fields, queries and mutation types. Entry points within the schema define the path through the graph to enable search functionalities, but also the exchange is promoted by mutation types, which allow creating, updating and deleting of metadata. QL4MDR is the foundation for the uniform interface, which is implemented in a modern web-based interface prototype.ConclusionsWe have introduced a uniform query interface for metadata repositories combining the ISO 11179 standard for metadata repositories and the GraphQL query language. A reference implementation based on the existing Samply.MDR was implemented. The interface facilitates access to metadata, enables better interaction with metadata as well as a basis for connecting existing repositories. We invite other ISO 11179-based metadata repositories to take this approach into account.
Summary Background Secondary use of routine medical data relies on a shared understanding of given information. This understanding is achieved through metadata and their interconnections, which can be stored in metadata repositories (MDRs). The necessity of an MDR is well understood, but the local work on metadata is a time-consuming and challenging process for domain experts. Objective To support the identification, collection, and provision of metadata in a predefined structured manner to foster consolidation. A particular focus is placed on user acceptance. Methods We propose a software pipeline MDRBridge as a practical intermediary for metadata capture and processing, based on MDRSheet, an ISO 11179–3 compliant template using popular spreadsheet software. It serves as a practical mediator for metadata acquisition and processing in a broader pipeline. Due to the different origins of the metadata, both manual entry and automatic extractions from application systems are supported. To enable the export of collected metadata into external MDRs, a mapping of ISO 11179 to Clinical Data Interchange Standards Consortium (CDISC) Operational Data Model (ODM) was developed. Results MDRSheet is embedded in the processing pipeline MDRBridge and delivers metadata in the CDISC ODM format for further use in MDRs. This approach is used to interactively unify core datasets, import existing standard datasets, and automatically extract all defined data elements from source systems. The involvement of clinical domain experts improved significantly due to minimal changes within their usual work routine. Conclusion A high degree of acceptance was achieved by adapting the working methods of clinical domain experts. The designed process is capable of transforming all relevant data elements according to the ISO 11179-3 format. MDRSheet is used as an intermediate format to present the information at a glance and to allow editing or supplementing by domain experts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.