Background The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) can be used to transform observational health data to a common format. CDM transformation allows for analysis across disparate databases for the generation of new, real-word evidence, which is especially important in rare disease where data are limited. Pulmonary hypertension (PH) is a progressive, life-threatening disease, with rare subgroups such as pulmonary arterial hypertension (PAH), for which generating real-world evidence is challenging. Our objective is to document the process and outcomes of transforming registry data in PH to the OMOP CDM, and highlight challenges and our potential solutions. Methods Three observational studies were transformed from the Clinical Data Interchange Standards Consortium study data tabulation model (SDTM) to OMOP CDM format. OPUS was a prospective, multi-centre registry (2014–2020) and OrPHeUS was a retrospective, multi-centre chart review (2013–2017); both enrolled patients newly treated with macitentan in the US. EXPOSURE is a prospective, multi-centre cohort study (2017–ongoing) of patients newly treated with selexipag or any PAH-specific therapy in Europe and Canada. OMOP CDM version 5.3.1 with recent OMOP CDM vocabulary was used. Imputation rules were defined and applied for missing dates to avoid exclusion of data. Custom target concepts were introduced when existing concepts did not provide sufficient granularity. Results Of the 6622 patients in the three registry studies, records were mapped for 6457. Custom target concepts were introduced for PAH subgroups (by combining SNOMED concepts or creating custom concepts) and World Health Organization functional class. Per the OMOP CDM convention, records about the absence of an event, or the lack of information, were not mapped. Excluding these non-event records, 4% (OPUS), 2% (OrPHeUS) and 1% (EXPOSURE) of records were not mapped. Conclusions SDTM data from three registries were transformed to the OMOP CDM with limited exclusion of data and deviation from the SDTM database content. Future researchers can apply our strategy and methods in different disease areas, with tailoring as necessary. Mapping registry data to the OMOP CDM facilitates more efficient collaborations between researchers and establishment of federated data networks, which is an unmet need in rare diseases.
Large administrative healthcare (including insurance claims) databases are used for various retrospective real-world evidence studies. However, in pulmonary arterial hypertension (PAH) and chronic thromboembolic pulmonary hypertension (CTEPH), identifying patients retrospectively based on administrative codes remains challenging, as it relies on code combinations ('algorithms') and the accuracy for patient identification of most of them is unknown. This study aimed to assess the performance of various algorithms in correctly identifying patients with PAH or CTEPH in administrative databases. A systematic literature review was performed to find publications detailing code-based algorithms used to identify PAH and CTEPH patients. PheValuator, a diagnostic predictive modelling tool, was applied to three US claims databases, yielding models that estimated the probability of a patient having the disease. These models were used to evaluate the performance characteristics of selected PAH and CTEPH algorithms. With increasing algorithm complexity, average positive predictive value increased (PAH: 13.4–66.0%; CTEPH: 10.3–75.1%) and average sensitivity decreased (PAH: 61.5–2.7%; CTEPH: 20.7–0.2%). Specificities and negative predictive values were high (≥97.5%) for all algorithms. Several of the algorithms performed well overall when considering all of these four performance parameters, and all algorithms performed with similar accuracy across the three claims databases studied, even though most were designed for patient identification in a specific database. Therefore, it is the objective of a study that will determine which algorithm may be most suitable; one- or two-component algorithms are most inclusive and three- or four-component algorithms identify most precise PAH or CTEPH populations, respectively.
The main aim of this analysis was to investigate time from symptom onset (chronic unexplained dyspnoea [CUD]) to diagnosis of Group 1 pulmonary hypertension (PH)—pulmonary arterial hypertension (PAH)—and to characterize healthcare resource utilization leading up to diagnosis using a nationwide US claims and an electronic health record (EHR) database from Optum © . Eligible patients were ≥18 years old at first CUD diagnosis (index event) and had a PAH diagnosis on or after index date. Based on administrative codes, PAH was defined as right heart catheterization (RHC), ≥ 2 PAH diagnoses (1 within a year of RHC), and ≥1 post‐RHC prescription for PAH treatment. All values are median (1st quartile–3rd quartile) unless otherwise stated. Of 854,722 patients with CUD in the claims database, 582 (0.1%) had PAH. Time from CUD to PAH diagnosis was 2.26 (0.73–4.22) years. PAH patients experienced 3 (2–4) transthoracic echocardiograms (TTEs), 6 (3–12) specialist visits, and 2 (1–4) hospitalizations during the diagnostic interval. Almost one‐third of patients (29%) waited 10 months or more to have a TTE. Findings from the EHR database were broadly similar. Resource utilization during the diagnostic interval was also analyzed in an overall PH cohort: findings were generally similar to the PAH cohort (2 [1–3] TTEs, 4 [2–9] specialist visits and 2 [1–4] hospitalizations). These data indicate a delay in the diagnostic pathway for PAH, and illustrate the burden associated with PAH diagnosis.
The approach performed well in the RA case study, but further work is required to better define its strengths and limitations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.