Bridging Health Data: OMOP and Genomics
Discover how OMOP CDM transforms health data sharing and precision medicine.
Manuel Rueda, Juan Manuel Ramírez-Anguita, Victoria López-Sánchez, Sergi Aguiló-Castillo, Maria Eugenia Gas López, Alberto Labarga, Miguel-Ángel Mayer, Javier Ripoll Esteve, Ivo G. Gut
― 6 min read
Table of Contents
- What’s the Deal with OMOP CDM?
- Enter OHDSI Community and Its Heroes
- A Spanish Initiative for Precision Medicine
- Turning OMOP CDM Data Into Beacon v2
- The File-Based Approach: Cooking Ahead
- The On-the-Fly Approach: No Waiting
- Testing the Waters: Real-World Data Conversion
- File-Based Conversion at the CNAG
- File-Based Conversion at IIS La Fe
- File-Based Conversion at Hospital del Mar
- A Comparison of Cooking Methods
- When to Cook Ahead
- When to Order On-the-Fly
- Conclusion: A Partnership for Health
- Original Source
- Reference Links
In today’s world, health data is growing at a staggering pace, especially when it comes to genomic and clinical information. This data can help doctors provide more personalized treatments tailored to each patient’s needs. However, there is a catch: sharing this data between different systems can be as tricky as trying to assemble a jigsaw puzzle with missing pieces. This is where a framework called the OMOP Common Data Model (CDM) comes into play. Think of it as a universal language for clinical data that makes it easier for different systems to talk to each other.
What’s the Deal with OMOP CDM?
The OMOP CDM aims to organize clinical data in a standard way. It is widely accepted and supported by a group known as OHDSI. They have a standardized vocabulary that helps categorize a wide array of health information. The goal? To make sure that regardless of where the data comes from (a research hospital in Spain or a clinic in the US), everyone is speaking the same language. Imagine conversations flowing without awkward pauses or misunderstandings.
But OMOP CDM has a little hiccup: it doesn’t handle genomic data all that well. And that’s kind of a big deal because genomic information is essential for personalized medicine, which aims to tailor treatments to individual genetic profiles.
Enter OHDSI Community and Its Heroes
To tackle this shortcoming, the OHDSI community assembled a team of experts dedicated to enhancing OMOP CDM. They want to ensure that genomic data can blend seamlessly into the mix. This effort is part of a larger global mission to improve health data sharing and collaboration, thanks to initiatives like the Global Alliance for Genomics and Health (GA4GH).
GA4GH introduced some handy tools like Beacon v2 and Phenopacket v2 standards for sharing genomic and phenotypic data. Think of these like standardized text messages that make sure your emojis don’t look weird when sent from one phone model to another.
A Spanish Initiative for Precision Medicine
In Spain, the IMPaCT program aims to take precision medicine seriously. It looks to incorporate the latest genomic healthcare advancements into the National Health System, ensuring that everyone has access to top-notch treatments. One part of this program is called IMPaCT-Data, which mashes together diverse datasets, making it easier to find the right genomic and phenotypic data.
Turning OMOP CDM Data Into Beacon v2
So, how do we transform OMOP CDM data to match the Beacon v2 format? Well, this article reveals two main approaches: a file-based method and an on-the-fly method. One of them is like preparing a hearty stew in advance, while the other is cooking up a fresh meal right when you’re hungry.
The File-Based Approach: Cooking Ahead
The file-based conversion method works in a way that is pretty efficient for centers using non-relational databases, like MongoDB. Here, large volumes of patient data are pre-transformed into a Beacon-friendly format, making access quick and straightforward. Picture a chef chopping all the ingredients for a delicious meal the night before. This method is great when researchers need to access data quickly, but it does require some periodic updates to keep things fresh.
To get this process rolling, data is exported from a relational database and turned into JSON format. Once converted, this data can then be neatly stored in a non-relational database where it can be accessed via the Beacon v2 API. Like having your soup ready to go when you want a quick meal!
The On-the-Fly Approach: No Waiting
On the other hand, the on-the-fly method takes a more dynamic route. Instead of preparing data in advance, it connects to the OMOP CDM database directly when someone needs to access information. Think of this as a food truck whipping up gourmet dishes right in front of you.
Whenever a request comes in, the system translates those requests into SQL queries to fetch the required data from the database. This approach is excellent for situations that require the most current information, like when new patient data is pouring in daily. However, it does demand a well-organized database to function smoothly. It’s a balance of speed and efficiency tailored for real-time access.
Testing the Waters: Real-World Data Conversion
To see how well these methods work in the real world, they conducted tests using various health data sources in Spain. They used datasets from things like COVID-19 patient records to understand the effectiveness of their conversion procedures.
File-Based Conversion at the CNAG
For example, at the Centro Nacional de Análisis Genómico (CNAG), they used the EUNOMIA dataset, which contains records for thousands of patients. By transforming this data into the Beacon v2 format, they could get a clear picture of patient health and share this information with other researchers.
File-Based Conversion at IIS La Fe
Next, they tested the file-based method at the Health Research Institute Hospital La Fe. Here, they pulled together clinical information from COVID-19 patients. The aim was to convert this data into Beacon v2 format, enabling researchers to query specific health characteristics easily. And just like baking cookies, the end results were deliciously complete with information!
File-Based Conversion at Hospital del Mar
At Hospital del Mar, they tapped into a massive database that has information on around one million patients. Using the IMASIS database, they managed to convert data to Beacon v2 format while maintaining nearly perfect accuracy. It’s wonderful how much insight can be gleaned from such a substantial collection of data!
A Comparison of Cooking Methods
The study also examines the pros and cons of both conversion methods.
When to Cook Ahead
The file-based approach is best for centers that find value in accessing preformatted data. It’s particularly suited for projects that combine information from multiple sources. A big win here is the quick response times, making it perfect when researchers are ready to dig into their data.
When to Order On-the-Fly
On-the-fly conversion shines when having the most up-to-date information is critical. It prevents the need for periodic updates and sidesteps the hassle of maintaining duplicate data. However, it does rely on having a speedy, well-organized database to make it work best.
Conclusion: A Partnership for Health
Both methods contribute to making health data sharing easier and more efficient. By offering simple solutions for converting health data, they help research communities come together, fostering collaboration and ultimately driving advances in precision medicine.
Humor aside, when it comes to health, sharing data is serious business. With these methods in play, we are closer to a world where health information flows freely, making it easier for healthcare providers to offer the best care possible to their patients. Here's to that future—and maybe a bit of soup on the side!
Original Source
Title: Enhancing Semantic Interoperability in Precision Medicine: Converting OMOP CDM to Beacon v2 in the Spanish IMPaCT- Data Project
Abstract: ObjectiveTo introduce novel methods to convert OMOP CDM data into GA4GH Beacon v2 format, enhancing semantic interoperability within Spains IMPaCT-Data program for personalized medicine. Materials and MethodsWe utilized a file-based approach with the Convert-Pheno tool to transform OMOP CDM exports into Beacon v2 format. Additionally, we developed a direct connection from PostgreSQL OMOP CDM to the Beacon v2 API, enabling real-time data access without intermediary text files. ResultsWe successfully converted OMOP CDM datasets from three research centers (CNAG, IIS La Fe, and HMar) to Beacon v2 format with nearly 100% data completeness. The direct connection approach improved data freshness and adaptability for dynamic environments. Discussion and ConclusionThis study introduces two methodologies for integrating OMOP CDM data with Beacon v2, offering performance optimization or real-time access. These methodologies can be adopted by other centers to enhance interoperability and collaboration in health data sharing.
Authors: Manuel Rueda, Juan Manuel Ramírez-Anguita, Victoria López-Sánchez, Sergi Aguiló-Castillo, Maria Eugenia Gas López, Alberto Labarga, Miguel-Ángel Mayer, Javier Ripoll Esteve, Ivo G. Gut
Last Update: 2024-12-28 00:00:00
Language: English
Source URL: https://www.medrxiv.org/content/10.1101/2024.12.25.24319606
Source PDF: https://www.medrxiv.org/content/10.1101/2024.12.25.24319606.full.pdf
Licence: https://creativecommons.org/licenses/by/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to medrxiv for use of its open access interoperability.
Reference Links
- https://github.com/OHDSI/Genomic-CDM
- https://impact.isciii.es/
- https://impact-data.bsc.es/en/about/impact
- https://3tr-imi.eu/
- https://github.com/CNAG-Biomedical-Informatics/omop-cdm-2-beacon-v2
- https://gitlab.bsc.es/impact-data/impd-beacon_omopcdm
- https://ohdsi.github.io/Eunomia/
- https://www.sjdhospitalbarcelona.org/es/hospital/proyectos-estrategicos/red-unicas-atencion-enfermedades-minoritarias
- https://by-covid.org/
- https://www.gcatbiobank.org/