AI and Medical Reports: A New Tool for Precision Medicine
AI software improves medical report processing for better patient care.
― 5 min read
Table of Contents
Precision medicine aims to tailor medical treatments to individual characteristics of each patient. This requires detailed understanding of a patient's condition, known as phenotyping, and organized Health Records that allow for effective sharing of clinical data. This is particularly important for rare diseases, where finding patients with similar conditions across the globe can make a big difference in treatment outcomes.
The Role of Medical Reports
Medical reports hold valuable information about a patient's health and play a key role in communication between healthcare providers. However, sharing these reports can be tough, especially when providers speak different languages. This adds layers of complexity when reports need to be translated without revealing personal information to protect patient Privacy. Additionally, many medical reports are written in a free-text format, which makes it hard to gather useful information for precision medicine.
Human Phenotype Ontology (HPO)
To tackle these challenges, the medical community has turned to the Human Phenotype Ontology (HPO). HPO standardizes the way diseases and Symptoms are described, creating a common language that can be used by both doctors and computers. Using HPO terms helps in identifying new diseases and is especially helpful when analyzing genetic information. However, doctors often provide HPO terms that vary in quality and completeness, which makes it necessary to standardize how these terms are used for clearer communication.
Challenges in Medical Data Sharing
Artificial Intelligence (AI) technologies, including deep learning and natural language processing, show promise in making medical information sharing easier. Yet, using systems that operate on the cloud may pose risks regarding patient privacy due to strict regulations like the General Data Protection Regulation (GDPR). For this reason, it’s suggested that sensitive patient information should be handled within secure local systems rather than on the cloud.
To make the most of clinical data, it's essential to have well-organized, anonymized, and structured clinical data. Researchers have developed software that can translate non-English medical reports, remove personal identifiers, and summarize symptoms using HPO terms while following the principles of data security and privacy.
The Study on AI and Medical Reports
A study was conducted to evaluate how effective this software is. The study took place at two different hospital centers and focused on gathering data from patients suspected of having genetic disorders. Medical reports from these patients were included in the study only if consent was given. Before using the reports, the researchers anonymized the patients' names.
Study Design: How It Worked
A doctor reviewed 50 medical reports, and the software went through a series of steps. First, names were removed to protect patient privacy. Then, the software handled the text by expanding abbreviations, correcting translations, and removing personal identifiers according to existing privacy rules. The doctor compared the results generated by the software to their own evaluations, documenting any mistakes and noting their significance.
The main goal was to see how many personal identifiers the software could remove compared to the doctor. The effectiveness of the software for summarizing symptoms using HPO terms was also studied. They set specific targets for the software's accuracy for these tasks.
How the Software Processed Reports
The software followed an offline process to handle medical reports. It first expanded any abbreviated terms, then translated the text into English, took out personal identifiers, and finally summarized the information using HPO terms. Each summary was flagged for confidence level, indicating how certain the software was about the information presented.
To improve accuracy and avoid losing important clinical details, the research team combined AI techniques with human review. They created lists of medical names, drug names, and commonly used abbreviations to ensure that critical information wasn't incorrectly removed.
Evaluating the Results
During the study, they identified what worked and what didn't. They categorized mistakes into major and moderate errors based on how much they affected the accuracy of the report.
The researchers considered missed identifiers as errors and also noted excess identifiers that should not have been included. They also looked into how accurately the symptoms were summarized using HPO terms, noting instances where the software over- or under-reported condition descriptions.
The results showed that the software successfully removed a high percentage of personal identifiers and accurately summarized many symptoms. Specific thresholds were set to determine if the software could reliably replace human effort in these tasks.
Findings from the Study
The results showed that the AI software outperformed expectations in many areas. It successfully anonymized personal information at an exceptional rate, surpassing the minimum accuracy goals set prior to the study. In terms of summarizing symptoms, the software also achieved impressive results, with a high percentage of correctly identified symptoms using HPO terms.
However, it was noted that there were still some areas for improvement. Some errors came from translation issues, while others were related to incorrect family member information being included.
The software was designed to be comprehensive and user-friendly, making it suitable for everyday use in clinical settings. However, it was acknowledged that human oversight might still be required to ensure the best quality results.
Limitations and Future Directions
While the study showed great promise, there were limitations to the approach taken. Certain medical terminology and proper names were not adequately addressed by the software, leading to instances where important details were either over- or under-identified. Additionally, the study focused mostly on genetic reports, which may not represent other types of medical documentation equally well.
Future improvements could include better handling of medications and clearer guidelines for multi-patient reports. The performance of similar AI tools could also be benchmarked more thoroughly to compare effectiveness.
Conclusion
In summary, the study emphasized how AI technologies could significantly enhance the efficiency and accuracy of processing medical reports. By improving both the removal of personal identifiers and the summarization of symptoms, these tools could facilitate more effective sharing of patient information, particularly in the context of precision medicine.
As AI continues to evolve, its role in healthcare settings will likely expand, paving the way for faster diagnoses and better patient care.
Title: Assessing feasibility and risk to translate, de-identify and summarize medical reports using deep learning
Abstract: BackgroundPrecision medicine requires accurate phenotyping and data sharing, particularly for rare diseases. However, sharing medical reports across language barriers is challenging. Alternatively, inconsistent and incomplete clinical summary provided by physicians using Human Phenotype Ontology (HPO) can lead to a loss of clinical information. MethodsTo assess feasibility and risk of using deep learning methods to translate, de-identify and summarize medical reports, we developed an open-source deep learning multi-language software in line with health data privacy. We conducted a non-inferiority clinical trial using deep learning methods to de-identify protected health information (PHI) targeting a minimum sensitivity of 90% and specificity of 75%, and summarize non-English medical reports in HPO format, aiming a sensitivity of 75% and specificity of 90%. ResultsFrom March to April 2023, we evaluated 50 non-English medical reports from 8 physicians and 12 different groups of diseases, which included neurodevelopmental disorders, congenital disorders, fetal pathology and oncology. Reports contain in median 15 PHI and 7 HPO terms. Deep learning method achieved a sensitivity of 99% and a specificity of 87% in de-identification, and a sensitivity of 78% and a specificity of 92% in summarizing medical reports, reporting an average number of 6.6 HPO terms per report, which is equivalent to the number of HPO terms provided usually by physicians in databases (6.8 in PhenoDB). ConclusionsDe-identification and summarization of non-English medical reports using deep learning methods reports non-inferior performance, providing insights on AI usage to facilitate precision medicine. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=145 HEIGHT=200 SRC="FIGDIR/small/23293234v3_ufig1.gif" ALT="Figure 1"> View larger version (44K): [email protected]@bddee9org.highwire.dtl.DTLVardef@175af12org.highwire.dtl.DTLVardef@138fddb_HPS_FORMAT_FIGEXP M_FIG Illustration of the non-inferiority trial for de-identification and summarization of non-english medical reports and main statistical performances. C_FIG
Authors: Kevin Yauy, L. W. Gauthier, M. Willems, N. Chatron, C. Cenni, P. Meyer, V. Ruault, C. Wells, Q. Sabbagh, D. Genevieve
Last Update: 2023-08-31 00:00:00
Language: English
Source URL: https://www.medrxiv.org/content/10.1101/2023.07.27.23293234
Source PDF: https://www.medrxiv.org/content/10.1101/2023.07.27.23293234.full.pdf
Licence: https://creativecommons.org/licenses/by-nc/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to medrxiv for use of its open access interoperability.
Reference Links
- https://marian-nmt.github.io
- https://microsoft.github.io/presidio/
- https://omim.org/
- https://www.insee.fr/fr/information/6051727
- https://abreviationsmedicales.ch/
- https://fr.wikipedia.org/wiki/Liste_d%27abr%C3%A9viations_en_m%C3%A9decine
- https://www.pays-de-la-loire.ars.sante.fr/system/files/2018-06/Aide%20-%20Acronymes.pdf
- https://nexus.phenotips.org/nexus/content/repositories/releases/org/phenotips/vocabulary-hpo-translation-french/1.4-rc-4/
- https://github.com/kyauy/ClinFly
- https://huggingface.co/spaces/kyauy/ClinFly