Improving Disease Prediction with New Tools
A tool enhances survival predictions for diseases like ALS and atrial fibrillation.
― 6 min read
Table of Contents
Predicting how long someone will live or how quickly a disease will progress is very important for almost all health conditions. This is especially true for diseases like amyotrophic lateral sclerosis (ALS). Researchers looked at different ways for predicting Survival and found that while some advanced models are available using machine learning and deep learning techniques, many of these tools are not widely used in real clinical situations. They also noted that there are simpler models that could work just as well or even better, especially when there is limited Data available.
Importance of This Study
Being able to predict how a disease will develop, including when a patient might experience significant events like the onset of symptoms or death, is crucial for guiding treatment decisions. However, there is often a disconnect between doctors who have access to patient data and researchers who create prediction models. To bridge this gap, a team developed a tool called predicTTE, which allows researchers, even those without technical expertise, to create effective prediction models. The tool is accessible through an online platform where healthcare providers can also share additional data to improve the models.
predicTTE is designed to be customizable for any kind of prediction involving time-to-event analysis. It includes advanced models from popular packages for deep learning and parametric approaches, allowing for the creation of combined models. The tool also manages missing data effectively using a specific model called MissForrest, which has been shown to be effective in real-world conditions. The goal of this work is to enhance clinical practices in this field.
Applications of This Study
The researchers demonstrated how their system could be used in three different situations related to neurological diseases. By combining the best prediction tools in a user-friendly way, they aim to facilitate fast and secure data sharing. Their work has the potential to evolve to accommodate new models and datasets in the future.
Understanding what affects survival times is important for all healthcare settings, especially for chronic conditions like neurological diseases. These diseases often lack good predictive markers, making it necessary to get creative with the existing patient data in a valuable way. One common method for time-to-event analysis is Cox regression, which has limitations in certain scenarios because it assumes that the risk of death remains constant over time. Newer models, including deep learning approaches, offer promise but often come with high technical demands that can exclude many clinicians from using them effectively.
To tackle this, the team has created an app that simplifies the use of these advanced models. The online platform allows patients and healthcare workers to access trained models for predictions and also contributes their own data for further improvements.
Use Case 1: Predicting ALS Survival
ALS is a serious and progressive disease, typically leading to death within a few years. Researchers believe that understanding how the disease will progress can help in making better predictions about survival. They identified key clinical characteristics that could predict survival, like a patient's age, specific genetic markers, and the rate of decline as measured by a standard scale.
The team used data from a large group of ALS patients to train a prediction model. They employed an ensemble approach, where predictions are first made using one model and then refined using information from similar patients. This technique significantly improved prediction accuracy and was validated with a separate group of ALS patients.
Missing data is a common issue, and the researchers demonstrated that their method for imputation allows for effective handling of incomplete information. This is crucial in real-world cases where some patient data might not be fully complete, yet accurate predictions can still be made.
Biomarkers in ALS
Use Case 2: EvaluatingIn another example, the researchers applied their model to assess how well certain biological markers in the blood can predict ALS survival. They compared survival predictions made using clinical measurements alone with models that included these new biomarkers.
Although previous predictions relied heavily on clinical data, the inclusion of blood-based biomarkers improved the overall model performance. This finding underscores the potential of combining different types of data for better predictions.
Use Case 3: Individualized Treatment for Atrial Fibrillation
The final example focused on patients suffering from atrial fibrillation, a condition that can lead to strokes. When choosing Treatments, current practices usually group patients based on their age and existing health conditions. However, this may not always provide the best treatment options for every individual.
Using predicTTE, the researchers analyzed a large dataset of patients and created individual predictions for the time until death after starting a specific treatment. They showed that their predictions could suggest more suitable treatment options tailored to each patient's unique situation rather than relying solely on broad categories.
The Online Platform
predicTTE is a comprehensive software package that helps users create time-to-event prediction models. The platform aims to make these advanced tools accessible to those who may not have a technical background. In a recent survey, many ALS patients expressed a desire to access their survival predictions, and the online platform makes this possible.
Patients and healthcare providers can not only retrieve personalized survival predictions but also contribute their own data through secure channels. This approach could lead to better data collection and the improvement of prediction models over time.
Conclusion
Being able to accurately predict how diseases progress is essential, especially for conditions where tissue samples might not be readily available. The development of tools like predicTTE can greatly enhance how predictions are made and utilized in clinical settings. By offering an accessible and secure platform for data sharing and prediction modeling, the researchers hope to create a loop that continuously improves both the datasets and the accuracy of predictions.
The work demonstrated how predicTTE can be applied in various ways, from predicting survival in ALS to assessing treatment options for patients with other conditions. By employing different scientific methods to fill in gaps in data and enhance prediction performance, these efforts could ultimately lead to better patient outcomes and more personalized medicine approaches.
Title: predicTTE: An accessible and optimal tool for time-to-event prediction in neurological diseases
Abstract: Time-to-event prediction is a key task for biological discovery, experimental medicine, and clinical care. This is particularly true for neurological diseases where development of reliable biomarkers is often limited by difficulty visualising and sampling relevant cell and molecular pathobiology. To date, much work has relied on Cox regression because of ease-of-use, despite evidence that this model includes incorrect assumptions. We have implemented a set of deep learning and spline models for time-to-event modelling within a fully customizable app and accompanying online portal, both of which can be used for any time-to-event analysis in any disease by a non-expert user. Our online portal includes capacity for end-users including patients, Neurology clinicians, and researchers, to access and perform predictions using a trained model, and to contribute new data for model improvement, all within a data-secure environment. We demonstrate a pipeline for use of our app with three use-cases including imputation of missing data, hyperparameter tuning, model training and independent validation. We show that predictions are optimal for use in downstream applications such as genetic discovery, biomarker interpretation, and personalised choice of medication. We demonstrate the efficiency of an ensemble configuration, including focused training of a deep learning model. We have optimised a pipeline for imputation of missing data in combination with time-to-event prediction models. Overall, we provide a powerful and accessible tool to develop, access and share time-to-event prediction models; all software and tutorials are available at www.predictte.org.
Authors: Johnathan Cooper-Knock, M. Weinreich, H. McDonough, N. Yacovzada, I. Magen, Y. Cohen, C. Harvey, S. Gornall, S. Boddy, J. Alix, N. Mohseni, J. Kurz, K. Kenna, S. Zhang, A. Iacoangeli, A. Al-Khleifat, M. Snyder, E. Hobson, A. Al-Chalabi, E. Hornstein, E. Elhaik, P. Shaw, C. McDermott
Last Update: 2024-07-23 00:00:00
Language: English
Source URL: https://www.biorxiv.org/content/10.1101/2024.07.20.604416
Source PDF: https://www.biorxiv.org/content/10.1101/2024.07.20.604416.full.pdf
Licence: https://creativecommons.org/licenses/by/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to biorxiv for use of its open access interoperability.