Simple Science

Cutting edge science explained simply

# Health Sciences# Genetic and Genomic Medicine

Insights into Genetic Variants and Disease Risk

Research reveals how genetic differences influence health through gene expression and splicing.

― 5 min read


Genetics and DiseaseGenetics and DiseaseLinksgenetic variants and health outcomes.Study reveals connections between
Table of Contents

Many genetic differences tied to common diseases are found in parts of our DNA that do not code for proteins. This makes it hard to see how these genetic differences affect health. One way to find out more is through a method called molecular quantitative trait locus (QTL) mapping. This method helps us find the genes and molecules that may be involved in how these genetic differences influence disease risk. By doing this, we can create specific tests to see how these genes work.

Applications of Molecular QTL Data

The data gathered from molecular QTL studies can be useful in many areas of medicine. For instance, it can help us find new targets for drugs, understand how drugs work and their potential side effects, identify new uses for existing drugs, and uncover important health markers.

Previous Research on QTL Mapping

Most past research on QTL mapping has focused on one type of molecule, either gene expression or proteins. To fully grasp how genetic differences lead to various traits, we need to look at data from different types of molecules together. Having multiple types of data from the same group of people makes it easier to analyze and validate our findings. For example, using mediation analysis helps us see how different traits may be connected through shared genetic pathways.

The INTERVAL Study

In this study, we used data from the INTERVAL project, a large resource of information from around 50,000 blood donors. This study collected detailed information about various biological molecules. We focused on gene expression and Splicing QTLs using RNA-sequencing data from over 4,700 individuals. We also combined this data with other molecular information collected from the same group, like protein levels and metabolites.

Findings on Gene Expression and Splicing

Through RNA-sequencing of blood samples, we looked at the expression levels of over 19,000 genes and more than 111,000 splicing events in those genes. We identified many genes with significant connections to their expression levels, which we defined as "cis-eGenes." This means that some genetic differences were found to affect the expression of these genes directly.

We discovered a substantial number of genes that had significant connections to splicing events, defined as "cis-sGenes." Some of these genes were new discoveries not found in other studies. This shows how important it is to examine gene expression and splicing events together.

Investigation of Genetic Effects

We further explored how genetic differences influence gene expression and splicing. We found a lot of shared genetic effects between different traits, which we assessed using statistical tools. We also looked for how these genetic differences might affect other traits down the line through mediation analysis.

Genetic Effects on Distal Gene Expression

Next, we investigated the impacts of genetic variants on genes located farther away from the site where the variant occurs, which we call "Trans-regulation." This analysis helps us understand how genetic variants may influence other genes indirectly.

We identified many genes that were significantly associated with trans-regulation, providing insight into how certain variants can impact multiple genes simultaneously. We also discovered that these genes were often linked to common functions, such as regulating how genes are expressed or how the body responds to outside factors.

Shared Genetic Connections

We compared the transcriptional QTLs we identified to other types of molecular QTLs from the same study participants. Through colocalization analyses, we looked for connections between genetic signals across different types of molecules, including proteins and metabolites. This revealed a significant number of shared genetic effects across different molecular traits.

Understanding Splicing's Role

Splicing, which involves removing certain parts of RNA, appears to play a crucial role in how genetic variants impact protein levels in the body. We found that many splicing events affected the production of proteins by changing which parts of the RNA are included or discarded.

For example, the splicing of certain genes affects the production of protein forms that can circulate in the blood. This adds another layer of complexity to how genetic differences can influence health.

Assessing the Impact of Genetic Variants on Health

To further understand how genetic variants contribute to health, we looked at genetic signals associated with various health conditions. We found that many of these signals overlapped with the genes and splicing events we identified in our analyses. By examining these overlaps, we could see potential pathways through which genetic differences might influence disease risk.

For example, we found connections between splicing variations in the interleukin-7 receptor gene and conditions such as eczema. This suggests that understanding splicing could reveal new ways genetics plays a role in disease.

COVID-19 and Genetic Factors

Given the widespread impact of COVID-19, we assessed how our findings related to genetic factors influencing susceptibility and severity of this disease. We found several connections between the genetic variations associated with COVID-19 and the genes and splicing events we studied. This indicates that our work could shed light on the underlying genetics of disease susceptibility.

Importance of Non-Protein-Coding Variants

Most of the genetic differences linked to common traits are found in non-coding regions of DNA. This reinforces the importance of studying these areas to understand how they contribute to disease risk. While we have made significant strides, more comprehensive molecular data is needed to define how these variants operate and interact.

Conclusions and Future Directions

Our findings have broad implications for understanding the genetic basis of complex diseases. By mapping out the genetic links between traits and health outcomes through systematic analysis, we can build a clearer picture of how genetics impacts health.

There is still much work to be done. Collecting more data from diverse populations and utilizing advanced analytical techniques will be crucial in expanding our understanding of genetic variants and their roles in health and disease.

This research serves as a foundation for future explorations into the genetic networks that influence human health, opening up potential avenues for new treatments and interventions. The findings and data presented here are expected to be invaluable for ongoing research efforts in genetics and molecular biology.

Original Source

Title: Genetic determinants of blood gene expression and splicing and their contribution to molecular phenotypes and health outcomes

Abstract: The biological mechanisms through which most non-protein-coding genetic variants affect disease risk are unknown. To investigate the gene-regulatory cascades that ensue from these variants, we mapped blood gene expression and splicing quantitative trait loci (QTLs) through bulk RNA-sequencing in 4,732 participants, and integrated these data with protein, metabolite and lipid QTLs in the same individuals. We identified cis-QTLs for the expression of 17,233 genes and 29,514 splicing events (in 6,853 genes). Using colocalization analysis, we identified 3,430 proteomic and metabolomic traits with a shared association signal with either gene expression or splicing. We quantified the relative contribution of the genetic effects at loci with shared etiology through statistical mediation, observing 222 molecular phenotypes significantly mediated by gene expression or splicing. We uncovered gene-regulatory mechanisms at GWAS disease loci with therapeutic implications, such as WARS1 in hypertension, IL7R in dermatitis and IFNAR2 in COVID-19. Our study provides an open-access and interactive resource of the shared genetic etiology across transcriptional phenotypes, molecular traits and health outcomes in humans (https://IntervalRNA.org.uk).

Authors: Dirk S. Paul, A. Tokolyi, E. Persyn, A. P. Nath, K. L. Burnham, J. Marten, T. Vanderstichele, M. Tardaguila, D. Stacey, B. Farr, V. Iyer, X. Jiang, S. A. Lambert, G. Noell, M. A. Quail, D. Rajan, S. C. Ritchie, B. B. Sun, S. A. J. Thurston, Y. Xu, C. D. Whelan, H. Runz, S. Petrovski, D. J. Gaffney, D. J. Roberts, E. Di Angelantonio, J. E. Peters, N. Soranzo, J. Danesh, A. S. Butterworth, M. Inouye, E. E. Davenport

Last Update: 2023-11-27 00:00:00

Language: English

Source URL: https://www.medrxiv.org/content/10.1101/2023.11.25.23299014

Source PDF: https://www.medrxiv.org/content/10.1101/2023.11.25.23299014.full.pdf

Licence: https://creativecommons.org/licenses/by/4.0/

Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.

Thank you to medrxiv for use of its open access interoperability.

More from authors

Similar Articles