New Methods in Genetic Heritability Research
A look into innovative techniques for measuring heritability in complex traits.
― 6 min read
Table of Contents
Heritability is a way to measure how much of the differences seen in a specific trait, like height or personality, can be explained by genetics. It looks at the genetic differences between individuals and how these differences relate to the traits we can observe. Traditionally, researchers used family studies to estimate heritability, but these studies often involved small groups of related individuals. Now, thanks to advances in genetic research, scientists can analyze much larger datasets, including unrelated individuals, to get a clearer picture of heritability.
Genetic Research Advances
With new methods in genetic research, particularly in Genome-wide Association Studies (GWAS), researchers can gather summary statistics that help estimate heritability without needing to analyze individual genetic data. GWAS examines many genetic variants across different individuals to find associations with specific traits. By looking at how these genetic variants influence traits, researchers can better understand how much of a trait's variation is due to genetics versus other factors.
This new approach means that many traits can be linked to a large number of genetic variants. In fact, it's common for thousands of genetic loci to contribute to the variation seen in a single trait. Researchers have developed various statistical methods to improve how we estimate heritability from the results of GWAS. One of the most popular methods is called linkage disequilibrium (LD) score regression, which helps correct for errors in the data and gives a clearer estimate of heritability.
Limitations of Current Methods
While LD score regression is widely used, it has limitations. The method primarily focuses on specific genetic correlations and may miss important information captured in the data. Furthermore, it only considers certain aspects of the genetic architecture of traits. This means that some of the genetic effects that contribute to traits could be overlooked, thus underestimating heritability.
Recent approaches have started to address these gaps by incorporating additional information. For instance, researchers have begun including more complex genetic Interactions, particularly interactions between genes, when estimating heritability. This is important because many traits do not rely solely on additive genetic effects, but also on how different genes interact with one another.
Introducing Interaction-LD Score Regression
One of the latest developments in this field is a new method called interaction-LD score regression (i-LDSC). This method builds on the original LD score regression by adding a way to account for interactions between genetic variants. The idea is that certain genetic interactions can significantly impact how traits are expressed. By including these interactions, i-LDSC aims to provide a more accurate estimate of heritability.
The i-LDSC method works by identifying specific pairs of genetic variants that interact with each other. This allows researchers to understand not just the individual effects of each variant, but also how they work together. For example, two variants might each have a small effect on a trait, but when combined, they could lead to a much larger effect.
How i-LDSC Works
To use i-LDSC, researchers first gather genetic data and create a model to estimate the contributions from both additive effects and interactions. They still rely on the summary statistics from GWAS, but they also compute a new set of scores that represent these interactions. By doing so, they can recover some of the heritability that might otherwise be overlooked by traditional methods.
In simulation studies, i-LDSC has shown that it can effectively capture the Non-additive genetic variance that previous models may miss. This means that i-LDSC can provide clearer insights into the genetic basis of traits, revealing how much more complex the genetic architecture truly is.
Simulation Studies and Results
To demonstrate the effectiveness of i-LDSC, researchers ran various simulations. These simulations involved creating synthetic traits using real genetic data from a diverse population. Different scenarios were tested, including variations in heritability and the proportions of genetic interaction effects. The results showed that i-LDSC could robustly detect significant non-additive genetic variance across many different setups.
Importantly, when the genetic data was generated solely with additive effects, i-LDSC still performed well. It managed to accurately estimate heritability without falsely identifying interaction effects. In contrast, traditional methods like LD score regression often failed to capture the full complexity of genetic architectures. This highlights i-LDSC's strength in recovering genetic contributions that would otherwise be missed.
Application to Real Data
The researchers also applied the i-LDSC framework to real-world data from large biobanks, examining traits such as height, blood pressure, and cholesterol levels. They found that many significant genetic interactions were indeed present, which previous methods had overlooked. In the UK Biobank and BioBank Japan studies, most traits analyzed showed strong evidence of interaction effects contributing to heritability.
By using i-LDSC, researchers could report higher heritability estimates for these traits compared to what was previously established. This not only underscores the importance of genetic interactions but also suggests that the genetic basis of many traits is far more complex than earlier models suggested.
Comparing i-LDSC and LD Score Regression
When comparing i-LDSC to traditional LD score regression, it became clear that i-LDSC provided a more holistic view of genetic contributions to traits. While LD score regression focuses mainly on additive effects, i-LDSC encompasses both additive and interaction effects. This means that i-LDSC can detect a greater amount of the variance explained by genetics, giving a better understanding of how complex traits develop.
The researchers also highlighted that the estimates obtained through i-LDSC were strongly correlated with those from LD score regression, indicating that both methods are exploring similar information. However, i-LDSC consistently captured additional contributions from interaction effects that were absent in LD score regression.
Future Directions
There are many ways to further develop the i-LDSC method and its applications. One area of interest is exploring how interaction scores relate to overall heritability estimates. Researchers have also noted that while i-LDSC focuses on pairwise interactions, there are opportunities to adapt the framework for other types of genetic interactions. Expanding the method to include different genetic contexts could provide even deeper insights into the genetics of complex traits.
Furthermore, i-LDSC could be combined with other models, like stratified LD score regression, to refine heritability estimates even more. By integrating functional annotation groups, researchers could possibly unlock further information about the genetic basis of traits.
Finally, while i-LDSC was applied to single traits in the studies described, future efforts might benefit from evaluating multiple traits at once. This could enhance the power of genetic analysis and further our understanding of genetic correlations across different traits.
Conclusion
The introduction of the i-LDSC framework marks an important step in genetic research. By considering non-additive effects and interactions between genetic variants, i-LDSC provides a more complete picture of how genetics influence complex traits. Its application to both simulated and real data demonstrates its ability to recover "missing" heritability and deepen our understanding of the genetic architecture underlying various traits.
As research in genetics continues to advance, tools like i-LDSC will be vital for uncovering the complexities of how traits are inherited and expressed, helping to pave the way for more precise medical and health-related applications. With ongoing development and collaboration, the future holds promise for much more detailed insights into the genetics of human traits and diseases.
Title: Discovering non-additive heritability using additive GWAS summary statistics
Abstract: LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e., interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
Authors: Lorin Crawford, S. P. Smith, G. Darnell, D. Udwin, J. Stamp, A. Harpak, S. Ramachandran
Last Update: 2024-04-15 00:00:00
Language: English
Source URL: https://www.biorxiv.org/content/10.1101/2022.07.21.501001
Source PDF: https://www.biorxiv.org/content/10.1101/2022.07.21.501001.full.pdf
Licence: https://creativecommons.org/licenses/by-nc/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to biorxiv for use of its open access interoperability.
Reference Links
- https://github.com/lcrawlab/i-LDSC
- https://github.com/bulik/ldsc/
- https://www.ukbiobank.ac.uk
- https://jenger.riken.jp/en/result
- https://mathgen.stats.ox.ac.uk/impute/data_download_1000G_phase1_integrated.html
- https://www.ncbi.nlm.nih.gov/gap
- https://www.ebi.ac.uk/gwas/
- https://github.com/arminschoech/GRM-MAF-LD
- https://yanglab.westlake.edu.cn/software/gcta/