A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets Journal Articles uri icon

  •  
  • Overview
  •  
  • Research
  •  
  • Identity
  •  
  • Additional Document Info
  •  
  • View All
  •  

abstract

  • AbstractIdentification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (Nā€‰=ā€‰297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 ā€“ 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data.

authors

  • DiScipio, Matteo
  • Di Scipio, Matteo
  • Khan, Mohammad
  • Mao, Shihong
  • Chong, Michael
  • Judge, Conor
  • Pathan, Nazia
  • Perrot, Nicolas
  • Nelson, Walter
  • Lali, Ricky
  • Di, Shuang
  • Morton, Robert
  • Petch, Jeremy
  • Pare, Guillaume

publication date

  • August 25, 2023