Commentary | Open | Published:
Candidate gene-environment interactions in breast cancer
BMC Medicinevolume 12, Article number: 195 (2014)
Gene-environment interactions have the potential to shed light on biological processes leading to disease, identify individuals for whom risk factors are most relevant, and improve the accuracy of epidemiological risk models. We review the progress that has been made in investigating gene-environment interactions in the field of breast cancer. Although several large-scale analyses have been carried out, only a few significant interactions have been reported. One of these, an interaction between CASP8-rs1045485 and alcohol consumption has been replicated, but others have not, including LSP1- rs3817198 and parity, and 1p11.2-rs11249433 and ever being parous. False positive interactions may arise if the gene and environment are correlated and the causal variant is less frequent than the tag SNP. We conclude that while much progress has been made in this area it is still too soon to tell whether gene-environment interactions will fulfil their promise. Before we can make this assessment we will need to replicate (or refute) the reported interactions, identify the causal variants that underlie tag-SNP associations and validate the next generation of epidemiological risk models.
Epidemiological studies have provided consistent evidence of associations between environmental (predominantly lifestyle and reproductive) factors and subsequent risk of breast cancer (BC). More recently, genome-wide association studies (GWAS) have identified more than 70 single nucleotide polymorphisms (SNPs) that influence breast cancer risk . Detecting a gene-environment (GxE) interaction between a SNP and an environmental risk factor has the potential to shed light on the biological process leading to disease, identify women for whom these risk factors are most relevant, and improve the accuracy of epidemiological risk models . A comprehensive review summarising the rationale for and the challenges of studying GxE interactions advocated a range of measures including supporting new and larger prospective studies, the reporting of stratified analyses as supplementary material and pre-planned analyses coordinated across multiple studies . In this commentary we review progress in investigating GxE interactions in the field of BC. We define GxE interaction as the modification of the effect of a genetic risk factor by an environmental factor, assessed statistically by testing the effects of gene and environment for departure from additivity, on an appropriate scale (usually the log or logit in disease studies). We focus on GxE interactions between common SNPs and established risk factors for BC (Table 1), discuss the implications of testing marker SNPs rather than the underlying causal variants that they tag and consider whether GxE studies have fulfilled their potential for illuminating disease processes or predicting risk.
GxE interactions between previously reported SNPs and established risk factors for BC
The first large (that is, at least 5,000 cases and 5,000 controls) GxE study of this type was carried out within the Million Women Study . In this analysis of 7,610 cases and 10,196 controls investigating potential GxE interactions between 12 SNPs and 10 established risk factors for BC there were no GxE interactions that were significant after adjusting for multiple testing. The most significant GxE interaction was between CASP8-rs1045485 and alcohol consumption (unadjusted P = 0.003). Since the publication of this report, there have been four further analyses of this type (Table 2), two from the Breast Cancer Association Consortium (BCAC) , and two from the Breast and Prostate Cancer Cohort Consortium (BPC3) ,. Only one of these, the largest (23 SNPs in 34,793 cases and 41,099 controls) , reported statistically significant GxE interactions, namely between LSP1-rs3817198 and parity (number of live births), CASP8-rs1045485 and alcohol consumption (replicating the most significant finding in the Million Women study ) and 1p11.2-rs11249433 and ever being parous. However, none of these interactions was replicated in the largest BPC3 study (39 SNPs in 16,285 BC cases and 19,376 controls ). A meta-analysis of the BCAC and BPC3 data suggested a possible interaction between SLC4A7-rs4973768 and smoking status but replication of this result has not yet been attempted.
The Shanghai Breast Cancer Genetics Study tested for interactions using a risk score formed as the weighted sum of genotypes from 10 SNPs . This would improve the power to detect a risk factor that has interactions with numerous SNPs, when there is insufficient power for the individual interactions. Although this study found no interactions with the risk score, this approach holds promise for identifying interacting risk factors in limited sample sizes.
Identification of novel risk SNPs through GxE interactions
SNPs with strong interaction effects may only be detectable when analysing gene and environment together, so they are missed by studies that consider SNPs in isolation. Methods that model and test the main and interaction effects of gene and environment jointly , or exploit the power of a case-only design while retaining robustness to possible gene environment dependence , have been developed for these purposes. Recently, several of these methods were applied to 71,527 SNPs with suggestive association with BC . Interactions were identified between two SNPs on 21q22.12 (rs10483028 and rs2242714) and adult body mass index (BMI), and one in ARID1B (rs12197388) with age at menarche and with parity. rs12197388 was only significant in the joint test of main and interaction effects, and the interaction term was not significant but the two SNPs on 21q22.12 were detected via their interactions, and further studies of this nature may discover more interactions using these novel methods.
Using tag-SNPs as proxies for an underlying causal variant
The GxE studies described above have relied on using marker SNPs, predominantly identified through GWAS, as proxies for the underlying causal variants. This usually leads to a loss in power to detect interactions . However, if gene and environment are dependent, a marker SNP can show an interaction even if there is no interaction at the causal variant . These `spurious interactions’ tend to arise when the causal variant is rare in comparison to the marker. This may not often be the case, but it nevertheless warrants caution when reporting GxE interactions. We recently studied a marker SNP (rs10235235) associated with a reduction in urinary levels of an estrogen metabolite . In 47,346 cases and 47,569 controls in the Collaborative Oncological Gene-environment Study (COGS) , this SNP showed (1) association with BC risk, (2) association with age at menarche in controls (but not cases) and (3) an interaction in which age at menarche modified the effect of rs10235235 on BC risk. In this example of a GxE interaction, therefore, the genetic risk factor (rs10235235) is dependent on the environmental risk factor (age at menarche), which could lead to a false positive . Of the interactions reported to date, gene-environment dependence has been observed between LSP1-rs3817198 and parity and 21q22.12-rs10483208/rs2242714 and BMI. In cases such as these, an interaction can only be definitively established when all variation in the associated regions has been identified and tested.
Several of the recommendations made by Hunter in 2005  have been pursued: large new prospective studies continue to be supported (for example the Breakthrough Generations study, a long-term cohort study focused on BC has recruited 112,049 women over the period 2003 to 2011 ), consortia of case-control (BCAC) and cohort studies (BPC3) have coordinated their efforts for analyses of data from >70,000 women and the results of stratified analyses have been conscientiously reported in supplementary tables ,. However, one of the lessons of the first generation of BC GWAS - was that the per-allele disease odds ratios (ORs) associated with individual tag-SNPs were much smaller than hypothesised (1.07 to 1.26). Results from the first generation of GxE analyses suggest that the same may be true for interactions, with the reported interaction ORs ranging from 1.06 to 1.59. If marginal ORs of 1.07 to 1.26 require scans of several thousand cases and several thousand controls then, depending on the number of GxE interactions being tested, only GxE studies that include tens of thousands of cases and controls will have the power required to detect interactions. It is hardly a coincidence that the first study to report statistically significant GxE interactions was the first study of this order of magnitude . Of the three significant interactions reported by Nickels and colleagues there is replication only for CASP8-rs1045485 and alcohol consumption. It is currently too soon to tell whether GxE interactions will shed light on disease processes and improve the accuracy of epidemiological risk models. Before we can make this assessment we will need to replicate or refute the reported interactions, identify the causal variants that underlie tag-SNP associations and validate the next generation of epidemiological risk models.
OF and FD wrote this commentary jointly. Both authors approved the final version.
OF is a group leader in genetic epidemiology at the Breakthrough Breast Cancer Research Centre. FD is professor of statistical genetics at the London School of Hygiene and Tropical Medicine.
the Breast Cancer Association Consortium
body mass index
the Breast and Prostate Cancer Cohort Consortium
Collaborative Oncological Gene-environment Study
established risk factor
genome-wide association study
- GxE interaction:
hormone replacement therapy
single nucleotide polymorphism
Michailidou K, Hall P, Gonzalez-Neira A, Ghoussaini M, Dennis J, Milne RL, Schmidt MK, Chang-Claude J, Bojesen SE, Bolla MK, Wang Q, Dicks E, Lee A, Turnbull C, Rahman N, Fletcher O, Peto J, Gibson L, Dos Santos Silva I, Nevanlinna H, Muranen TA, Aittomäki K, Blomqvist C, Czene K, Irwanto A, Liu J, Waisfisz Q, Meijers-Heijboer H, Adank M, et al: Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat Genet. 2013, 45: 353-361. 10.1038/ng.2563. 361e351-352
Hunter DJ: Gene-environment interactions in human diseases. Nat Rev Genet. 2005, 6: 287-298. 10.1038/nrg1578.
Travis RC, Reeves GK, Green J, Bull D, Tipper SJ, Baker K, Beral V, Peto R, Bell J, Zelenika D, Lathrop M: Gene-environment interactions in 7610 women with breast cancer: prospective evidence from the Million Women Study. Lancet. 2010, 375: 2143-2151. 10.1016/S0140-6736(10)60636-8.
Milne RL, Gaudet MM, Spurdle AB, Fasching PA, Couch FJ, Benitez J, Arias Perez JI, Zamora MP, Malats N, Dos Santos SI, Gibson LJ, Fletcher O, Johnson N, Anton-Culver H, Ziogas A, Figueroa J, Brinton L, Sherman ME, Lissowska J, Hopper JL, Dite GS, Apicella C, Southey MC, Sigurdson AJ, Linet MS, Schonfeld SJ, Freedman DM, Mannermaa A, Kosma VM, Kataja V, et al: Assessing interactions between the associations of common genetic susceptibility variants, reproductive history and body mass index with breast cancer risk in the breast cancer association consortium: a combined case-control study. Breast Cancer Res. 2010, 12: R110-10.1186/bcr2797.
Nickels S, Truong T, Hein R, Stevens K, Buck K, Behrens S, Eilber U, Schmidt M, Haberle L, Vrieling A, Gaudet M, Figueroa J, Schoof N, Spurdle AB, Rudolph A, Fasching PA, Hopper JL, Makalic E, Schmidt DF, Southey MC, Beckmann MW, Ekici AB, Fletcher O, Gibson L, Silva Idos S, Peto J, Humphreys MK, Wang J, Cordina-Duverger E, Menegaux F, et al: Evidence of gene-environment interactions between common breast cancer susceptibility loci and established environmental risk factors. PLoS Genet. 2013, 9: e1003284-10.1371/journal.pgen.1003284.
Campa D, Kaaks R, Le Marchand L, Haiman CA, Travis RC, Berg CD, Buring JE, Chanock SJ, Diver WR, Dostal L, Fournier A, Hankinson SE, Henderson BE, Hoover RN, Isaacs C, Johansson M, Kolonel LN, Kraft P, Lee IM, McCarty CA, Overvad K, Panico S, Peeters PH, Riboli E, Sanchez MJ, Schumacher FR, Skeie G, Stram DO, Thun MJ, Trichopoulos D, et al: Interactions between genetic variants and breast cancer risk factors in the breast and prostate cancer cohort consortium. J Natl Cancer Inst. 2011, 103: 1252-1263. 10.1093/jnci/djr265.
Barrdahl M, Canzian F, Joshi AD, Travis RC, Chang-Claude J, Auer PL, Gapstur SM, Gaudet M, Diver WR, Henderson BE, Haiman CA, Schumacher FR, Le Marchand L, Berg CD, Chanock SJ, Hoover RN, Rudolph A, Ziegler RG, Giles GG, Baglietto L, Severi G, Hankinson SE, Lindström S, Willet W, Hunter DJ, Buring JE, Lee IM, Zhang S, Dossus L, Cox DG, et al: Post-GWAS gene-environment interplay in breast cancer: results from the Breast and Prostate Cancer Cohort Consortium and a meta-analysis on 79 000 women. Hum Mol Genet. 2014, 23: 5260-5270. 10.1093/hmg/ddu223.
Li H, Beeghly-Fadiel A, Wen W, Lu W, Gao YT, Xiang YB, Cai Q, Long J, Shi J, Chen K, Zheng Y, Shu XO, Zheng W: Gene-environment interactions for breast cancer risk among Chinese women: a report from the Shanghai Breast Cancer Genetics Study. Am J Epidemiol. 2013, 177: 161-170. 10.1093/aje/kws238.
Kraft P, Yen YC, Stram DO, Morrison J, Gauderman WJ: Exploiting gene-environment interaction to detect genetic associations. Hum Hered. 2007, 63: 111-119. 10.1159/000099183.
Gauderman WJ, Zhang P, Morrison JL, Lewinger JP: Finding novel genes by testing G x E interactions in a genome-wide association study. Genet Epidemiol. 2013, 37: 603-613. 10.1002/gepi.21748.
Dai JY, Kooperberg C, Leblanc M, Prentice RL: Two-stage testing procedures with independent filtering for genome-wide gene-environment interaction. Biometrika. 2012, 99: 929-944. 10.1093/biomet/ass044.
Schoeps A, Rudolph A, Seibold P, Dunning AM, Milne RL, Bojesen SE, Swerdlow A, Andrulis I, Brenner H, Behrens S, Orr N, Jones M, Ashworth A, Li J, Cramp H, Connley D, Czene K, Darabi H, Chanock SJ, Lissowska J, Figueroa JD, Knight J, Glendon G, Mulligan AM, Dumont M, Severi G, Baglietto L, Olson J, Vachon C, Purrington K, et al: Identification of new genetic susceptibility loci for breast cancer through consideration of gene-environment interactions. Genet Epidemiol. 2014, 38: 84-93. 10.1002/gepi.21771.
Hein R, Beckmann L, Chang-Claude J: Sample size requirements for indirect association studies of gene-environment interactions (G x E). Genet Epidemiol. 2008, 32: 235-245. 10.1002/gepi.20298.
Dudbridge F, Fletcher O: Gene-environment dependence creates spurious gene-environment interaction. Am J Hum Genet. 2014, 95: 301-307. 10.1016/j.ajhg.2014.07.014.
Johnson N, Walker K, Gibson LJ, Orr N, Folkerd E, Haynes B, Palles C, Coupland B, Schoemaker M, Jones M, Broderick P, Sawyer E, Kerin M, Tomlinson IP, Zvelebil M, Chilcott-Burns S, Tomczyk K, Simpson G, Williamson J, Hillier SG, Ross G, Houlston RS, Swerdlow A, Ashworth A, Dowsett M, Peto J, Dos Santos SI, Fletcher O: CYP3A variation, premenopausal estrone levels, and breast cancer risk. J Natl Cancer Inst. 2012, 104: 657-669. 10.1093/jnci/djs156.
Johnson N, Dudbridge F, Orr N, Gibson L, Jones ME, Schoemaker MJ, Folkerd EJ, Haynes BP, Hopper JL, Southey MC, Dite GS, Apicella C, Schmidt MK, Broeks A, Van T, Veer LJ, Atsma F, Muir K, Lophatananon A, Fasching PA, Beckmann MW, Ekici AB, Renner SP, Sawyer E, Tomlinson I, Kerin M, Miller N, Burwinkel B, Marme F, Schneeweiss A, et al: Genetic variation at CYP3A is associated with age at menarche and breast cancer risk: a case-control study. Breast Cancer Res. 2014, 16: R51-10.1186/bcr3662.
Swerdlow AJ, Jones ME, Schoemaker MJ, Hemming J, Thomas D, Williamson J, Ashworth A: The Breakthrough Generations Study: design of a long-term UK cohort study to investigate breast cancer aetiology. Br J Cancer. 2011, 105: 911-917. 10.1038/bjc.2011.337.
Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, Ballinger DG, Struewing JP, Morrison J, Field H, Luben R, Wareham N, Ahmed S, Healey CS, Bowman R, Meyer KB, Haiman CA, Kolonel LK, Henderson BE, Le Marchand L, Brennan P, Sangrajrang S, Gaborieau V, Odefrey F, Shen CY, Wu PE, Wang HC, Eccles D, Evans DG, Peto J, et al: Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007, 447: 1087-1093. 10.1038/nature05887.
Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A, Wang J, Yu K, Chatterjee N, Orr N, Willett WC, Colditz GA, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, Hayes RB, Tucker M, Gerhard DS, Fraumeni JF, Hoover RN, Thomas G, Chanock SJ: A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet. 2007, 39: 870-874. 10.1038/ng2075.
Stacey SN, Manolescu A, Sulem P, Rafnar T, Gudmundsson J, Gudjonsson SA, Masson G, Jakobsdottir M, Thorlacius S, Helgason A, Aben KK, Strobbe LJ, Albers-Akkers MT, Swinkels DW, Henderson BE, Kolonel LN, Le Marchand L, Millastre E, Andres R, Godino J, Garcia-Prats MD, Polo E, Tres A, Mouy M, Saemundsdottir J, Backman VM, Gudmundsson L, Kristjansson K, Bergthorsson JT, Kostic J, et al: Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer. Nat Genet. 2007, 39: 865-869. 10.1038/ng2064.
We acknowledge support from Breakthrough Breast Cancer (OF) and the MRC (G1000718 and K006215; FD). The funding bodies had no role in the preparation of this manuscript.
The authors declare that they have no competing interests.