Research article | Open | Open Peer Review | Published:
Genomic aberrations in young and elderly breast cancer patients
BMC Medicinevolume 13, Article number: 266 (2015)
Age at breast cancer diagnosis is a known prognostic factor. Previously, several groups including ours have shown that young age at diagnosis is associated with higher prevalence of basal-like tumors and aggressive tumor phenotypes. Yet the impact of age at diagnosis on the genomic landscape of breast cancer remains unclear. In this study, we examined the pattern of somatic mutations, chromosomal copy number variations (CNVs) and transcriptomic profiles in young and elderly breast cancer patients.
Analyses were performed on The Cancer Genome Atlas (TCGA) dataset. Patients with metastatic disease at diagnosis, classified as normal-like by PAM50 or had missing clinical information were excluded. Young patients were defined as ≤45 years of age, while elderly patients were those ≥70 years of age at breast cancer diagnosis. The remaining patients were classified as “intermediate”. We evaluated the association between age at diagnosis and somatic mutations, CNV and gene expression in a logistic regression model adjusting for tumor size, nodal status, histology and breast cancer subtype. All analyses were corrected for multiple testing using the Benjamini–Hochberg approach.
In this study, 125, 486 and 169 patients were ≤45, 46–69 and ≥70 years of age, respectively. Older patients had more somatic mutations (n = 44 versus 35 versus 31; P = 0.0009) and more CNVs, especially in ductal tumors (P = 0.02). Eleven mutations were independently associated with age at diagnosis, of which only GATA3 was associated with young age (15.2 % versus 8.2 % versus 9 %; P = 0.003). Only two CNV events were independently associated with age, with more chr18p losses in older patients and more chr6q27 deletions in younger ones. Younger age at diagnosis was associated with higher expression of gene signatures related to proliferation, stem cell features and endocrine resistance.
Age adds a layer of biological complexity beyond breast cancer molecular subtypes, classic pathological and clinical variables, worthy of further consideration in future drug development as we seek to refine therapeutic strategies in the era of personalized medicine.
Young age at breast cancer diagnosis is a known poor prognostic factor [1, 2]. Previous studies have indicated higher prevalence of poorly differentiated, estrogen receptor (ER)-negative and human epidermal growth factor receptor 2 (HER2)-positive tumors in women diagnosed at a young age [3, 4]. Further genomic characterization has revealed enrichment with basal-like tumors [5, 6]. While these observations could well explain the poorer outcome of young breast cancer patients compared to their older counterparts, younger age remains an independent poor determinant of long-term outcome . This underscores the need to further refine our understanding of the impact of age on cancer biology, which could have relevant implications on patient management.
On the other hand, few data are available with respect to the biological features of tumors arising in the elderly. Currently, around 30–35 % of breast cancer patients are over 70 years of age at the time of diagnosis and this is expected to increase in the coming years . While these patients appear to develop relatively more “indolent” tumors characterized by high endocrine receptor expression , the late onset of these tumors may also suggest accumulation of several genomic aberrations over time, due to the stochastic nature of DNA damage in eukaryotic cells during the replication process. Acknowledging that morbidities other than cancer itself often contribute to mortality of older patients , it is very important to refine our understanding of the biology of these tumors in an attempt to optimize their management.
Previously, our group and others have published on the differences at the transcriptomic level according to age at diagnosis, investigating selected genes or pathways [5, 6, 10]. However, we lack studies that evaluate the differences at the DNA level. In the current study,we investigated for the first time the differences in somatic mutations and copy number variations (CNVs) between young and older breast cancer patients. In addition, we evaluated the expression of thousands of relevant genomic signatures at the RNA level.
All analyses were performed on The Cancer Genome Atlas (TCGA) publicly available dataset. Eligible patients were those with non-metastatic disease who had complete information on age at breast cancer diagnosis, tumor histology, tumor size and lymph node status. For each patient, we determined the breast cancer molecular subtype using PAM50 . PAM50 classes were determined from the TCGA RNA-Seq gene expression data using the genefu package of the R/Bioconductor statistical package. Samples of patients classified as normal-like were excluded, as they often represent an artifact due to limited tumor cellularity and a large background of normal breast cells in the sample .
Young patients were defined as ≤45 years of age, while elderly patients were defined as those ≥70 years of age at breast cancer diagnosis. The remaining patients were classified as “intermediate”. Since the TCGA dataset is publicly available, ethics committee approval was not needed. In addition, neither patient informed consent nor permission to use this data was required to perform this analysis.
We evaluated three parameters: 1) somatic mutations using exome sequencing; 2) somatic CNV; and 3) transcriptomic profiles. We downloaded the data from the TCGA online repository in February 2015.
In the current analysis, all somatic mutations were considered apart from those referred to as “silent” mutations. Somatic CNV was evaluated using array comparative genomic hybridization (CGH) data, available as pre-processed, publicly available information and not validated by any other methodology. Segmented data were used as input for Genomic Identification of Significant Targets in Cancer, version 2.0 (GISTIC 2.0) and version 6.2 on the Broad Institute GenePattern cloud server to obtain somatic focal and broad CNV events . These were then parsed in R. For focal events, only “high-level” focal amplification events, defined as log2 ratio >0.9 were retained, whereas focal losses were retained with log2 ratio >0.3 and with a Q value <0.25. Broad events, defined as arm-level events encompassing 98 % or more of a chromosome arm, were computed using GISTIC as well.
For transcriptomic profiling, we used the RNA sequencing data to evaluate differences in transcriptomic profiles according to age. Data were downloaded from the TCGA online repository and RNA-Seq absolute expression values were log2 transformed before performing the analyses.
The association between age groups, that is, young (≤45 years), intermediate (46–49 years) and elderly patients (≥70 years), with clinicopathological characteristics was evaluated using Pearson’s chi-squared test. The Kruskal–Wallis test was used to compare the number of mutations and CNVs according to age group. For mutations that were represented in at least 5 % in any age group, we evaluated their independent association with age at diagnosis (as a continuous variable) in a logistic regression model adjusting for tumor size (≤2 cm versus >2 cm), nodal status (negative versus positive), tumor histology (ductal versus lobular) and breast cancer subtype (luminal-A versus luminal-B versus HER2 versus basal). A similar model was used to evaluate the independent association between age, CNV and gene expression using the Molecular Signatures Database (MSigDB; PMID: 16199517). All analyses were corrected for multiple testing using the Benjamini–Hochberg approach .
A total of 780 patients from the TCGA dataset where included, of whom 125, 486 and 169 were ≤45, 46–69 and ≥70 years of age, respectively. Transcriptomic data was available for all patients, while 722 (92.5 %) and 713 (91.4 %) had available somatic mutation and CNV data, respectively.
Table 1 summarizes the main characteristics of patients. As expected, young patients had less lobular cancer (7 % versus 24 % versus 29 %; P <0.001), fewer node-negative tumors (38 % versus 49 % versus 49 %; P = 0.05) and a trend of more basal-like tumors (20 % versus 18 % versus 14 %; P = 0.16).
Somatic mutations according to age
We found a significant association between age at diagnosis and the prevalence of somatic mutations. Median number of somatic mutations in the young group was 31, compared to 35 and 44 in the intermediate and older patient groups, respectively (P value = 0.0009). Figure 1 shows the four most prevalent somatic mutations in the different age groups. PIK3CA and TP53 were the most common somatic mutations, constituting around 50–60 % of all mutations across the different age groups. The striking difference between the three age groups was for GATA3, which was the third most common somatic mutation in young patients, constituting 15.2 %, while TTN mutation was the third most frequent mutation in the intermediate (15.1 %) and older patient groups (29 %).
To evaluate the independent effect of age on the prevalence of somatic mutations, we performed a logistic regression analysis adjusted for tumor size, nodal status, histology and breast cancer molecular subtype. We found 11 mutations to be independently associated with age at diagnosis (Table 2). All were associated with older age at diagnosis, except GATA3, which was independently associated with breast cancer arising in young women (15.2 % versus 8.2 % versus 9 %; P = 0.003, false discovery rate (FDR) = 0.033).
Somatic CNV events according to age
We evaluated the prevalence of CNV events according to age. We found a tendency of higher focal and broad CNV in older patients (mean = 15), compared to 13.9 and 13.5 in the intermediate and younger age groups, respectively (P = 0.2). The differences were more apparent when restricting the analysis to patients with ductal carcinoma (mean CNV in older patients = 16.4 versus 14.9 in intermediate versus 13.8 in young patients; P = 0.05). In a logistic regression model, we found 13 CNV events to be independently associated with age (Fig. 2, Additional file 1). However, upon adjusting for multiple testing, only two CNV events maintained a P value <0.05: chr18p loss and chr6q27 deletion; the former was associated with tumors diagnosed in older patients, while the latter was more common in younger patients.
Gene expression differences according to age
We evaluated the association between age at diagnosis and the expression of 10,296 gene expression signatures. In a logistic regression model adjusted for tumor size, nodal status, histology and breast cancer molecular subtype, we found around 1,200 gene signatures to be independently associated with age at diagnosis (FDR <0.05), mainly in younger patients (Additional file 2). The main themes that emerged from this analysis are summarized in Table 3 and indicated higher expression of signatures related to proliferation, stem cell and endocrine resistance in tumors arising at young age.
This is the first analysis to explore the prevalence of somatic mutations and CNV according to age. Our findings indicate that age is associated with unique biological features at the DNA level, independent of tumor stage, histology and breast cancer molecular subtype. In addition, age at diagnosis appears to impact the tumor transcriptome confirming previous observations, but also highlighting novel findings. While previous studies provide ample information on the differences at the pathological level according to age [2, 15], this study provides further insights on differences at the genomic level as well. This is also in line with previous studies that showed changes in the normal breast at both the genomic and epigenetic level between young and older women, including changes in genes that are known to be relevant in breast carcinogenesis [16, 17]. Such evidence may suggest the need to explore treatment strategies in patients diagnosed at extremes of age based on their unique molecular makeup.
Different themes emerged from our analysis. First, older patients have more mutations and CNV events. This is likely a reflection of more genomic errors accumulated in the DNA as women age. We found that several somatic mutations were independently associated with older age at diagnosis. Of particular relevance, the high prevalence of KMT2D mutations. Since this gene was recently shown to be involved in tumor proliferation and cell migration , we speculate that KMT2D mutations may alter breast cancer behavior. Another finding is the high prevalence of FOXA1 mutations. The latter is required for ER-alpha as a cofactor for chromatin binding and constitutes a major proliferative and survival pathway for luminal-A tumors , which are common in older patients . Nevertheless, it is yet to be determined whether these mutations and/or others represent key driver mutations of tumors arising in older patients and the optimal way of targeting them.
On the other hand, GATA3 mutation was the main somatic event that characterized tumors arising at a younger age, which could have relevant clinical implications. GATA3 is an essential component of the ER complex and its mutations are likely to affect ER-regulated transcriptional activity [21, 22]. GATA3 directly upregulates ER-alpha and other proto-oncogenes suggesting that it may promote tumorigenesis in luminal cancer . Preclinical data indicate that mutations in GATA3 also affect ER binding to DNA [22, 24], modulate response of breast cancer cells to estrogen signaling , could promote tumor growth [21, 26] and could be associated with endocrine resistance . This is of extreme relevance, since the poor prognosis associated with younger age at diagnosis has been mainly observed in patients with ER-positive breast cancers [3, 5]. We could speculate that the higher prevalence of GATA3 mutations in these patients may render these patients more resistant to endocrine therapy. Our transcriptomic analyses also highlights the high expression of endocrine resistance signatures in younger patients, thus suggesting that endocrine resistance is an important hallmark of tumors arising in young women, worthy of further exploration. Of note, previous preclinical studies have shown that GATA expression (not mutation) results in reversal of the epithelial-mesenchymal transition (EMT) and induction of differentiation in basal-like tumors [27, 28]. Therefore, it is the loss of GATA3 expression that was suggested to contribute to the aggressiveness of basal-like tumors. Using our dataset, we found that GATA3 expression is higher in patients with GATA3 mutation (data not shown). These mutations were mostly exclusive in patients with ER-positive breast cancer. Thus, based on our findings, we cannot assume that the higher rate of GATA3 mutations observed in younger patients is linked to the known increased incidence of basal-like tumors in these patients.
CNVs are genomic events that are regarded as highly biologically relevant in breast cancer  and we found two events, more chr18p losses and chr6q27 deletions, to be independently associated with age at diagnosis. chr18p loss was more common in older patients and previous data indicated that it is associated with higher risk of recurrence . Of note, chr18 also harbors SMAD4, which is a known tumor suppressor gene and has been shown to be associated with poor prognosis in several tumor types when lost [31–33]. On the other hand, very little is known on its significance in breast cancer. A previous study showed that chromosome 6 is frequently rearranged in breast cancer, particularly at three regions, including 6q27 . In addition, chr6q27 deletion appears to be more prevalent in tumors with aggressive features . This may suggest that this region could harbor relevant tumor suppressor genes that may contribute to the aggressive nature of tumors arising in younger patients.
Another key point emerging from our study is the existence of relevant gene expression differences according to age. Previously, we showed that tumors arising in young women are enriched with stem cell-related genes . In addition, Pirone et al. have shown that pathways implicated in maintaining stem cell dynamics, Wnt/β-catenin and ephrin receptor signaling [35, 36] were differentially expressed in the normal breast between young and older women . The current analysis corroborates this association and suggests that targeting the stem cell component is a strategy that deserves exploration in young breast cancer patients. Currently, there are several drugs in development, such as Notch inhibitors that are known to target the stem cell compartment . Of note, in the current analysis, we found high expression of signatures related to Notch signaling pathways (Table 3) in young breast cancer patients, which may suggest the potential relevance of exploring such strategies in younger patients.
We recently initiated a preoperative window trial evaluating the role of targeting RANKL, a known stem cell regulator  and in which we have previously shown to be highly expressed in tumors arising at a young age [5, 39]. In this trial (D-BEYOND; NCT01864798), all patients are premenopausal and receive the anti-RANKL monoclonal antibody denosumab before surgery. The aim is to evaluate the impact of RANKL inhibition on several biological processes, including proliferation, stem cell markers, immune-related markers, and many others. The trial has recruited >50 % of its target accrual and represents a proof of concept that could open the door for designing future trials in women diagnosed at extremes of age, based on a better understanding of the biology of their tumors.
In conclusion, the present work shows that tumors arising at different ages are biologically distinct, not only at the protein level, as previously shown, but also at the RNA and DNA levels. This includes aberrations in relevant cancer-related genes. While current treatment decision-making is mainly based on tumor stage and breast cancer subtype, our analysis suggests that age adds a layer of biological complexity, worthy of investigating tailored therapeutic strategies in specific age groups. This could further result in refining therapeutic strategies as we embark on an era of personalized medicine.
Comparative genomic hybridization
Copy number variation
False discovery rate
- GISTIC 2.0:
Genomic Identification of Significant Targets in Cancer, version 2.0
Human epidermal growth factor receptor 2
Molecular Signatures Database
The Cancer Genome Atlas
Fredholm H, Eaker S, Frisell J, Holmberg L, Fredriksson I, Lindman H. Breast cancer in young women: poor survival despite intensive treatment. PLoS One. 2009;4, e7695.
Gnerlich JL, Deshpande AD, Jeffe DB, Sweet A, White N, Margenthaler JA. Elevated breast cancer mortality in women younger than age 40 years compared with older women is attributed to poorer survival in early-stage disease. J Am Coll Surg. 2009;208:341–7.
Cancello G, Maisonneuve P, Rotmensz N, Viale G, Mastropasqua MG, Pruneri G, et al. Prognosis and adjuvant treatment effects in selected breast cancer subtypes of very young women (<35 years) with operable breast cancer. Ann Oncol. 2010;21:1974–81.
Azim HA Jr, Partridge AH. Biology of breast cancer in young women. Breast Cancer Res. 2014;16:427.
Azim HA Jr, Michiels S, Bedard PL, Singhal SK, Criscitiello C, Ignatiadis M, et al. Elucidating prognosis and biology of breast cancer arising in young women using gene expression profiling. Clin Cancer Res. 2012;18:1341–51.
Anders CK, Fan C, Parker JS, Carey LA, Blackwell KL, Klauber-Demore N, et al. Breast carcinomas arising at a young age: unique biology or a surrogate for aggressive intrinsic subtypes? J Clin Oncol. 2011;29:e18–20.
Wildiers H. Issues in the adjuvant treatment of common tumors (with a focus on breast cancer) in older adults (age >70). Ann Oncol. 2012;23:x339–341.
Morrison DH, Rahardja D, King E, Peng Y, Sarode VR. Tumour biomarker expression relative to age and molecular subtypes of invasive breast cancer. Br J Cancer. 2012;107:382–7.
van de Water W, Markopoulos C, van de Velde CJ, Seynaeve C, Hasenburg A, Rea D, et al. Association between age at diagnosis and disease-specific mortality among postmenopausal women with hormone receptor-positive breast cancer. JAMA. 2012;307:590–7.
Johnson RH, Hu P, Fan C, Anders CK. Gene expression in “young adult type” breast cancer: a retrospective analysis. Oncotarget. 2015;6:13688–702.
Parker JS, Mullins M, Cheang MC, Leung S, Voduc D, Vickery T, et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009;27:1160–7.
Bastien RR, Rodriguez-Lescure A, Ebbert MT, Prat A, Munarriz B, Rowe L, et al. PAM50 breast cancer subtyping by RT-qPCR and concordance with standard clinical molecular markers. BMC Med Genet. 2012;5:44.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57:289–300.
El Saghir NS, Seoud M, Khalil MK, Charafeddine M, Salem ZK, Geara FB, et al. Effects of young age at presentation on survival in breast cancer. BMC Cancer. 2006;6:194.
Pirone JR, D'Arcy M, Stewart DA, Hines WC, Johnson M, Gould MN, et al. Age-associated gene expression in normal breast tissue mirrors qualitative age-at-incidence patterns for breast cancer. Cancer Epidemiol Biomarkers Prev. 2012;21:1735–44.
Johnson KC, Koestler DC, Cheng C, Christensen BC. Age-related DNA methylation in normal breast tissue and its relationship with invasive breast tumor methylation. Epigenetics. 2014;9:268–75.
Guo C, Chen LH, Huang Y, Chang CC, Wang P, Pirozzi CJ, et al. KMT2D maintains neoplastic cell proliferation and global histone H3 lysine 4 monomethylation. Oncotarget. 2013;4:2144–53.
Carroll JS, Liu XS, Brodsky AS, Li W, Meyer CA, Szary AJ, et al. Chromosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell. 2005;122:33–43.
Azim HA Jr, Azim H. Breast cancer arising at a young age: do we need to define a cut-off? Breast. 2013;22:1007–8.
Chou J, Provot S, Werb Z. GATA3 in development and cancer differentiation: cells GATA have it! J Cell Physiol. 2010;222:42–9.
Liu Z, Merkurjev D, Yang F, Li W, Oh S, Friedman MJ, et al. Enhancer activation requires trans-recruitment of a mega transcription factor complex. Cell. 2014;159:358–73.
Cohen H, Ben-Hamo R, Gidoni M, Yitzhaki I, Kozol R, Zilberberg A, et al. Shift in GATA3 functions, and GATA3 mutations, control progression and clinical presentation in breast cancer. Breast Cancer Res. 2014;16:464.
Gaynor KU, Grigorieva IV, Allen MD, Esapa CT, Head RA, Gopinath P, et al. GATA3 mutations found in breast cancers may be associated with aberrant nuclear localization, reduced transactivation and cell invasiveness. Horm Cancer. 2013;4:123–39.
Adomas AB, Grimm SA, Malone C, Takaku M, Sims JK, Wade PA. Breast tumor specific mutation in GATA3 affects physiological mechanisms regulating transcription factor turnover. BMC Cancer. 2014;14:278.
Usary J, Llaca V, Karaca G, Presswala S, Karaca M, He X, et al. Mutation of GATA3 in human breast tumors. Oncogene. 2004;23:7669–78.
Kouros-Mehr H, Bechis SK, Slorach EM, Littlepage LE, Egeblad M, Ewald AJ, et al. GATA-3 links tumor differentiation and dissemination in a luminal breast cancer model. Cancer Cell. 2008;13:141–52.
Yan W, Cao QJ, Arenas RB, Bentley B, Shao R. GATA3 inhibits breast cancer metastasis through the reversal of epithelial-mesenchymal transition. J Biol Chem. 2010;285:14042–51.
Ueno T, Emi M, Sato H, Ito N, Muta M, Kuroi K, et al. Genome-wide copy number analysis in primary breast cancer. Expert Opin Ther Targets. 2012;16:S31–35.
Climent J, Martinez-Climent JA, Blesa D, Garcia-Barchino MJ, Saez R, Sanchez-Izquierdo D, et al. Genomic loss of 18p predicts an adverse clinical outcome in patients with high-risk breast cancer. Clin Cancer Res. 2002;8:3863–9.
Singhi AD, Foxwell TJ, Nason K, Cressman KL, McGrath KM, Sun W, et al. Smad4 loss in esophageal adenocarcinoma is associated with an increased propensity for disease recurrence and poor survival. Am J Surg Pathol. 2015;39:487–95.
Kozak MM, von Eyben R, Pai J, Vossler SR, Limaye M, Jayachandran P, et al. Smad4 inactivation predicts for worse prognosis and response to fluorouracil-based treatment in colorectal cancer. J Clin Pathol. 2015;68:341–5.
Liu NN, Xi Y, Callaghan MU, Fribley A, Moore-Smith L, Zimmerman JW, et al. SMAD4 is a potential prognostic marker in human breast carcinomas. Tumour Biol. 2014;35:641–50.
Noviello C, Courjal F, Theillet C. Loss of heterozygosity on the long arm of chromosome 6 in breast cancer: possibly four regions of deletion. Clin Cancer Res. 1996;2:1601–6.
Genander M, Frisen J. Ephrins and Eph receptors in stem cells and cancer. Curr Opin Cell Biol. 2010;22:611–6.
Yang L, Tang H, Kong Y, Xie X, Chen J, Song C, et al. LGR5 promotes breast cancer progression and maintains stem-like cells through activation of Wnt/beta-catenin signaling. Stem Cells. 2015;33:2913–24.
Han J, Hendzel MJ, Allalunis-Turner J. Notch signaling as a therapeutic target for breast cancer treatment? Breast Cancer Res. 2011;13:210.
Asselin-Labat ML, Vaillant F, Sheridan JM, Pal B, Wu D, Simpson ER, et al. Control of mammary stem cell function by steroid hormone signalling. Nature. 2010;465:798–802.
Azim HA Jr, Peccatori FA, Brohee S, Branstetter D, Loi S, Viale G, et al. RANK-ligand (RANKL) expression in young breast cancer patients and during pregnancy. Breast Cancer Res. 2015;17:24.
The authors would like to thank all patients who donated samples for research purposes. This work was partly supported by research grants from Le Fonds de la Recherche Scientifique and the Breast Cancer Research Foundation (BCRF).
None of the authors have any competing interests.
HAA Jr, BN and SB produced the study concept and design. SB and GZ collected and assembled data. All authors performed data analysis and interpretation. All authors contributed to manuscript writing. All authors read and approved the final manuscript.
The independent association between age at diagnosis and chromosomal copy number variation (CNV) events. (DOCX 15 kb)
The independent association between age at diagnosis and gene expression signatures (>10,000) in a logistic regression model adjusted for tumor size, nodal status, tumor histology and breast cancer subtype. (PDF 1780 kb)