- Research article
- Open Access
Identification of metabolites associated with prostate cancer risk: a nested case-control study with long follow-up in the Northern Sweden Health and Disease Study
BMC Medicine volume 18, Article number: 187 (2020)
Prostate cancer is the second most frequently diagnosed cancer in men. Metabolomics can potentially provide new insights into the aetiology of prostate cancer by identifying new metabolic risk factors. This study investigated the prospective association between plasma metabolite concentrations and prostate cancer risk, both overall and by stratifying for disease aggressiveness and baseline age.
In a case-control study nested in the Northern Sweden Health and Disease Study, pre-diagnostic concentrations of 148 plasma metabolites were determined using targeted mass spectrometry- and nuclear magnetic resonance-based metabolomics in 777 prostate cancer cases (follow-up ≥ 5 years) and 777 matched controls. Associations between prostate cancer risk and metabolite concentrations were investigated using conditional logistic regression conditioned on matching factors (body mass index, age and sample storage time). Corrections for multiple testing were performed using false discovery rate (20%) and Bonferroni. Metabolomics analyses generated new hypotheses, which were investigated by leveraging food frequency questionnaires (FFQs) and oral glucose tolerance tests performed at baseline.
After correcting for multiple testing, two lysophosphatidylcholines (LPCs) were positively associated with risk of overall prostate cancer (all ages and in older subjects). The strongest association was for LPC C17:0 in older subjects (OR = 2.08; 95% CI 1.45–2.98; p < 0.0001, significant also after the Bonferroni correction). Observed associations with risk of overall prostate cancer in younger subjects were positive for glycine and inverse for pyruvate. For aggressive prostate cancer, there were positive associations with six glycerophospholipids (LPC C17:0, LPC C20:3, LPC C20:4, PC ae C38:3, PC ae C38:4 and PC ae C40:2), while there was an inverse association with acylcarnitine C18:2. Moreover, plasma LPC C17:0 concentrations positively correlated with estimated dietary intake of fatty acid C17:0 from the FFQs. The associations between glycerophospholipids and prostate cancer were stronger in case-controls with normal glucose tolerance.
Several glycerophospholipids were positively associated with risk of overall and aggressive prostate cancer. The strongest association was observed for LPC C17:0. The associations between glycerophospholipids and prostate cancer risk were stronger in case-controls with normal glucose tolerance, suggesting a link between the glucose metabolism status and risk of prostate cancer.
Prostate cancer is the second most common cancer among men worldwide . The rapid increase in incidence during recent decades may owe partly to extrinsic factors (diet and lifestyle) [2, 3]. Insulin-like growth factor I (IGF-I), overweight and obesity have been consistently identified as risk factors for prostate cancer [4, 5]. In addition, associations with risk of prostate cancer have been observed positively with intake of dairy products and calcium, and inversely with plasma concentrations of alpha-tocopherol and selenium; however, the evidences are limited .
Using metabolomics to measure a large number of low molecular weight compounds (metabolites) in biofluids, reflecting the final read-out of gene-environment interactions [6,7,8], may help to identify novel risk factors for prostate cancer.
To the best of our knowledge, seven previous studies, conducted within three cohorts, have investigated the association between pre-diagnostic levels of plasma and serum metabolites and the risk of prostate cancer incidence [9,10,11,12,13,14,15]. The results of these studies differ somewhat as regards metabolites associated with disease risk. The differences may be partly explained by the use of different metabolomics methodologies, varying characteristics of the study populations and dissimilarities in the experimental designs (e.g. sample size, fasting status, follow-up time, matching factors and disease subtype categorisation). Therefore, the identification of metabolites associated with risk of prostate cancer warrants further investigation.
By carefully monitoring individual metabolites, we and others have previously shown that the concentrations of circulating metabolites are subjected to changes in response to a meal [16,17,18]. For example, concentrations of phospholipids, amino acids and their breakdown products, glycolytic products, acylcarnitines and ketone bodies vary by up to 80% 0–3 h after a meal  and by up to 100% 0–8 h after a meal . The variation in concentration of metabolites because of varying time since last meal is not related to the estimation of risk and may therefore hinder identification of metabolites associated with disease risk. This variation can be reduced by including only case-control sets for which plasma samples are collected after overnight fasting. Moreover, prostate cancer can pass through a period of latency during which the subclinical disease may cause metabolic changes . These changes are not directly related to the aetiology of the disease and should therefore be avoided. The possibility of such changes can be reduced by including only cases with a long minimum follow-up time (e.g. time between sample collection and diagnosis ≥ 5 years). However, the two largest studies previously performed did not include only (1) case-control sets with fasting samples or (2) cases with a long follow-up time in all subgroups analysed [13, 15]. In addition, baseline age-specific association with risk of prostate cancer has been shown for IGF-I . However, to our knowledge, it has not been investigated whether the association between metabolites and prostate cancer risk varies with baseline age [9,10,11,12,13,14,15].
The aim of this study was to investigate the prospective association between metabolite concentrations and risk of prostate cancer in a case-control study nested within the Northern Sweden Health and Disease Study (NSHDS) cohort. Subgroup analyses were performed after stratification by disease subtype (non-aggressive, aggressive) and baseline age (40–50, 60 years). In order to reduce the variation in concentrations of metabolites that are not directly related to estimation of disease risk, i.e. the variation caused by eating a meal or by subclinical prostate cancer, we only included case-control sets with fasting samples and cases with a minimum follow-up time of at least 5 years (≥ 5 years). In addition, for the first time, we applied combined targeted mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy-based metabolomics approach to quantify a larger number of metabolites. Metabolomics analyses led to the generation of new hypotheses, which were corroborated for the first time by leveraging food frequency questionnaires (FFQs) and a complementary oral glucose tolerance test (OGTT) performed at baseline.
The Northern Sweden Health and Disease Study
The NSHDS is a population-based cohort that started in 1985 . In brief, residents in the Swedish province of Västerbotten were invited to a health assessment at 40, 50 and/or 60 years of age. In the assessment, each participant underwent a health examination, including measurement of height, weight and blood pressure. In addition, an OGTT was performed with a 75-g oral glucose load, according to the World Health Organization (WHO) standards, and all participants were asked to complete a validated FFQ. The FFQ covered various food items from which dietary intake of different nutrients, e.g. different fatty acids (FAs), can be calculated . Nutrient intake was calculated by multiplying the daily intake of different food items calculated from the FFQ by the nutrient content of each food item extracted from a food composition database at the Swedish National Food Administration . Finally, participants were asked to donate blood for future research purposes. The blood sample from each participant was drawn after overnight fasting; separated into buffy coat, erythrocytes and (heparin) plasma aliquots; and then stored at − 80 °C (within 2 h of collection) at the Medicinal Biobank, Umeå University. All participants gave written informed consent to participate in the study. The study was approved by the Research Ethics Committee of Umeå University Hospital and the Regional Ethics Committee in Uppsala (no. 2013-124).
A nested case-control study on prostate cancer was designed within the NSHDS cohort. Among the 38,467 men who participated in the health survey between 1985 and 2007, 1664 were diagnosed with prostate cancer based on national registries to which reporting is mandated by law (e.g. patients’ registry, cancer registry and cause of death registry) during the follow-up period until 2012. A total of 777 cases were selected for the present study after applying some inclusion/exclusion criteria. The criteria for inclusion were as follows: (1) overnight fasting, (2) no previous cancer incidence before prostate cancer diagnosis, (3) ≥ 5 years between the time of sample collection (baseline) and prostate cancer diagnosis and (4) no type 2 diabetes (T2D) diagnosed at baseline (declared during the health assessment).
Each selected case was classified as having either an aggressive or non-aggressive subtype of the disease, based on the International Union Against Cancer classification system [23, 24]. Aggressive prostate cancer was defined as poorly differentiated tumour (Gleason’s score 8–10 or grade 3 in the three-level WHO grading system, with grade 3 indicating the lowest level of differentiation [19, 25, 26]), non-localised tumour (T3–4), lymph node metastasis (N1), bone metastasis (M1), serum prostate-specific antigen (PSA) level above 50 ng/mL at diagnosis or fatal prostate cancer by March 2007 (irrespective of tumour characteristics at diagnosis). Cases not classified as having an aggressive form of the disease were included in the group of non-aggressive cases.
One control was selected for each prostate cancer case among men who were alive, free of any cancer history at the time of diagnosis of the corresponding case and had a plasma sample collected after overnight fasting. Cases and controls were matched according to age, body mass index (BMI) and duration of sample storage in the freezer. The windows used for matching factors to select a control for each case from NSHDS cohort were as follows: age ± 210 days, BMI ± 0.8 kg/m2 and sample storage in freezer ± 220 days. None of the selected controls had a T2D diagnosis at baseline.
During selection of cases and controls from NSHDS for the present study, we set a criteria to exclude cases and controls who declared having T2D in the health assessment at the time of sample collection (baseline). However, the metabolomics analysis led to new hypotheses which were investigated after retrieving the data from an OGTT performed at baseline. Using the OGTT data, we identified some new cases and controls with T2D according to the WHO criteria  at baseline, but whose T2D had not been declared in the health assessment.
In NSHDS, at enrolment, the participants were invited at even decades, i.e. 40, 50 and 60 years of age. Therefore, the present study included participants who were 40 (n = 45 sets), 50 (n = 288 sets) and 60 (n = 444 sets) years old. This allowed the stratification of case-control sets into age-specific subgroups [19, 24]. Because of the lower prevalence of prostate cancer in younger participants, we included 40- and 50-year-olds in the same subgroup (younger, 40–50 years, n = 333 sets), while 60-year-olds were included in another subgroup (older, 60 years, n = 444 sets).
Targeted MS- and NMR-based metabolomics
Plasma samples were analysed using both targeted MS- and NMR-based metabolomics (Additional file 1). The targeted MS-based metabolomics was performed using the AbsoluteIDQ p180 assay (BIOCRATES, Innsbruck, Austria), quantifying 188 metabolites . The targeted NMR-based metabolomics was performed using an automated quantification algorithm (AQuA), which was specifically developed in our laboratory for large epidemiological studies quantifying 67 metabolites from the NMR spectra of human plasma .
Metabolite inclusions and exclusions
Metabolites with occurrence below 50% in samples from the nested cohort or an analytical coefficient of variation (CV) above 15% in quality control (QC) samples were excluded from the statistical analyses. The quality control criteria were fulfilled by 103 (out of 188) metabolites in the targeted MS analysis and by 45 (out of 67) metabolites in the targeted NMR analysis, which were used in statistical analysis (Additional file 2).
Identification of metabolites associated with risk of prostate cancer
The association between each individual metabolite and the risk of prostate cancer was assessed using a conditional logistic regression model conditioned on matching factors (BMI, age and sample storage time). The metabolite concentrations were log2-transformed, and thus, linear estimates were per doubling. This derived the crude odds ratio (ORcrude), the 95% confidence interval (CI) and the corresponding p value (pcrude) for each association. Analyses were repeated after stratification by disease subtype (non-aggressive, aggressive) and baseline age (40–50, 60 years). The conditional logistic regression analyses were conducted using the PHREG procedure in SAS (version 9.3, SAS Institute Inc., Cary, NC), with the case-control sets as strata.
Correction for multiple testing was performed using two approaches of varying stringency/conservancy, in consistence with previous studies (low stringency: ; high stringency: ). For the low stringency approach, false discovery rate (FDR) correction (significance level at 20%) was performed using the Q-value package  in RStudio (version 3.0.3, R Foundation for Statistical Computing Platform, Vienna, Austria). This approach was employed for holistic interpretation of results and hypothesis generation. For the high stringency approach, the Bonferroni correction was employed, using the number of metabolites as number of variables (α = 0.05/148).
For statistically significant metabolites (FDR 20%), each conditional logistic regression model was further adjusted for BMI and exact age (continuously) in one model and additionally for alcohol intake (< 10, 10–19, 20–39, ≥ 40 g/day) and smoking (no, past, current, unknown) in another model . The estimated risk for each metabolite by the adjusted models did not differ from the unadjusted model by more than 10%, and therefore, only results from the unadjusted model are presented.
Possible differential associations by age or disease type were investigated for each of the metabolites that were statistically significant after correction for multiple testing (FDR 20%). This was done by associating individual plasma metabolites with prostate cancer risk allowing for different associations for each of the two categories for disease type and baseline age, respectively (for disease type: binary, non-aggressive, aggressive; for age: binary, 40–50, 60 years). Each comparison was made using a Wald test for the hypothesis of equal regression coefficients. Log-transformed metabolite data were used.
For statistically significant metabolites (FDR 20%), each conditional logistic regression model was also repeated for subgroups by follow-up time (≤ 10, > 10 years). Possible differential associations by follow-up time were performed as described above.
Categorical logistic regression analysis was also performed for each metabolite (FDR 20%) based on quartile values. Each subject was assigned to a quartile based on cut-off values derived from the distribution of concentrations in the control group. The first quartile was used as a referent level.
Spearman’s correlation coefficient was used to assess the correlation between metabolite concentrations (lysophosphatidylcholines, LPCs) and baseline characteristics (glucose values from the OGTT and estimated daily intake of different FAs from the FFQ). Correlation analysis was performed using the CORR procedure in SAS (version 9. 3, SAS Institute Inc., Cary, NC).
Status of glucose metabolism at baseline and risk of prostate cancer
Metabolomics findings led to a hypothesis of a link between baseline glucose metabolism and risk of prostate cancer. The subjects in this study underwent an OGTT at enrolment, so the status of glucose metabolism at baseline, i.e. normal glucose tolerance (NGT), impaired fasting glucose (IFG) and impaired glucose tolerance (IGT) or T2D, could be determined using the WHO criteria . A categorical logistic regression analysis (NGT as the referent) was used to assess the association between the status of glucose metabolism and risk of prostate cancer (overall, and by restricting to aggressive or non-aggressive cases and stratifying by baseline age).
Conditional logistic regression analyses were repeated after excluding case-controls with abnormal glucose metabolism at baseline (i.e. IFG, IGT or T2D) and thereby restricting to case-controls with NGT. These analyses were limited to metabolites found to be associated with prostate cancer risk after correction for multiple testing (FDR 20%). This was performed to examine whether the associations between these metabolites and prostate cancer risk were improved in matched case-controls with NGT.
Table 1 presents baseline characteristics (clinical measures, blood parameters, status of glucose metabolism based on OGTT results and estimated daily intakes of different FAs reported in the FFQ) for the respective case-control sets included in the present study. Values are presented for the entire study population and for the subgroups of younger (40–50 years) and older (60 years) subjects, respectively.
Table 2 displays the characteristics of prostate cancer cases for the entire study population and for the subgroups of younger and older subjects. The median age at prostate cancer diagnosis was 67 years (range 45.9–80.2 years), and the median time between sample collection (baseline) and prostate cancer diagnosis was 10.3 years (range 5.0–19.9 years). For younger cases, the follow-up time was 11.8 (5.1–19.5) years, while for older cases, it was 9.5 (5.0–19.9) years. Overall, 22% of cases were classified as having an aggressive form of the disease (Table 2).
Identification of metabolites associated with risk of prostate cancer
Metabolites with nominal p value < 0.05
The metabolites associated with prostate cancer risk before and after stratification by disease subtype and baseline age (nominal p value < 0.05; Additional file 3) are shown in Fig. 1. Several glycerophospholipids (LPCs and phosphatidylcholines (PCs)) and glycine were frequently associated with risk of prostate cancer (OR > 1). The associations observed for glycerophospholipids were typically stronger for risk of aggressive disease and in older subjects. The association observed for glycine was generally stronger for risk of overall and non-aggressive disease in younger subjects. Pyruvate, arginine, ornithine and acylcarnitine C18:2 were frequently associated with risk of prostate cancer (OR < 1).
Metabolites significant after correction for multiple testing
Metabolites with a significant association with risk of prostate cancer after correction for multiple testing (FDR 20% and Bonferroni) are shown in Table 3 (Additional file 4). Higher plasma concentrations of LPC C17:0 and LPC C18:0 were associated with higher risk of overall prostate cancer, and each respective association was stronger in older subjects. Importantly, the association between LPC C17:0 and overall prostate cancer risk in older subjects was significant after the Bonferroni correction. Higher plasma concentration of glycine and pyruvate was associated with higher and lower risk of overall prostate cancer in younger subjects, respectively. Higher concentrations of six glycerophospholipids (LPC C17:0, LPC C20:3, LPC C20:4, PC ae C38:3, PC ae C38:4 and PC ae C40:2) were associated with higher risk of aggressive prostate cancer, while higher concentrations of acylcarnitine C18:2 were associated with lower risk of aggressive prostate cancer. The association observed for LPC C17:0 was stronger in older subjects, where individuals in the top quartile had 3.9-fold higher odds of developing aggressive disease (Table 3). None of the associations with non-aggressive prostate cancer risk (Fig. 1) was significant after correction for multiple testing. There was heterogeneity between the two age categories (40–50, 60 years) for LPC C17:0, LPC C18:0 and glycine (Additional file 5). Only PC ae C38:0 showed heterogeneity between disease subtype categories (aggressive, non-aggressive; Additional file 5). There was no heterogeneity between follow-up time categories (5–10, > 10 years; Additional file 6).
LPCs attracted particular interest in the analyses, since the majority of metabolites identified belonged to this class of glycerophospholipids (Table 3). Based on Spearman’s correlation coefficients, two clusters of LPCs were identified in the correlation matrix (Fig. 2a). There were significant positive correlations between the LPCs in each cluster. The first cluster included LPCs with ≤ 20 carbons in the FA moiety, and the second cluster included LPCs with > 20 carbons in the FA moiety. The LPCs associated with prostate cancer risk (Table 3) belonged to the first cluster.
The OR values obtained for the individual LPCs in the first cluster (≤ 20 carbons) had a pattern highly similar to that of the total sum of LPCs (Fig. 2b; Additional file 7). After normalisation of each individual LPC to the total sum of LPCs, the associations between LPC C18:0, LPC C20:3 and LPC C20:4 and risk of prostate cancer were no longer significant, while the association between LPC C17:0 and risk of prostate cancer remained significant (Fig. 2b; Additional file 7). This finding clearly distinguished LPC C17:0 from other LPCs in the first cluster.
Lysophosphatidylcholines and dietary fatty acids
The correlations between plasma concentration of LPCs and daily dietary intake of the corresponding FA (estimated from the baseline FFQ) were investigated. The strongest correlation was observed between the plasma concentration of LPC C17:0 and estimated daily intake of FA C17:0 (Additional file 8).
Lysophosphatidylcholines and glucose metabolism
LPCs in the first cluster (≤ 20 carbons) displayed a negative correlation with glucose values from the OGTT, while the correlations between LPCs in the second cluster (with > 20 carbons) and glucose values from the OGTT were generally non-significant (Fig. 3a). The correlations between LPCs in the first cluster and post-load glucose values (2 h) were stronger than the corresponding correlations to pre-load glucose values (0 h; Fig. 3a).
Status of glucose metabolism and prostate cancer risk
LPCs (≤ 20 carbons) were consistently associated with risk of prostate cancer (Fig. 2b) and displayed negative correlations with glucose values from the OGTT (Fig. 3a). Further investigation of whether glucose metabolism status at baseline (i.e. NGT, IFG, IGT or T2D) was associated with prostate cancer risk revealed that IGT was associated with lower risk of prostate cancer (p < 0.05; Fig. 3b; Additional file 9). The associations were often stronger in younger subjects than in older subjects (Fig. 3b). For metabolites found to be associated with risk of prostate cancer after correction for multiple testing (Table 3), we investigated whether the associations were altered after limiting the analyses to case-control sets with NGT at baseline. The results showed that associations for some LPCs with ≤ 20 carbons in the FA moiety and for some PCs appeared stronger (p < 0.05), but the associations for pyruvate and acylcarnitine C18:2 were not changed (Fig. 4; Additional file 10). The associations for glycine became stronger for overall and non-aggressive prostate cancer in younger subjects (p < 0.05).
LPC C17:0 and LPC C18:0 were associated with risk of overall prostate cancer (Table 3; OR > 1), and the association appeared stronger in older subjects (for LPC C17:0, it was still significant after the Bonferroni correction). In younger subjects, glycine and pyruvate were associated with risk of overall prostate cancer (Table 3; glycine: OR > 1; pyruvate: OR < 1). Several glycerophospholipids and acylcarnitine C18:2 were associated with aggressive prostate cancer risk (glycerophospholipids: OR > 1; acylcarnitine C18:2: OR < 1) (Table 3). The strongest association was found in older subjects for LPC C17:0, where individuals in the top quartile had 3.9-fold higher odds of developing aggressive prostate cancer (Table 3).
Strengths and limitations
The combined targeted MS- and NMR-based metabolomics methodology used in the present study provided absolute metabolite quantities with low analytical variation, and the use of two separate techniques increased the metabolite coverage. The validity of the metabolomics findings was tested using a holistic approach, in which metabolomics-based hypotheses were corroborated by data generated at baseline (FFQ and OGTT). The use of plasma samples collected after overnight fasting reduced the variation in concentrations of metabolites resulting from varying time since the last meal. A long minimum follow-up time (≥ 5 years) reduced the possibility of metabolic alterations due to subclinical prostate cancer at baseline. Each case-control set was matched for BMI and age, to account for the known risk factors for prostate cancer. The study was limited by including individuals from only one European country and by employing only targeted metabolomics, which measures a limited number of metabolites compared with an untargeted approach.
Seven previous studies have investigated the association between metabolite levels and the risk of prostate cancer incidence in prospectively collected human blood samples [9,10,11,12,13,14,15]. Three of these studies were nested within the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study (ATBC) [11, 12, 14]; one within the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial (PLCO) ; one within the European Prospective Investigation into Cancer and Nutrition study (EPIC) in Heidelberg ; and two within the EPIC-multicentre cohort (Germany, Greece, Italy, the Netherlands, Spain and the UK) [13, 15]. The ABTC and PLCO studies used untargeted metabolomics to non-quantitatively measure many metabolites, many not targeted in the present study. The EPIC studies used the same MS-based methodology, allowing more straightforward comparisons with the present study.
The present study is the first to show positive associations between LPC C17:0 and risk of overall and aggressive prostate cancer (Table 3) [9,10,11,12,13,14,15]. The inverse association between acylcarnitine C18:2 and risk of aggressive prostate cancer in the present study (Table 3) is consistent with the association between higher concentrations of acylcarnitine C18:2 and lower risk of advanced-stage prostate cancer in the EPIC-multicentre studies [13, 15].
The positive association between LPC C18:0 and risk of overall prostate cancer (Table 3) in the present study was not observed in any of the EPIC-multicentre studies (ORs close to one) [13, 15]. In the EPIC-Heidelberg study, an inverse association between LPC C18:0 and overall prostate cancer risk (p < 0.05) was observed, but the association was no longer significant after correction for multiple testing or adjusting for confounding factors in that case-cohort study . The positive associations between glycerophospholipids (LPCs and PCs) and risk of aggressive prostate cancer in our study are in the opposite direction to the inverse associations between glycerophospholipids and risk of advanced-stage prostate cancer reported in the EPIC-multicentre studies [13, 15].
In this study, we only used overnight fasting samples, because varying time since the last meal can generate variations in the concentration of several metabolites that are not related to the estimation of disease risk. This was not considered in the experimental design of the EPIC studies [10, 13, 15]. Instead, the EPIC-multicentre studies matched each case-control set based on categories of time since last meal (< 3, 3–6, > 6 h) [13, 15]. However, the range of variation in the concentration of some metabolites within each of those time categories is still large. For examples, we have previously shown that LPC concentrations change by 25–34% compared with baseline within 3 h after a meal . Therefore, our study population was (at least in theory) subjected to lower pre-analytical variation and this may have assisted in revealing the association between, for example, LPC C18:0 and higher risk of overall prostate cancer.
Moreover, in the present study, as in some previous studies [19, 24], the aggressive disease subtype included both high grade and non-localised tumours (T3–4). In contrast, the EPIC-multicentre studies considered only non-localised tumours (T3–4) in the advanced-stage disease subtype [13, 15]. These differences in disease subtype categorisation may partly account for the different associations observed between our study and the EPIC-multicentre studies. At this stage, it is not possible to determine the exact reason for the opposing association between LPCs and risk of aggressive/advanced-stage prostate cancer observed in the present study and the EPIC-multicentre studies. Further research, for example comparing our nested cohort with a subgroup in the EPIC-multicentre cohort and including only case-controls with samples collected after overnight fasting, may help explain the discrepancies. This suggestion is motivated by our previous observation that the extent of change in the concentrations of metabolites (i.e. phospholipids) after a meal is also associated with the physiological status of individuals (in addition to time since last meal) . For example, the post-meal changes in several metabolites are blunted in individuals with impaired glucose/insulin metabolism . This may indicate that the association of a metabolite with a disease may switch direction or be affected when moving from fasting samples to post-meal samples if the extent of post-meal change in the metabolite differs between cases and controls, as a result of disease-associated physiological status.
Sarcosine has been prospectively associated with risk of prostate cancer in a study performed in the PLCO . No association between sarcosine and risk of prostate cancer was found in the present study (data not shown). One reason may be the shorter follow-up times used in the PLCO study (≥ 1 year) compared with the present study (≥ 5 years). Intriguingly, no association with sarcosine was found in another study performed in the PLCO that used longer follow-up times (≥ 4.4 years) . This might suggest sarcosine is rather a marker of (early) diagnosis [31, 32] or progression [33, 34] in prostate cancer.
Investigating links between prostate cancer and dietary fatty acids
After normalising the concentration of each LPC with ≤ 20 carbons to the total concentration of such LPCs, only LPC C17:0 remained associated with risk of prostate cancer (Fig. 2b; OR > 1; p < 0.05). This may imply an increase at two levels: (1) an overall increase in LPCs and (2) an additional increase in the LPC with 17 carbons in the FA moiety (LPC C17:0). Moreover, the LPC C17:0 concentration in plasma displayed a significant positive correlation with dietary intake of the corresponding FA (FA C17:0; Additional file 8), while for other LPCs, such a correlation was generally weak or absent.
Odd-chain FAs (e.g. C17:0) are found in products of ruminant origin (e.g. dairy products). Previous studies have shown a positive correlation between levels of plasma LPC C17:0 and dairy consumption [35, 36]. Therefore, our findings may suggest a link between consumption of dairy products and risk of prostate cancer. Some previous studies have also suggested a higher prostate cancer risk with higher intake of dairy products [37, 38], although evidence of such an association is still limited , and therefore, it warrants further investigation.
A previous study in the Pan-European EPIC-cohort reported dairy consumption to be higher among males in northern Sweden than in many other European countries . If high dairy consumption has implications for prostate cancer, then metabolic markers of dairy consumption (e.g. LPC C17:0) [35, 36] are more likely to display a stronger association to prostate cancer risk in a population such as northern Sweden.
It should be borne in mind that according to recent findings, biosynthesis can also contribute to circulatory levels of FA C17:0 . Hence, more research is needed to enable sound conclusions to be drawn. Understanding the link between consumption of dairy products and risk of prostate cancer would be of great value in formulating prevention strategies, since consumption of dairy products is a modifiable risk factor.
Investigating the link between prostate cancer and glucose metabolism
Higher concentrations of LPC C17:0, LPC C18:0 and glycine, which were associated with higher risk of overall prostate cancer in the present study (Table 3), were found to be associated with lower risk of IGT and T2D in a previous study . Similarly, FA C17:0 has been linked to glucose intolerance . In addition, pyruvate, which we observed to be associated with lower risk of overall prostate cancer, is an important intermediate in glucose metabolism . These findings suggest a possible link between glucose metabolism and risk of prostate cancer. In addition, aberrant glucose metabolism [24, 43] and T2D [44, 45] have previously been associated with lower risk of prostate cancer. Therefore, we investigated the correlation between metabolites frequently associated with a risk of prostate cancer (LPCs) and glucose concentrations (0 h and 2 h) from an OGTT performed on all subjects at baseline. Intriguingly, we detected negative correlations between LPCs (with ≤ 20 carbons in the FA moiety) and glucose concentrations.
Considering the negative correlation between LPCs and glucose concentrations, we then investigated whether the status of glucose metabolism at baseline (i.e. IGT, IFG and T2D) was associated with risk of prostate cancer. It was observed that IGT was associated with a lower risk of prostate cancer (Fig. 3b). Next, we postulated that the association between metabolites and prostate cancer risk may appear stronger in a population with NGT, where the risk-reducing (masking) effect of IGT does not exist. Therefore, we repeated the statistical analyses among case-controls with NGT for metabolites associated with prostate cancer risk (Table 3). We found that the associations appeared stronger for many of the glycerophospholipids and glycine, supporting our postulated effect (Fig. 4). Overall, these findings show that the reverse cross-association between T2D and prostate cancer risk [44, 45] can be observed at metabolite level.
A number of studies have shown altered metabolic regulation in relation to the development [9,10,11,12,13,14,15] and progression  of prostate cancer. It would be of great interest to acquire insights into the genes involved in controlling these metabolic alterations. The cross-association between T2D and prostate cancer observed in the present and previous studies [44, 45] may warrant further research on genes and signalling pathways at the intercept of prostate cancer and T2D. One such signalling pathway is phosphatase and tensin homologue (PTEN)-PI3K/AKT with known implications in both prostate cancer and diabetes [47, 48].
We observed associations of LPCs, PCs, glycine, acylcarnitine C18:2 and pyruvate with prostate cancer risk, with the associations varying with baseline age and disease aggressiveness. The strongest and most consistent association was observed between LPC C17:0, potentially a marker of dairy consumption, and higher risk of overall and aggressive prostate cancer. Moreover, several LPCs that were associated with risk of prostate cancer were negatively correlated with blood glucose concentrations from an OGTT. The association of glycerophospholipids and glycine with prostate cancer risk was stronger in case-controls with NGT. These findings suggest a possible link between glucose metabolism and the risk of developing prostate cancer.
Availability of data and materials
The data used in the present study are available to the editorial board for evaluating the findings presented. In order to access the data and samples for another study, after receiving ethical approval, a request should be submitted to the Biobank Research Unit at Umeå University, which administers and conveys information about access to data and samples in this study (https://www.umu.se/en/biobank-research-unit/).
Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study
Automated quantification algorithm
Body mass index
Coefficient of variation
Diastolic blood pressure
European Prospective Investigation into Cancer and Nutrition study
False discovery rate
Food frequency questionnaire
Insulin-like growth factor I
Impaired glucose tolerance
Impaired fasting glucose
Normal glucose tolerance
Nuclear magnetic resonance
Northern Sweden Health and Disease Study
Oral glucose tolerance test
Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial
Phosphatase and tensin homologue
Systolic blood pressure
Type 2 diabetes
World Health Organization
Bray F, Ferlay J, Soerjomataram I, Sigel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBACON estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.
Sim HG, Cheng CW. Changing demography of prostate cancer in Asia. Eur J Cancer. 2005;41(6):834–45.
Khan N, Afaq F, Mukhtar H. Lifestyle as risk factor for cancer: evidence from human studies. Cancer Lett. 2010;293(2):133–43.
Travis RC, Appleby PN, Martin RM, Holly JMP, Albanes D, Black A, et al. A meta-analysis of individual participant data reveals an association between circulating levels of IGF-I and prostate cancer risk. Cancer Res. 2016;76(8):2288–300.
WCRF/AICR. Diet, nutrition, physical activity and prostate cancer: a global perspective. Continuous Update Project Expert Report 2018. https://wcrf.org: Accessed 15 Nov 2019.
Wishart DS, Tzur D, Knox C, Eisner R, Guo AC, Young N, et al. HMDB: the human metabolome database. Nucleic Acid Res. 2007;35(Database issue):D521–6.
Nicholson JK, Holmes E, Elliott P. The metabolome-wide association study: a new look at human disease risk factors. J Proteome Res. 2008;7(9):3637–8.
Nicholson G, Rantalainen M, Maher AD, Li JV, Malmodin D, Ahmadi KR, et al. Human metabolic profiles are stably controlled by genetic and environmental variation. Mol Syst Biol. 2011;7:525.
Huang J, Mondul AM, Weinstein SJ, Koutros S, Derkach A, Karoly E, et al. Serum metabolomic profiling of prostate cancer risk in the prostate, lung, colorectal, and ovarian cancer screening trial. Br J Cancer. 2016;115(9):1087–95.
Kühn T, Floegel A, Sookthai D, Johnson T, Rolle-Kampczyk U, Otto W, et al. Higher plasma levels of lysophosphatidylcholine 18:0 are related to a lower risk of common cancers in a prospective metabolomics study. BMC Med. 2016;14:13.
Mondul AM, Moore SC, Weinstein SJ, Karoly ED, Sampson JN, Albanes D. Metabolomic analysis of prostate cancer risk in a prospective cohort: the alpha-tocopherol, beta-carotene cancer prevention (ATBC) study. Int J Cancer. 2015;137(9):2124–32.
Mondul AM, Moore SC, Weinstein SJ, Männistö S, Sampson JN, Albanes D. 1-Stearoylglycerol is associated with risk of prostate cancer: results from serum metabolomic profiling. Metabolomics. 2014;10(5):1036–41.
Schmidt JA, Fensom GK, Rinaldi S, Scalbert A, Appleby PN, Achaintre D, et al. Pre-diagnostic metabolite concentrations and prostate cancer risk in 1077 cases and 1077 matched controls in the European prospective investigation into cancer and nutrition. BMC Med. 2017;15(1):122.
Huang J, Mondul AM, Weinstein SJ, Karoly ED, Sampson JN, Albanes D. Prospective serum metabolomic profile of prostate cancer by size and extent of primary tumor. Oncotarget. 2017;8(28):45190–9.
Schmidt JA, Fensom GK, Rinaldi S, Scalbert A, Appleby PN, Achaintre D, et al. Patterns in metabolite profile are associated with risk of more aggressive prostate cancer: a prospective study of 3,057 matched case-control sets from EPIC. Int J Cancer. 2020;146(3):720–30.
Shrestha A, Müllner E, Poutanen E, Mykännen H, Moazzami AA. Metabolomic changes in serum metabolome in response to a meal. Eur J Nutr. 2017;56(2):671–81.
Shi L, Brunius C, Lindelöf M, Shameh SA, Wu H, Lee I, et al. Targeted metabolomics reveals differences in the extended postprandial plasma metabolome of healthy subjects after intake of whole-grain rye porridges versus refined wheat bread. Mol Nutr Food Res. 2016;61(7):1600924.
Krug S, Kastenmüller G, Stückler F, Rist MJ, Skurk T, Sailer M, et al. The dynamic range of the human metabolome revealed by challenges. FASEB J. 2012;26(6):2607–19.
Stattin P, Rinaldi S, Biessy C, Stenman UH, Hallmans G, Kaaks R. High levels of circulating insulin-like growth factor-I increase prostate cancer risk: a prospective study in a population-based nonscreened cohort. J Clin Oncol. 2004;22(15):3104–12.
Norberg M, Wall S, Boman K, Weinehall L. The Västerbotten intervention programme: background, design and implications. Glob Health Action. 2010;3.
Johansson I, Hallmans G, Wikman A, Biessy C, Riboli E, Kaaks R. Validation and calibration of food-frequency questionnaire measurements in the northern Sweden health and disease cohort. Public Health Nutr. 2002;5(3):487–96.
Bergström L, Kylberg E, Hagman U, Eriksson H, Bruce Å. The food composition data base system (KOST-systemet) - its use for nutrient values. Vår Föda. 1991;43:439–4.
AJCC. Cancer staging manual. Lippincott-Raven. 1997;5:219–25.
Stocks T, Lukanova A, Rinaldi S, Biessy C, Dossus L, Lindahl B, et al. Insulin resistance is inversely related to prostate cancer: a prospective study in northern Sweden. Int J Cancer. 2007;120(12):2678–86.
Egevad L. Reproducibility of Gleason grading of prostate cancer can be improved by the use of reference images. Urology. 2001;57(2):291–5.
Mostofi FK, Sesterhenn IA, Sobin LH. Histologic typing of prostate tumours. Geneva: World Health Organization; 1980. p. 17–21.
WHO. Definition, diagnosis and classification of diabetes mellitus and its complications. Report of a WHO consultation 1999.
Bogmuil R, Koal T, Weinberger KM, Dammeier S. Targeted metabolomics: fast, standardized mass spectrometric analysis of blood plasma with a kit. Larborwelt. 2008;2:17–23.
Röhnisch HE, Eriksson J, Müllner E, Agback P, Sandström C, Moazzami AA. AQuA: an automated quantification algorithm for high-throughput NMR-based metabolomics and its application in human plasma. Anal Chem. 2018;90(3):2095–102.
Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100(16):9440–5.
Koutros S, Meyer TE, Fox SD, Issaq HJ, Veenstra TD, Huang WY, et al. Prospective evaluation of serum sarcosine and risk of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Carcinogenesis. 2013;34(10):2281–5.
Lucarelli G, Fanelli M, Larocca AM, Germinario CA, Rutigliano M, Vavallo A, et al. Serum sarcosine increases the accuracy of prostate cancer detection in patients with total serum PSA less than 4.0 ng/ml. Prostate. 2012;72(15):1161–21.
Sreekumar A, Poisson LM, Rajendiran TM, Khan AP, Cao Q, Yu J, et al. Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression. Nature. 2009;457(7231):910–4.
Lucarelli G, Ditonno P, Bettocchi C, Spilotros M, Rutigliano M, Vavallo A, et al. Serum sarcosine is a risk factor for progression and survival in patients with metastatic castration-resistant prostate cancer. Future Oncol. 2013;9(6):899–907.
Nestel PJ, Straznicky N, Mellett NA, Wong G, De Souza DP, Tull DL, et al. Specific plasma lipid classes and phospholipid fatty acids indicative of dairy food consumption associate with insulin sensitivity. Am J Clin Nutr. 2014;99(1):46–53.
Floegel A, von Ruesten A, Drogan D, Schulze MB, Prehn C, Adamski J, et al. Variation of serum metabolites related to habitual diet: a targeted metabolomic approach in EPIC-Potsdam. Eur J Clin Nutr. 2013;67(10):1100–8.
Aune D, Navarro Rosenblatt DA, Chan DS, Vieira AR, Vieira R, Greenwood DC, et al. Dairy products, calcium, and prostate cancer risk: a systematic review and meta-analysis of cohort studies. Am J Clin Nutr. 2015;101(1):87–117.
Tat D, Kenfield SA, Cowan JE, Broering JM, Carroll PR, Van Blarigan EL, et al. Milk and other dairy foods in relation to prostate cancer recurrence: data from the cancer of the prostate strategic urologic research endeavor (CaPSURETM). Prostate. 2018;78(1):32–9.
Hjartåker A, Lagiou A, Slimani N, Lund E, Chirlague MD, Vasilopoulou E, et al. Consumption of dairy products in the European prospective investigation into cancer and nutrition (EPIC) cohort: data from 35955 24-hour dietary recalls in 10 European countries. Public Health Nutr. 2002;5(6):1259–71.
Jenkins BJ, Seyssel K, Chiu S, Pan PH, Lin SY, Stanley E, et al. Odd chain fatty acids; new insights of the relationship between the gut microbiota, dietary intake, biosynthesis and glucose intolerance. Sci Rep. 2017;23:44845.
Wang-Sattler R, Yu Z, Herder C, Messias AC, Floegel A, He Y, et al. Novel biomarkers for pre-diabetes identified by metabolomics. Mol Syst Biol. 2012;8(1):615.
Guo X, Li H, Xu H, Woo S, Dong H, Lu F, et al. Glycolysis in the control of blood glucose homeostasis. Acta Pharm Sin B. 2012;2(4):358–67.
Häggström C, Stocks T, Nagel G, Manjer J, Bjørge T, Hallmans G, et al. Prostate cancer, prostate cancer death, and death form other causes, among men with metabolic aberrations. Epidemiology. 2014;25(6):823–8.
Bansal D, Bahnsali A, Kapil G, Undela K, Tiwari P. Type 2 diabetes and risk of prostate cancer: a meta-analysis of observational studies. Prostate Cancer Prostatic Dis. 2013;16(2):151–8.
Tsilidis KK, Allen NE, Appelby PN, Rohrmann S, Nöthlings U, Arriola L, et al. Diabetes mellitus and risk of prostate cancer in the European prospective investigation into cancer and nutrition. Int J Cancer. 2015;136(2):372–81.
Lucarelli G, Loizzo D, Ferro M, Rutigliano M, Vartolomei MD, Cantiello F, et al. Metabolic profiling for the identification of novel diagnostic markers and therapeutic targets in prostate cancer: an update. Expert Rev Mol Diagn. 2019;19(5):337–87.
Huang X, Liu G, Guo J, Su Z. The PI3K/AKT pathway in obesity and type 2 diabetes. Int J Biol Sci. 2018;14(11):1483–93.
Yue S, Li J, Lee SY, Shao T, Song B, Cheng L, et al. Cholesteryl ester accumulation induced by PTEN loss and PI3K/AKT activation underlies human prostate cancer aggressiveness. Cell Metab. 2014;19(3):393–406.
We would like to thank Mr. Robert Johansson, Mrs. Åsa Ågren and Mrs. Anna-Sara Molin for sample and data retrieval from biobanks. We would like to thank Northern Sweden Diet Database (NSDD) for providing us with dietary data and Regionalt Cancercentrum Norr for providing us with cancer data.
This study was funded by the Swedish Research Council Formas, SciLifeLab and strategic funding from the Swedish University of Agricultural Sciences for metabolomics-based studies. Open access funding provided by Swedish University of Agricultural Sciences.
Ethics approval and consent to participate
This study was approved by the Research Ethics Committee of Umeå University Hospital and the Regional Ethical Committee in Uppsala (no. 2013-124).
Consent for publication
The authors declare no potential conflicts of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1.
Methods for targeted MS and NMR-based metabolomics.
Additional file 2.
Quality control of metabolites quantified with targeted metabolomics.
Additional file 3.
Associations between individual metabolites and prostate cancer risk.
Additional file 4.
Associations significant after correction for multiple testing.
Additional file 5.
Differential associations by age group and disease type.
Additional file 6.
Associations with overall prostate cancer risk after stratification by follow-up time.
Additional file 7.
Adjusting for the sum of lysophosphatidylcholines with ≤20 carbons.
Additional file 8.
Correlation of lysophosphatidylcholines with dietary fatty acids.
Additional file 9.
Status of glucose metabolism and prostate cancer risk.
Additional file 10.
Prostate cancer risk in case-controls with normal glucose tolerance.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Röhnisch, H.E., Kyrø, C., Olsen, A. et al. Identification of metabolites associated with prostate cancer risk: a nested case-control study with long follow-up in the Northern Sweden Health and Disease Study. BMC Med 18, 187 (2020). https://doi.org/10.1186/s12916-020-01655-1
- Prostate cancer
- Nested case-control study
- Nuclear magnetic resonance spectroscopy
- Mass spectrometry
- Risk biomarkers