Hepatocellular carcinoma (HCC), the most prevalent form of liver cancer, is difficult to diagnose and has limited treatment options with a low survival rate. Aside from a few key risk factors, such as hepatitis, high alcohol consumption, smoking, obesity, and diabetes, there is incomplete etiologic understanding of the disease and little progress in identification of early risk biomarkers.
To address these aspects, an untargeted nuclear magnetic resonance metabolomic approach was applied to pre-diagnostic serum samples obtained from first incident, primary HCC cases (n = 114) and matched controls (n = 222) identified from amongst the participants of a large European prospective cohort.
A metabolic pattern associated with HCC risk comprised of perturbations in fatty acid oxidation and amino acid, lipid, and carbohydrate metabolism was observed. Sixteen metabolites of either endogenous or exogenous origin were found to be significantly associated with HCC risk. The influence of hepatitis infection and potential liver damage was assessed, and further analyses were made to distinguish patterns of early or later diagnosis.
Our results show clear metabolic alterations from early stages of HCC development with application for better etiologic understanding, prevention, and early detection of this increasingly common cancer.
Liver cancer is the sixth most commonly diagnosed cancer and the second leading cause of cancer death worldwide . Hepatocellular carcinoma (HCC), the most frequent type of liver cancer, is primarily associated with chronic hepatitis B (HBV) and C (HCV) infections and aflatoxin exposure , while other major risk factors include obesity, type 2 diabetes, tobacco smoking, and heavy alcohol drinking [3–5]. HCC is highly malignant, usually diagnosed at late stages, and often has a poor prognosis with limited treatment options . The late diagnosis and consequent poor survival associated with the disease are often attributed to its lack of pathognomonic symptoms and limitations of diagnostic modalities. Improving both the understanding of HCC etiology and the early detection of the disease is an important first step towards the design of effective prevention strategies aimed at early diagnosis and reduction of HCC incidence. A valuable tool toward these goals is the analysis of bio-samples from prospective cohort studies, where healthy participants are enrolled and followed over time for the appearance of various diseases. Since HCC development implies alterations in the metabolic functions of the liver and, in a majority of cases, progresses from pre-cancerous lesions through to cirrhosis and cancer, it is conceivable that metabolic changes may be detected from the very early stages of the disease, long prior to clinical diagnosis. Thus, metabolomics may serve as a valuable tool for the identification of biomarkers for early detection of HCC.
Metabolomics is a powerful high-throughput approach that relies on state of the art analytical methods, such as nuclear magnetic resonance (NMR), to identify metabolic signatures or biomarkers associated with homeostasis perturbations . Metabolomic strategies play an increasingly important role in clinical and observational studies, in the hope that they will offer new perspectives not only in understanding the processes of disease development, but also for identification of diagnostic/prognostic markers and targeted healthcare . Indeed, several recent studies have leveraged metabolite profiling to provide new insights into pathological processes pertaining to cancer, heart disease, or diabetes mellitus [9–15]. Although a number of metabolomic-based approaches have been applied to HCC, they have either been largely based on traditional case–control designs, high risk patient groups (e.g. hepatitis infection, cirrhosis, or other chronic liver diseases), non-Western populations where traditional HCC risk factors predominate, or on tumor tissues [16–31]. However, there is currently very little information derived from prospective settings where biological samples have been collected prior to disease diagnosis [32–34].
In this study, we investigated whether metabolic differences could be detected between HCC cases and matched controls derived from a prospective cohort study using serum samples collected prior to diagnosis. A NMR-based metabolomic approach was applied to a case–control study nested within a large, multi-center prospective cohort.
The present study is based on a case–control study nested within the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort, a multicenter prospective study designed to investigate the association between diet, lifestyle, and environmental factors and the incidence of various types of cancer and other chronic diseases. The rationale, detailed study design, and methods have been previously detailed . Briefly, diet and lifestyle data were collected at recruitment from approximately 520,000 men and women aged 35–85 years enrolled between 1992 and 2000 in 23 centers from 10 Western European countries (Denmark, France, Germany, Greece, Italy, Norway, Spain, Sweden, the Netherlands, and the United Kingdom) . The study subjects were recruited from the general population, except for France (women who were members of a health insurance scheme for state school employees), Naples and Norway (women only), Utrecht and Florence (women attending breast cancer screening), and subsamples of the Oxford “Health Conscious” sub-cohort (vegetarians) and the Italian and Spanish cohorts (mainly members of blood donor associations).
The EPIC cohort in general, and this study in particular, have received approval from the Ethics Committee of the International Agency for Research on Cancer as well as the ethics review boards of individual EPIC centers. EPIC participants provided written consent for the use of their blood samples and all data.
Blood sample collection
Blood samples were collected using standardized methods at recruitment from most participants and are stored at IARC (Lyon, France) in liquid nitrogen at –196 °C for all countries except Denmark (−150 °C, nitrogen vapor) and Sweden (−80 °C, freezers) where samples are stored locally .
Cancer and vital status assessment
Vital status during follow-up (98.5 % complete) was assessed by record linkage with regional and/or national mortality registries in all countries except Germany and Greece, where follow-up was actively reported by study subjects or their next-of-kin. Cancer incidence was determined through record linkage with population-based regional cancer registries (Denmark, Italy, the Netherlands, Norway, Spain, Sweden, and the United Kingdom) or via a combination of methods, including the use of health insurance records, contacts with cancer and pathology registries, and active follow-up through study subjects and their next-of-kin (France, Germany, Greece). For the present study, the dates of follow-up for cancer incidence and vital status are complete up to end of 2006.
The HCC nested case–control study
Ascertainment of cases
HCC cases were defined as tumor in the liver (C22.0) according to the 10th Revision of the International Statistical Classification of Diseases, Injury and Causes of Death. For each HCC case identified, the histology, methods used to diagnose the cancer, and α-fetoprotein (AFP) levels were reviewed to exclude metastatic cases or other types of primary liver cancers.
The nested case–control study
The design of the nested case–control study has been previously described in detail . Briefly, 125 HCC cases with available blood samples at baseline were identified between participants’ recruitment and 2006. For each case, two controls were selected by incidence density sampling from all cohort members alive and free of cancer (except non-melanoma skin cancer), and matched by age at blood collection (±1 year), sex, study center, date (±2 months) and time of the day at blood collection (±3 h), and fasting status at blood collection (<3/3–6/>6 h). Women were additionally matched by menopausal status (pre-/peri-/postmenopausal) and hormone replacement therapy use at time of blood collection (yes/no). Participants with insufficient remaining blood sample for NMR analyses were excluded (Ncases = 11). For six cases, only one eligible control was available for each case. Therefore, the final sample size for the present analysis included 114 HCC cases and 222 matched controls.
Serum sample analysis
Laboratory assays: HBV/HCV infection, biomarkers of liver function and AFP
HBV and HCV seropositivity were detected in serum samples using the ARCHITECT HBsAg and anti-HCV chemiluminescent microparticle immunoassays (CMIAs; Abbott Diagnostics, France): HBsAg-positive when ≥0.05 IU/mL and HCV-positive when the ratio of sample relative light units to cutoff relative light units was ≥1 in two measurements . Biochemical markers of hepatic injury, including albumin, total bilirubin, alanine aminotransferase (ALT), aspartate aminotransferase (AST), gamma-glutamyltransferase (GGT), and liver-specific alkaline phosphatase (AP) were measured on the ARCHITECT c Systems™ (Abbott Diagnostics) using standard protocols. The normal ranges were: albumin, 35–50 g/L; total bilirubin, 3.4–20.5 mmol/L; ALT, <55 U/L; AST, 5–34 U/L; GGT, 12–64 U/L (men) and 9–36 U/L (women); and AP, 40–150 U/L. A liver function score was calculated from concentrations of albumin, total bilirubin, ALT, AST, GGT, and AP, each contributing 1 point when outside of the normal range . The liver score was categorized as no liver damage (liver score 0), probable liver damage (liver score 1–2), and likely liver damage (liver score ≥3). Additionally, the concentration of serum AFP, which is currently a pre-diagnostic biomarker for HCC, was measured in blood using the ARCHITECT AFP kit. The laboratory analyses were performed at the Centre de Biologie République, Lyon, France .
NMR metabolomic data acquisition
Serum samples (200 μL) were processed according to standard procedures for NMR metabolomic measurement . One-dimensional 1H Carr-Purcell-Meiboom-Gill (CPMG) and Nuclear Overhauser effect spectroscopy (NOESY) NMR spectra were recorded for each serum sample on a Bruker Avance III spectrometer operating at 800.15 MHz 1H NMR frequency. Additional two-dimensional NMR spectra were recorded on a set of representative samples (one control and one case) to achieve assignment of the NMR signals observed in the 1H one-dimensional fingerprints to metabolites. The measured chemical shifts were compared to reference shifts of pure compounds using the HMDB , MMCB , and ChenomX NMR Suite (Chenomx Inc., Edmonton, Canada) databases. Figure 1 shows the mean CPMG spectrum with metabolite assignments. The detailed list of the 44 annotated metabolites is provided in Additional file 1: Table S1. NMR signals arising from lipids enabled the quantification of unsaturated lipids in the serum (signal at 5.28 ppm, resonance of -CH = CH- from unsaturated lipids) as well as terminal lipids methyls corresponding to several classes of lipoproteins: very-low-density lipoproteins (VLDL; δ 0.86 ppm), low-density lipoproteins (LDL; δ 0.84 ppm), and high-density lipoproteins (HDL; δ 0.82 ppm). After processing and calibration, each 1D NMR spectrum was reduced into bins of 0.001 ppm width over a chemical shift range of 0.5–9 ppm using the AMIX software (Bruker GmbH, Rheinstetten, Germany), giving a total number of 8,500 NMR variables.
All NMR analyses were performed blindly with respect to case/control status. Further details on sample preparation, NMR data acquisition, and spectra processing are available in Additional file 1.
Orthogonal partial least-square (O-PLS)
O-PLS  analyses were conducted in order to build predictive sample classification models based on whole CPMG or NOESY NMR spectra to discriminate between HCC cases and controls, by relating the 8,500 NMR variables to case/control status. Results were visualized on score plots corresponding to sample projection onto the predictive axis and the first orthogonal component of the model. The metabolic signature discriminating HCC cases from controls was visualized by the corresponding loading plot. The optimal number of orthogonal components for building O-PLS models was selected using a 7-fold cross validation procedure. The associated R2 and Q2 parameters were calculated as a measure of the “goodness of fit and prediction”, i.e. the explained and predicted variances, respectively. The robustness of O-PLS models was further validated using permutations (1000 times) under the null hypothesis; for each permutated case/control labels, R2 and Q2 values were obtained and compared to the original ones, their decrease indicating the good quality of the model .
Metabolite paired difference analysis
The statistical recoupling of variables  procedure was first applied to reduce the 8,500 NMR variables into 285 intelligent buckets, or clusters of NMR variables, that correspond to reconstructions of peak entities. ANOVA models were then carried out on each of the 285 clusters of variables by modelling the case–control set by means of a random effect variable to account for the matching design of the study in ANOVA mixed-effect models. To correct for multiple testing, q values were determined using the Benjamini-Hochberg procedure  to control the false discovery rate with a threshold of 0.05. In this way, 96 clusters of NMR variables were found to be significantly associated with HCC outcome. Significant clusters of variables corresponding to different peaks of the same metabolite (based on the metabolite identification reported above) were combined into a single variable by summing up the bins intensities taking into consideration the number of homolog protons in the signal resonance. This procedure resulted in a list of 23 combined clusters of variables, 16 of which corresponded to distinct metabolite or lipid classes and were retained for further analyses, while five corresponded to other signals from mixed classes of lipids and two corresponded to the superimposition of signals from different metabolites.
Conditional logistic regression (CLR)
CLR models were used to quantify the associations between the 16 metabolites selected as described above and HCC risk by computing odds ratios (OR) and 95 % confidence intervals (95 % CIs). The metabolites were modeled as continuous variables with the OR corresponding to one standard deviation increase in metabolic intensity. CLR models were run conditioned on the matching factors (referred to as crude), and after adjustment for potential confounding variables (referred to as multivariable), i.e. body mass index (continuous), smoking status (current smokers, non-smokers, former smokers, unknown), lifetime alcohol drinking pattern (never drinkers, former drinkers, drinkers only at recruitment, lifetime drinkers), level of alcohol consumption at recruitment (g/d; continuous), serum-clot contact time (≤1 d or >1 d; a value that corresponds to the time between blood collection and blood centrifugation ), physical activity (inactive, moderately inactive, moderately active, active, missing), educational status (primary school, secondary school, professional school, longer education, unknown; as a proxy variable for socioeconomic status), and waist circumference (cm). The multivariable models for serum ethanol concentration were not adjusted for level of alcohol consumption at recruitment. For all metabolites, an additional CLR model with further adjustment for liver function score was also run.
Receiver operating characteristics (ROC)
ROC curves and corresponding area under the curve (AUC) were generated for several models including the AFP concentration, the liver function score, the multivariate metabolic profile using both the score values from the O-PLS classification model (referred as O-PLS score), and the cross-validated predicted-Y values (referred as O-PLS CV status) as well as a combination between the O-PLS CV status and AFP or the liver score. Combinations of the variables were obtained by summing up the O-PLS CV status with either AFP or the liver score after normalization of each variable to one unit variance. The specificity, sensitivity, and accuracy were obtained from the optimal cut-off point that corresponded to the minimal distance to the ideal point.
Analyses stratified by hepatitis infection status (37 HCC cases Hep+, 77 HCC cases Hep–), by liver function score (34 HCC cases with no liver damage, 80 HCC cases with probable to certain liver damage), by years between blood collection and cancer diagnosis with a cut-off at 2 years (22 HCC cases diagnosed <2 years, 92 HCC cases diagnosed ≥2 years from blood collection) were also conducted. In the grouping of cases diagnosed <2 years, the small sample size prevented model stability upon multivariable adjustment. Thus, only crude CLR models were run for this subgroup.
The analyses were performed using SIMCA-P 12 (Umetrics, Umeå, Sweden), MATLAB (The MathWorks Inc., Natick, MA) routines developed in-house, and R software  using the packages ‘splines’ and ‘survival’.
Baseline characteristics of the study participants are summarized in Table 1. The median follow-up time between blood collection and HCC diagnosis (lag time) was 4.8 years. Serum blood samples of HCC cases were more likely to test positive for HBV or HCV infections (32.5 % vs. 3.2 % in the controls), and to have altered liver function as indicated by high liver function score (36.8 % vs. 14.4 % for probable liver damage and 33.3 % vs. 0.9 % for likely liver damage for cases vs. controls, respectively).
The O-PLS analysis presented in Fig. 2a shows a metabolic profile discriminating between HCC cases and the matched controls (R2 = 35 %, Q2 = 21 %). The metabolic signature (Fig. 2b) associated with HCC occurrence presented (1) higher levels in the aromatic amino acids (AAA) tyrosine and phenylalanine, glutamate, acetate, citrate, glucose, propylene glycol, and ethanol; (2) lower levels in unsaturated lipids and VLDL, N-acetyl glycoproteins, choline, glutamine, acetone, mannose and the branched-chain amino-acids (BCAA) valine, leucine, and isoleucine levels, compared to the control group. The corresponding P values, q values, and fold changes of the metabolites are presented in Table 2. The ROC analyses (Fig. 2c) of the metabolic signature (O-PLS score) and of the cross-validated data (O-PLS CV status) presented an AUC of 85 % and 74 %, respectively (Table 3). The ROC parameters obtained from AFP compared to the combination of O-PLS CV status with AFP were increased after combining the variables (AUC 73 % vs. 75 %, specificity 65.3 % vs. 80.6 %, sensitivity 71.9 % vs. 75.4 %, accuracy 67.5 % vs. 78.9 %). Multivariable adjusted CLR models showed that AAA (per 1-SD), tyrosine (OR = 2.46; 95 % CI, 1.65–3.6), phenylalanine (OR = 2.07; 95 % CI, 1.40–3.06), glutamate (OR = 2.44; 95 % CI, 1.54–3.87), citrate (OR = 1.76; 95 % CI, 1.22–2.54), glucose (OR = 1.67; 95 % CI, 1.19–2.35), and propylene glycol (OR = 2.20; 95 % CI, 1.06–4.60) were associated with a statistically significant higher HCC risk. In contrast, BCAA, leucine (OR = 0.60; 95 % CI, 0.43–0.85), isoleucine (OR = 0.72; 95 % CI, 0.53–0.98), choline (OR = 0.45; 95 % CI, 0.31–0.65), N-acetyl glycoproteins (OR = 0.46; 95 % CI, 0.32–0.67), unsaturated lipids (OR = 0.36; 95 % CI, 0.21–0.63), and VLDL (OR = 0.52; 95 % CI, 0.36–0.74) were inversely associated with HCC risk (Table 4).
The O-PLS analyses stratified by hepatitis infection status of the cases (Fig. 3a,b) presented distinct metabolic signatures from hepatitis-infected HCC cases (R2 = 45 %, Q2 = 34 %) and hepatitis-free HCC cases (R2 = 28 %, Q2 = 12 %). Hepatitis-infected HCC cases presented (1) higher levels of AAA, glucose, and citrate and (2) lower VLDL and unsaturated lipids levels, while on the other hand HCC hepatitis-free cases were characterized by (1) higher levels in ethanol and glutamate and (2) lower levels in glutamine, BCAA, and choline. In hepatitis-free HCC cases, the risk associations of glutamine (OR = 0.56; 95 % CI, 0.34–0.92) and glutamate (OR = 2.06; 95 % CI, 1.18–3.61) were significantly different from matched controls (Table 4).
Figure 3c shows O-PLS subgroup analysis of HCC cases with abnormal liver function (score ≥1). A robust model was obtained (R2 = 58 %, Q2 = 43 %) and the metabolic signature was similar to that including all samples (Fig. 2b). However, no significant model was obtained from HCC cases with a normal liver function (score = 0) only (data not shown). Table 4 shows results of multivariable CLR additionally adjusted for liver function score for which only citrate (OR = 1.88; 95 % CI, 1.14–3.11) and phenylalanine (OR = 1.75; 95 % CI, 1.04–2.94) remained significantly associated with HCC risk.
Figure 4 presents the O-PLS and ROC analyses stratified by lag time between blood collection and diagnosis. The metabolic signature of HCC cases diagnosed within 2 years after blood collection is characterized by (1) higher levels in AAA and glutamate, and (2) lower levels in unsaturated lipids and choline while in addition, the metabolic signature of HCC diagnosed later (≥2 years) presented (1) higher levels in glucose, ethanol, and propylene glycol and (2) lower levels in BCAA and N-acetyl glycoproteins. Among the cases diagnosed <2 years from recruitment, the AUC of ROC curves from the O-PLS metabolic signature and from O-PLS CV data were 93 % and 82 %, respectively (Fig. 4c).
Higher ROC parameters (Table 3) were found for O-PLS CV status compared to AFP and the liver score (O-PLS CV status vs. AFP, liver score: AUC 82 % vs. 81 %, 79 %; specificity 100 % vs. 79 %, 88.4 %; sensitivity 63.6 % vs. 77.3 %, 68.2 %; accuracy 87.7 % vs. 78.5 %, 81.5 %). However, the parameters did not improve after combining O-PLS CV status with AFP while they were slightly improved after combining O-PLS CV status with the liver score (AUC 84 %; specificity 86 %; sensitivity 77.3 %; accuracy 83 %). ROC analysis of the cases diagnosed ≥2 years from recruitment showed an AUC of 79 % for the O-PLS metabolic signature and 71 % for the O-PLS CV status. Combining the O-PLS CV status with AFP improved the ROC parameters in comparison to AFP alone or O-PLS CV alone (O-PLS CV status + AFP vs. AFP, O-PLS CV: AUC: 73 % vs. 71 %, 71 %; specificity: 70.9 % vs. 60.9 %, 68.7 %; sensitivity: 70.6 % vs. 74 %, 67.4 %; accuracy: 70.8 % vs. 65.3 %, 68.3 %). However, the best model was obtained from the liver score (AUC 80 %, specificity 83.8 %, sensitivity 70.6 %, accuracy 79.3 %).
Findings for subgroup CLR analyses of individual metabolites by lag time of diagnosis from recruitment show a significant HCC risk association for citrate in cases diagnosed <2 years from recruitment (OR = 2.19; 95 % CI, 1.10–4.35), while for cases diagnosed ≥2 years from recruitment significant HCC risk associations were observed for glucose (OR = 1.47; 95 % CI, 1.10–1.96), acetate (OR = 1.32; 95 % CI, 1.01–1.75), N-acetyl glycoproteins (OR = 0.43; 95 % CI, 0.31–0.60), BCAA (valine OR = 0.68; 95 % CI, 0.52–0.89); leucine (OR = 0.47; 95 % CI, 0.34–0.66); isoleucine (OR = 0.59; 95 % CI, 0.44–0.80), and glutamine (OR = 0.67; 95 % CI, 0.51–0.88) (Table 4).
This study is, to the best of our knowledge, the first NMR metabolomic analysis based on subjects from a prospective cohort study on Western European populations for epidemiology of liver cancer. We have identified a number of metabolites that differed between HCC cases and corresponding matched controls. As concerns the specificity of these associations, we note that an analogous study was conducted in parallel on extrahepatic/intrahepatic bile duct carcinomas without providing any significant results (data not shown). We also note that the impact of long-term storage of EPIC samples as well as other potential sources of systematic variations of the metabolic profiles has been thoroughly detailed earlier .
O-PLS analysis showed a clear discrimination between cases and controls with somewhat different metabolomic profiles with respect to the length of time from blood collection to diagnosis, hepatitis infection status, and liver function. Importantly, this study showed that consideration of metabolomic profiles can improve HCC diagnosis beyond that provided by AFP and liver enzyme levels, which are currently the most common HCC biomarkers often applied in clinical practice.
The liver is central for the metabolism of carbohydrates, fats and proteins, and also plays key roles in detoxification and hormone production. Thus, a degree of metabolic dysregulation would be expected with liver diseases, particularly HCC. For this reason, the application of metabolomic technologies may be able to provide some insight into the etiology and mechanisms of HCC and, possibly, the identification of early diagnostic biomarkers or biomarker patterns characteristic of cancer at this anatomical site.
To date, three NMR or NMR/mass spectrometry, serum-based metabolomic studies have been conducted looking specifically at HCC [20, 23, 24]. All three case–control studies were based on sera collected from HCC cases post-diagnosis. The comparison group in one of the studies was hepatitis-infected subjects , while that of the others were cirrhotic patients [23, 24]. The studies identified potential (1) impairment of the tricarboxylic acid cycle, increased lipid catabolism, and elevation of essential amino acids , and (2) defects on ammonium detoxification and increased fatty acid beta-oxidation  in HCC. The fundamental design differences with the present study are that the latter is based on prospectively identified HCC cases, such that metabolomic profiles are likely indicative of pre-diagnostic changes, and that the matched control subjects were cancer-free cohort participants. The key metabolic alterations observed are related to changes in amino acid, polyunsaturated lipid, acetate, and citrate metabolism, among the 16 individual metabolites highlighted here. Because our study is nested within the prospective EPIC cohort, which has detailed information on dietary and lifestyle factors and measured anthropometry, we were able to make statistical adjustments for many important confounding variables such as smoking status, alcohol consumption and habits, physical activity, educational attainment (as a proxy marker for socioeconomic status), body mass index, and waist circumference.
Of particular note is our observation of a 0.82-fold reduction in choline in HCC cases (Table 2), meaning a significant inverse HCC risk association for this compound (Table 4; OR = 0.45; 95 % CI, 0.31–0.65). In animal studies, choline deficiency has been shown to cause liver damage, oxidative stress, and spontaneous liver cancer [47–49]. In human studies, HCC has been associated with a down regulation in choline metabolism .
Also interesting is our identification of circulating ethanol as a strong HCC risk factor, alcohol being a major lifestyle risk factor for this disease. We also observed a shift, in terms of fold difference between HCC cases and their matched controls, from glutamine to glutamate, indicating a possible defect in ammonium detoxification , as also observed in the study by Nahon et al. . It is of interest that Nahon et al.  observed this shift comparing HCC cases to cirrhotic controls, while our findings indicate that this important change may actually be present for some time prior to diagnosis. In the study by Gao et al. , higher levels of AAA were associated with liver cirrhosis and HCC, together with lower levels of BCAA, choline, and unsaturated lipids. The same changes were observed suggesting an important alteration of amino acid and lipid metabolism in the progression to HCC.
An interesting observation in the present study was a strong, significant positive HCC risk association for the exogenous metabolite propylene glycol. Identification of propylene glycol in human serum is not uncommon , and it is thought to derive largely from pharmaceutical use since it is widely used as a solvent in many intravenous, oral, and topical pharmaceutical preparations (as well as in other general products including cosmetics, food, and toothpastes). The liver of an adult with normal liver and kidney functions will metabolize propylene glycol into lactate, acetate, and pyruvate within several hours . Therefore, high levels of propylene glycol could be reflective of medication use, possibly in participants with liver damage or due to its simple accumulation resulting from impaired liver function. Despite the prospective nature of our study, it may be speculated that HCC cases may have encountered some symptoms, which may have prompted medical surveillance and/or alteration of dietary/lifestyle habits (e.g. reduced alcohol intake or smoking cessation). Yet, such changes would likely bias risk estimates towards the null or be unrelated to the disease outcome.
In addition to its prospective design, availability of detailed pre-diagnostic lifestyle/dietary data, and anthropometric measures, additional strengths of our study include the ability to consider liver function parameters based on a score developed from clinically relevant liver enzyme concentrations. The assumption is that decreased liver function is associated with a greater degree of liver damage. From our findings, it is apparent that the metabolic pattern associated with HCC may be reflective of liver dysfunction, as suggested by the stratified analysis on the liver function score. These results support the fact that HCC largely arises from a background of increasingly severe liver damage. Indeed, the process ending with HCC is considered to be gradual, involving infection by hepatitis viruses or the development of fatty liver diseases or cirrhosis . Each part of the process may be characterized by alterations in metabolic factors, which may be detectable by metabolomic approaches [54–58]. Due to this gradual process, we note that longer follow-up time would be required in order to thoroughly assess, prior to any liver damage, the specificity of the identified HCC risk associations. Our study was composed of a large number of HCC cases that were not infected with either hepatitis B or C. Thus, we attempted to determine whether metabolomic differences could be observed in the absence of these predominant HCC risk factors. Although exclusion of hepatitis-positive cases attenuated some of our findings and resulted in loss of significance for specific metabolites, strong associations were observed for glutamate and glutamine. This is indicative of a potential defect in ammonium detoxification in non-hepatitis HCC. This observation deserves further in-depth investigation.
In our study, we were also interested in comparing metabolic changes preceding cancer diagnosis by several years. Thus, we conducted stratified analysis by lag time between blood collection and diagnosis, which showed specific metabolic changes according to follow-up time. However, a key limitation of the present study is the lack of any clinical data, assessment of any medication usage, or subgroup analyses based on pathways of HCC development. The metabolite changes associated with the later cases are more likely to be informative on the etiology and/or risk exposure (e.g. dietary components, environmental, lifestyle, and pollutants), while metabolic changes in cases diagnosed <2 years after recruitment likely reflect a direct influence of the tumor.
For the first time, a metabolic pattern based on serum samples was identified to be associated with HCC risk within a large prospective study. Several metabolites associated with either an increased or decreased HCC risk have been highlighted. The majority of associations remained significant after controlling for potential confounders and consideration of correction for multiple testing. The results suggest that metabolic patterns can provide meaningful etiologic insight into HCC development and can potentially be used to detect this cancer in its early stages, even several years prior to clinical diagnosis.
Aromatic amino acids
Area under the curve
Conditional logistic regression
Chemiluminescent microparticle immunoassays
European Prospective Investigation into Cancer and Nutrition
Chronic hepatitis B
Chronic hepatitis C
Nuclear magnetic resonance
Nuclear Overhauser effect spectroscopy
Orthogonal partial least-square
Receiver operating characteristics
Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, et al. GLOBOCAN 2012 v1.0, Cancer Incidence and Mortality Worldwide 2013. http://globocan.iarc.fr, accessed on 30/01/2015.
El-Serag HB. Hepatocellular carcinoma. N Engl J Med. 2011;365:1118–27.
Nicholson JK, Lindon JC, Holmes E. ‘Metabonomics’: understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. Xenobiotica. 1999;29:1181–9.
Floegel A, Stefan N, Yu Z, Muhlenbruch K, Drogan D, Joost HG, et al. Identification of serum metabolites associated with risk of type 2 diabetes using a targeted metabolomic approach. Diabetes. 2013;62:639–48.
Mayers JR, Wu C, Clish CB, Kraft P, Torrence ME, Fiske BP, et al. Elevation of circulating branched-chain amino acids is an early event in human pancreatic adenocarcinoma development. Nat Med. 2014;20:1193–8.
Shah SH, Sun JL, Stevens RD, Bain JR, Muehlbauer MJ, Pieper KS, et al. Baseline metabolomic profiles predict cardiovascular events in patients at risk for coronary artery disease. Am Heart J. 2012;163:844–50.
Beyoglu D, Imbeaud S, Maurhofer O, Bioulac-Sage P, Zucman-Rossi J, Dufour JF, et al. Tissue metabolomics of hepatocellular carcinoma: tumor energy metabolism and the role of transcriptomic classification. Hepatology. 2013;58:229–38.
Chen J, Wang WZ, Lv S, Yin PY, Zhao XJ, Lu X, et al. Metabonomics study of liver cancer based on ultra performance liquid chromatography coupled to mass spectrometry with HILIC and RPLC separations. Anal Chim Acta. 2009;650:3–9.
da Costa AN, Pontoizeau C, Plymoth A, Santos-Silva D, Mendy M, Sangrajrang S, et al. A multi-marker approach for early detection of HBV-related hepatocellular carcinoma in areas of high incidence. Eur J Cancer. 2012;48:S169–70.
Gao HC, Lu Q, Liu X, Cong H, Zhao LC, Wang HM, et al. Application of H-1 NMR-based metabonomics in the study of metabolic profiling of human hepatocellular carcinoma and liver cirrhosis. Cancer Sci. 2009;100:782–5.
Liu Y, Hong Z, Tan G, Dong X, Yang G, Zhao L, et al. NMR and LC/MS-based global metabolomics to identify serum biomarkers differentiating hepatocellular carcinoma from liver cirrhosis. Int J Cancer. 2014;135:658–68.
Nahon P, Amathieu R, Triba MN, Bouchemal N, Nault JC, Ziol M, et al. Identification of serum proton NMR metabolomic fingerprints associated with hepatocellular carcinoma in patients with alcoholic cirrhosis. Clin Cancer Res. 2012;18:6714–22.
Patterson AD, Maurhofer O, Beyoglu D, Lanz C, Krausz KW, Pabst T, et al. Aberrant lipid metabolism in hepatocellular carcinoma revealed by plasma metabolomics and lipid profiling. Cancer Res. 2011;71:6590–600.
Ressom HW, Xiao JF, Tuli L, Varghese RS, Zhou B, Tsai TH, et al. Utilization of metabolomics to identify serum biomarkers for hepatocellular carcinoma in patients with liver cirrhosis. Anal Chim Acta. 2012;743:90–100.
Soga T, Sugimoto M, Honma M, Mori M, Igarashi K, Kashikura K, et al. Serum metabolomics reveals gamma-glutamyl dipeptides as biomarkers for discrimination among different forms of liver disease. J Hepatology. 2011;55:896–905.
Tan YX, Yin PY, Tang L, Xing WB, Huang Q, Cao D, et al. Metabolomics study of stepwise hepatocarcinogenesis from the model rats to patients: potential biomarkers effective for small hepatocellular carcinoma diagnosis. Mol Cell Proteomics. 2012;11:M111.010694.
Wu H, Xue RY, Dong L, Liu TT, Deng CH, Zeng HZ, et al. Metabolomic profiling of human urine in hepatocellular carcinoma patients using gas chromatography/mass spectrometry. Anal Chim Acta. 2009;648:98–104.
Fedirko V, Duarte-Salles T, Bamia C, Trichopoulou A, Aleksandrova K, Trichopoulos D, et al. Prediagnostic circulating vitamin D levels and risk of hepatocellular carcinoma in European populations: a nested case-control study. Hepatology. 2014;60:1222–30.
Lai GY, Weinstein SJ, Albanes D, Taylor PR, Virtamo J, McGlynn KA, et al. Association of serum alpha-tocopherol, beta-carotene, and retinol with liver cancer incidence and chronic liver disease mortality. Br J Cancer. 2014;111:2163–71.
Lukanova A, Becker S, Husing A, Schock H, Fedirko V, Trepo E, et al. Prediagnostic plasma testosterone, sex hormone-binding globulin, IGF-I and hepatocellular carcinoma: etiological factors or risk markers? Int J Cancer. 2013;134:164–73.
Riboli E, Hunt KJ, Slimani N, Ferrari P, Norat T, Fahey M, et al. European prospective investigation into cancer and nutrition (EPIC): study populations and data collection. Public Health Nutr. 2002;5:1113–24.
Trichopoulos D, Bamia C, Lagiou P, Fedirko V, Trepo E, Jenab M, et al. Hepatocellular carcinoma risk factors and disease burden in a European cohort: a nested case-control study. J Natl Cancer Inst. 2011;103:1686–95.
Fedirko V, Trichopolou A, Bamia C, Duarte-Salles T, Trepo E, Aleksandrova K, et al. Consumption of fish and meats and risk of hepatocellular carcinoma: the European Prospective Investigation into Cancer and Nutrition (EPIC). Ann Oncol. 2013;24:2166–73.
Beckonert O, Keun HC, Ebbels TMD, Bundy JG, Holmes E, Lindon JC, et al. Metabolic profiling, metabolomic and metabonomic procedures for NMR spectroscopy of urine, plasma, serum and tissue extracts. Nat Protoc. 2007;2:2692–703.
Fages A, Ferrari P, Monni S, Dossus L, Floegel A, Mode N, et al. Investigating sources of variability in metabolomic data in the EPIC study: the principal component partial R-square (PC-PR2) method. Metabolomics. 2014;10:1074–83.
Barr J, Vazquez-Chantada M, Alonso C, Perez-Cormenzana M, Mayo R, Galan A, et al. Liquid chromatography-mass spectrometry-based parallel metabolic profiling of human and mouse model serum reveals putative biomarkers associated with the progression of nonalcoholic fatty liver disease. J Proteome Res. 2010;9:4501–12.
Lin X, Zhang Y, Ye G, Li X, Yin P, Ruan Q, et al. Classification and differential metabolite discovery of liver diseases based on plasma metabolic profiling and support vector machines. J Sep Sci. 2011;34:3029–36.
Guarantor of the article: Bénédicte Elena-Herrmann, PhD and Mazda Jenab, PhD.
This work was supported by the French National Cancer Institute (L’Institut National du Cancer; INCA; grant number 2009-139; PI: M. Jenab). AF received financial support (BDI fellowship) from the Centre National de la Recherche Scientifique (CNRS) and Bruker Biospin. The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by Danish Cancer Society (Denmark); Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, and Institut National de la Santé et de la Recherche Médicale (INSERM) (France); Deutsche Krebshilfe, Deutsches Krebsforschungszentrum (DKFZ), and Federal Ministry of Education and Research (Germany); Hellenic Health Foundation (Greece); Italian Association for Research on Cancer (AIRC), National Research Council, Associazione Italiana per la Ricerca sul Cancro-AIRC-Italy, and AIRE-ONLUS Ragusa, AVIS Ragusa, Sicilian Government (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), and Statistics Netherlands (the Netherlands); European Research Council (ERC; grant number ERC-2009-AdG 232997) and Nordforsk, and Nordic Center of Excellence Programme on Food, Nutrition and Health (Norway); Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia (No. 6236) and Navarra, and ISCIII RETIC (RD06/0020) (Spain); Swedish Cancer Society, Swedish Scientific Council, and Regional Government of Skåne and Västerbotten (Sweden); Cancer Research UK, Medical Research Council, Stroke Association, British Heart Foundation, Department of Health, Food Standards Agency, and Wellcome Trust (UK). The funders had no role in the study design, data collection, analysis, and interpretation presented in this article. They were neither involved in the writing of the manuscript, nor in the decision to submit it for publication.
Authors and Affiliations
Institut des Sciences Analytiques, Centre de RMN à très hauts champs, CNRS/ENS Lyon/UCB Lyon-1, Université de Lyon, 5 rue de la Doua, 69100, Villeurbanne, France
Anne Fages, Clément Pontoizeau & Benedicte Elena-Herrmann
International Agency for Research on Cancer (IARC-WHO), Lyon, France
The authors declare that they have no competing interests.
The authors’ responsibilities were as follows: ER is the overall PI of the EPIC study which is jointly coordinated from ICL (ER) and IARC (IR); MJ and BE-H conceptualized, designed, obtained funding for, and implemented/managed the present research; AF, CP, and BE-H performed the laboratory analyses; AF and PF conducted the statistical analyses; AF, MJ, VF, TDS, MS, and BE-H contributed to the writing of the manuscript and data interpretation. Contributing authors from each collaborating centre provided the original data and biological samples, information on the respective populations, advice on study design/analysis, and interpretation of the results. All authors contributed comments on the draft manuscript and provided an approval of the final version of the manuscript for publication.
Dimitrios Trichopoulos deceased.
Benedicte Elena-Herrmann and Mazda Jenab contributed equally to this work.
Supplementary methods for NMR metabolomics data acquisition, additional table (Table S1. Metabolites identified in serum samples) and additional figures (Figure S1. Validation (1000 resampling) of the O-PLS model based on 1H CPMG spectra and O-PLS metabolic signature obtained from the analysis of 1H NOESY NMR spectra.Figure S2. Validation (1000 resampling) of the O-PLS models stratified by hepatitis infection status of the cases, and liver function score. Figure S3. Validation (1000 resampling) of the O-PLS models stratified by lag time between blood collection and diagnosis). (PDF 684 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.