- Research article
- Open Access
- Open Peer Review
Determinants of the urinary and serum metabolome in children from six European populations
BMC Medicinevolume 16, Article number: 202 (2018)
The Commentary to this article has been published in BMC Medicine 2018 16:222
Environment and diet in early life can affect development and health throughout the life course. Metabolic phenotyping of urine and serum represents a complementary systems-wide approach to elucidate environment–health interactions. However, large-scale metabolome studies in children combining analyses of these biological fluids are lacking. Here, we sought to characterise the major determinants of the child metabolome and to define metabolite associations with age, sex, BMI and dietary habits in European children, by exploiting a unique biobank established as part of the Human Early-Life Exposome project (http://www.projecthelix.eu).
Metabolic phenotypes of matched urine and serum samples from 1192 children (aged 6–11) recruited from birth cohorts in six European countries were measured using high-throughput 1H nuclear magnetic resonance (NMR) spectroscopy and a targeted LC-MS/MS metabolomic assay (Biocrates AbsoluteIDQ p180 kit).
We identified both urinary and serum creatinine to be positively associated with age. Metabolic associations to BMI z-score included a novel association with urinary 4-deoxyerythronic acid in addition to valine, serum carnitine, short-chain acylcarnitines (C3, C5), glutamate, BCAAs, lysophosphatidylcholines (lysoPC a C14:0, lysoPC a C16:1, lysoPC a C18:1, lysoPC a C18:2) and sphingolipids (SM C16:0, SM C16:1, SM C18:1). Dietary-metabolite associations included urinary creatine and serum phosphatidylcholines (4) with meat intake, serum phosphatidylcholines (12) with fish, urinary hippurate with vegetables, and urinary proline betaine and hippurate with fruit intake. Population-specific variance (age, sex, BMI, ethnicity, dietary and country of origin) was better captured in the serum than in the urine profile; these factors explained a median of 9.0% variance amongst serum metabolites versus a median of 5.1% amongst urinary metabolites. Metabolic pathway correlations were identified, and concentrations of corresponding metabolites were significantly correlated (r > 0.18) between urine and serum.
We have established a pan-European reference metabolome for urine and serum of healthy children and gathered critical resources not previously available for future investigations into the influence of the metabolome on child health. The six European cohort populations studied share common metabolic associations with age, sex, BMI z-score and main dietary habits. Furthermore, we have identified a novel metabolic association between threonine catabolism and BMI of children.
Under-nutrition during gestation was first proposed in the early 1990s to explain the association observed between low birth weight in infancy and higher mortality rates from cardiovascular disease in male adults [1, 2]. Since then, it has been hypothesised that the origins of many diseases that manifest later in life may be traced back to fetal development—known as the DOHaD (Developmental Origins of Health and Disease) paradigm . In addition, early-life environmental exposures may have wide-ranging consequences for health. Critical windows in development, such as the prenatal period and infancy, have been shown to be particularly susceptible to environmental risk factors that influence disease burden into adulthood [4,5,6]. For example, prenatal exposure to passive smoke and outdoor air pollutants are acknowledged risk factors for asthma and other allergies including eczema [7, 8], and exposure to endocrine-disrupting and household chemicals have been found to increase obesity risk in children [9, 10]. Moreover, childhood exposure to passive smoke has also been associated with lung cancer risk in adults , whilst prenatal infection and exposure to lead have been linked respectively to schizophrenia  and attention deficit hyperactivity disorder in children . Growing evidence suggests environmental exposure in early life can also alter molecular phenotypes—such as the epigenome—which then persist throughout life [14, 15]. Consequently, the importance of measuring multiple environmental exposures simultaneously (the exposome) and the impact of this on health at different stages of life are increasingly being recognised [16,17,18,19,20]. Population cohort-based exposome research studies could help address the multi-dimensional interplay between various environmental factors and developmental health outcomes . For example, a recent exposome study conducted in Greece has identified that proximity to landfill waste may impact neurodevelopment in children .
Metabolic profiling has been utilised to characterise markers of environmental exposures [23,24,25,26,27] and confer valuable information regarding early life health outcomes; from preterm birth  and fetal growth  to childhood disease [30,31,32]. Age, sex, body morphology, and dietary intakes all play important roles in determining the urine and serum metabolome, and whilst their contributions to metabolic phenotypes are relatively well characterised in the adult population [33,34,35,36,37,38,39,40,41,42], to date there are only a few studies, of relatively small sample size, in children [43,44,45,46]. In addition, epidemiological studies that permit evaluation of the complementarity of urine and serum metabolomics data are also lacking .
To address this knowledge gap, metabolomic analyses of serum and urine were performed as part of the Human Early-Life Exposome (HELIX) project, which seeks to define the environmental exposome from pregnancy to childhood, to associate these with child health outcomes and to define molecular ‘omics’ markers . The project gathered samples and data from six longitudinal birth cohort studies across six European countries—France, Greece, Lithuania, Norway, Spain and the UK. Analyses were conducted on biofluid samples from the HELIX subcohort of children between 6 and 11 years of age to perform molecular phenotyping including metabolomics, proteomics, transcriptomics and genomics and also to measure chemical exposure levels in order to identify molecular markers of exposure . Specifically in this current study, we aim to (a) characterise the major determinants of the child metabolome, (b) define metabolite associations to demographic factors, BMI and main dietary intake habits in European children, and (c) evaluate correlation patterns and complementarity between serum and urine metabolic profiles.
HELIX project multilevel study design
The HELIX study is a collaborative project across six established and longitudinal birth cohorts in Europe. A multilevel study design was employed. Level 1—the entire study population of HELIX consists of 31,472 mother-child pairs which were recruited between 1999 and 2010 during their pregnancies by the six cohorts. Level 2—the HELIX subcohort consists of 1301 mother-child pairs from which exposure data, ‘omics’ molecular profiles, and child health outcomes were measured at 6–11 years of age. Level 3—panel studies with repeated sampling periods from a cohort of 150 children and 150 pregnant women to understand temporal variability of the personal exposure data .
Current study sample population—the HELIX children subcohort
The children in the HELIX subcohort were followed up between December 2013 and February 2016; there were approximately 200 mother–child pairs from each of the six cohorts. Follow-up examinations for the subcohort took place either at local hospitals, primary care centres or the National Institute for Public Health (NIPH) in Oslo, during which mothers were interviewed and children checked and examined by trained nurses according to standardised operating procedures. Biological samples were also collected on the day of the examinations. Metabolic phenotypes of 1201 children’s urine and sera samples from the HELIX subcohort were generated, of which complete matching metadata listed in Table 1 were available for 1192 children as follows: Born in Bradford, UK (BiB, n = 199) ; Study of determinants of pre- and postnatal developmental, France (EDEN, n = 157) ; Infancia y Medio Ambiente, Environment and Childhood, Spain (INMA, n = 207) ; Kaunas Cohort, Lithuania (KANC, n = 201) ; The Norwegian Mother and Child Cohort Study, Norway (MoBa, n = 229) ; Mother-Child Cohort in Crete, Greece (Rhea, n = 199) . Hence, the number of samples carried forward for data analysis was 1192.
Body mass index and food dietary frequency data
During the subcohort follow-up examinations, height and weight were respectively measured with a stadiometer and a digital weight scale both without shoes and with light clothing. Height and weight measurements were converted to body mass index (BMI in kg/m2) for age and sex z-scores using the international World Health Organization (WHO) reference curves in order to allow for comparison with other studies .
Data on the food intake frequency of 44 food items from 11 main food groups were collected through a short food frequency questionnaire and the average number of times per week that each food item was consumed was recorded. The 11 main groups were sweets, which include chocolate (bars, bonbon, spreads, cacao), sugar, honey, jam or other sweets; meat, which includes processed meat, poultry and red meat; fish, which includes canned fish, oily fish, white fish and seafood; beverages, which include both high- and low-sugar soda, other soft and fizzy drinks; potatoes, which include also French fries; vegetables, which include both raw and cooked vegetables; dairy products, which include yogurt, cheese, milk and dairy desserts; cereal, which include bread, breakfast cereal, rice and pasta, rusks, crispy bread, rice and corn cakes; fruits, which include fruits, fresh juice, canned and dry fruits; bakery products which include biscuits, cookies and pastries; and total added lipids which include butter, margarine and vegetable oil.
Biofluid sample collection
Urine and sera samples were collected and processed according to identical pre-defined standardised protocols across all six cohorts. Urine samples were collected by family members at home, kept in a fridge overnight and transported in a temperature controlled environment. Samples were aliquoted and frozen within 3 h of arrival at the clinics. Two urine samples, representing last night-time and first morning voids, were collected on the evening and morning before the clinical examination and were subsequently pooled to generate a more representative sample of the last 24 h for metabolomic analysis (n = 1107) . Either the night-time void (n = 37) or morning void (n = 48) sample was analysed in cases where a pooled sample was missing.
Serum sampling: Blood was collected during the follow-up visit at the end of the clinical examination. Blood samples were drawn using a ‘butterfly’ vacuum clip and local anaesthetic and were collected into 4 mL silica plastic tubes. Samples were inverted gently for 6–7 times and spun down at 2500 g for 15 min at 4 °C. The median serum sample processing time from sample collection to freezing was 1.8 h (IQR: 1.5–2.0), and the median postprandial interval (time between last meal and blood collection) was 3.3 h (IQR: 2.8–4.0, Additional file 1: Figure S1).
Urine metabolite NMR measurements
1H NMR spectroscopy was chosen for urinary analysis for several reasons: it has inherently high reproducibility ; urinary metabolite concentrations are high, making the relatively low sensitivity of NMR spectroscopy less of a hindrance; the data processing workflow is well established . One-dimensional 600 MHz 1H NMR spectra of all 1192 urine samples were acquired on the same Bruker Avance III spectrometer operating at 14.1 Tesla within a period of 1 month. The spectrometer was equipped with a Bruker SampleJet system, and a 5-mm broad-band inverse configuration probe maintained at 300K. Prior to analysis, cohort samples were randomised to mitigate analytical bias, and individual samples were thawed and homogenised using a vortex mixer and centrifuged at 13,000 g for 10 min at 4 °C to remove insoluble material. Five hundred forty microliters of urine sample was mixed with 60 μL of a buffer solution (1.5 M KH2PO4, 2 mM NaN3, 1% deuterated 3-(trimethylsilyl)-[2,2,3,3-d4]-propionic acid sodium salt (TSP) solution, pH 7.4) and was transferred into an NMR tube (5 mm Bruker SampleJet NMR tubes). Ninety-six-sample tube well plates were kept at 6 °C in the cooled Bruker SampleJet unit. Aliquots of the study quality control (QC) sample, made from pooled urine samples from 20 individuals included in this study, were used to monitor analytical performance throughout the run and were analysed at an interval of every 23 samples (i.e. 4 QC samples per well plate). The 1H NMR spectra were acquired using a standard one-dimensional solvent suppression pulse sequence (relaxation delay - 90° pulse - 4 μs delay - 90° pulse - mixing time - 90° pulse - acquire FID). For each sample, 32 transients were collected into 64K data points using a spectral width of 12,000 Hz with a recycle delay of 4 s, a mixing time of 100 ms, and an acquisition time of 2.73 s. A line-broadening function of 0.3 Hz was applied prior to Fourier transformation. All 1H NMR spectra were automatically phased and baseline-corrected using Topspin 3.2 software (BrukerBioSpin, Rheinstetten, Germany). The 1H NMR urine spectra were referenced to the TSP resonance at 0 ppm. NMR spectra were imported into the MATLAB 2014a (MathWorks, Massachusetts, USA) computing environment and aligned using the recursive segment-wise peak alignment method , an algorithm based on cross-correlation. The study QC sample spectrum was used as a reference for spectral alignment. A single representative resonance in the spectrum was selected for each assigned metabolite, based on its presence in a high proportion of the spectra, high signal-to-noise ratio, and limited overlap with other resonances. Metabolite resonance peak areas were estimated using trapezoidal numerical integration and were corrected for local spectral baseline, and 44 metabolites were obtained using this method. Quantification was achieved for 24 metabolites; 20 metabolites were semi-quantified using a method of signal integration and quantification as previously described in Maitre et.al . Probabilistic quotient normalisation  was used to adjust for variable urine sample dilution.
Assignment of endogenous urinary metabolites was made by reference to online databases (HMDB) , statistical total correlation spectroscopy (STOCSY)  and using ChenomxNMRsuite 7.1 profiler (ChenomxInc, Edmonton, Canada) and/or confirmed by 2D NMR experiments on a selected sample including homonuclear 1H-1H correlation spectroscopy (COSY), and 1H-1H total correlation spectroscopy (TOCSY) and 1H-13C heteronuclear single quantum coherence spectroscopy (HSQC). Spike-in experiments using authentic chemical standards were also used to confirm novel metabolite annotations. A summary of signal annotation and assignment is shown in Additional file 1: Table S1.
Serum metabolite measurements
The AbsoluteIDQ p180 kit  was chosen for serum analysis as it is a widely used standardised, targeted LC-MS/MS assay, and its inter-laboratory reproducibility has been demonstrated by several independent laboratories . It is increasingly employed for large-scale epidemiology studies [66,67,68], facilitating comparisons to thousands of metabolome profiles across other studies. Serum samples were quantified using the AbsoluteIDQ p180 kit following the manufacturer’s protocol  using LC-MS/MS—and Agilent HPLC 1100 liquid chromatography coupled to a SCIEX QTRAP 6500 triple quadrupole mass spectrometer. Briefly, the kit allows for the targeted analysis of 188 metabolites in the classes of amino acids, biogenic amines, acylcarnitines, glycerophospholipids, sphingolipids and sum of hexoses, covering a wide range of analytes and metabolic pathways in one targeted assay. The kit consists of a single sample processing procedure, with two separate analytical runs, a combination of liquid chromatography (LC) and flow injection analysis (FIA) coupled to tandem mass spectrometry (MS/MS). Isotopically labelled and chemically homologous internal standards were used for quantification; in total, 56 analytes were fully quantified and validated. Of the total 188 metabolites measured, 42 metabolites were measured by LC-MS/MS and 146 metabolites by FIA-MS/MS. The amino acids and biogenic amines were analysed quantitatively by LC–ESI-MS/MS, with the use of an external seven-point calibration curve based on isotope-labelled internal standards. The quantification method for all amino acids and amines was fully validated. The acylcarnitines (40), glycerophospholipids (90), sphingolipids (15), and sum of hexoses (1) were analysed by FIA-ESI-MS/MS, using a one-point internal standard calibration with representative internal standards. Metabolites were quantified (results shown in micromolar concentration units) according to the manufacturer’s protocol using the MetIDQ™ Version 5.4.8 Boron software for targeted metabolomic data processing and management. Blank PBS (phosphate-buffered saline) samples (three technical replicates) were used for the calculation of the limits of detection (LOD). The median values of all PBS samples on the plate were calculated as approximation of the background noise per metabolite signal, and 3 times this value was calculated as the LOD.
LC-MS/MS data of serum samples were acquired in 18 batches. Every analytical batch, in a 96-well plate format, is included up to 76 randomised cohort samples. Also in every analytical batch, three sets of quality control samples were included, the NIST SRM 1950 plasma reference material (in 4 replicates), a commercial available serum QC material (CQC in 2 replicates, SeraLab, S-123-M-27485) and the QCs provided by the manufacturer in three concentration levels. The NIST SRM 1950 reference was used as the main quality control sample for the LC-MS/MS analysis.
Analytical performance of urinary and serum metabolites
Analytical performance in the urinary NMR and serum LC-MS/MS data was assessed by reference to the QC samples measured at regular intervals during the run, with 4 QC samples analysed in every 96-well plate batch. Coefficients of variation (CVs) for each metabolite were calculated based on the pooled QC for the NMR analysis and the NIST SRM 1950 for the LC-MS/MS. Moreover, for the LC-MS/MS serum analysis, the limits of detection (LODs) were also used to assess the analytical performance of individual metabolites. For the LC-MS/MS serum dataset, metabolite exclusion was based on a variable meeting two conditions: (1) CV of over 30% and (2) over 30% of the data are below LOD. Eleven out of the 188 serum metabolites detected were excluded as a result, leaving 177 serum metabolites to be used for further statistical analysis. Mean coefficients of variations across the 44 NMR detected urinary metabolites, and the 177 LC-MS/MS detected serum metabolites carried forward for data analysis were found to be 11 and 15%, respectively (Additional file 1: Tables S2 and S3).
Metabolite concentrations were log10 transformed to normalise data prior to statistical analyses, and the resultant distribution of the transformed data can be found in Additional files 2 and 3. To avoid log transform of zero values, the lowest non-zero value was added to the variable distribution as a constant before log transformation. All statistical analyses were performed using R (‘The R Project for Statistical Computing’) software environment (v3.3.1) unless specified otherwise. Metabolome-wide association study (MWAS) analyses were performed using multiple linear regression models in the R package ‘base’. Linear regression models were fitted for each metabolite with concentration as the outcome variable. Covariates included in the regression models were batch, run order, sex, age, zBMI and dietary intake habits of the 11 food groups; in addition, urine data models were adjusted for sampling type (night only, morning only or pooled sample) and the serum data models were adjusted for postprandial interval. Regression models were computed separately for each individual cohort and meta-analysis was used to combine the effect size estimates using a fixed-effect inverse variance weighting from the six cohorts with the R package ‘meta’, and I2 statistics were used to assess the heterogeneity in the effect estimates between the cohorts. Bonferroni correction (n = 177 for serum data, n = 44 for urine data) was applied throughout to account for multiple test comparisons (p value threshold = 1.1 × 10−3 for urine and 2.8 × 10−4 for serum metabolites). For variance decomposition, analysis was performed using a partial R2 approach, the variance in the urinary and serum data was partitioned according to the following 5 main categories: pre-analytical, analytical, demographic, dietary and cohort/country. The analysis was performed on each of the 44 urinary metabolites and 177 serum metabolites. In addition to the covariates used in the MWAS analyses—batch (analytical), run order (analytical), time of sampling (urine pre-analytical), postprandial interval (serum pre-analytical), sex (demographic), age (demographic), BMI z-score (demographic) and dietary intake frequencies; ethnicity (demographic), and serum and urine sample processing time variables (pre-analytical) were also included in the respective serum and urine variance decomposition analyses. For principal components analysis, metabolite data were also mean-centred and univariate scaled prior to PCA modelling. For serum and urine metabolic pairwise correlation analyses, data were pre-adjusted for analytical and pre-analytical variables and Pearson’s correlation coefficients were calculated. Serum correlation networks were drawn using Cytoscape (version 3.5) software  and the MetScape plugin application (version 3) . Additionally, we have examined the impact of applying alternative data transformation and imputation strategies on the MWAS analysis results. To this end, Box-Cox transformation  was used in conjunction with QRILC imputation (quantile regression approach for left-censored missing)  and the modelled results are shown in Additional file 1: Tables S12–S15. Box-Cox transformation and QRILC imputation were performed respectively using R packages ‘MASS’ and ‘imputeLCMD’.
Characteristics of the study population included in this analysis (n = 1192) are shown in Table 1. Around 200 children from each of the six cohorts participated in this study (54.6% male, 45.4% female), and the vast majority of the sample population were of White-European background with the notable exception of BiB (UK) where many were of Black and Asian Minority Ethnic group, mainly of South Asian origin. There were also significant age differences between the cohorts, with the children from the EDEN cohort being the oldest (median age in EDEN was 10.8 years whilst median ages in KANC, BiB, and Rhea were 6.4–6.6 years). In addition, there were substantial differences between the cohorts in BMI z-score and across dietary intake habits in the 11 food groups (Tables 1 and 2).
In our study, 1H NMR spectroscopy and targeted LC-MS/MS were respectively used to perform metabolic profiling of the urine and serum samples. Estimates of the concentrations for urinary metabolites using NMR spectroscopy (μmol/mmol of creatinine) are provided in Additional file 1: Table S4 and for serum metabolite measurements using the LC-MS/MS AbsoluteIDQ p180 assay (μmol/L) in Additional file 1: Table S5.
Metabolic differences between cohorts
Differences in metabolite concentrations between cohorts were assessed by ANOVA after pre-adjusting for covariates through linear regression models. Metabolites with p values below the significance threshold after Bonferroni correction (p value threshold = 1.1 × 10−3 for urine and 2.8 × 10−4 for serum metabolites) are shown in Fig. 1. A large number of metabolites, 104 out of 177 serum metabolites and 10 of the 44 urine metabolites measured, were found to be significantly different between cohorts. In particular, serum amino acid levels were frequently found highest in the Rhea cohort, whilst a disproportionally high number of serum glycerophospholipid species were found to be most abundant in the MoBa cohort samples. Given the stark differences in the metabolic phenotypes between cohorts, we decided to perform stratified analyses followed by meta-analysis to combine the effect estimates from the six individual cohorts in many of the subsequent analyses.
None of the 177 serum metabolites were identified from meta-analysis to be significantly affected by serum sample processing time after adjusting for covariates and stratifying by country. Similarly, none of the 44 urinary metabolites were found to be associated with sample processing time. Thus, in subsequent analyses, urine and serum processing time were not included as covariates.
The majority of serum samples were collected 3 to 4 h postprandial (median was 3.3 h with IQR: 2.8–4.0), and there were no major differences in postprandial interval between the cohorts (Additional file 1: Figure S1). Postprandial effects could be observed in 21 out of 177 metabolites: 11 amino acids, one biogenic amine, two short-chain acylcarnitines, four long-chain acylcarnitines and three lysophosphatidylcholine species were found to be associated with postprandial interval (Fig. 2a). The 11 amino acids were negatively associated whilst the four long-chain acylcarnitines were positively associated with postprandial interval.
Comparing the urinary metabolite levels of night-time void (n = 38) and morning void (n = 48) samples, we found alanine and citrate concentrations to be elevated in the night-time void samples and N-methyl nicotinamide, N-acetyl-neuraminic acid and 4-deoxythreonic acid to be higher in the morning void samples (Fig. 2b).
Demographic factors and BMI for the HELIX children
Both urinary and serum creatinine levels (Additional file 1: Figures S3, S4 and Table S6) were found to be significantly associated with age after adjusting for multiple testing using Bonferroni correction. An increase of 1 year in the child’s age was associated with increases of 0.39 standard deviation (SD) in urinary creatinine level (95% CI 0.26 to 0.53) and 0.30 SD in serum creatinine level (95% CI 0.17 to 0.43). A positive association between creatinine concentration and age was identified as a common phenotype amongst our six different study cohorts (Additional file 1: Figures S3 and S4); effect sizes between urine creatinine level and age were 0.40 SD/year for BiB, 0.27 SD/year for EDEN, 0.35 SD/year for KANC, 0.33 SD/year for MoBa, 0.84 SD/year for Rhea and 0.45 SD/year for INMA. No other urine or serum metabolites measured were associated with age.
Metabolic associations with sex, adjusted for covariates and multiple testing, are shown in Fig. 3. Variation in effect size between cohorts was assessed using I2 statistic, which measures the percentage of variation across cohorts that is due to heterogeneity rather than chance. Fifteen out of 18 urine or serum metabolites identified as associated with sex have I2 < 50% (Additional file 1: Table S7). Urinary isoleucine was found at lower concentrations (− 0.24 SD lower; 95% CI − 0.37 to − 0.12) while 5-oxoproline (0.23 SD higher; CI 0.11 to 0.36) and tyrosine (0.43 SD higher; CI 0.31 to 0.55) were higher in males. Amongst the serum metabolites, the neurotransmitter serotonin (0.32 SD higher; CI 0.20 to 0.44) was found to be higher in males while serine (− 0.26; CI − 0.39 to − 0.14), lysine (− 0.24; CI − 0.35 to − 0.12), ornithine (− 0.35; CI − 0.47 to − 0.23), putrescine (− 0.21; CI − 0.33 to − 0.10), six median-to-long chain acylcarnitines (C10, C12, C14:1, C14:1–OH, C14:2 and C16:1) and three sphingolipids (SM C16:1, SM C18:0, SM C18:1) were found higher in females.
Based on regression models adjusted for covariates, we found 45 urine or serum metabolites to be associated with BMI z-score and 44 of the 45 associations have I2 < 50% (Fig. 4 and Additional file 1: Table S8). Urinary 4-deoxyerythronic acid (metabolite SD per unit zBMI: 0.21; 95% CI 0.16 to 0.26) and valine (BCAA, metabolite SD/zBMI: 0.09; CI 0.04 to 0.15) were positively associated with BMI z-score, and urinary p-cresol sulphate (a microbial metabolite and uremic toxicant , metabolite SD/zBMI: − 0.10; CI − 0.16 to − 0.05) and pantothenate (vitamin B5—required for synthesis of coenzyme A, metabolite SD/zBMI: − 0.12; CI − 0.17 to − 0.07) were negatively associated with BMI z-score. Positive associations between urine 4-deoxyerythronic acid and valine levels and zBMI could be observed consistently in five of the six different study cohorts with the exception of MoBa (Additional file 1: Figures S5 and S6); effect sizes between urine 4-deoxyerythronic acid level and zBMI were 0.25 SD/unit score for BiB, 0.25 SD/unit score for EDEN, 0.25 SD/unit score for KANC, 0.00 SD/unit score for MoBa (not significant), 0.22 SD/unit score for Rhea and 0.19 SD/unit score for INMA. Interestingly, children from MoBa have the lowest BMI z-score amongst the six cohorts (Table 1).
Amongst serum metabolites, significant positive associations with BMI z-score included free carnitine, (metabolite SD/zBMI: 0.18; CI 0.13 to 0.24), short-chain acylcarnitines (C3, C5), seven amino acids including glutamate, BCAAs valine and leucine and sphingolipids (SM C16:0, SM C16:1, SM C18:1). A large number of phosphatidylcholine species (20) and four lysophosphatidylcholines (lysoPC a C14:0, lysoPC a C16:1, lysoPC a C18:1, lysoPC a C18:2) were also found to be strongly associated with BMI z-score in the study (Fig. 4 and Additional file 1: Table S8). Again, associations between serum metabolites and zBMI could be observed consistently in our study cohorts, for example both serum glutamate (Additional file 1: Figure S7) and carnitine (Additional file 1: Figure S8) levels were positively associated with zBMI in all six cohorts.
Figure 5 and Additional file 1: Table S9 summarise the significant urine and serum metabolite associations with the 11 dietary food group intake after adjusting for multiple testing (p value threshold = 1.1 × 10−3 for urine and 2.8 × 10−4 for serum metabolites) and covariates including analytical batch and run order, age, sex, BMI z-score and postprandial interval for serum and urine sampling type for urine models. We identified 57 diet-metabolite associations and 40 of the 57 associations have I2 < 50%.
For urinary metabolites, we identified creatine to be positively associated with meat intake (SD per portion per week: 0.025; 95% CI 0.012 to 0.039). Hippurate was positively associated with both fruit (SD per portion per week: 0.026; 95% CI 0.018 to 0.034) and vegetable consumption (SD per portion per week: 0.021; 95% CI 0.011 to 0.031). Proline betaine, N-methylnicotinic acid and scyllo-inositol were positively associated with fruit intake, whilst glutamine, alanine and leucine were negatively associated with fruit intake. In addition, pantothenate and acetate were respectively found positively associated with dairy and potato intake.
For serum metabolites, we found 12 glycerophosphatidylcholine species to be associated with fish consumption (Fig. 5), 4 glycerophosphatidylcholine species (PC ae 36:3, PC ae 36:4, PC ae 36:5 and PC ae 38:5) to be positively associated with meat consumption and 5 glycerophosphatidylcholine species (PC aa C38:0, PC aa C38:6, PC ae C38:5, PC ae C38:6, PC ae C40:6) to be negatively associated with sweet consumption. In addition, we found acetylornithine to be positively associated with fruit intake, and two acylcarnitines (C5:1, C6:1) and one sphingolipid (SM (OH) C16:1) to be negatively associated with beverages (soft and fizzy drinks).
Variance decomposition analysis of LC-MS/MS serum and NMR urine metabolic profiles
Using principal components analysis, we found that metabolites in LC-MS/MS serum metabolic profiles were inherently more collinear when compared to NMR urine profiles; only 6 principal components were required to describe half of the variance in the 177 serum metabolites as opposed to 12 principal components required to describe the same proportion of the variance in the 44 urinary metabolites (Additional file 1: Figure S9). Secondly, as metabolic profiles often capture information derivable from various sources that may be analysis-specific or individual-specific, we performed variance decomposition analysis to discover and compare the volume of information contained in the two metabolic datasets that were attributable to the various factors. Using a partial R2 approach, we partitioned the variance in the urinary and serum data according to the following 5 main categories: pre-analytical, analytical, demographic, dietary and cohort/country. The analysis was performed on each of the 44 urinary metabolites and on each of the 177 serum metabolites, and Fig. 6 illustrates the distributions of the percentages of variance explained by the 5 categories. Our data indicate that whilst analytical biases accounted for only a small fraction (median of 1.5%) of the explained variance in the NMR urinary profile, they accounted for a much larger portion of the explained variance (median of 9.1%) in the LC-MS/MS serum profile. Dietary information accounted for the largest proportion of the explained variance in the urinary metabolic profile (median of 1.6%), and overall, we found that demographic, dietary and information about country of origin are better reflected in the serum dataset, as these factors together explain a median of 9.0% amongst serum metabolites versus a median of 5.1% amongst urine metabolites (breakdown by individual metabolite can be found in Additional file 1: Tables S10 and S11).
Serum and urine metabolic pairwise correlations
Metabolite inter-correlations often convey biological pathway information; thus, metabolite pairwise correlation analyses were performed separately for serum and urine datasets. Significant correlations were observed between serum metabolites which belong to the same compound classes (Fig. 7), and in particular, strong correlation clusters are found for glycerophospholipids species (maximum Pearson’s correlation coefficient r = 0.94), amino acids (maximum r = 0.97) and acylcarnitines (maximum r = 0.88). Other notable correlations included positive correlations between valine, leucine and isoleucine (all BCAA, r > 0.92), alpha-AAA (α-aminoadipic acid) with BCAA and lysine, positive correlations between valine and short chain acylcarnitines (C5, C3, C4, r = 0.65 between valine and C5) and negative correlations between alanine and acetylcarnitine (C2, r = − 0.54). Significant positive correlations between urine metabolites are shown as a heatmap in Fig. 8 (p value threshold of 5.3 × 10−5). Positive correlations included leucine with valine (r = 0.56), acetate with succinate (r = 0.32), formate with acetate (r = 0.17), trimethylamine oxide and dimethylamine (r = 0.44), 3-indoxylsulfate and p-cresol sulphate (r = 0.43), alanine and glycine and threonine/lactate (r = 0.52–0.65), 4-deoxyerythronic acid with alanine (r = 0.17) and threonine/lactate (r = 0.21), and creatine with carnitine/choline (r = 0.30). Significant negative correlations included 4-deoxythreonic acid with the following amino acids: threonine/lactate, alanine, tyrosine, glutamine and glycine (r = − 0.17 to − 0.42). Pairwise correlation between metabolite concentrations across the two biological fluid types were also examined (Additional file 1: Figure S10, p value threshold of 6.4 × 10−6). Significant correlations were found in 391/7788 serum-urine metabolite pairs. Significant positive correlations were found in the cases when a metabolite has been measured in both urine and serum. Specifically creatinine (r = 0.39), glycine (r = 0.35), alanine (r = 0.29), valine (r = 0.18), serum carnitine and urine carnitine/choline (r = 0.23), and serum threonine and urinary threonine/lactate (r = 0.26) are all individually strongly correlated across the two biological fluid matrices. Other notable correlations include serum threonine with urine 4-deoxyerythronic acid (r = 0.31), which is consistent with the proposition that threonine is the main source of 4-deoxyerythronic acid . Urine N-methylnicotinic acid was correlated (r = 0.23) with serum Ac-Orn (acetylornithine), and additionally, we also found urine acetone and 4-deoxythreonic acid to be positively associated with multiple serum acylcarnitines, while urine alanine was negatively associated with multiple serum acylcarnitines (Additional file 1: Figure S10). Amongst the 391 significant serum-urine metabolite pairs, the median correlation r2 was 2.7% whilst across all 7788 serum-urine metabolite pairs the median correlation r2 was only 0.15% indicating that, even if a subset of serum-urine metabolic correlations are significant, information contained in our urine and serum profiles was largely orthogonal to one another.
Utilising two reproducible and well-characterised metabolic profiling platforms, 1H NMR spectroscopy and LC-MS/MS, we have characterised the urine and serum metabolic phenotypes in European children from six cohort populations representing different demographic and sample characteristics. Little is known regarding the normal concentration ranges of urinary and serum metabolites in healthy European children at present, and in this study, we have used a sample size of approximately 1200 individuals spread across six European countries and embedded the work in a population with rich metadata on diet, anthropometry and environmental exposure. 1H NMR spectroscopy and targeted LC-MS/MS (the AbsoluteIDQ p180 kit) were chosen for the analysis of urine, and serum samples correspondingly in this study, as they offer good sensitivity, broad dynamic range and metabolite coverage, are widely applied and have been used previously for epidemiological studies in the respective bio fluids .
Sample handling and pre-analytical effects
Sample handling in such a large population and across six different centres would be expected to impact on metabolite levels. Stability of serum metabolites are considered lower when compared to those found in urine, and it has been reported that concentrations of many blood metabolites are altered by 12 h pre-storage delay at room temperature . Thus, great care was taken when the study sample collection protocol was developed to help ensure that the sample processing time was kept short (< 2 h). Two separate studies have previously found that urine or serum samples stored at 4 °C for up to 24 h before being frozen were comparable to those frozen immediately [77, 78], and in our study, we have confirmed that neither urine nor serum sample processing time appear to bias our subsequent data analysis. Also, the design of the urine sample collection benefited from our previous pilot work  and we took advantage of a pooled sample design, combining the last sample before bedtime with the first morning void sample in the following day, to reduce diurnal variations. Morning or night void samples were only included in analyses as replacements for the pooled samples when pooled samples were missing (7% of the total). Levels of several metabolites, including citrate and N-methyl nicotinamide, were found to be significantly different between morning or night-time void samples; these are consistent with findings from our earlier pilot panel study which examined the diurnal and day-to-day variability of urine sampling . Whilst fasting-state samples reduce temporal within-day sampling variability compared to non-fasted samples , such sample collections are not always feasible, as was the case for the HELIX project. Thus, most of the serum samples analysed were from non-fasting states with a median postprandial period of 3.3 h, and we have found large number of amino acids and acylcarnitines to be associated with postprandial intervals. Similarly, in a previous study of healthy female volunteers , using the AbsoluteIDQ p180 kit, significantly altered postprandial concentrations of amino acids and acylcarnitines were reported, likely as a result of changes in fatty acid oxidation and ketogenesis.
Demographic factors, BMI and the child metabolome
Overall, we found the serum metabolite concentrations from the HELIX children population to be remarkably similar to reference values obtained in a study of healthy French adults . However, there are some notable differences; for example, the serum creatinine level is lower in the HELIX children compared with adult populations, probably reflecting differences in lean muscle mass between adults and children —a well-studied phenomenon  that was replicated in our study. Likewise, the urinary creatinine level was lower in the HELIX children population compared to reference values for adult populations [82, 83], and our cohort-stratified regression models also identified both urinary and serum creatinine to be positively associated with a child’s age, reaffirming creatinine as a valid indicator of muscle development in children [81, 82, 84].
Body anthropometry is an important predictor of molecular profiles and is of intense interest for disease risk stratification in epidemiological studies. The standardised BMI z-score calculated for a given age and sex has been established as a reliable measure in accessing obesity burden in child populations . We observed positive associations between urinary and serum BCAAs and standardised BMI z-score, which have previously been reported in other children or young adult populations [43, 86, 87]. BCAAs are important nutrient signals , and increased circulating BCAAs levels have been suggested to predict future insulin resistance  as well as increased cardio-metabolic risk independent to adiposity in young adults . Also, we identified two sphingolipids (SM C16:1 and SM C18:1) to be both higher in females and positively associated with BMI z-score, possibly reflecting differences in body fat composition and physical development between boys and girls. Also, two of the lysophosphatidylcholines (lysoPC a C16:1, lysoPC a C18:1) associated with BMI z-score in this study have recently been shown to be correlated to infant birth weight . Moreover, out of the 41 serum metabolites found to be associated with BMI z-score in our HELIX children cohort, 14 metabolites (including kynurenine, glutamate, lysoPC a C18:1, lysoPC a C18:2) have also previously been reported in the EPIC study in an adult population, where the AbsoluteIDQ p180 kit was also used , demonstrating that many serum metabolic associations with BMI observed in adulthood can also be found in childhood.
A key finding of our study was novel evidence for a positive association between urinary 4-deoxyerythronic acid and child BMI z-score, a threonine catabolite [91, 92] found elevated during pregnancy . Whilst very little is currently known about the biology of 4-deoxyerythronic acid, it is present and has been found to be inversely associated with age in adults [27, 74, 94], and higher levels of this and related metabolites have been observed in children with early onset type I diabetes . Threonine is an essential amino acid, and threonine dehydrogenase has been reported as a relatively minor (~ 10%) contributor to threonine oxidation in humans when compared to other species (up to 80%) —indicating that exogenous sources or symbiotic microbial metabolism may be playing an important role in 4-deoxyerythronic acid exposure. Interestingly, it has been reported that formula-fed infants have a lower capacity to oxidise threonine than do infants fed breast milk  and that catabolism of threonine can lead to methylglyoxal production which contributes to the pathophysiology of obesity and diabetes  and can reduce health span in model systems . Urinary 4-deoxyerythronic acid was found positively associated with child BMI in five of the six participating cohorts, with the exception of MoBa which has the lowest BMI z-score distribution amongst the six cohorts. It is possible that 4-deoxyerythronic acid association to BMI is more discernable in overweight populations. We report herein a correlation between serum threonine and urinary 4-deoxyerythronic acid which supports the hypothesis that endogenous catabolism of threonine is a source of this metabolite. However, further work is required to understand the relationship between 4-deoxyerythronic acid and metabolic health.
Habitual dietary intake and the child metabolome
We have confirmed in children a number of known diet-metabolite associations in adults, including meat (which has high creatine content) with urine creatine , vegetables and fruits with urine hippurate [100, 101], fruits with proline betaine and scyllo-inositol [47, 102]. It is also of note that all 12 metabolites associated with fish intake in the study were serum glycerophosphatidylcholine metabolites; oily fish in the diet alters glycerophospholipid composition and is an important nutrient source for polyunsaturated fatty acids [103, 104]. The extent to which metabolic phenotypes mediate the impact of dietary behaviour on childhood adiposity and cardiovascular indicators will be a focus of our future work. We anticipate that the metabolic phenotyping dataset acquired on the HELIX study population will provide a useful molecular resource to help elucidate the complex interactions between childhood environmental and dietary exposures and adverse health outcomes.
Complementarity between the serum and urine metabolome
In the HELIX study, matched urine and serum samples across six European cohorts were collected according to well-defined protocols, providing a valuable resource for uncovering metabolic relationships across the two most accessible biological fluid types. Whilst NMR spectroscopy and LC-MS/MS-based metabolic profiling have been widely applied in epidemiological studies [34, 68, 83, 105, 106], our study is one of very few that allows comparison of the effects of pre-analytical, analytical, demographic, dietary and geographic variation between the two biofluid types from the same sample population. It has previously been reported that biological variations are more robustly captured in a blood metabolic profile compared to urine . In our study, we confirm that the combined information from demographics, diet and cohort accounts for greater variance in the LC-MS/MS serum profile compared to NMR urine profile, even if the LC-MS/MS serum profile is more susceptible to analytical batch effects. However, with respect to dietary habits specifically, these are better reflected in the urinary metabolome presumably due to high metabolite turnover, and it has previously been reported in a colon adenoma case-control study (n = 253) that more metabolites in urine were uniquely associated with diet than in serum . Our pairwise metabolite correlation analyses also potentially confer information about metabolic pathway activities: urinary acetate with formate and succinate (TCA cycle activity and gut bacterial metabolism); urinary creatine with choline/carnitine (meat diet); 3-indoxylsulfate and p-cresol sulphate (both sulphated uremic solutes produced by gut bacteria); urinary dimethylamine with trimethylamine, and trimethylamine oxide (amine derivatives), and urinary and serum valine with leucine (branched-chain amino acid metabolism). Our correlation analysis between metabolite concentrations across the two biological fluid types confirmed that for many compounds, metabolite concentrations between urine and serum are positively correlated and also confirmed metabolic pathway associations with serum threonine and urinary 4-deoxyerythronic acid (threonine catabolism) .
Our study had a number of important limitations. Firstly, the sample size from each of the six individual cohorts was relatively small (n~200) for observational studies, limiting the statistical power available to uncover novel metabolic associations, particularly when effect sizes were generally small. There were also notable differences in sample characteristics between the cohorts, particularly in age, with the median cohort child age varying from 6 to 11 years old, making it difficult to disentangle cohort level differences from other covariates in our variance decomposition analyses, as those confounders were heavily correlated. Our study also lacks 24-h dietary recall data, and serum samples were collected from non-fasting states. In addition, we acknowledge the inherent limitations in the use of food frequency questionnaire which include the potential for dietary intake misclassifications and that categorising distinct food sources into groups may be imperfect. For example, cocoa could be considered as a vegetable but was classified as sweets in this study. We intend to follow up the metabolite—diet associations identified in this study with detailed food subgroup analyses as part of a future publication.
To make certain the timely completion of an annotated metabolome resource, we have decided to acquire and to process the serum and urine metabolic data using analytical methods which quantify omnipresent metabolites that were typically detected well in this study. Whilst this approach had the advantage of improving the sensitivity and specificity of the quantitation and provide explicit metabolite identification, it limited the number of metabolites that were measured and resulted in only partial coverage of the serum and urine metabolome. Also, the serum metabolic assay only provided partial specificity in the assignment of lipid species as the locations of double bonds or the length of the fatty acid chains remain ambiguous. Supplementing the current study with other complementary metabolomic approaches such as untargeted LC-MS and GC-MS analyses in future would help enhance metabolite coverage and greatly augment the metabolome resource of healthy children available at present.
We have characterised the major components of the urine and serum metabolome in the HELIX subcohort. Typically but not universally, metabolic associations with age, sex, BMI z-score and dietary habits were common to the six populations studied. Also, a novel metabolic association between threonine catabolism and BMI of children was identified. Inter-metabolite correlation analyses for both urine and serum metabolic phenotypes revealed potential pathway associations, and population-specific variance (demographic, dietary and country of origin) was better captured in the serum than in the urine metabolic profile. This study establishes a reference metabolome resource in multiple European populations for urine and serum from healthy children. This provides a critical foundation for future work to define the utility of metabolic profiles to monitor or predict the impact of environmental and other exposures on human biology and child health.
Branched-chain amino acids
Born in Bradford, UK
Below the limit of detection
Body mass index
Study of determinants of pre- and postnatal developmental, France
The Human Early-Life Exposome project
High-performance liquid chromatography
Infancia y Medio Ambiente, Environment and Childhood, Spain
Kaunas Cohort, Lithuania
Liquid chromatography tandem mass spectrometry
Limit of detection
The Norwegian Mother and Child Cohort Study, Norway
Nuclear magnetic resonance
Principal component analysis
Quantile regression approach for left-censored missing
Mother-Child Cohort in Crete, Greece
Leon DA, Lithell HO, Vagero D, Koupilova I, Mohsen R, Berglund L, Lithell UB, McKeigue PM. Reduced fetal growth rate and increased risk of death from ischaemic heart disease: cohort study of 15 000 Swedish men and women born 1915-29. Br Med J. 1998;317(7153):241–5.
Barker DJP, Gluckman PD, Godfrey KM, Harding JE, Owens JA, Robinson JS. Fetal nutrition and cardiovascular-disease in adult life. Lancet. 1993;341(8850):938–41.
Wadhwa PD, Buss C, Entringer S, Swanson JM. Developmental origins of health and disease: brief history of the approach and current focus on epigenetic mechanisms. Semin Reprod Med. 2009;27(5):358–68.
Thacher JD, Gruzieva O, Pershagen G, Neuman A, Wickman M, Kull I, Melen E, Bergstrom A. Pre- and postnatal exposure to parental smoking and allergic disease through adolescence. Pediatrics. 2014;134(3):428–34.
Berends LM, Ozanne SE. Early determinants of type-2 diabetes. Best Pract Res Clin Endocrinol Metab. 2012;26(5):569–80.
Burbank AJ, Sood AK, Kesic MJ, Peden DB, Hernandez ML. Environmental determinants of allergy and asthma in early life. J Allergy Clin Immunol. 2017;140(1):1–12.
Burke H, Leonardi-Bee J, Hashim A, Pine-Abata H, Chen YL, Cook DG, Britton JR, McKeever TM. Prenatal and passive smoke exposure and incidence of asthma and wheeze: systematic review and meta-analysis. Pediatrics. 2012;129(4):735–44.
Deng QH, Lu C, Li YG, Sundell J, Norback D. Exposure to outdoor air pollution during trimesters of pregnancy and childhood asthma, allergic rhinitis, and eczema. Environ Res. 2016;150:119–27.
Vafeiadi M, Roumeliotaki T, Myridakis A, Chalkiadaki G, Fthenou E, Dermitzaki E, Karachaliou M, Sarri K, Vassilaki M, Stephanou EG, et al. Association of early life exposure to bisphenol A with obesity and cardiometabolic traits in childhood. Environ Res. 2016;146:379–87.
Agay-Shay K, Martinez D, Valvi D, Garcia-Esteban R, Basagana X, Robinson O, Casas M, Sunyer J, Vrijheid M. Exposure to endocrine-disrupting chemicals during pregnancy and weight at 7 years of age: a multi-pollutant approach. Environ Health Perspect. 2015;123(10):1030–7.
Boffetta P, Tredaniel J, Greco A. Risk of childhood cancer and adult lung cancer after childhood exposure to passive smoke: a meta-analysis. Environ Health Perspect. 2000;108(1):73–82.
Gilmore JH, Jarskog LF, Vadlamudi S, Lauder J. Prenatal infection and risk for schizophrenia: IL-I beta, IL-6, and TNF alpha inhibit cortical neuron dendrite development. Neuropsychopharmacology. 2004;29(7):1221–9.
Braun JM, Kahn RS, Froehlich T, Auinger P, Lanphear BP. Exposures to environmental toxicants and attention deficit hyperactivity disorder in US children. Environ Health Perspect. 2006;114(12):1904–9.
Heijmans BT, Tobi EW, Stein AD, Putter H, Blauw GJ, Susser ES, Slagboom PE, Lumey LH. Persistent epigenetic differences associated with prenatal exposure to famine in humans. Proc Natl Acad Sci U S A. 2008;105(44):17046–9.
Ghantous A, Hernandez-Vargas H, Byrnes G, Dwyer T, Herceg Z. Characterising the epigenome as a key component of the fetal exposome in evaluating in utero exposures and childhood cancer risk. Mutagenesis. 2015;30(6):733–42.
Lioy PJ, Rappaport SM. Exposure science and the exposome: an opportunity for coherence in the environmental health sciences. Environ Health Perspect. 2011;119(11):A466–7.
Wild CP. The exposome: from concept to utility. Int J Epidemiol. 2012;41(1):24–32.
Vineis P, van Veldhoven K, Chadeau-Hyam M, Athersuch TJ. Advancing the application of omics-based biomarkers in environmental epidemiology. Environ Mol Mutagen. 2013;54(7):461–7.
Robinson O, Martinez D, Aurrekoetxea JJ, Estarlich M, Somoano AF, Iniguez C, Santa-Marina L, Tardon A, Torrent M, Sunyer J, et al. The association between passive and active tobacco smoke exposure and child weight status among Spanish children. Obesity. 2016;24(8):1767–77.
Robinson O, Basagana X, Agier L, de Castro M, Hernandez-Ferrer C, Gonzalez JR, Grimalt JO, Nieuwenhuijsen M, Sunyer J, Slama R, et al. The pregnancy exposome: multiple environmental exposures in the INMA-Sabadell birth cohort. Environ Sci Technol. 2015;49(17):10632–41.
Bisgaard H, Vissing NH, Carson CG, Bischoff AL, Folsgaard NV, Kreiner-Moller E, Chawes BLK, Stokholm J, Pedersen L, Bjarnadottir E, et al. Deep phenotyping of the unselected COPSAC2010 birth cohort study. Clin Exp Allergy. 2013;43(12):1384–94.
Sarigiannis DA. Assessing the impact of hazardous waste on children’s health: the exposome paradigm. Environ Res. 2017;158:531–41.
Athersuch TJ, Keun HC. Metabolic profiling in human exposome studies. Mutagenesis. 2015;30(6):755–62.
Athersuch TJ. The role of metabolomics in characterizing the human exposome. Bioanalysis. 2012;4(18):2207–12.
Baker MG, Simpson CD, Lin YS, Shireman LM, Seixas N. The use of metabolomics to identify biological signatures of manganese exposure. Ann Work Expo Health. 2017;61(4):406–15.
Ladva CN, Golan R, Greenwald R, Yu TW, Sarnat SE, Flanders WD, Uppal K, Walker DI, Tran V, Liang DH, et al. Metabolomic profiles of plasma, exhaled breath condensate, and saliva are correlated with potential for air toxics detection. J Breath Res. 2018;12(1):016008.
Ellis JK, Athersuch TJ, Thomas LDK, Teichert F, Perez-Trujillo M, Svendsen C, Spurgeon DJ, Singh R, Jarup L, Bundy JG, et al. Metabolic profiling detects early effects of environmental and lifestyle exposure to cadmium in a human population. BMC Med. 2012;10:61.
Wilson K, Hawken S, Ducharme R, Potter BK, Little J, Thebaud B, Chakraborty P. Metabolomics of prematurity: analysis of patterns of amino acids, enzymes, and endocrine markers by categories of gestational age. Pediatr Res. 2014;75(2):367–73.
Maitre L, Villanueva CM, Lewis MR, Ibarluzea J, Santa-Marina L, Vrijheid M, Sunyer J, Coen M, Toledano MB. Maternal urinary metabolic signatures of fetal growth and associated clinical and environmental factors in the INMA study. BMC Med. 2016;14:177.
Overgaard AJ, Kaur S, Pociot F. Metabolomic biomarkers in the progression to type 1 diabetes. Curr Diab Rep. 2016;16(12):127.
Smolinska A, Klaassen EMM, Dallinga JW, van de Kant KDG, Jobsis Q, Moonen EJC, van Schayck OCP, Dompeling E, van Schooten FJ. Profiling of volatile organic compounds in exhaled breath as a strategy to find early predictive signatures of asthma in children. PLoS One. 2014;9(4):e95668.
James SJ, Cutler P, Melnyk S, Jernigan S, Janak L, Gaylor DW, Neubrander JA. Metabolic biomarkers of increased oxidative stress and impaired methylation capacity in children with autism. Am J Clin Nutr. 2004;80(6):1611–7.
Yu ZH, Zhai GJ, Singmann P, He Y, Xu T, Prehn C, Roemisch-Margl W, Lattka E, Gieger C, Soranzo N, et al. Human serum metabolic profiles are age dependent. Aging Cell. 2012;11(6):960–7.
Elliott P, Posma JM, Chan Q, Garcia-Perez I, Wijeyesekera A, Bictash M, Ebbels TMD, Ueshima H, Zhao LC, van Horn L, et al. Urinary metabolic signatures of human adiposity. Sci Transl Med. 2015;7(285):285ra62.
Guertin KA, Moore SC, Sampson JN, Huang WY, Xiao Q, Stolzenberg-Solomon RZ, Sinha R, Cross AJ. Metabolomics in nutritional epidemiology: identifying metabolites associated with diet and quantifying their potential to uncover diet-disease relations in populations. Am J Clin Nutr. 2014;100(1):208–17.
Jourdan C, Petersen AK, Gieger C, Doring A, Illig T, Wang-Sattler R, Meisinger C, Peters A, Adamski J, Prehn C, et al. Body fat free mass is associated with the serum metabolite profile in a population-based study. PLoS One. 2012;7(6):e40009.
Kochhar S, Jacobs DM, Ramadan Z, Berruex F, Fuerhoz A, Fay LB. Probing gender-specific metabolism differences in humans by nuclear magnetic resonance-based metabonomics. Anal Biochem. 2006;352(2):274–81.
Moore SC, Matthews CE, Sampson JN, Stolzenberg-Solomon RZ, Zheng W, Cai QY, Tan YT, Chow WH, Ji BT, Liu DK, et al. Human metabolic correlates of body mass index. Metabolomics. 2014;10(2):259–69.
Stella C, Beckwith-Hall B, Cloarec O, Holmes E, Lindon JC, Powell J, van der Ouderaa F, Bingham S, Cross AJ, Nicholson JK. Susceptibility of human metabolic phenotypes to dietary modulation. J Proteome Res. 2006;5(10):2780–8.
Holmes E, Loo RL, Stamler J, Bictash M, Yap IKS, Chan Q, Ebbels T, De Iorio M, Brown IJ, Veselkov KA, et al. Human metabolic phenotype diversity and its association with diet and blood pressure. Nature. 2008;453(7193):396–U350.
Wurtz P, Wang Q, Kangas AJ, Richmond RC, Skarp J, Tiainen M, Tynkkynen T, Soininen P, Havulinna AS, Kaakinen M, et al. Metabolic signatures of adiposity in young adults: Mendelian randomization analysis and effects of weight change. PLoS Med. 2014;11(12):e1001765.
Dunn WB, Lin WC, Broadhurst D, Begley P, Brown M, Zelena E, Vaughan AA, Halsall A, Harding N, Knowles JD, et al. Molecular phenotyping of a UK population: defining the human serum metabolome. Metabolomics. 2015;11(1):9–26.
McCormack SE, Shaham O, McCarthy MA, Deik AA, Wang TJ, Gerszten RE, Clish CB, Mootha VK, Grinspoon SK, Fleischman A. Circulating branched-chain amino acid concentrations are associated with obesity and future insulin resistance in children and adolescents. Pediatric Obesity. 2013;8(1):52–61.
Knip M, Virtanen SM, Akerblom HK. Infant feeding and the risk of type 1 diabetes. Am J Clin Nutr. 2010;91(5):1506S–13S.
Freemark M. Metabolomics in nutrition research: biomarkers predicting mortality in children with severe acute malnutrition. Food Nutr Bull. 2015;36:S88–92.
Chiu CY, Yeh KW, Lin G, Chiang MH, Yang SC, Chao WJ, Yao TC, Tsai MH, Hua MC, Liao SL, et al. Metabolomics reveals dynamic metabolic changes associated with age in early childhood. PLoS One. 2016;11(2):e0149823.
Playdon MC, Sampson JN, Cross AJ, Sinha R, Guertin KA, Moy KA, Rothman N, Irwin ML, Mayne ST, Stolzenberg-Solomon R, et al. Comparing metabolite profiles of habitual diet in serum and urine. Am J Clin Nutr. 2016;104(3):776–89.
Vrijheid M, Slama R, Robinson O, Chatzi L, Coen M, van den Hazel P, Thomsen C, Wright J, Athersuch TJ, Avellana N, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. 2014;122(6):535–44.
Maitre L, de Bont J, Casas M, Robinson O, Aasvang GM, Agier L, Andrušaitytė S, Ballester F, Basagaña X, Borràs E, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018;8(9):e021311.
Wright J, Small N, Raynor P, Tuffnell D, Bhopal R, Cameron N, Fairley L, Lawlor DA, Parslow R, Petherick ES, et al. Cohort profile: the Born in Bradford multi-ethnic family cohort study. Int J Epidemiol. 2013;42(4):978–91.
Heude B, Forhan A, Slama R, Douhaud L, Bedel S, Saurel-Cubizolles MJ, Hankard R, Thiebaugeorges O, De Agostini M, Annesi-Maesano I, et al. Cohort profile: the EDEN mother-child cohort on the prenatal and early postnatal determinants of child health and development. Int J Epidemiol. 2016;45(2):353–63.
Guxens M, Ballester F, Espada M, Fernandez MF, Grimalt JO, Ibarluzea J, Olea N, Rebagliato M, Tardon A, Torrent M, et al. Cohort profile: the INMA-INfancia y Medio Ambiente-(environment and childhood) project. Int J Epidemiol. 2012;41(4):930–40.
Grazuleviciene R, Nieuwenhuijsen MJ, Vencloviene J, Kostopoulou-Karadanelli M, Krasner SW, Danileviciute A, Balcius G, Kapustinskiene V. Individual exposures to drinking water trihalomethanes, low birth weight and small for gestational age risk: a prospective Kaunas cohort study. Environ Health. 2011;10:32.
Magnus P, Birke C, Vejrup K, Haugan A, Alsaker E, Daltveit AK, Handal M, Haugen M, Hoiseth G, Knudsen GP, et al. Cohort profile update: the Norwegian mother and child cohort study (MoBa). Int J Epidemiol. 2016;45(2):382–8.
Chatzi L, Leventakou V, Vafeiadi M, Koutra K, Roumeliotaki T, Chalkiadaki G, Karachaliou M, Daraki V, Kyriklaki A, Kampouri M, et al. Cohort profile: the mother-child cohort in Crete, Greece (Rhea study). Int J Epidemiol. 2017;46(5):1392–1393k.
de Onis M, Onyango AW, Borghi E, Siyam A, Nishida C, Siekmann J. Development of a WHO growth reference for school-aged children and adolescents. Bull World Health Organ. 2007;85(9):660–7.
Maitre L, Lau CE, Vizcaino E, Robinson O, Casas M, Siskos AP, Want EJ, Athersuch T, Slama R, Vrijheid M, et al. Assessment of metabolic phenotypic variability in children’s urine using 1H NMR spectroscopy. Sci Rep. 2017;7:46082.
Dona AC, Jimenez B, Schafer H, Humpfer E, Spraul M, Lewis MR, Pearce JTM, Holmes E, Lindon JC, Nicholson JK. Precision high-throughput proton NMR spectroscopy of human urine, serum, and plasma for large-scale metabolic phenotyping. Anal Chem. 2014;86(19):9887–94.
Karaman I, Ferreira DLS, Boulange CL, Kaluarachchi MR, Herrington D, Dona AC, Castagne R, Moayyeri A, Lehne B, Loh M, et al. Workflow for integrated processing of multicohort untargeted H-1 NMR metabolomics data in large-scale metabolic epidemiology. J Proteome Res. 2016;15(12):4188–94.
Veselkov KA, Lindon JC, Ebbels TMD, Crockford D, Volynkin VV, Holmes E, Davies DB, Nicholson JK. Recursive segment-wise peak alignment of biological H-1 NMR spectra for improved metabolic biomarker recovery. Anal Chem. 2009;81(1):56–66.
Dieterle F, Ross A, Schlotterbeck G, Senn H. Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in H-1 NMR metabonomics. Anal Chem. 2006;78(13):4281–90.
Wishart DS, Tzur D, Knox C, Eisner R, Guo AC, Young N, Cheng D, Jewell K, Arndt D, Sawhney S. HMDB: the human metabolome database. Nucleic Acids Res. 2007;35(Database):D521–6.
Cloarec O, Dumas ME, Craig A, Barton RH, Trygg J, Hudson J, Blancher C, Gauguier D, Lindon JC, Holmes E, et al. Statistical total correlation spectroscopy: an exploratory approach for latent biomarker identification from metabolic H-1 NMR data sets. Anal Chem. 2005;77(5):1282–9.
User Manual UM_p180_AB SCIEX_9. Biocrates Life Sciences AG. Innsbruck; 2014.
Siskos AP, Jain P, Romisch-Margl W, Bennet M, Achaintre D, Asad Y, Marney L, Richardson L, Koulman A, Griffin JL, et al. Interlaboratory reproducibility of a targeted metabolomics platform for analysis of human serum and plasma. Anal Chem. 2017;89(1):656–65.
Trabado S, Al-Salameh A, Croixmarie V, Masson P, Corruble E, Feve B, Colle R, Ripoll L, Walther B, Boursier-Neyret C, et al. The human plasma-metabolome: reference values in 800 French healthy volunteers; impact of cholesterol, gender and age. PLoS One. 2017;12(3):e0173615.
Merz B, Nothlings U, Wahl S, Haftenberger M, Schienkiewitz A, Adamski J, Suhre K, Wang-Sattler R, Grallert H, Thorand B, et al. Specific metabolic markers are associated with future waist-gaining phenotype in women. PLoS One. 2016;11(6):e0157733.
Yet I, Menni C, Shin SY, Mangino M, Soranzo N, Adamski J, Suhre K, Spector TD, Kastenmuller G, Bell JT. Genetic influences on metabolite levels: a comparison across metabolomic platforms. PLoS One. 2016;11(4):e0153672.
Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27(3):431–2.
Basu S, Duren W, Evans CR, Burant CF, Michailidis G, Karnovsky A. Sparse network modeling and MetScape-based visualization methods for the analysis of large-scale metabolomics data. Bioinformatics. 2017;33(10):1545–53.
Sakia RM. The Box-Cox transformation technique - a review. J R Stat Soc Ser D. 1992;41(2):169–78.
Wei RM, Wang JY, Su MM, Jia E, Chen SQ, Chen TL, Ni Y. Missing value imputation approach for mass spectrometry-based metabolomics data. Sci Rep. 2018;8:663.
Vanholder R, Schepers E, Pletinck A, Nagler EV, Glorieux G. The uremic toxicity of Indoxyl sulfate and p-Cresyl sulfate: a systematic review. J Am Soc Nephrol. 2014;25(9):1897–907.
Darling PB, Grunow J, Rafii M, Brookes S, Ball RO, Pencharz PB. Threonine dehydrogenase is a minor degradative pathway of threonine catabolism in adult humans. Am J Physiol Endocrinol Metab. 2000;278(5):E877–84.
Carayol M, Licaj I, Achaintre D, Sacerdote C, Vineis P, Key TJ, Moret NCO, Scalbert A, Rinaldi S, Ferrari P. Reliability of serum metabolites over a two-year period: a targeted metabolomic approach in fasting and non-fasting samples from EPIC. PLoS One. 2015;10(8):e0135437.
Anton G, Wilson R, Yu ZH, Prehn C, Zukunft S, Adamski J, Heier M, Meisinger C, Romisch-Margl W, Wang-Sattler R, et al. Pre-analytical sample quality: metabolite ratios as an intrinsic marker for prolonged room temperature exposure of serum samples. PLoS One. 2015;10(3):e0121495.
Dunn WB, Broadhurst D, Ellis DI, Brown M, Halsall A, O'Hagan S, Spasic I, Tseng A, Kell DB. A GC-TOF-MS study of the stability of serum and urine metabolomes during the UK Biobank sample collection and preparation protocols. Int J Epidemiol. 2008;37:23–30.
Barton RH, Nicholson JK, Elliott P, Holmes E. High-throughput H-1 NMR-based metabolic analysis of human serum and urine for large-scale epidemiological studies: validation study. Int J Epidemiol. 2008;37:31–40.
Shrestha A, Mullner E, Poutanen K, Mykkanen H, Moazzami AA. Metabolic changes in serum metabolome in response to a meal. Eur J Nutr. 2017;56(2):671–81.
Baxmann AC, Ahmed MS, Marques NC, Menon VB, Pereira AB, Kirsztajn GM, Heilberg IP. Influence of muscle mass and physical activity on serum and urinary creatinine and serum cystatin C. Clin J Am Soc Nephrol. 2008;3(2):348–54.
Savory DJ. Reference ranges for serum creatinine in infants, children and adolescents. Ann Clin Biochem. 1990;27:99–101.
Sugita O, Uchiyama K, Yamada T, Sato T, Okada M, Takeuchi K. Reference values of serum and urine creatinine, and of creatinine clearance by a new enzymatic method. Ann Clin Biochem. 1992;29:523–8.
Bouatra S, Aziat F, Mandal R, Guo AC, Wilson MR, Knox C, Bjorndahl TC, Krishnamurthy R, Saleem F, Liu P, et al. The human urine metabolome. PLoS One. 2013;8(9):e73076.
Richmond W, Colgan G, Simon S, Stuart-Hilgenfeld M, Wilson N, Alon US. Random urine calcium/osmolality in the assessment of calciuria in children with decreased muscle mass. Clin Nephrol. 2005;64(4):264–70.
Must A, Anderson SE. Body mass index in children and adolescents: considerations for population-based applications. Int J Obes. 2006;30(4):590–4.
Mangge H, Zelzer S, Pruller F, Schnedl WJ, Weghuber D, Enko D, Bergsten P, Haybaeck J, Meinitzer A. Branched-chain amino acids are associated with cardiometabolic risk profiles found already in lean, overweight and obese young. J Nutr Biochem. 2016;32:123–7.
Perng W, Gillman MW, Fleisch AF, Michalek RD, Watkins SM, Isganaitis E, Patti ME, Oken E. Metabolomic profiles and childhood obesity. Obesity. 2014;22(12):2570–8.
Lynch CJ, Adams SH. Branched-chain amino acids in metabolic signalling and insulin resistance. Nat Rev Endocrinol. 2014;10(12):723–36.
Robinson O, Keski-Rahkonen P, Chatzi L, Kogevinas M, Nawrot T, Pizzi C, Plusquin M, Richiardi L, Robinot N, Sunyer J, et al. Cord blood metabolic signatures of birth weight: a population-based study. J Proteome Res. 2018;17(3):1235–47.
Carayol M, Leitzmann MF, Ferrari P, Zamora-Ros R, Achaintre D, Stepien M, Schmidt JA, Travis RC, Overvad K, Tjonneland A, et al. Blood metabolic signatures of body mass index: a targeted metabolomics study in the EPIC cohort. J Proteome Res. 2017;16(9):3137–46.
Edgar AJ. The human L-threonine 3-dehydrogenase gene is an expressed pseudogene. BMC Genet. 2002;3:18.
Van Winkle LJ, Galat V, Iannaccone PM. Threonine appears to be essential for proliferation of human as well as mouse embryonic stem cells. Front Cell Dev Biol. 2014;2:18.
Diaz SO, Barros AS, Goodfellow BJ, Duarte IF, Carreira IM, Galhano E, Pita C, Almeida MD, Gil AM. Following healthy pregnancy by nuclear magnetic resonance (NMR) metabolic profiling of human urine. J Proteome Res. 2013;12(2):969–79.
Thompson JA, Markey SP, Fennessey PV. Gas-chromatographic-mass-spectrometric identification and quantitation of tetronic and deoxytetronic acids in urine from normal adults and neonates. Clin Chem. 1975;21(13):1892–8.
Kassel DB, Martin M, Schall W, Sweeley CC. Urinary metabolites of L-threonine in type-1 diabetes determined by combined gas-chromatography chemical ionization mass-spectrometry. Biomed Environ Mass Spectrom. 1986;13(10):535–40.
Darling PB, Dunn M, Sarwar G, Brookes S, Ball RO, Pencharz PB. Threonine kinetics in preterm infants fed their mothers’ milk or formula with various ratios of whey to casein. Am J Clin Nutr. 1999;69(1):105–14.
Matafome P, Sena C, Seica R. Methylglyoxal, obesity, and diabetes. Endocrine. 2013;43(3):472–84.
Ravichandran M, Priebe S, Grigolon G, Rozanov L, Groth M, Laube B, Guthke R, Platzer M, Zarse K, Ristow M. Impairing L-threonine catabolism promotes healthspan through methylglyoxal-mediated proteohormesis. Cell Metab. 2018;27(4):914–25 e915.
Nair S, O'Brien SV, Hayden K, Pandya B, Lisboa PJG, Hardy KJ, Wilding JPH. Effect of a cooked meat meal on serum creatinine and estimated glomerular filtration rate in diabetes-related kidney disease. Diabetes Care. 2014;37(2):483–7.
Krupp D, Doberstein N, Shi LJ, Remer T. Hippuric acid in 24-hour urine collections is a potential biomarker for fruit and vegetable consumption in healthy children and adolescents. J Nutr. 2012;142(7):1314–20.
Edmands WMB, Beckonert OP, Stella C, Campbell A, Lake BG, Lindon JC, Holmes E, Gooderham NJ. Identification of human urinary biomarkers of cruciferous vegetable consumption by metabonomic profiling. J Proteome Res. 2011;10(10):4513–21.
Heinzmann SS, Brown IJ, Chan Q, Bictash M, Dumas ME, Kochhar S, Stamler J, Holmes E, Elliott P, Nicholson JK. Metabolic profiling strategy for discovery of nutritional biomarkers: proline betaine as a marker of citrus consumption. Am J Clin Nutr. 2010;92(2):436–43.
Glaser C, Demmelmair H, Koletzko B. High-throughput analysis of fatty acid composition of plasma glycerophospholipids. J Lipid Res. 2010;51(1):216–21.
Careagahouck M, Sprecher H. Effect of a fish oil diet on the composition of rat neutrophil lipids and the molecular-species of choline and ethanolamine glycerophospholipids. J Lipid Res. 1989;30(1):77–87.
Psychogios N, Hau DD, Peng J, Guo AC, Mandal R, Bouatra S, Sinelnikov I, Krishnamurthy R, Eisner R, Gautam B, et al. The human serum metabolome. PLoS One. 2011;6(2):e16957.
Vogt S, Wahl S, Kettunen J, Breitner S, Kastenmuller G, Gieger C, Suhre K, Waldenberger M, Kratzsch J, Perola M, et al. Characterization of the metabolic profile associated with serum 25-hydroxyvitamin D: a cross-sectional analysis in population-based data. Int J Epidemiol. 2016;45(5):1469–81.
Nicholson G, Rantalainen M, Maher AD, Li JV, Malmodin D, Ahmadi KR, Faber JH, Hallgrimsdottir IB, Barrett A, Toft H, et al. Human metabolic profiles are stably controlled by genetic and environmental variation. Mol Syst Biol. 2011;7:525.
This research received funding from the EU European Commission’s Seventh Framework Programme (FP7/2007–2013) under grant agreement no. 308333—the HELIX project. We would like to thank all the study participants and the full list of the HELIX Project Investigators that can be found at http://www.projecthelix.eu. The Norwegian Mother and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research, NIH/NIEHS (contract no. N01-ES-75558), NIH/NINDS (grant no. 1 UO1 NS 047537-01 and grant no. 2 UO1 NS 047537-06A1). Dr. Maribel Casas received funding from Instituto de Salud Carlos III (Ministry of Economy and Competitiveness) (MS16/00128).
We are grateful to all the participating families in Norway who take part in this on-going cohort study, but especially those families who came in for a clinical examination of their child, who in addition donated blood and urine to this specific study.
Born in Bradford is only possible because of the enthusiasm and commitment of the children and parents in BiB. We are grateful to all the participants, health professionals and researchers who have made Born in Bradford happen.
The research leading to these results has received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 308333—the HELIX project.
Availability of data and materials
A data warehouse has been established in a format that allows future use beyond the project lifespan (2013–2017) as an accessible resource for collaborative research involving researchers external to the project. The content of the data warehouse and procedures for external access is described on the project website (http://www.projecthelix.eu). Access to HELIX data is based on approval by the HELIX Project Executive Committee and by the individual cohorts, who will evaluate potential overlap with on-going work, adequacy of data protection plans, logistic and financial consequences, and adequacy of authorship and acknowledgement plans. We encourage interested researchers to contact us to set up collaborations.
Ethics approval and consent to participate
Research has been carried out according to the international and national guidelines and regulations (including the Declaration of Helsinki). The work was covered by ethics approvals from each cohort, and participants were asked to sign an informed consent form for the specific HELIX work including clinical examination and biospecimen collection and analysis. Informed consent was obtained from all subjects. Ethics approvals were obtained from the following committees:
BiB: Bradford Teaching Hospitals NHS Foundation Trust - The Research Management & Support Office, Bradford Institute for Health Research (REC no: 14/YH/0013). EDEN: Agence nationale de sécurité du médicament et des produits de santé (ANSM Ref: 140567-B81), Commission Nationale de l’Informatique et des Libertés (CNIL-CCTIRS Décision DR-2014-593/Demande d’autorisation no. 914630), Direction générale de la recherche et de l’innovation - Comité consultatif sur le traitement de l’information en matière de recherche dans le domaine de la santé (n° 14.574), Comité de protection des personnes (CPP) - CHU Poitiers (2013-A01839-36). INMA -Sabadell: Comité Ético de Inverticación Clínica Parc de Salut MAR (N°2005/2106/l). KANC: LIETUVOS BIOETIKOS KOMITETAS (projekto Nr. 282996). MoBa: Regional komité for medisinsk og helsefaglig forskningsetikk (REK sør-øst, Ref: 2013/1628b). Rhea: ETHICAL COMMITTEE OF THE GENERAL UNIVERSITY HOSPITAL OF HERAKLION, CRETE (no. 14694/5-11-2013).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
- Exposome metabolomics
- Metabolic phenotyping
- Birth cohorts
- NMR spectroscopy
- European children
- Metabolic profile