- Research article
- Open Access
In utero and childhood exposure to tobacco smoke and multi-layer molecular signatures in children
BMC Medicine volume 18, Article number: 243 (2020)
The adverse health effects of early life exposure to tobacco smoking have been widely reported. In spite of this, the underlying molecular mechanisms of in utero and postnatal exposure to tobacco smoke are only partially understood. Here, we aimed to identify multi-layer molecular signatures associated with exposure to tobacco smoke in these two exposure windows.
We investigated the associations of maternal smoking during pregnancy and childhood secondhand smoke (SHS) exposure with molecular features measured in 1203 European children (mean age 8.1 years) from the Human Early Life Exposome (HELIX) project. Molecular features, covering 4 layers, included blood DNA methylation and gene and miRNA transcription, plasma proteins, and sera and urinary metabolites.
Maternal smoking during pregnancy was associated with DNA methylation changes at 18 loci in child blood. DNA methylation at 5 of these loci was related to expression of the nearby genes. However, the expression of these genes themselves was only weakly associated with maternal smoking. Conversely, childhood SHS was not associated with blood DNA methylation or transcription patterns, but with reduced levels of several serum metabolites and with increased plasma PAI1 (plasminogen activator inhibitor-1), a protein that inhibits fibrinolysis. Some of the in utero and childhood smoking-related molecular marks showed dose-response trends, with stronger effects with higher dose or longer duration of the exposure.
In this first study covering multi-layer molecular features, pregnancy and childhood exposure to tobacco smoke were associated with distinct molecular phenotypes in children. The persistent and dose-dependent changes in the methylome make CpGs good candidates to develop biomarkers of past exposure. Moreover, compared to methylation, the weak association of maternal smoking in pregnancy with gene expression suggests different reversal rates and a methylation-based memory to past exposures. Finally, certain metabolites and protein markers evidenced potential early biological effects of postnatal SHS, such as fibrinolysis.
The in utero period and the first years of human life are crucial for the development and maturation of organs . Insults during these periods may result in later adverse health consequences, which might persist during the whole lifespan. This is known as the Developmental Origins of Health and Disease (DOHaD) concept .
Maternal smoking during pregnancy represents one of the most important avoidable risk factors, and its short- and long-term adverse effects on offspring, including prematurity, lower birth weight, increased risk of asthma and obesity, and impaired neurodevelopment, have been widely reported [3, 4]. In 15 European countries, the prevalence of maternal smoking at any time during pregnancy ranged between 4.2 and 18.9% in 2011–2012 . Secondhand smoke (SHS) is one of the main contributors to the indoor air pollution, with 40% of children exposed worldwide in 2004 . In Europe between 1999 and 2008, among never-smoking adolescents, around 50%, 70%, and 45% were exposed to SHS inside home, outside home, and both, respectively . SHS has been related to increased risk of asthma, lower respiratory infections, and sudden infant death syndrome [4, 6].
The molecular alterations resulting from tobacco smoke exposure are only partially understood. Their study can facilitate the development of biomarkers of exposure that surpass the limitations of existing ones, such as questionnaires and urinary cotinine, which only informs about recent exposure [8, 9]. For instance, the first epigenetic biomarker of maternal smoking allowed the discrimination between exposed and unexposed children with an accuracy > 90% . They may also provide knowledge on the molecular mechanisms that could mediate the effects of tobacco smoking on health. For instance, it has been described that epigenetic deregulation of JNK2 gene by maternal smoking during pregnancy is associated with impaired lung function in early childhood . Also, methylation levels of maternal smoking-related CpGs in adolescents/adults have been causally linked through Mendelian randomization to inflammatory bowel disease and schizophrenia . Moreover, molecular responses might be more sensitive and earlier markers of a biological effect than clinical outcomes.
In this line, several studies have shown that maternal smoking in pregnancy is associated with altered patterns of DNA methylation at birth, in placenta  and in cord blood [11, 14,15,16]. Interestingly, some of the maternal smoking-related blood loci show persistent dysregulation until childhood [15, 16], adolescence [16, 17], or even adulthood . However, not much is known about the transcriptional consequences of these persistent DNA methylation changes . Furthermore, while the alterations at multiple molecular layers, from epigenetics to metabolomics, have been investigated in relation to current smoking in adults [19,20,21,22], there is lack of information about the multi-layer molecular changes associated with in utero exposure or with the exposure to SHS in children. Regarding adult SHS exposure, only DNA methylation candidate studies are available .
Here, we aimed to identify multi-layer molecular signatures associated with exposure to tobacco smoke in two early life susceptibility windows, in utero due to maternal smoking and in childhood through exposure to SHS. For this, we used molecular data from 1203 children of the Human Early Life Exposome (HELIX) study, including child blood DNA methylation and transcription, plasma proteins, and sera and urinary metabolites.
The Human Early Life Exposome (HELIX) study is a collaborative project across 6 established and ongoing longitudinal population-based birth cohort studies in Europe : the Born in Bradford (BiB) study in the UK ; the Étude des DÉterminants prÉ et postnatals du dÉveloppement et de la santÉ de l’Enfant (EDEN) study in France ; the INfancia y Medio Ambiente (INMA) cohort in Spain ; the Kaunas cohort (KANC) in Lithuania ; the Norwegian Mother, Father and Child Cohort Study (MoBa) ; and the RHEA Mother Child Cohort study in Crete, Greece . Around age 8 years, the HELIX follow-up visit of the offspring took place using harmonized questionnaires and sampling protocols (n = 1301). HELIX children with complete data on prenatal and postnatal variables of exposure to tobacco smoking and with data on at least one omics platform were selected for the present study (n = 1203). Years of enrollment, years of HELIX visit, and years of smoking prohibition in each cohort are shown in Additional file 1: Table S1.
At the HELIX follow-up visit, child peripheral blood was collected and processed into a variety of sample matrices: buffy coat, serum, plasma, and whole blood. Median fasting time was 3.5 h (SD = 1.1 h). Two spot urine samples (one before bedtime and one first morning void) were collected at the participants’ home and transported at 4 °C to the cohort center, where they were combined at equal volumes to create a daily urine pool. In the HELIX follow-up, the daily urine pool was available for 92.90% of the children, while the first morning and bedtime urines were available for 3.14% and 3.96% of the children, respectively. All sample types were stored at − 80 °C until processed.
Exposure to tobacco smoking
Definitions of exposure to tobacco smoking in pregnancy
Maternal smoking habits during pregnancy were obtained from existing data collected in each cohort using non-harmonized questionnaires administered to the mothers in at least the first and third trimester of pregnancy. Two variables for active maternal smoking during pregnancy were generated: (i) any maternal smoking during pregnancy if the mother had smoked at any time during pregnancy (“yes/no”), and (ii) sustained maternal smoking, if the mother had smoked, at least, in the 1st and in the 3rd trimester (“yes/no”). The mean number of cigarettes smoked per day during pregnancy was estimated averaging the mean number of cigarettes per day in the first and third trimesters.
Maternal exposure to SHS in pregnancy (mother-SHS: “yes/no”) was assessed through questionnaire and was slightly different between cohorts: (i) exposure at home, work, or leisure places (INMA); (ii) exposure at home or work (MoBa, RHEA, BiB); (iii) exposure without specifying location (EDEN); and (iv) partner smoking (KANC).
We combined all previous definitions to create a variable that captured both dose and duration of exposure to smoking during pregnancy with the following categories: “unexposed,” “SHS,” “non-sustained smoker,” “sustained smoker at low dose (≤ 9 cigarettes per day),” and “sustained smoker at high dose (> 9 cigarettes per day).”
Definitions of childhood exposure to secondhand smoke
Childhood exposure to secondhand smoke (SHS) was assessed through a harmonized questionnaire administered to the parents as part of the HELIX project. The questionnaire included questions about (i) smoking at home by the mother, mother’s partner, or other people, and (ii) attendance to indoor places where people smoke. From this information, we created a variable of SHS with four levels: “unexposed,” “exposed to SHS only outside home,” “exposed to SHS only inside home,” and “exposed to SHS inside and outside home.” This variable was reclassified in larger groups to assess global exposure to SHS (global-SHS: “yes/no”; home-SHS: “yes/no”).
Urinary cotinine levels were measured in the children using the Immulite2000 Nicotine Metabolite (Cotinine) 600 Test on an Immulite 2000 XPi from Siemens Healthineers at Fürst Medisinsk Laboratorium in Norway for which the LOD was 3.03 μg/L. Due to low concentrations in children, a categorical variable was created (cotinine: “detected/undetected”).
Correlation between exposure variables
The correlation among the proportion of exposed children using different smoking definitions and different windows of exposure was calculated using the tetrachoric correlation test, with the psych R package . The tetrachoric correlation estimates what the correlation would be if measured on a continuous scale.
Detailed information on molecular characterization of the 4 molecular layers (6 omics datasets) can be found in Additional file 2: Additional Methods [32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49].
Blood DNA methylation
Briefly, methylation was measured in DNA extracted from buffy coat (EDTA tube) using the Infinium HumanMethylation450 beadchip (Illumina, USA) at the Spanish National Genotyping Center (CeGen, Spain). Samples were randomized and balanced by sex and cohort within each batch. Samples with low call rate (< 98%) were excluded. Also, probes with low call rate (< 95%) , probes in sexual chromosomes, cross-hybridizing probes, and probes containing single nucleotide polymorphisms (SNPs) were filtered out . Methylation levels were normalized using the functional normalization method with prior background correction with Noob , and slide batch effect was controlled with the ComBat method . Beta values, going from 0 (un-methylated) to 1 (fully methylated), were used in the analyses. CpGs were annotated with the IlluminaHumanMethylation450kanno.ilmn12.hg19 R package . Blood methylation quantitative trait loci (mQTL) were retrieved from mQTLdb (http://www.mqtldb.org/) (database: MatrixEQTL; timepoint: childhood; distance: 1 Mb).
Blood gene expression
RNA was extracted from whole blood collected in Tempus tubes. Gene expression was assessed using the GeneChip® Human Transcriptome Array 2.0 (HTA 2.0) (Affymetrix, USA) at the University of Santiago de Compostela (USC, Spain). Samples were randomized and balanced by sex and cohort within each batch. Data was normalized at the gene level with the GCCN (SST-RMA) algorithm, and batch effects and blood cell type composition were controlled with two surrogate variable analysis (SVA) methods, isva  and SmartSVA , during the differential expression analyses. Gene expression values were log2 transformed, and annotation of transcript clusters (TCs) to genes was done with the Affymetrix Expression Console software using the HTA-2_0 Transcript Cluster Annotations Release na36 (hg19).
Blood miRNA expression
miRNA expression was quantified using the SurePrint Human miRNA Microarray rel.21 (Agilent Technologies, USA) , at the Genomics Core Facility at the Centre for Genomic Regulation (CRG, Spain). Samples were randomized and balanced by sex and cohort within each batch. miRNA expression levels were normalized with the least variant set (LVS) method  with background correction with the Normexp method . Normalized miRNA levels were log2 transformed and annotated using a combination of information from Agilent annotation (“Annotation_7056”) and miRbase v21 (GRCh38 and mapped back to hg19) released in January 2017. Additional control of batch effect and blood cell composition during the differential expression analyses was done with the SVA standard method .
Plasma protein levels were assessed using the antibody-based multiplexed platform from Luminex at the Proteomics Unit (Centre for Genomic Regulation (CRG)/University Pompeu Fabra (UPF), Spain), using 3 commercial kits (Thermo Fisher Scientific, USA): Cytokines 30-plex (catalog number (CN): LHC6003M), Apoliprotein 5-plex (CN: LHP0001M), and Adipokine 15-plex (CN: LHC0017M) (Additional file 1: Table S2). All samples were randomized and blocked by cohort. Raw intensities were converted to nanograms per milliliter (5-plex kit) and to picograms per milliliter (15- and 30-plex kits) using 8-point calibration curves added in each plate. Only 36 proteins out of 43 with > 30% of measurements in the linear range of quantification were kept for the analysis. Protein levels were log2 transformed, and plate batch effect was corrected. Values below the lower limit of quantification (LOQ1) and above the upper limit of quantification (LOQ2) were imputed using the truncdist R package .
Serum metabolites were quantified using the targeted metabolomics Absolute-IDQTM p180 Kit (Biocrates Life Sciences AG, USA). Serum metabolic profiles were acquired following the manufacturer’s protocol using LC-MS/MS on a Sciex QTrap 6500 equipped with an Agilent 1100 series HPLC (Agilent Technologies, USA), at Imperial College London (ICL, UK); a full description of the HELIX metabolomics methods and data can be found elsewhere . Samples were fully randomized. Metabolites were quantified (mM) following the manufacturer’s protocol (appendix), and then log2 transformed. Metabolite exclusion was based on a metabolite variable meeting two conditions: (1) CV of over 30% and (2) over 30% of the data are below LOD. Eleven out of the 188 serum metabolites detected were excluded as a result, leaving 177 serum metabolites to be used for further statistical analysis. The mean coefficient of variation across the 177 LC-MS/MS detected serum metabolites was 16%. Analytical performance was in line with expectations from and inter-laboratory ring trial of this platform .
Urinary metabolic profiles were analyzed on a 14.1-T (600 MHz 1H) NMR spectrometer (Bruker, USA) at Imperial College London (ICL, UK) . Urine samples were fully randomized. In addition, 60 identical study quality control urine samples were included at regular intervals during the runs. While data acquisitions were untargeted, data processing workflow followed a targeted strategy to identify and quantify the 44 most abundant metabolites in urine [47, 49]. Sample concentration of a given metabolite was estimated from the signal of the internal standard trimethylsilylpropanoic (TSP). Data was normalized using median fold change normalization method which takes into account the distribution of relative levels from all 44 metabolites compared to the reference sample in determining the most probable dilution factor. Twenty-six metabolites were absolutely quantified, and 18 semi-quantified. Concentration levels were expressed as log2, and before that, we used an off-set of ½ the minimal value for each metabolite.
The steps to analyze the HELIX data were as follows: (1) We systematically analyzed the association between period-specific exposure to tobacco smoke and child molecular marks from the different omics layers. (2) For statistically significant molecular marks, we tested the effect of dose and duration of the exposure. (3) For CpGs only, we searched for cis expression quantitative trait methylation (eQTM). (4) We contrasted our findings with the previous literature, either through direct comparison or through enrichment analyses. (5) We conducted sensitivity analyses to test the robustness of the findings to exposure definitions, period-specific effects, and ancestry.
Association between exposure to smoking and molecular phenotypes
To test the relationship between exposure to tobacco smoke versus child molecular marks, we fitted linear regressions between each tobacco smoking variable (predictor) and each molecular mark (outcome) adjusting for covariates, using limma  within the implementation of omicRexposome R package . Children unexposed to tobacco smoke, in each definition and period, were considered the reference group. Since we wanted to assess period-specific smoking effects, models were mutually adjusted: any and sustained maternal smoking in pregnancy were adjusted for childhood global-SHS, while childhood global-SHS and urinary cotinine detection were adjusted for sustained maternal smoking in pregnancy.
Other covariates were selected through directed acyclic graphs (DAGs) with the DAGitty tool  (Additional file 3: Fig. S1; and Additional file 3: Fig. S2). The following variables were selected for both periods: (i) cohort, (ii) self-reported ancestry (European ancestry, Pakistani or Asian, and others), (iii) maternal age, and (iv) self-reported maternal education (low (primary school), medium (secondary school), and high (university degree or higher)). In addition, pregnancy models were adjusted for maternal obesity status, and childhood models for child body mass index (zBMI). Maternal BMI was calculated from pre- or early-pregnancy weight and height and divided in four categories derived from World Health Organization (WHO) definitions. Child zBMI is a sex and age z-score calculated according to WHO reference curves [51, 52]. Single imputation of missing data for covariates was done using a chained equations method  with the mice R package . The proportion of missing data in the selected covariates was minimal: 0.17% in maternal age, 0.75% in maternal BMI, and 2.16% in maternal education. Apart from covariates resulting from DAG diagrams, models were also adjusted for sex and child age, as well as for specific covariates in models of each of the omics. Methylation models were adjusted for blood cell type composition (CD4+ and CD8+ T cells, natural killer cells (NK), monocytes, granulocytes, and B cells), which was estimated from raw methylation data [55, 56]. Models of plasma proteins and serum metabolites were adjusted for time to last meal and hour of blood collection, and models for urine were adjusted for sample type (morning, night, or the pool of both). Serum and urinary metabolites were additionally corrected for technical batch. Technical batch effects of the other omics datasets were eliminated during the quality control process, as described in previous sections.
Multiple-testing correction was performed within each molecular layer following different approaches. For omics with more than 1000 features (methylation, gene expression, and miRNAs), we used false discovery rate (FDR)–Benjamini-Hochberg correction . For the remaining omics, we divided the nominal p value by the effective number of tests  (proteins—18.4, p value = 2.72E−03; serum metabolites—60.3, p value = 8.29E−04; urinary metabolites—33.3, p value = 1.50E−03).
The effect of smoking on DNA methylation is reported as a difference in methylation levels between exposed and unexposed children, while for the other omics, the effect is reported as a log2 fold change (log2FC). To examine whether different definitions of exposure (i.e., any vs. sustained) or different adjustments yielded increased or decreased magnitudes of association, we calculated the percent change in the coefficients (β) between the two models (i.e., sign(max(βSust.;βAny) × (βSust. − βAny)/|βAny| × 100)).
Dose and duration of the effect
To study the effect of dose and duration during pregnancy, we fitted linear regressions between the levels of the molecular biomarker (outcome) and the exposure to tobacco smoking (predictor) defined in the following categories: “unexposed,” “SHS,” “non-sustained smoker,” “sustained smoker at low dose (≤ 9 cigarettes per day),” and “sustained smoker at high dose (> 9 cigarettes per day),” where “unexposed” was the reference category. Similarly, for the childhood exposure, we fitted linear regressions between the levels of the molecular biomarker (outcome) versus the exposure to tobacco smoking (predictor) defined in the following categories: “unexposed,” “exposed to SHS only outside home,” “exposed to SHS only inside home,” and “exposed to SHS inside and outside home,” where “unexposed” was considered the reference category.
Expression quantitative trait methylation
For CpGs significantly associated with exposure to tobacco smoke, we searched for cis eQTMs using methylation and gene expression data from 874 HELIX children. Cis effects were defined in a window of 1 Mb from the transcription start site (TSS) of each TC (each gene). The association between DNA methylation and gene expression levels was assessed via 1420 linear regressions. Models were adjusted for cohort, child’s age, sex, zBMI, ancestry, and blood cell type composition. To control for gene expression batch effect, we estimated surrogate variables (SVs) with the SVA R package , protecting covariates (cohort, child’s age, sex, zBMI, ancestry, and blood cell type composition), and the effect of these SVs was eliminated from the gene expression matrix by calculating the residuals.
Direct comparison with the literature and enrichment analyses
We compared findings in our study with previous literature on the molecular effects of exposure to tobacco smoke in pregnancy (DNA methylation ) and of own smoking in adults (DNA methylation , gene expression , miRNA expression , serum metabolites ). When possible, we did a detailed comparison with previous findings; for genome-wide omics without significant marks in our study, we conducted enrichment analyses. To explore enrichment of our results for molecular marks (CpGs/genes) identified previously for current smoking in adults, we tested whether the distribution of p values at these marks in our data deviates from a null distribution using one-sided Kolmogorov-Smirnov tests, and compared the direction of effect estimate to what was reported previously.
On the molecular marks that survived multiple-testing correction, we performed sensitivity analyses to test the robustness of results under the following scenarios: (i) for models of any and sustained maternal smoking in pregnancy, we tested the effect of adjusting for home-SHS, instead of global-SHS; (ii) we compared main models, which are mutually adjusted for the two exposure periods (i.e., maternal smoking in pregnancy adjusted for childhood SHS), versus unadjusted models (i.e., maternal smoking in pregnancy without adjustment for childhood SHS); and (iii) finally, we restricted the analysis to children of European ancestry (N = 1083).
All the analyses were done in R environment. The R packages MultiDataSet , rexposome, and omicRexposome  were used to manage and analyze the omics and exposure data. ggplot2 , qqman , calibrate , sjPlot , OmicCircos , and coMET  R packages were used to visualize the results.
The study included 1203 children, aged 6 to 11 years, from the Human Early Life Exposome (HELIX) project that had complete information on pregnancy and childhood exposure to tobacco smoking and data on at least one omics platform . These children were from longitudinal cohorts in 6 European countries, 90.1% were of European ancestry, 54.5% were males, and 51.5% were born from highly educated mothers (Table 1). Molecular features measured at an average age of 8.1 years included blood DNA methylation, blood gene and miRNA expression, plasma proteins, and sera and urinary metabolites (Table 2). For 93% of the children, omics data from at least 4 platforms was available (Table 1).
Exposure to tobacco smoking
Out of the 1203 mothers, 14.6% reported having smoked at some point during pregnancy (any smoking), and 9.5% reported having smoked throughout whole pregnancy (sustained smokers) (Table 1). When considering information on dose and duration during pregnancy (n = 1193), 55.6% of the mothers were unexposed, 30.2% were exposed to SHS, 5.2% were non-sustained smokers, 7.8% were sustained smokers at low dose (≤ 9 cigarettes per day), and 1.3% were sustained heavy smokers (Fig. 1a).
The frequency of children exposed to global-SHS, meaning exposure at home or in other places, was 35.4% (Table 1): 17.2% were exposed only outside home, 10.9% only at home, and 7% in both places (0.4% did not had enough information to be classified in any of these categories) (Fig. 1b). Moreover, 17.5% of the children had urinary cotinine levels over the limit of detection (LOD) (Table 1). The correlations of cotinine measures with global-SHS and home-SHS were 0.7 and 0.8, respectively (Table 3).
The correlations between in utero and childhood exposures are shown in Table 3. The highest correlation of maternal smoking in pregnancy was with child cotinine levels (0.7), then with home-SHS (0.6), and finally with global-SHS (0.4). Considering a combination of any maternal smoking in pregnancy and global-SHS exposure during childhood, 59% of the children were not exposed to any smoking, 5.6% were exposed only in pregnancy, 26.4% only after birth, and 9.1% in both periods (Fig. 1c).
The proportion of exposed children was highly dependent on the cohort (Fig. 1, Additional file 3: Fig. S3). Southern European cohorts (RHEA-Greece, INMA-Spain, and EDEN-France) had the highest percentage of smoking mothers, and maternal or child exposure to SHS was highest in RHEA, in INMA, and also in KAUNAS-Lithuania. In MoBa-Norway, only 20.8% of the children were exposed to prenatal or childhood tobacco smoke, while in RHEA, this percentage rose to 73.6%.
Association between maternal smoking in pregnancy and child DNA methylation
Screening and comparison with the literature
After controlling for childhood global-SHS, any and sustained maternal smoking in pregnancy were associated with altered child blood DNA methylation. Lambda inflation factors ranged from 0.951 to 1.003 (Additional file 3: Fig. S4). At 5% false discovery rate (FDR), a total of 41 unique CpGs were differently methylated, when comparing any or sustained maternal smoking in pregnancy to non-smoking: 24 were associated with both smoking definitions, 3 with any, and 14 with sustained, although all of them were at least nominally significant in both models (Additional file 1: Table S3). These 41 CpGs were located in 18 loci, defined as regions of < 2 Mb, and were distributed along the genome (Fig. 2). Around 30% of the CpG sites were hypo-methylated (lower methylation in exposed children) (Additional file 3: Fig. S4), and CpGs located in the same locus were affected in the same direction, except for the AHRR locus (Fig. 2). Differential DNA methylation at 17 out of the 18 loci had previously been reported in relation to sustained maternal smoking in pregnancy in cord blood  (Additional file 1: Table S3). Moreover, persistent effects until childhood were described for 16 of them . The unique locus not previously related to maternal smoking in pregnancy was FMN1 (Formin 1), but other CpGs in that locus were found differently methylated in current smokers in the opposite direction . Thirteen out of the 18 loci (27 out of the 41 CpGs) had at least one genetic variant (mQTL), in cis and/or trans, associated with methylation levels (Additional file 1: Table S3).
Effects of dose and duration
As expected, stronger effects were observed for sustained maternal smoking in pregnancy compared with any maternal smoking in pregnancy in 39 out of the 41 CpGs (Additional file 3: Fig. S5). The mean absolute percentage change of methylation from any to sustained maternal smoking in pregnancy models was 32.8% (Additional file 1: Table S3). The effect of dose and/or duration of maternal smoking in pregnancy on DNA methylation was investigated. At visual inspection, we identified 4 different illustrative patterns: (i) CpGs exhibiting an increased or decreased DNA methylation tendency with increasing maternal smoking in pregnancy dose and/or duration (Fig. 3a, b); (ii) CpGs with an increased or decreased DNA methylation tendency only with increasing dose of sustained maternal smoking in pregnancy, but without response to any maternal smoking in pregnancy (Fig. 3c, d); (iii) CpGs with a saturated pattern at any maternal smoking in pregnancy (Fig. 3e); and (iv) CpGs with a saturated pattern at sustained maternal smoking in pregnancy (Fig. 3f). The plots for all 41 CpGs are shown in Additional file 4: Fig. S6 (10 showing a linear trend, 4 with a dose response in sustained smokers, 3 saturated with any smoking, and 8 saturated with sustained smoking). Some CpGs did not show a clear pattern, and others located at the same locus showed different patterns. Maternal exposure to SHS was only nominally associated (p value < 0.05) with 2 of these 41 CpGs (cg11902777 and cg17454592).
Search for gene expression quantitative trait methylation
We, then, examined whether the 41 CpGs might be eQTMs. A total of 480 unique transcript clusters (TCs, equivalent to known or putative genes) with their transcription start site (TSS) within ± 500 kb of the 41 CpGs were identified. At 5% FDR, 15 methylation to expression relationships were found, which included 12 unique CpGs in 5 loci and 7 unique TCs (Fig. 2, Additional file 1: Table S4). All eQTMs, except cg21161138 (AHRR)-TC05002792.hg.1, occurred between CpGs and genes located at < 160 kb (Additional file 3: Fig. S7). None of the eQTMs genes corresponded to the most proximal gene to the CpG site. Two out of the 15 eQTM relationships were inverse, meaning higher methylation-lower gene expression, while the others were positive. The inverse associations were between PNOC and a CpG (cg17199018) located downstream the gene, and between EXOC3 and a CpG (cg11902777) located upstream.
With a nominal p value < 0.05, 65 other eQTMs were detected, in total, involving 33 unique CpGs in 14 out of the 18 loci (Additional file 1: Table S4). Only the methylation levels of CpGs at the gene body of GFI1 and AHRR were associated with the expression of the same genes. AHRR, associated with maternal smoking in pregnancy  and current smoking in adults , is an interesting example. Five CpGs in the AHRR locus were associated with maternal smoking in pregnancy: 2 hyper-methylated located at intron 1 (cg17924476, cg23067299), and 3 hypo-methylated at other introns (cg11902777, cg05575921, cg21161138) (Fig. 4). Hyper-methylated CpGs in relation to maternal smoking in pregnancy were positive eQTMs for AHRR, PDCD6, and EXOC3 genes, while hypo-methylated CpGs were inverse eQTMs for the same genes (in both cases implying higher expression of the genes).
Association between maternal smoking in pregnancy and other child molecular phenotypes
Besides child DNA methylation, we also screened the association between maternal smoking in pregnancy and the other molecular layers: gene and miRNA transcription, plasma proteins, and serum and urinary metabolites.
After multiple-testing correction, only 2 associations between maternal smoking in pregnancy and urinary metabolites were detected. Urinary alanine and lactate were increased in children of mothers classified as sustained smokers during pregnancy compared to children of non-smoking mothers (effect = 0.189 and 0.174, p value = 3.93E−04 and 5.19E−04, respectively) (Additional file 1: Table S5).
No associations were found between maternal smoking in pregnancy and child serum metabolites or blood gene/miRNA expression. However, given the effect of maternal smoking on child methylation and the eQTM analyses, we took a closer look at gene expression (Fig. 2). Top 10 associations can be seen in Additional file 1: Table S6, and among the 15 eQTM genes, only EXOC3 was nominally downregulated in children of smoker mothers (p value < 0.01) (Additional file 1: Table S7).
Association between childhood exposure to SHS and child molecular phenotypes
In postnatal life, childhood SHS exposure was related to child plasma protein and serum metabolite levels, but not to any other of the molecular layers (blood DNA methylation, gene/miRNA transcription, or urinary metabolites).
In particular, higher levels of PAI1 (plasminogen activator inhibitor-1) protein were found in children with urinary cotinine levels above the LOD compared to those below the LOD (effect = 0.379, p value = 1.66E−04) (Additional file 1: Table S8). PAI1 levels increased with increased frequency of exposure, with children exposed both inside and outside home having the highest PAI1 levels (Fig. 5). This association was attenuated when testing global-SHS instead of urinary cotinine levels.
On the other hand, children classified as exposed to SHS under different definitions had lower levels of several serum metabolites compared to those classified as not exposed [global-SHS: sphingomyelin (OH) C16:1 (effect = − 0.073, p value = 7.97E−05), carnitine C9 (effect = − 0.052, p value = 8.03E−04), and cotinine: PC ae C38.0 (effect = − 0.110, p value = 7.88E−04)] (Additional file 1: Table S8). We only detected a clear pattern of a dose-effect response for carnitine C9 (Fig. 5).
Comparison and enrichment for signals identified in current smokers
We, then, compared our results of exposure to tobacco smoke in children, both in pregnancy and in postnatal life, with the molecular marks identified for current smoking in adults. For serum metabolites  and miRNAs , molecular layers with a limited number of marks assessed in the omics platforms, we did a direct comparison of the findings. For genome-wide omics, we analyzed whether our findings showed any enrichment for the signals identified in studies of current smoking and DNA methylation (N = 16,223 CpGs at 5% FDR) , and gene expression (N = 1270 genes at 10% FDR) .
As in current smokers , children exposed to postnatal SHS had lower levels (p value < 0.05) of diacyl (aa)-phosphatidylcholines (PC aa C36:0, PC aa C38:0), acyl-alkyl (ae)-phosphatidylcholines (PC ae C38:0, PC ae C38:6, PC ae C40:6), and sphingomyelin (OH) C22:2 (Additional file 1: Table S9) compared to non-exposed children. None of the amino acids reported to be increased in current smokers were affected in SHS-exposed children. In contrast to this, maternal smoking in pregnancy had no effect on serum metabolites related to current smoking (Additional file 1: Table S10).
None of the miRNAs for current smoking were nominally significant in our study, either with exposure pregnancy or in childhood (Additional file 1: Table S11 and Table S12). Similarly, no enrichment for current smoking sensitive genes was observed among our results of exposure during pregnancy (enrichment p values for any and sustained maternal smoking, 0.903 and 0.842, Additional file 3: Fig. S8) or our results of exposure to childhood SHS (enrichment p values for global-SHS and child cotinine, 0.579 and 0.746, Additional file 3: Fig. S9).
Regarding DNA methylation, we also found that our results for childhood SHS were not enriched for CpGs associated with current smoking (enrichment p values for global-SHS and child cotinine, 0.034 and 0.998) (Additional file 3: Fig. S10). In contrast, we observed enrichment among our list of CpGs associated with maternal smoking in pregnancy (enrichment p values for any and sustained maternal smoking in pregnancy, 2.834E−07 and < 2.2E−16, respectively) (Additional file 3: Fig. S11). In particular, the DNA methylation at 1279 of the 16,223 current smoking sensitive CpGs  was nominally significant in children of sustained smoker mothers, 73.3% of them with consistent direction of the effect, and with a lambda inflation factor of 1.16.
Sensitivity analyses for pregnancy and childhood exposure to smoking
We conducted further sensitivity analyses for the molecular marks that survived multiple-testing correction. First, we ran additional pregnancy models adjusting for home-SHS instead of global-SHS. In general, the strength of the association was reduced, but for DNA methylation, the change in effect size between models was low (mean absolute percentage change < 5%) (Additional file 3: Fig. S12, Additional file 1: Table S13-S14).
Second, we compared adjusted and unadjusted models for exposure to smoking in the reciprocal period. For maternal smoking in pregnancy, mainly associated with DNA methylation levels, results were practically the same between models, again, with mean absolute percentage change < 5% (Additional file 3: Fig. S13, Additional file 1: Table S15-S16). For postnatal SHS, adjustment for sustained maternal smoking in pregnancy, in general, attenuated the effect sizes to a maximum of around 20% change (Additional file 1: Table S17).
Third, we repeated the analysis restricting the sample to children of European ancestry (N = 1083) (Additional file 1: Table S18-S20). In general, p values were attenuated, likely due to a smaller samples size, but effect sizes remained of similar magnitude (mean absolute percentage change < 5%) (Additional file 3: Fig. S14, Additional file 1: Table S18). The other biomarkers showed more heterogeneous patterns when evaluated in children of European ancestry only, with some of them showing an increase of the effect (Additional file 1: Table S19-S20).
Despite the efforts of public health campaigns, maternal smoking in pregnancy and childhood SHS are still main adverse avoidable risk factors for child health. This study is the first to examine the association of exposure to tobacco smoking at different windows of exposure, in utero and in childhood, with multi-layer molecular phenotypes.
Exposure to maternal smoking in pregnancy was associated with DNA methylation of 41 CpG sites located in 18 different loci. All loci had previously been related to maternal smoking during pregnancy [14,15,16] or current smoking . However, none of the previous studies had incorporated substantial transcriptomics data from the same subjects to interpret the functional consequences of these epigenetic changes. Furthermore, few studies investigated duration and intensity of maternal smoking in pregnancy, which might be relevant for public health advice [14, 16].
As in cord blood [14, 16], sustained maternal smoking in pregnancy produced larger effects on childhood blood DNA methylation than any maternal smoking, with the latter group including sustained smoker mothers as well as mothers that usually only smoked during the 1st trimester. Moreover, when considering duration and intensity of maternal smoking in pregnancy, some CpGs showed a dose-response trend, whereas others got saturated with any maternal smoking in pregnancy or did not have any meaningful response. These heterogeneous patterns, even in the same locus, can be explained by CpG-specific responses, but also by less accuracy in the measurement of some CpGs (low biological response to high technical noise) . In any case, the persistence and the linear trend response of CpGs in MYO1G, GNG12, AHRR, FRMD4A, RADIL, and 7q11.22 make them interesting candidates for the development of an epigenetic biomarker for in utero exposure to smoking . In general, maternal SHS during pregnancy had mostly negligible effects on offspring blood DNA methylation at the 41 significant CpGs, or at least their effects were diluted over time and not detected in childhood with the actual sample size.
We also observed that DNA methylation in 5 of these 18 loci was related to gene expression of nearby genes, but usually not of the closest annotated gene. However, these effects were weak as we did not detect significant associations between maternal smoking in pregnancy and expression of these genes. In other words, DNA methylation response to maternal smoking in pregnancy was not mirrored at the transcriptional level of nearby genes. Similarly, previous studies in former smokers have shown that smoking has a longer-lasting influence on the methylome compared to the transcriptome . The reversal rate of gene expression at 1 year after smoking cessation has been calculated in > 50% and reaches > 85% after 10 years, whereas for methylation, it ranges from 17 to 33% with some effects still visible 40 years after smoking cessation. The different reversal rates between methylation and transcription could be explained by the complex transcriptional regulation that involves mechanisms other than DNA methylation. Whether persistent epigenetic marks act as a memory of the cell to previous exposures, to trigger rapid or amplified transcriptional activation in certain contexts (i.e., after a second exposure event), is unknown. Also, the weak association between exposure to tobacco smoke and transcription, in comparison to methylation, might be explained by the highest instability of the RNA compared to DNA, which might have introduced noise into the transcriptional data.
AHRR (Aryl-Hydrocarbon Receptor Repressor), which is involved in xenobiotic detoxification, cell growth, and differentiation, has widely been reported in relation to smoking. In particular, cg05575921 has been found to be hypo-methylated in cord blood , adult blood , placenta , and adipose tissue . AHRR is an interesting example to discuss the complexity of epigenetic regulation. First, in our study, AHRR exhibited both hyper-methylation (intron 1) and hypo-methylation (other introns) in response to smoking. Through the eQTM analyses, we found that both hyper- and hypo-methylation were related to increased expression of the gene in blood. This finding evidences that epigenetic regulation of transcription is gene context-specific (i.e., intron 1 behaves different from other introns) and that methylation-expression correlations are fundamental to understand final transcriptional consequences. Second, besides AHRR gene, methylation at CpGs of this locus was also associated with the expression of two other nearby genes: PDCD6 and EXOC3. PDCD6 (Programmed Cell Death 6) is a calcium sensor involved in endoplasmic reticulum (ER)-Golgi vesicular transport, endosomal biogenesis, or membrane repair, and EXOC3 (Exocyst Complex Component 3) is a component of the exocyst complex involved in the docking of exocytic vesicles with fusion sites on the plasma membrane. Further research might clarify the potential role of these genes, if any, in relation to tobacco smoking.
Two metabolites (lactate and alanine), known to be increased with glycemic dysregulation , were found at higher levels in urine of children born from sustained smoker mothers compared with non-smokers. Although there are some studies reporting an association between maternal smoking and type 2 diabetes and metabolic syndrome, the evidences are still inconclusive according to a recent meta-analysis .
In contrast to in utero exposure, exposure to childhood SHS, assessed through either questionnaire or cotinine, was associated with child serum metabolites and plasma proteins. In particular, we found that SHS, defined as urinary cotinine above the LOD, increased plasma PAI1 (plasminogen activator inhibitor-1 SERPINE1 gene) protein levels. PAI1 is the principal inhibitor of tissue plasminogen activator (tPA) and urokinase (uPA), enzymes that convert plasminogen into plasmin (fibrinolysis) (Additional file 3: Fig. S15). Therefore, higher levels of PAI1 are indicative of a thrombotic state, and they have been found in active smokers [70, 71]. Although the increase of plasma PAI1 levels in SHS-exposed children was substantially smaller than the increase detected in active smokers , our findings evidence that SHS was sufficient to produce a pro-thrombotic state in children. The long-term consequences of this pro-thrombotic state in children, if prolonged over time, are unknown, but in adults, it is linked to age-related subclinical (i.e., inflammation or insulin resistance) or clinical (i.e., myocardial infarction, obesity) conditions .
We also found several serum metabolites altered in SHS-exposed children. Reduced levels of diacyl (aa)- and acyl-alkyl (ae)-phosphatidylcholines and of sphingomyelin (OH) C22:2 were in agreement with findings in active adult smokers . Is it worth noting that these diacyl (aa)- and acyl-alkyl (ae)-phosphatidylcholines overlap with those positively associated with adherence to Mediterranean diet and with protective risk factors for cardiovascular disease . We considered the reported location of childhood SHS exposure (inside home, outside, and in both places) as a surrogate of intensity of exposure to smoking. While PAI1 plasma protein levels and carnitine C9 were higher in children exposed in both locations, no clear dose-response patterns were observed for other metabolites.
Given that SHS effects might be subtler than those of active smoking, we also examined whether molecular features (CpG methylation or gene/miRNA transcription) described in current smokers overlapped with our findings of postnatal SHS with a less stringent p value cutoff. We did not detect any enrichment, suggesting that if postnatal SHS has an effect on these molecular layers, our sample size is too limited to detect it. Conversely, as an indication of the long-term and strong effects of active smoking, we did find enrichment for child CpGs associated with maternal smoking in pregnancy among CpGs described for current smoking in other studies. Indeed, it has been described a remarkable overlap between the blood methylation signatures detected in adult smokers and in newborns of smoker mothers .
Globally, our findings together with previous literature suggest that offspring blood DNA methylation captures strong and permanent effects associated with active maternal smoking during pregnancy. In our study, the associations between maternal smoking during pregnancy and DNA methylation were not attenuated after adjustment for childhood SHS, likely due the weaker effects of passive compared to active smoking. In contrast, the potential biological effects of SHS were best captured by dynamic molecules with fast responses, such as metabolites and proteins. Time-course studies will be needed to dissect this acute response in more detail. Adjustment for maternal smoking during pregnancy attenuated the effects of childhood SHS on these markers, highlighting the importance of mutually adjusted models in order to identify period-specific effects.
Findings should be considered within the context of the study’s limitations. First, exposure assessment to tobacco smoking, through either questionnaire or cotinine, has some intrinsic limitations. Maternal smoking in pregnancy and child exposure to SHS were self- or parental-reported, and they can be subject to misreporting . Urinary cotinine, although more objective, only provides information about the most recent exposure (half-life in urine ~ 20 h) . In our study, cotinine detection correlated strongly with the childhood SHS classification though, giving us reasonable confidence in the questionnaire reports. Second, we aimed to dissect pregnancy from childhood exposure associations using mutually adjusted models. However, misclassification and weaker effects of SHS compared to effects of maternal smoking in pregnancy (i.e., PAI1 or AHRR ), as well as the high level of overlap between maternal and childhood smoking exposure, might have limited our ability to distinguish between these two time periods. Larger samples of children exposed to SHS, without in utero exposure, might be needed to investigate SHS, especially for DNA methylation. Third, although the statistical models were adjusted for an exhaustive list of confounders, including child zBMI, we cannot completely rule out residual confounding. For example, plasma PAI1, which can be released by fat cells , is related to BMI and percent body fat . Fourth, although the study has been designed as a comprehensive screening using high-throughput omics platforms, these platforms do not have complete coverage of the molecular layers, and consequently, we might have missed some biological signals. Also, cell type-specific responses might have been diluted within the context of whole blood analyses. Fifth, some of the smoking sensitive CpGs detected in our study are known to be regulated by mQTLs. The role of genetic variation in modifying the effects of the exposure to smoking deserves future research. Finally, our study predominantly consisted of European ancestry children, and thus, additional studies involving diverse ethnic backgrounds are needed in order to improve the generalizability of the findings. Potential confounding by ancestry was controlled by adjusting the models for self-reported ethnic origin. For the top signals, we also performed a sensitivity analysis restricted to European ancestry children and results did not change substantially.
Our study investigated the in utero and postnatal effects of exposure to tobacco smoke on 4 molecular layers, assessed through a harmonized protocol, in 1203 children across Europe. Our results confirmed previous findings of persistent associations between maternal smoking in pregnancy and childhood blood DNA methylation and showed dose-response trends at some CpG sites. These might be informative for the development of an epigenetic risk score of in utero exposure to tobacco smoke. The persistent methylation signature related to in utero exposure to smoking was not mirrored at the transcriptional level. The meaning of the gap between methylation and transcriptional signals requires further investigation, but might mean a methylation-based memory of the cell. Childhood SHS was not related to blood methylation, indicating much weaker effects of recent SHS with respect to active maternal smoking in pregnancy. In contrast, childhood SHS was related to higher plasma levels of PAI1, a protein that inhibits fibrinolysis, and to certain metabolites. The final clinical impact of sustained increased levels of PAI1 in children is unknown, but this finding highlights the importance of the analysis of molecular traits to capture subtler effects at earlier timepoints.
Availability of data and materials
Summarized results of each model (exposure, marker, effect, SE, p value) will be uploaded on the HELIX webpage (https://helixomics.isglobal.org/). Raw data can be obtained under request, after signature of a Data Transfer Agreement (DTA).
Body mass index
Coefficient of variation
Directed acyclic graph
Expression quantitative trait methylation
Limit of detection
Limit of quantification
Methylation quantitative trait locus
Maternal smoking during pregnancy
Surrogate variable analysis
Transcription start site
Gollwitzer ES, Marsland BJ. Impact of early-life exposures on immune maturation and susceptibility to disease. Trends Immunol. 2015;36(11):684–96. https://doi.org/10.1016/j.it.2015.09.009.
Hanson MA, Gluckman PD. Early developmental conditioning of later health and disease: physiology or pathophysiology? Physiol Rev. 2014;94:1027–76. https://doi.org/10.1152/physrev.00029.2013.
Mund M, Louwen F, Klingelhoefer D, Gerber A. Smoking and pregnancy--a review on the first major environmental risk factor of the unborn. Int J Environ Res Public Health. 2013;10:6485–99. https://doi.org/10.3390/ijerph10126485.
CDC. The health consequences of smoking—50 years of progress a report of the surgeon general. A Rep Surg Gen. 2014; (this is a report downloaded from: https://www.cdc.gov/tobacco/data_statistics/sgr/50th-anniversary/index.htm#report).
Smedberg J, Lupattelli A, Mårdby A-C, Nordeng H. Characteristics of women who continue smoking during pregnancy: a cross-sectional study of pregnant women and new mothers in 15 European countries. BMC Pregnancy Childbirth. 2014;14:213. https://doi.org/10.1186/1471-2393-14-213.
Oberg M, Jaakkola MS, Woodward A, Peruga A, Prüss-Ustün A. Worldwide burden of disease from exposure to second-hand smoke: a retrospective analysis of data from 192 countries. Lancet (London, England). 2011;377:139–46. https://doi.org/10.1016/S0140-6736(10)61388-8.
Veeranki SP, Mamudu HM, Zheng S, John RM, Cao Y, Kioko D, et al. Secondhand smoke exposure among never-smoking youth in 168 countries. J Adolesc Health. 2015;56(2):167–73. https://doi.org/10.1016/j.jadohealth.2014.09.014.
Llaquet H, Pichini S, Joya X, Papaseit E, Vall O, Klein J, et al. Biological matrices for the evaluation of exposure to environmental tobacco smoke during prenatal life and childhood. Anal Bioanal Chem. 2010;396(1):379–99. https://doi.org/10.1007/s00216-009-2831-8.
Mattes W, Yang X, Orr MS, Richter P, Mendrick DL. Biomarkers of tobacco smoke exposure. Adv Clin Chem. 2014;2014(67):1–45. https://doi.org/10.1016/bs.acc.2014.09.001.
Reese SE, Zhao S, Wu MC, Joubert BR, Parr CL, Håberg SE, et al. DNA methylation score as a biomarker in newborns for sustained maternal smoking during pregnancy. Environ Health Perspect. 2016; https://doi.org/10.1289/EHP333.
Bauer T, Trump S, Ishaque N, Thürmann L, Gu L, Bauer M, et al. Environment-induced epigenetic reprogramming in genomic regulatory elements in smoking mothers and their children. Mol Syst Biol. 2016;12:861. https://doi.org/10.15252/msb.20156520.
Wiklund P, Karhunen V, Richmond RC, Parmar P, Rodriguez A, De Silva M, et al. DNA methylation links prenatal smoking exposure to later life health outcomes in offspring. Clin Epigenetics. 2019;11:97. https://doi.org/10.1186/s13148-019-0683-4.
Morales E, Vilahur N, Salas LA, Motta V, Fernandez MF, Murcia M, et al. Genome-wide DNA methylation study in human placenta identifies novel loci associated with maternal smoking during pregnancy. Int J Epidemiol. 2016:1644–55. https://doi.org/10.1093/ije/dyw196.
Joubert BR, Håberg SE, Nilsen RM, Wang X, Vollset SE, Murphy SK, et al. Research | Children’s health 450K epigenome-wide scan identifies differential DNA methyla in newborns related to maternal smoking during pregnancy. Environ Health Perspect. 2012;120:1425–32.
Joubert BR, Felix JF, Yousefi P, Bakulski KM, Just AC, Breton C, et al. DNA methylation in newborns and maternal smoking in pregnancy: genome-wide consortium meta-analysis. Am J Hum Genet. 2016;98:680–96.
Richmond RC, Simpkin AJ, Woodward G, Gaunt TR, Lyttleton O, McArdle WL, et al. Prenatal exposure to maternal smoking and offspring DNA methylation across the lifecourse: findings from the Avon Longitudinal Study of Parents and Children (ALSPAC). Hum Mol Genet. 2015;24:2201–17.
Lee KWK, Richmond R, Hu P, French L, Shin J, Bourdon C, et al. Prenatal exposure to maternal cigarette smoking and DNA methylation: epigenome-wide association in a discovery sample of adolescents and replication in an independent cohort at birth through 17 years of age. Environ Health Perspect. 2015;123(2):193–9. https://doi.org/10.1289/ehp.1408614 Epub 2014 Oct 17.
Tehranifar P, Wu H-C, McDonald JA, Jasmine F, Santella RM, Gurvich I, et al. Maternal cigarette smoking during pregnancy and offspring DNA methylation in midlife. Epigenetics. 2018;13:129–34. https://doi.org/10.1080/15592294.2017.1325065.
Xu T, Holzapfel C, Dong X, Bader E, Yu Z, Prehn C, et al. Effects of smoking and smoking cessation on human serum metabolite profile: results from the KORA cohort study. BMC Med. 2013;11:60. https://doi.org/10.1186/1741-7015-11-60.
Huan T, Joehanes R, Schurmann C, Schramm K, Pilling LC, Peters MJ, et al. A Whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking. Hum Mol Genet. 2016;25:ddw288. https://doi.org/10.1093/hmg/ddw288.
Joehanes R, Just AC, Marioni RE, Pilling LC, Reynolds LM, Mandaviya PR, et al. Epigenetic signatures of cigarette smoking. Circ Cardiovasc Genet. 2016;9:436–47.
Willinger CM, Rong J, Tanriverdi K, Courchesne PL, Huan T, Wasserman GA, et al. MicroRNA signature of cigarette smoking and evidence for a putative causal role of microRNAs in smoking-related inflammation and target organ damage clinical perspective. Circ Cardiovasc Genet. 2017;10:e001678. https://doi.org/10.1161/CIRCGENETICS.116.001678.
Reynolds LM, Magid HS, Chi GC, Lohman K, Barr RG, Kaufman JD, et al. Secondhand tobacco smoke exposure associations with DNA methylation of the aryl hydrocarbon receptor repressor. Nicotine Tob Res. 2017;19(4):442–51. https://doi.org/10.1093/ntr/ntw219.
Maitre L, de Bont J, Casas M, Robinson O, Aasvang GM, Agier L, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018;8:e021311. https://doi.org/10.1136/bmjopen-2017-021311.
Wright J, Small N, Raynor P, Tuffnell D, Bhopal R, Cameron N, et al. Cohort profile: the born in Bradford multi-ethnic family cohort study. Int J Epidemiol. 2013;42(4):978–91. https://doi.org/10.1093/ije/dys112 Epub 2012 Oct 12.
Heude B, Forhan A, Slama R, Douhaud L, Bedel S, Saurel-Cubizolles M-J, et al. Cohort profile: the EDEN mother-child cohort on the prenatal and early postnatal determinants of child health and development. Int J Epidemiol. 2016;45:353–63.
Guxens M, Ballester F, Espada M, Fernández MF, Grimalt JO, Ibarluzea J, et al. Cohort profile: the INMA—INfancia y Medio Ambiente—(environment and childhood) project. Int J Epidemiol. 2012;41:930–40.
Grazuleviciene R, Danileviciute A, Nadisauskiene R, Vencloviene J. Maternal smoking, GSTM1 and GSTT1 polymorphism and susceptibility to adverse pregnancy outcomes. Int J Environ Res Public Health. 2009;6:1282–97.
Magnus P, Birke C, Vejrup K, Haugan A, Alsaker E, Daltveit AK, et al. Cohort profile update: the Norwegian mother and child cohort study (MoBa). Int J Epidemiol. 2016;45:382–8. http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=27063603.
Chatzi L, Leventakou V, Vafeiadi M, Koutra K, Roumeliotaki T, Chalkiadaki G, et al. Cohort profile: the mother-child cohort in Crete, Greece (Rhea Study). Int J Epidemiol. 2017;46:1392–1393k.
Revelle MW. psych: procedures for personality and psychological research (R package); 2017.
Lehne B, Drong AW, Loh M, Zhang W, Scott WR, Tan S-T, et al. A coherent approach for analysis of the Illumina HumanMethylation450 BeadChip improves data quality and performance in epigenome-wide association studies. Genome Biol. 2015;16:37.
Fortin J-P, Labbe A, Lemire M, Zanke BW, Hudson TJ, Fertig EJ, et al. Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome Biol. 2014;15:503.
Chen Y, Lemire M, Choufani S, Butcher DT, Grafodatskaya D, Zanke BW, et al. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203–9.
Hansen KD. IlluminaHumanMethylation450kmanifest: annotation for Illumina’s 450k methylation arrays. R. Package Version 0.4.0; 2012.
Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27. https://doi.org/10.1093/biostatistics/kxj037.
Buckberry S, Bent SJ, Bianco-Miotto T, Roberts CT. MassiR: a method for predicting the sex of samples in gene expression microarray datasets. Bioinformatics. 2014;30(14):2084–5. https://doi.org/10.1093/bioinformatics/btu161.
Teschendorff AE, Zhuang J, Widschwendter M. Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies. Bioinformatics. 2011;27(11):1496–505. https://doi.org/10.1093/bioinformatics/btr171.
Chen J, Behnam E, Huang J, Moffatt MF, Schaid DJ, Liang L, et al. Fast and robust adjustment of cell mixtures in epigenome-wide association studies with SmartSVA. BMC Genomics. 2017;18:413. https://doi.org/10.1186/s12864-017-3808-1.
Package “SmartSVA”. 2017.
Hernandez-Ferrer C, Wellenius GA, Tamayo I, Basagaña X, Sunyer J, Vrijheid M, et al. Comprehensive study of the exposome and omic data using rexposome Bioconductor packages. Bioinformatics. 2019;35(24):5344–5. https://doi.org/10.1093/bioinformatics/btz526.
Mestdagh P, Hartmann N, Baeriswyl L, Andreasen D, Bernard N, Chen C, et al. Evaluation of quantitative miRNA expression platforms in the microRNA quality control (miRQC) study. Nat Methods. 2014;11:809–15. https://doi.org/10.1038/nmeth.3014.
Suo C, Salim A, Chia K-S, Pawitan Y, Calza S. Modified least-variant set normalization for miRNA microarray. RNA. 2010;16:2293–303. https://doi.org/10.1261/rna.2345710.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. https://doi.org/10.1093/nar/gkv007.
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–3.
Nadarajah S, Kotz S. The exponentiated type distributions. Acta Appl Math. 2006;92:97–111.
Lau CHE, Siskos AP, Maitre L, Robinson O, Athersuch TJ, Want EJ, et al. Determinants of the urinary and serum metabolome in children from six European populations. BMC Med. 2018;16:202. https://doi.org/10.1186/s12916-018-1190-8.
Siskos AP, Jain P, Römisch-Margl W, Bennett M, Achaintre D, Asad Y, et al. Interlaboratory reproducibility of a targeted metabolomics platform for analysis of human serum and plasma. Anal Chem. 2017;89(1):656–65. https://doi.org/10.1021/acs.analchem.6b02930.
Veselkov KA, Lindon JC, Ebbels TMD, Crockford D, Volynkin VV, Holmes E, et al. Recursive segment-wise peak alignment of biological (1) h NMR spectra for improved metabolic biomarker recovery. Anal Chem. 2009;81:56–66.
Textor J, van der Zander B, Gilthorpe MS, Liśkiewicz M, Ellison GTH. Robust causal inference using directed acyclic graphs: the R package ‘dagitty’. Int J Epidemiol. 2017:dyw341. https://doi.org/10.1093/ije/dyw341.
WHO | BMI-for-age (5-19 years). WHO. 2015. http://www.who.int/growthref/who2007_bmi_for_age/en/.
de Onis M, Onyango AW, Borghi E, Siyam A, Nishida C, Siekmann J. Development of a WHO growth reference for school-aged children and adolescents. Bull World Health Organ. 2007;85:660–7.
White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30:377–99.
van Buuren S, Groothuis-Oudshoorn K. Mice: multivariate imputation by chained equations in R. J Stat Softw. 2011;45:1–67.
Reinius LE, Acevedo N, Joerink M, Pershagen G, Dahlén S-E, Greco D, et al. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PLoS One. 2012;7:e41361.
Houseman EA, Kile ML, Christiani DC, Ince TA, Kelsey KT, Marsit CJ. Reference-free deconvolution of DNA methylation data and mediation by cell composition effects. BMC Bioinformatics. 2016;17:259. https://doi.org/10.1186/s12859-016-1140-4.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B (Methodol). 1995;57:289–300.
Li M-X, Yeung JMY, Cherny SS, Sham PC. Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets. Hum Genet. 2012;131:747–56.
Hernandez-Ferrer C, Ruiz-Arenas C, Beltran-Gomila A, González JR. MultiDataSet: an R package for encapsulating multiple data sets with application to omic data integration. BMC Bioinformatics. 2017;18:36.
Wickham H. ggplot2: elegant graphics for data analysis. J Stat Softw. 2017.
Stephen Turner A, Stephen TM. Package “qqman” title Q-Q and Manhattan plots for GWAS data; 2017.
Jan Graffelman A, Jan GM. Package “calibrate” title calibration of scatterplot and biplot axes; 2015.
Lüdecke D. sjPlot: data visualization for statistics in social science, R package version 2.4.0. R package version 2.4.0; 2017.
Hu Y, Yan C, Hsu CH, Chen QR, Niu K, Komatsoulis GA, et al. Omiccircos: a simple-to-use R package for the circular visualization of multidimensional omics data. Cancer Informat. 2014;13:13–20. https://doi.org/10.4137/CIN.S13495 eCollection 2014.
Martin TC, Yet I, Tsai PC, Bell JT. coMET: visualisation of regional epigenome-wide association scan results and DNA co-methylation patterns. BMC Bioinformatics. 2015;16:131. https://doi.org/10.1186/s12859-015-0568-2.
Bose M, Wu C, Pankow JS, Demerath EW, Bressler J, Fornage M, et al. Evaluation of microarray-based DNA methylation measurement using technical replicates: the atherosclerosis risk in communities (ARIC) study. BMC Bioinformatics. 2014;15:312. http://www.biomedcentral.com/1471-2105/15/312.
Tsai P-C, Glastonbury CA, Eliot MN, Bollepalli S, Yet I, Castillo-Fernandez JE, et al. Smoking induces coordinated DNA methylation and gene expression changes in adipose tissue with consequences for metabolic health. bioRxiv. 2018:353581. https://doi.org/10.1101/353581.
Yousri NA, Mook-Kanamori DO, Selim MMED, Takiddin AH, Al-Homsi H, Al-Mahmoud KAS, et al. A systems view of type 2 diabetes-associated metabolic perturbations in saliva, blood and urine at different timescales of glycaemic control. Diabetologia. 2015;58(8):1855–67. https://doi.org/10.1007/s00125-015-3636-2.
Behl M, Rao D, Aagaard K, Davidson TL, Levin ED, Slotkin TA, et al. Evaluation of the association between maternal smoking, childhood obesity, and metabolic disorders: a national toxicology program workshop review. Environ Health Perspect. 2012;121:170–80. https://doi.org/10.1289/ehp.1205404.
Ozaki K, Hori T, Ishibashi T, Nishio M, Aizawa Y. Effects of chronic cigarette smoking on endothelial function in young men. J Cardiol. 2010;56:307–13. https://doi.org/10.1016/j.jjcc.2010.07.003.
Ma Q, Ozel AB, Ramdas S, McGee B, Khoriaty R, Siemieniak D, et al. Genetic variants in PLG, LPA, and SIGLEC 14 as well as smoking contribute to plasma plasminogen levels. Blood. 2014;124:3155–64. https://doi.org/10.1182/blood-2014-03-560086.
Cesari M, Pahor M, Incalzi RA. Plasminogen activator inhibitor-1 (PAI-1): a key factor linking fibrinolysis and age-related subclinical and clinical conditions. Cardiovasc Ther. 2010;28(5):e72–91. https://doi.org/10.1111/j.1755-5922.2010.00171.x.
Tong TYN, Koulman A, Griffin JL, Wareham NJ, Forouhi NG, Imamura F. A combination of metabolites predicts adherence to the Mediterranean diet pattern and its associations with insulin sensitivity and lipid homeostasis in the general population: the Fenland study, United Kingdom. J Nutr. 2019:1–11. https://doi.org/10.1093/jn/nxz263.
Sikdar S, Joehanes R, Joubert BR, Xu CJ, Vives-Usano M, Rezwan FI, et al. Comparison of smoking-related DNA methylation between newborns from prenatal exposure and adults from personal smoking. Epigenomics. 2019;11(13):1487–500. https://doi.org/10.2217/epi-2019-0066.
Aurrekoetxea JJ, Murcia M, Rebagliato M, López MJ, Castilla AM, Santa-Marina L. Determinants of self-reported smoking and misclassification during pregnancy, and analysis of optimal cut-off points for urinary cotinine: a cross-sectional study. https://doi.org/10.1136/bmjopen-2012-002034.
Fain JN. Release of inflammatory mediators by human adipose tissue is enhanced in obesity and primarily by the nonfat cells: a review. Mediat Inflamm. 2010;2010:513948. https://doi.org/10.1155/2010/513948.
Sasaki A, Kurisu A, Ohno M, Ikeda Y. Overweight/obesity, smoking, and heavy alcohol consumption are important determinants of plasma PAI-1 levels in healthy men. Am J Med Sci. 2001;322:19–23. http://www.ncbi.nlm.nih.gov/pubmed/11465242.
We would like to thank all the children and their families for their generous contribution. MoBa are grateful to all the participating families in Norway who take part in this ongoing cohort study. The authors would also like to thank Ingvild Essén, Jorunn Evandt for thorough field work, Heidi Marie Nordheim for biological sample management, and the MoBa-administrative unit.
The study has received funding from the European Community’s Seventh Framework Programme (FP7/2007-206) under grant agreement no. 308333 (the HELIX project); and the H2020-EU.3.1.2. - Preventing Disease Programme under grant agreement no 874583 (the ATHLETE project). BiB received core infrastructure funding from the Wellcome Trust (WT101597MA) and a joint grant from the UK Medical Research Council (MRC) and Economic and Social Science Research Council (ESRC) (MR/N024397/1). INMA data collections were supported by grants from the Instituto de Salud Carlos III, CIBERESP, and the Generalitat de Catalunya-CIRIT. KANC was funded by the grant of the Lithuanian Agency for Science Innovation and Technology (6-04-2014_31V-66). The Norwegian Mother, Father and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. The RHEA project was financially supported by European projects and the Greek Ministry of Health (Program of Prevention of Obesity and Neurodevelopmental Disorders in Preschool Children, in Heraklion district, Crete, Greece: 2011–2014; “Rhea Plus”: Primary Prevention Program of Environmental Risk Factors for Reproductive Health, and Child Health: 2012–2015). The work was also supported by MICINN (MTM2015-68140-R) and Centro Nacional de Genotipado-CEGEN-PRB2-ISCIII. The CRG/UPF Proteomics Unit is part of the Spanish Infrastructure for Omics Technologies (ICTS OmicsTech), and it is a member of the ProteoRed PRB3 consortium which is supported by grant PT17/0019 of the PE I+D+i 2013-2016 from the Instituto de Salud Carlos III (ISCIII) and ERDF. We acknowledge support from the Spanish Ministry of Science and Innovation through the “Centro de Excelencia Severo Ochoa 2019-2023” Program (CEX2018-000806-S), and support from the Generalitat de Catalunya through the CERCA Program.
MV-U and CR-A were supported by a FI fellowship from the Catalan Government (FI-DGR 2015 and #016FI_B 00272). MC received funding from Instituto Carlos III (Ministry of Economy and Competitiveness) (CD12/00563 and MS16/00128).
Ethics approval and consent to participate
The six HELIX cohorts have the required permissions by national ethics committees for their cohort recruitment and follow-up visits and for secondary use of pre-existing samples and data. The work in HELIX was covered by new ethics approvals in each country. At enrolment in the HELIX project, families were asked to sign an informed consent form for the specific HELIX work including clinical examination and biospecimen collection and analysis. An Ethics Task Force was established to support the HELIX project on ethical issues, for advice on the project’s ethical compliance, identification, and alerting to changes in legislation where applicable. Specific procedures are in place within HELIX to safeguard the privacy of study subjects and confidentiality of data .
Consent for publication
No personal data is published. Families signed a consent form to participate in the study (see above).
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Table S1.
Years of enrollment in the cohort, years of HELIX vist and years when smoking was banned in each cohort (country). Table S2. Proteins targeted in each of the three Magnetic Human Luminex kits from Life Technologies. Table S3. Association of maternal smoking in pregnancy and child blood DNA methylation levels adjusted for global-SHS, ordered by chromosome position. Table S4. eQTM analyses: Association between child blood DNA methylation levels at 41 CpG sites vs. child blood expression levels of genes within ±500 kb, ordered by p-value. Table S5. Association between maternal smoking in pregnancy and child blood expression levels of top genes, ordered by p-value of sustained maternal smoking in pregnancy. Table S6. Association between maternal smoking in pregnancy and child blood expression levels of genes within ±500 kb, ordered by p-value of sustained- maternal smoking in pregnancy. Table S7. Association of maternal smoking in pregnancy and child molecular phenotypes adjusted for global-SHS. Table S8. Association of childhood SHS and child molecular phenotpyes adjusted for sustained maternal smoking in pregnancy. Table S9. Comparison of the association of own smoking in adults and SHS in children with serum metabolites, ordered by metabolite name. Table S10. Comparison of the association of own smoking in adults and SHS in children with blood miRNA expression, ordered by miRNA name. Table S11. Comparison of the association of own smoking in adults and SHS in children with blood gene expression, ordered by p-value in children (sustained maternal smoking in pregnancy). Table S12. Comparison of the association of own smoking in adults and SHS in children with blood DNA methylation, ordered by p-value in children (sustained maternal smoking in pregnancy). Table S13. Association of maternal smoking in pregnancy and child DNA methylation levels adjusted for home-SHS. Table S14. Association of maternal smoking in pregnancy and child molecular phenotypes adjusted for home-SHS. Table S15. Association of maternal smoking in pregnancy and child DNA methylation levels unadjusted for global-SHS. Table S16. Association of maternal smoking in pregnancy and child molecular phenotypes unadjusted for global-SHS. Table S17. Association of childhood SHS and child molecular phenotypes unadjusted for sustained maternal smoking in pregnancy. Table S18. Association of maternal smoking in pregnancy and child DNA methylation levels in European ancestry children adjusted for global-SHS. Table S19. Association of maternal smoking in pregnancy and child molecular phenotypes in European ancestry children adjusted for global-SHS. Table S20. Association of childhood SHS and child molecular phenotypes in European ancestry children adjusted for sustained maternal smoking in pregnancy
Additional file 3: Additional Fig. S1-S5 and Fig. S7-S15. Fig. S1.
Directed acyclic graph (DAG) with causal assumptions from a priori knowledge between maternal smoking during pregnancy (MSDP) and child molecular features. In blue, the causal path assessed in the model. Fig. S2. Directed acyclic graph (DAG) with causal assumptions from a priori knowledge between childhood SHS and child molecular features. Fig. S3. Percentage of children exposed to tobacco smoking in the study population and by cohort: any (A) and sustained (B) maternal smoking during pregnancy (MSDP), childhood global-SHS (C), and child urinary cotinine measurements (D). Fig. S4. QQ-plot and Volcano-plot of the associations between child DNA methylation and any (A and B) and sustained (C and D) maternal smoking during pregnancy (MSDP), adjusted for global-SHS. Fig. S5. Comparison of effects on child blood DNA methylation between any and sustained maternal smoking during pregnancy (MSDP), adjusted for global-SHS. Fig. S7. Plot showing significance of methylation to expression relationships (−log10(p-value)) in relation to the distance between TC-TSS and CpG. Fig. S8. QQ-plot of the associations between child blood gene expression and global-SHS (A) and urinary cotinine (B), adjusted for sustained maternal smoking during pregnancy (MSDP), among 1270 genes identified in current smokers at 10% FDR (Huan et al. 2016). Fig. S9. QQ-plot of the associations between child blood DNA methylation and global-SHS (A) and urinary cotinine (B), adjusted for sustained maternal smoking during pregnancy (MSDP), among 18,763 CpGs identified in current smoking at 5% FDR (Joehanes et al. 2016). Fig. S10. QQ-plot of the associations between child blood DNA methylation and any (A) and sustained (B) maternal smoking during pregnancy (MSDP), adjusted for global-SHS, among 18,763 CpGs identified in current smoking at 5% FDR (Joehanes et al. 2016). Fig. S11. Interaction between any maternal smoking during pregnancy (MSDP) and global-SHS. The y-axis shows predicted methylation levels at cg01664727 (at RUNX1 locus). Fig. S12. Comparison of effects of maternal smoking during pregnancy (MSDP) on child blood DNA methylation between models adjusted for global-SHS and for home-SHS. Fig. S13. Comparison of effects of maternal smoking during pregnancy (MSDP) on child blood DNA methylation between models adjusted for global-SHS and unadjusted model. Fig. S14. Comparison of effects of maternal smoking during pregnancy (MSDP) on child blood DNA methylation between datasets including all children and including only European ancestry children. Fig. S15. Schematic representation of PAI1 cascade. In red inhibition steps and in green activation steps.
Additional file 4: Fig. S6. Fig. S6.
Box plots showing the change of child blood DNA methylation compared to unexposed mothers at 41 CpGs (y-axis) by categories of dose and/or duration of exposure to tobacco smoking in pregnancy (x-axis), adjusted for global-SHS. Horizontal line in the middle of the boxes shows the mean difference in DNA methylation with respect to the reference category of unexposed mothers. Boxes represents the DNA methylation change ± standard error (SE), and vertical lines indicate extreme changes defined as ±3xSE. Legend: Mat-SHS (mothers exposed to SHS), Non-sust (non-sustained smoker mothers), Sust (= < 9) (Sustained smoker mothers at low dose – less than or equal to 9 cigarettes per day), Sust (> 9) (Sustained smoker mothers at high dose – more than 9 cigarettes per day). Other categories are self-explanatory.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Vives-Usano, M., Hernandez-Ferrer, C., Maitre, L. et al. In utero and childhood exposure to tobacco smoke and multi-layer molecular signatures in children. BMC Med 18, 243 (2020). https://doi.org/10.1186/s12916-020-01686-8
- Tobacco smoking
- Secondhand smoke
- Molecular phenotypes
- DNA methylation