- Research article
- Open Access
Body mass index and waist circumference in relation to the risk of 26 types of cancer: a prospective cohort study of 3.5 million adults in Spain
BMC Medicine volume 19, Article number: 10 (2021)
A high body mass index (BMI) has been associated with increased risk of several cancers; however, whether BMI is related to a larger number of cancers than currently recognized is unclear. Moreover, whether waist circumference (WC) is more strongly associated with specific cancers than BMI is not well established. We aimed to investigate the associations between BMI and 26 cancers accounting for non-linearity and residual confounding by smoking status as well as to compare cancer risk estimates between BMI and WC.
Prospective cohort study with population-based electronic health records from Catalonia, Spain. We included 3,658,417 adults aged ≥ 18 years and free of cancer at baseline between 2006 and 2017. Our main outcome measures were cause-specific hazard ratios (HRs) with 99% confidence intervals (CIs) for incident cancer at 26 anatomical sites.
After a median follow-up time of 8.3 years, 202,837 participants were diagnosed with cancer. A higher BMI was positively associated with risk of nine cancers (corpus uteri, kidney, gallbladder, thyroid, colorectal, breast post-menopausal, multiple myeloma, leukemia, non-Hodgkin lymphoma) and was positively associated with three additional cancers among never smokers (head and neck, brain and central nervous system, Hodgkin lymphoma). The respective HRs (per 5 kg/m2 increment) ranged from 1.04 (99%CI 1.01 to 1.08) for non-Hodgkin lymphoma to 1.49 (1.45 to 1.53) for corpus uteri cancer. While BMI was negatively associated to five cancer types in the linear analyses of the overall population, accounting for non-linearity revealed that BMI was associated to prostate cancer in a U-shaped manner and to head and neck, esophagus, larynx, and trachea, bronchus and lung cancers in an L-shaped fashion, suggesting that low BMIs are an approximation of heavy smoking. Of the 291,305 participants with a WC measurement, 27,837 were diagnosed with cancer. The 99%CIs of the BMI and WC point estimates (per 1 standard deviation increment) overlapped for all cancers.
In this large Southern European study, a higher BMI was associated with increased risk of twelve cancers, including four hematological and head and neck (only among never smokers) cancers. Furthermore, BMI and WC showed comparable estimates of cancer risk associated with adiposity.
The prevalence of obesity worldwide has nearly tripled over the past three decades, reaching 650 million adults in 2016 . Body mass index (BMI), the most common indicator of general adiposity, has been convincingly associated with at least 12 cancer types . Results from previous large cohort studies suggest that BMI is associated with a larger number of cancer types than currently recognized and that some of those associations may be non-linear [3, 4]. However, the main limitations of available studies include limited adjustment for potential confounding, reliance on self-reported weight and height, and lack of generalizability to different populations. Furthermore, although conducting analyses stratified by smoking status is critical to provide unbiased estimates of the impact of obesity on cancer risk [4, 5], many studies failed to present results stratified by smoking status, in part due to insufficient statistical power .
In addition, whether BMI as a sole indicator of general adiposity fully captures the complex association between adiposity and cancer risk is still in dispute. Central adiposity, typically assessed using waist circumference (WC), has been suggested to increase the risk of several cancer types and to better discriminate risk associated with obesity for colon and breast post-menopausal cancers [6,7,8]. However, only few studies have systematically compared the effect estimates of BMI and WC for multiple site-specific cancers, and none have studied less frequently occurring cancer types [9, 10].
The primary objective of the current study was to investigate associations between BMI and the risk of 26 types of cancer accounting for non-linearity and residual confounding by smoking status. Our secondary objective was to compare risk estimates for general (BMI) and central (WC) adiposity in relation to the risk of 26 cancer types.
Study design, setting, and data sources
We performed a cohort study with prospectively collected data from the Information System for Research in Primary Care (SIDIAP; www.sidiap.org), from January 1, 2006, until December 31, 2018. SIDIAP includes routinely recorded information by health professionals from 287 primary care centers in Catalonia, a region in Northeastern Spain [11, 12]. SIDIAP contains anonymized records for approximately six million people (80% of the Catalan population) and is representative of the Catalan population in terms of age, sex, and geographic distribution . It includes high-quality data on anthropometric measurements, disease diagnoses (International Classification for Diseases, 10th revision [ICD-10]), prescription and dispensation of drugs, laboratory tests, and demographic and lifestyle information. Further, SIDIAP is linked to the Minimum Basic Dataset (CMBD in Spanish), a population-based registry that includes hospital discharge information in Spain .
For the primary objective, we included all participants aged ≥ 18 years with a valid BMI (weight (kg)/height (m)2 between 15 and 60 kg/m2) recorded between January 1, 2006, and December 31, 2017, and subsequent eligible follow-up time (minimum of 1 year). The study’s index date was the date of the first BMI assessment during this period. We followed participants from the study index date until first incident (primary) cancer diagnosis, death, transferal out of the SIDIAP, or until the end of the study period (December 31, 2018). We excluded individuals who were older than 100 years of age at index date, had a BMI assessment only available during pregnancy (from the 3rd month of pregnancy until 2 months after delivery), had any record of cancer before the study index date, or complied with any of the end-of-follow-up criteria described above before attaining 12 months of follow-up to avoid reverse causality (Fig. 1, dataset 1). For our secondary objective, we included an additional eligibility criterion, which was to have a valid WC assessment (WC values ≥ 40 cm and ≤ 160 cm) no more than 5 years previous to or 1 year later than the index date (first BMI measurement recorded) (Fig. 1, dataset 2). If a participant had more than one WC measurement available, we selected the closest one to the index date. Figure 1 shows the flow chart of inclusion and exclusion criteria for each study objective.
Assessment of anthropometric indicators and covariates
For our primary objective, the exposure of interest was BMI as a continuous variable (in kg/m2). BMI was automatically calculated through a computer program (“Estació clínica d’atenció primària”) after general practitioners (GPs) or nurses entered the weight (kg) and height (cm) of patients they directly assessed in a standardized manner . For participants without information from that computer program, we calculated the BMI using weight and height data available in their health records (if height was not available on the same date as the weight measurement, we calculated the individuals’ mean height using all available measurements in their health records during adulthood (≥ 18 years) and we chose the closest real height value to the mean). For our secondary objective, we additionally considered WC as an exposure; this indicator was routinely measured by trained health professionals (GPs and nurses) who follow a measurement protocol . WC was measured at the umbilical level, midway between the anterior superior iliac spine and the inferior border of the rib while participants were standing.
We also extracted information on sex (women, men), age (in years), and geographic region of nationality (Spain, European [non-Spanish], Africa, America, and Asia). We assessed socioeconomic status in urban areas using the “Mortalidad en áreas pequeñas Españolas y Desigualdades Socioeconómicas y Ambientales” (MEDEA) deprivation index, which is calculated at the census tract level and was categorized into quintiles by the SIDIAP for anonymization purposes . The first and the fifth quintiles represent the least and most deprived groups of the population living in urban areas of Catalonia, respectively. We included a rural category since the MEDEA index was not available for participants living in those areas. We also extracted information on smoking status (never, former, or current smoker) and alcohol intake (none, low or high). If a participant had more than one record of smoking status and alcohol intake available, we selected the one closest to the index date within a 6-year period (5 years before and 1 year after the first BMI measurement). For type 2 diabetes, we considered any registry of a GP diagnosis (ICD-10 code E11) before the index date. For women, we included information on menopausal status and hormonal replacement therapy (HRT) use, the definitions of which can be consulted in Additional file 1: Appendix S1.
Ascertainment of cancer cases
We considered first incident cancer diagnoses as the outcomes of interest. We identified outcomes using ICD-10 codes in the SIDIAP database and ICD-9 codes in the CMBD from January 1, 2007, to December 31, 2018. We mapped ICD-9 diagnosis codes to ICD-10 using available conversion codes (eCIEMaps v3.1.9) which are provided in Additional file 1: Table S1. We used the following cancer types as outcomes: head and neck; esophagus; stomach; colorectal; liver; gallbladder and biliary tract; pancreas; larynx; trachea, bronchus, and lung; bone and articular cartilage; malignant melanoma of skin; breast (which we categorized into pre- and post-menopausal due to well-established evidence indicating different BMI relations) ; cervix uteri; corpus uteri; ovary; prostate; testis; kidney; bladder; brain and central nervous system (CNS); thyroid; Hodgkin lymphoma; non-Hodgkin lymphoma; multiple myeloma; and leukemia. All cancer diagnoses in the SIDIAP including the CMBD have been previously validated .
We described the number of excluded individuals in each step of the creation of the main dataset. We presented the overall baseline characteristics of the study participants and by the World Health Organization (WHO) BMI categories: underweight or normal weight (BMI < 18.5 kg/m2 and between ≥ 18.5 and < 25 kg/m2), overweight (BMI ≥ 25 and < 30 kg/m2), and obesity (BMI ≥ 30 kg/m2).
We fitted Cox proportional hazard models with age as the time metric to estimate cause-specific hazard ratios (HR) and 99% confidence intervals (CI) for the relation between BMI and risk of each cancer type. We stratified all models by age (5-year categories) and sex to reduce the sensitivity to violations of the proportional hazards assumption. The first (basic) model included BMI only (model 1) and the second (multivariable-adjusted) model further adjusted for smoking status, alcohol intake, type 2 diabetes, socioeconomic status, and nationality (model 2). A directed acyclic graph was used to guide decisions on the control for confounding (Additional file 1: Fig. S1) . We used a missing category for variables with missing data.
Firstly, we investigated potential non-linear associations between BMI and risk of each cancer. We considered non-linearity in BMI by fitting models using restricted cubic splines for BMI with 3 knots (placed at the 10th, 50th, and 90th percentiles) or 5 knots (placed at the 5th, 27.5th, 50th, 72.5th, and 95th percentiles). We evaluated linearity by comparing the Akaike information criterion of models with restricted splines to the model with BMI as a linear term in combination with a Wald test linearity hypothesis [20, 21]. To assess residual confounding by smoking, we re-run the multivariable-adjusted (adjusted for alcohol intake, type 2 diabetes, socioeconomic status, and nationality) models among never smokers for cancers for which we found evidence of non-linearity.
Secondly, we fitted model 2 with BMI as a linear term to estimate HRs of the relation between BMI (per 5 kg/m2 increment) and risk of each cancer type. Again, we re-run the multivariable-adjusted models (adjusted for alcohol intake, type 2 diabetes, socioeconomic status, and nationality) only among participants who reported having never smoked to explore residual confounding by smoking.
In the subsample of participants who had information on both BMI and WC (Fig. 1, dataset 2), we compared risk estimates for general (BMI) and central (WC) adiposity in relation to the risk of 26 cancers by fitting Cox proportional hazard models (one for each adiposity indicator) with age as the time metric. We estimated HRs and 99% CIs per 1 standard deviation (SD) increment of adiposity indicators (BMI and WC) to allow comparability between both estimates . We considered estimates different if the 99% CIs of the point estimates of each adiposity indicator did not overlap. We adjusted the statistical models for the same variables as in model 2, and we used the same end of follow-up definition. We only analyzed cancer types for which we ascertained at least 100 cancer cases.
Model-checking and sensitivity analyses
For all models, we checked the proportional hazard assumption by using the Schoenfeld test of proportionality and by visual inspection of the scaled Schoenfeld residuals .
We assessed the robustness of our primary objective findings by performing six sensitivity analyses. First, we accounted for residual selection bias by additionally adjusting model 2 for the number of GP consultations in the year of the index date because participants who see their GP more often may have different health behaviors than those who see their GP less often. Second, we explored potential outcome misclassification by restricting the analyses to specific regions of Catalonia where we had access to population-based or hospital cancer registries. We considered as cancer cases only those who had the same diagnosis in the SIDIAP and a cancer registry. Third, we addressed potential reverse causality (i.e., undiagnosed cancer affecting BMI) by extending the minimum follow-up time (of 1 year in the main analyses) to 2 and 4 years. Fourth, we strengthened the validity of our results by performing multiple imputations (using the fully conditional specification approach, with 10 imputed data sets created) to deal with missing values of model 2 covariates [23, 24]. Fifth, we avoided confounding in the analyses of BMI and specific cancer types by re-running model 2, additionally adjusting for HRT use in post-menopausal women [women-only cancers] and excluding participants with a diagnosis of chronic hepatitis B/C [liver cancer risk factor] or a helicobacter pylori infection [stomach cancer risk factor]). Finally, to investigate to which extent the relationships between BMI and risk of each cancer type represents an effect of weight, height, or both weight and height, we re-ran the multivariable-adjusted models (model 2) with height and weight as the main exposures, mutually adjusted for each other.
To assess the robustness of our secondary findings, we performed two sensitivity analyses. We re-ran the analyses that compared BMI and WC in relation to cancer risk with mutual adjustment for both adiposity indicators using residuals of WC and BMI (e.g., we regressed WC on BMI, and we included the residuals from this analysis in the model using BMI as an indicator of general adiposity) to assess if this added valuable information to fully capture adiposity . Finally, we added height as an adjustment variable to the analyses that compared BMI and WC in relation to cancer risk.
The a priori level of statistical significance was set at a 2-sided P value of 0.01 for all analyses. We used STATA version 15.1 (College Station, TX, USA) for data analysis and R version 3.5.0 for data visualization.
We obtained approval from the Clinical Research Ethics Committee of the IDIAPJGol (project code: P14/074) to perform this study.
Of the 6,447,722 individuals aged between ≥ 18 and ≤ 100 years in the SIDIAP population, 2,459,462 were excluded due to the unavailability of a valid BMI, 131,167 due to personal history of cancer, and 198,676 due to less than 12 months of follow-up (Fig. 1). A total of 3,658,417 participants constituted the primary dataset of this study for whom follow-up ended at a median of 8.3 years (interquartile range [IQR] 5–11) after study entry. In total, 202,828 [5.6%] individuals were diagnosed with cancer over the study period (Table 1). Among all participants, 55% were women, the median age at inclusion was 46 years (IQR 32–61), and the median BMI was 26.3 kg/m2 (IQR 23–30). When stratifying participants by WHO categories of BMI, the median follow-up and age increased with increasing categories of BMI. There were fewer participants from deprived areas and more current smokers in the underweight and normal weight category compared to those in the obesity category (Table 1). Compared to the overall SIDIAP adult population, the individuals included in this study were more likely to be women and older, as well as to have more comorbidities and complete information on lifestyle factors (the characteristics of the included and excluded individuals can be consulted in Additional file 1: Table S2).
Non-linear BMI associations and analyses restricted to never smokers
BMI was non-linearly associated with ten of twenty-six cancer types (p for non-linearity < 0.01) (Fig. 2). For cancers of the head and neck, esophagus, stomach, larynx, trachea, bronchus, and lung, low BMI values were associated with a higher risk of these cancers. The risk stabilized above values of 22 kg/m2 (with HRs either at or below one). These non-linear relations disappeared when we restricted the analyses to never smokers (Fig. 3).
The curves for the associations between BMI and risk of cancers of the liver, breast post-menopausal, corpus uteri, prostate, and Hodgkin lymphoma were non-linear and were similarly shaped in the overall cohort and among never smokers (Figs. 2 and 3). Liver cancer showed an attenuated U-shaped curve, with a higher risk among participants with very low or very high BMI values. The risk of breast post-menopausal cancer seemed to increase linearly up to a BMI of 30 kg/m2, at which point the increase in risk diminished. For prostate cancer, the risk curve displayed an attenuated inverse U-shape, with a lower risk of cancer among those with low, normal, and very high BMIs, but an increased risk for those in the overweight range. For corpus uteri cancer, the risk increased faster than linear at higher BMI values. Finally, the association between BMI and Hodgkin lymphoma was J-shaped, with a modest higher risk of this lymphoma in people with low BMIs and a more markedly higher risk for those with high BMIs.
Linear BMI associations and analyses restricted to never smokers
A BMI increment of 5 kg/m2 (in multivariable analyses) was positively associated with risk of cancers of the corpus uteri (HR 1.49, 99%CI 1.45–1.53), kidney (1.16, 1.12–1.20), gallbladder and biliary tract (1.10, 1.03–1.19), multiple myeloma (1.09, 1.04–1.15), thyroid (1.08, 1.03–1.13), leukemia (1.07, 1.04–1.11), colorectal (1.06, 1.04–1.08), breast post-menopausal (1.07, 1.05–1.08), and non-Hodgkin lymphoma (1.04, 1.01–1.08) (Fig. 4). Results from the basic model are presented in Additional file 1: Table S3. Results for corpus uteri and breast-postmenopausal cancers should be interpreted in combination with the splines of Fig. 3 due to the evidence of non-linearity. For the five cancer types (trachea, bronchus and lung, larynx, esophagus, head and neck, and prostate) for which we observed an inverse association between BMI and cancer risk, there was evidence of non-linearity as shown in Fig. 3. After restricting the analyses to never smokers, BMI remained inversely associated only with risk of prostate cancer (0.95, 0.92–0.98), but became positively associated with risk of Hodgkin lymphoma (1.16, 1.01–1.35), and cancers of the head and neck (1.09, 1.03–1.16), and brain and CNS (1.07, 1.00–1.10).
BMI and WC comparison in relation to cancer risk
Of the 291,305 participants who also had a WC assessment available, 27,837 were diagnosed with cancer from 2007 to 2018 (Table 1). Among eligible participants, the median follow-up time was 9.9 (IQR 7–12) years and the median age was 59 (IQR 46–71) years. The median WC was 100 (IQR 91–108) cm and the median BMI was 29 (IQR 26–33) kg/m2. Compared to the overall BMI cohort, these participants were older and had a higher median BMI and a higher prevalence of type 2 diabetes (Table 1).
We ascertained more than 100 cases for all cancers of interest except cancers of the bone and articular cartilage (64 cases), Hodgkin lymphoma (63), testis (52), and breast pre-menopausal (44) (Fig. 5). For all cancer sites, the 99% CIs of the HRs for WC (per 1 SD increase) and BMI overlapped. We observed the largest differences between the WC and BMI effect estimates for cancers of the bladder (HR for BMI 0.97, 99%CI 0.91–1.03; WC 1.04, 0.98–1.10), larynx (HR for BMI 0.77, 99%CI 0.65–0.91; WC 0.91, 0.78–1.06), and trachea, bronchus, and lung (HR for BMI 0.85, 99%CI 0.79–0.91; WC 0.97, 0.90–1.03), although the 99%CIs overlapped. Nonetheless, these results should be interpreted with caution due to evidence of non-linearity in the association between WC and risk of bladder and trachea, bronchus, and lung cancers (Additional file 1: Table S4).
We assessed the robustness of our results by comparing the HRs of our main analyses to those from sensitivity analyses. We found that the HRs from our primary model (model 2) were similar to those from the sensitivity analyses. The CIs of the sensitivity analyses consistently included the main point estimate with only two exceptions (Additional file 1: Tables S5-S8). In the analysis in which we extended the minimum follow-up time from 1 to 4 years, the HRs from the main model for stomach and trachea, bronchus, and lung cancers (1-year follow-up) were not included in the CIs from the models with a 4-year minimum follow-up (stomach cancer with 1-year follow-up HR 0.99, 99%CI 0.99–1.00, vs. 4-year follow-up HR 1.01, 99%CI 1.00–1.01; trachea, bronchus, and lung cancer with 1-year follow-up HR 0.96, 99%CI 0.96–0.97, vs. 4-year follow-up HR 0.97, 99%CI 0.97–0.97; all HRs are per 1 kg/m2 increment in BMI) (Additional file 1: Table S5). We also re-ran the multivariable-adjusted models (model 2) using height on one hand and weight on the other as the main exposures (Additional file 1: Table S9). The nine cancer types that were positively associated with BMI were also all positively associated with weight, while six were so with height (colorectal, breast post-menopausal, kidney, thyroid, non-Hodgkin lymphoma, and leukemia). Corpus uteri cancer was negatively associated with height. The five cancer types for which we found a negative association with BMI were also negatively associated with weight while two of these were positively associated with height (trachea, bronchus, and lung and prostate cancers).
Furthermore, in the analysis comparing WC and BMI in relation to cancer risk, we assessed whether adding the residuals of the complementary adiposity indicator added valuable information to fully capture adiposity. This was not the case as the 99%CIs of the models comprising residuals always included the HRs from the main models (Additional file 1: Fig. S2). For example, for corpus uteri cancer, the model that only included BMI (HR 1.60, 99%CI 1.47–1.74) was similar to the one that included BMI and the residuals of WC (HR 1.61, 99%CI 1.48–1.76); the same was observed for the model that only included WC (HR 1.52, 99%CI 1.39–1.67) and the one that included WC and the residuals of BMI (HR 1.53, 99%CI 1.39–1.68). The CIs of the sensitivity analysis that further adjusted for height also consistently included the main point estimate of the main analyses comparing WC and BMI in relation to cancer risk (Additional file 1: Table S10).
In this prospective study that included 3,658,417 participants and 202,837 cancer cases, we found that a higher BMI was associated with risk of 18 of 26 cancer types, although these relations differed in terms of direction, shape, and smoking status at baseline. BMI was positively associated with risk of cancers of the corpus uteri, kidney, gallbladder and biliary tract, thyroid, colorectum, breast post-menopausal, multiple myeloma, leukemia, and non-Hodgkin lymphoma (in descending order of linear effect sizes). After restricting the analyses to never smokers to account for incomplete adjustment for smoking, BMI was also positively associated with Hodgkin lymphoma and cancers of the head and neck, and brain and CNS. BMI was associated in an inverse U-shaped manner with the risk of prostate cancer and in an L-shaped fashion with the risk of four cancers (head and neck, esophagus, larynx, and trachea, bronchus, and lung) in the overall cohort likely indicating residual confounding by smoking since the shape of these associations drastically changed among never smokers, except for prostate cancer.
In a subsample of 291,305 participants with a WC measurement and 27,837 cancer cases, we compared cancer risk estimates of WC and BMI. The 99% CIs of the WC and BMI effect estimates consistently overlapped, indicating that WC provides risk associations similar to BMI across a wide range of cancer types in our population.
Strengths and limitations of this study
This study has several strengths. Firstly, to our knowledge, this is the first study to systematically compare both BMI and WC indicators in relation to the risk of a wide variety of cancers, including less frequently occurring ones. Secondly, owing to the large scale of the SIDIAP database, we were able to investigate the association between BMI and numerous cancer types in a Southern European region, increasing the external validity of results previously reported in Northwestern European countries [3, 4]. Lastly, we previously demonstrated the high quality of cancer diagnoses in the SIDIAP data and we conducted sensitivity analyses in regions where we could include cancer cases confirmed by population-based cancer registries (Additional file 1: Table S6) .
This study also has limitations. Firstly, the inclusion of individuals with a BMI measurement (62% of the SIDIAP adult population) could result in selection bias. However, the study participants were not substantially different from the overall SIDIAP population (Additional file 1: Table S2). Secondly, although we cannot exclude the possibility of exposure misclassification, we were empirically reassured that this was not a serious bias. The distribution of BMI in the SIDIAP was similar to population-based survey data and representative studies of the Spanish population (Additional file 1: Table S11). Thirdly, outcome misclassification could have biased our results towards the null because modest positive predictive values have been reported in a validation study of SIDIAP cancer diagnoses . Fourth, residual confounding is an inherent limitation of observational studies; an example in our study was residual confounding for smoking status at baseline. Fifth, we did not have data on factors in the possible causal path between obesity and cancer, such as specific reproductive variables (e.g., parity, breastfeeding history), physical activity, and diet. Neither did we have information on cancer subtype or stage at diagnosis, which could have helped sharpen the analyses for certain cancers (e.g., prostate cancer). Fifth, while the magnitude of this study’s sample size has its advantages, some of the significant findings of this study could have been related to the large sample size. Another limitation was the missing covariate data which ranged from 10% (for the MEDEA deprivation index) to 39% (for alcohol intake risk). However, the results from our main analysis did not differ when we performed multiple imputations of these data (Additional file 1: Table S5). Finally, we had information for both BMI and WC for only 10% of the study participants. This limited our interpretation of the comparison of adiposity measures associated with cancer risk to individuals with both indicators and does not enable us to extrapolate the WC effect estimates to the general population.
Interpretation and comparison with previous studies
The observed positive associations between BMI and different cancer types are in line with previous studies. The increased risk of breast post-menopausal and corpus uteri cancers has been consistently reported in the literature [25, 26]. Furthermore, our non-linear analyses showed that the higher the BMI, the greater the magnitude of risk of corpus uteri cancer which concurs with previous studies [4, 27]. The positive association between BMI and cancers of the colorectum, kidney, thyroid, and gallbladder and biliary tract is well recognized in the literature; however, nuances by subtype (kidney) [2, 28], histology (thyroid) , and sex (colorectal and gallbladder and biliary tract) have been reported [25, 30, 31]. In our data, we observed a stronger effect of BMI for gallbladder and biliary tract cancer in women and colorectal cancer in men, which is in line with previous studies (Additional file 1: Table S12) [25, 31]. Further, our results showed a clear pattern in the association between BMI and hematological cancers. The association observed between BMI and higher risk of leukemia and multiple myeloma has been consistently reported in the literature [25, 32,33,34], but the association between BMI and the lymphomas is less well established. Although our results for non-Hodgkin lymphoma are supported by two meta-analyses [25, 35], other studies have only reported a link with the subtype of diffuse large B cell lymphoma . For Hodgkin lymphoma, we observed a J-shaped association with BMI, which concurs with a large study from the United Kingdom (UK) . The positive association observed between BMI and cancers of the brain and CNS might have been driven by the inclusion of meningioma in this broad cancer group .
We also observed that the associations between BMI and respiratory tract cancers (head and neck, esophagus, larynx, and trachea, bronchus and lung) were L-shaped, suggesting that low BMIs are an approximation of heavy smoking. In the linear analyses restricted to never smokers, the associations between BMI and cancers of the larynx and esophagus became null, likely due to the opposite effects of BMI in adenoma and squamous cell carcinoma . Also, among never smokers, BMI became positively associated with cancer of the head and neck and remained negatively associated with cancer of the trachea, bronchus, and lung, which concurs with other meta-analyses [25, 38,39,40]. For prostate cancer, we found an attenuated inverse U-shaped association which coincided with a large UK study . The shape of this association could be explained by the dual effect of BMI on prostate cancer (inversely and positively associated with localized and advanced prostate cancer, respectively) . Unfortunately, we did not have data on prostate cancer subtypes to test this hypothesis.
There were also differences between our results and those of previous studies. Despite the evidence supporting the inverse association between BMI and risk of breast pre-menopausal cancer , we observed a negative trend only with BMI values greater than 27 kg/m2. In addition, some studies described a positive association between BMI and cancers of the liver and stomach [42, 43]. Our results suggest these associations are non-linear and similarly shaped to a large UK study (U- and L-shaped for liver and stomach cancers, respectively) . We noted that the non-linear association for stomach resembled the one for respiratory tract cancers, suggesting residual confounding by smoking status for this cancer as well.
In a post hoc analysis, modeling height and weight in mutually adjusted models, we found that the nine and five cancer types that were positively and negatively, respectively, associated with BMI (in linear models) were also all associated with weight in the same directions. On the other hand, height was positively associated with 14 cancer types (and only negatively associated with corpus uteri cancer) (Additional file 1: Table S9). This suggests that the associations observed for BMI (our main analysis) were driven by excess body weight rather than height. Height is a complex exposure and likely reflects the fact that more stem cells are at risk of acquiring driver mutations during cell division over time. A second possible explanation is that a common factor (such as insulin-like growth factor (IGF) 1) directly affects cancer risk as well as increasing height .
Finally, our results indicate that BMI and WC have a comparable relationship with cancer risk. The effect estimates of BMI and WC were similar although we observed moderate differences for cancers of the bladder, larynx, and trachea, bronchus, and lung. Contrarily to BMI, WC was not negatively associated with the risk of cancers of the larynx and trachea, bronchus, and lung. We hypothesized that this could be explained by smoking since smokers tend to have a higher WC, more visceral adipose tissue, and leaner body mass .
In this large Southern European study, we found that a higher BMI was associated with higher risk of twelve cancer types. We provide novel evidence that higher BMI increases the risk of four hematological and head and neck (only among never smokers) cancers, and we confirmed associations reported in previous studies. Moreover, this study showed that BMI and WC result in comparable estimates of cancer risk associated with adiposity at a population level.
While the observational nature of this study prevents us from making policy and clinical recommendations, our findings reinforce the need for public health strategies focusing on the reduction of obesity for cancer prevention and indicate that assessing obesity-related cancer risk in primary care using BMI may be sufficient.
Availability of data and materials
In accordance with current European and national law, the data used in this study is only available for the researchers participating in this project. Thus, we are not allowed to distribute or make publicly available the data to other parties. However, researchers from public institutions can request data from the SIDIAP and other sources (e.g., Cancer Registries) if they comply with certain requirements. Further information is available online (https://www.sidiap.org/index.php/menu-solicitudes-en/application-proccedure) or by contacting Anna Moleras (firstname.lastname@example.org).
Body mass index
Minimum Basic DataSet
Central nervous system
Hormonal replacement therapy
International Classification for Diseases, 9th revision
International Classification for Diseases, 10th revision
- MEDEA (deprivation index):
“Mortalidad en áreas pequeñas Españolas y Desigualdades Socioeconómicas y Ambientales”
Information System for Research in Primary Care
World Health Organization
World Health Organization. Overweight and obesity. 2016 [cited 2018 Nov 5]. Available from: http://www.who.int/news-room/fact-sheets/detail/obesity-and-overweight.
Secretan BL, Ph D, Scoccianti C, Ph D, Loomis D, Ph D. Body Fatness and Cancer - Viewpoint of the IARC Working Group. Vol. 375, The New England Journal of Medicine. 2016.
Reeves GK, Pirie K, Beral V, Green J, Spencer E, Bull D. Cancer incidence and mortality in relation to body mass index in the Million Women Study: cohort study. Br Med J. 2007;335(7630):1134–9.
Bhaskaran K, Douglas I, Forbes H, Dos-Santos-Silva I, Leon DA, Smeeth L. Body-mass index and risk of 22 specific cancers: a population-based cohort study of 5·24 million UK adults. Lancet. 2014;384(9945):755–65.
Song M, Giovannucci E. Estimating the influence of obesity on cancer risk: stratification by smoking is critical. J Clin Oncol. 2016;34(27):3237–9.
De Ridder J, Julián-Almárcegui C, Mullee A, Rinaldi S, Van Herck K, Vicente-Rodríguez G, et al. Comparison of anthropometric measurements of adiposity in relation to cancer risk: a systematic review of prospective studies. Cancer Causes Control. 2016;27:291–300.
White AJ, Nichols HB, Bradshaw PT, Sandler DP. Overall and central adiposity and breast cancer risk in the sister study. Cancer. 2015;121(20):3700–8.
Pischon T, Lahmann PH, Boeing H, Friedenreich C, Norat T, Tjønneland A, et al. Body size and risk of colon and rectal cancer in the European Prospective Investigation Into Cancer and Nutrition (EPIC). JNCI J Natl Cancer Inst. 2006;98(13):920–31.
Freisling H, Arnold M, Soerjomataram I, O’Doherty MG, Ordóñez-Mena JM, Bamia C, et al. Comparison of general obesity and measures of body fat distribution in older adults in relation to cancer risk: meta-analysis of individual participant data of seven prospective cohorts in Europe. Br J Cancer. 2017;116(11):1486–97.
Barberio AM, Alareeki A, Viner B, Pader J, Vena JE, Arora P, et al. Central body fatness is a stronger predictor of cancer risk than overall body size. Nat Commun. 2019;10(1):383.
García-Gil MDM, Hermosilla E, Prieto-Alhambra D, Fina F, Rosell M, Ramos R, et al. Construction and validation of a scoring system for selection of high quality data in a Spanish population primary care database (SIDIAP). Inf Prim Care. 2012;20(2):1–1.
Bolíbar B, Fina Avilés F, Morros R, Del Mar G-GM, Hermosilla E, Ramos R, et al. Base de datos SIDIAP: La historia clínica informatizada de Atención Primaria como fuente de información para la investigación epidemiológica. Med Clin (Barc). 2012;138(14):617–21.
Generalitat de Catalunya. Conjunt mínim bàsic de dades (CMBD). 2017 [cited 2019 Mar 5]. Available from: https://catsalut.gencat.cat/ca/proveidors-professionals/registres-catalegs/registres/cmbd/index.html#googtrans(ca%7Ces).
Lecube A, Monereo S, Rubio MÁ, Martínez-de-Icaya P, Martí A, Salvador J, et al. Prevención, diagnóstico y tratamiento de la obesidad. Posicionamiento de la Sociedad Española para el Estudio de la Obesidad de 2016. Endocrinol Diabetes y Nutr. 2017;64:15–22.
Institut Català de la Salut. Guies de pràctica cliń ica: Abordatge de la diabetis mellitus tipus 2. 2015. [cited 2019 Mar 5]. Available from: http://ics.gencat.cat/web/.content/documents/assistencia/gpc/GuiaDiabetis2015.pdf.
Domínguez-Berjón MF, Borrell C, Cano-Serral G, Esnaola S, Nolasco A, Pasarín MI, et al. Construcción de un índice de privación a partir de datos censales en grandes ciudades españolas (Proyecto MEDEA). Gac Sanit. 2008;22(3):179–87.
Calle EE, Kaaks R. Overweight, obesity and cancer: epidemiological evidence and proposed mechanisms. Nat Rev Cancer. 2004;4(8):579–91.
Recalde M, Manzano-Salgado C, Díaz Y, Puente D, Garcia-Gil M del M, Marcos-Gragera R, et al. Validation of cancer diagnoses in electronic health records: results from The Information System For Research In Primary Care (SIDIAP) in Northeast Spain. Clin Epidemiol 2019;11:1015–1024.
Greenland S, Pearl J, Robins JM. Causal Diagrams for Epidemiologic Research. Epidemiology. 1999;10(1):37–48.
Harrell FEJ. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. New York: Springer; 2001.
Orsini N, Greenland S. A procedure to tabulate and plot results after flexible modeling of a quantitative covariate. Stata J 2011;11(1):1–29.
Grambsch P, Therneau T. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81(3):515–26.
Pedersen AB, Mikkelsen EM, Cronin-Fenton D, Kristensen NR, Pham TM, Pedersen L, et al. Missing data and multiple imputation in clinical epidemiological research. Clin Epidemiol. 2017;9:157–66.
Lee KJ, Carlin JB. Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation. Am J Epidemiol. 2010;171(5):624–32.
Renehan AG, Tyson M, Egger M, Heller RF, Zwahlen M. Body-mass index and incidence of cancer: a systematic review and meta-analysis of prospective observational studies. Lancet. 2008;371(November):569–78.
Aune D, Navarro Rosenblatt DA, Chan DSM, Vingeliene S, Abar L, Vieira AR, et al. Anthropometric factors and endometrial cancer risk: a systematic review and dose–response meta-analysis of prospective studies. Ann Oncol. 2015;26(8):1635–48.
Crosbie EJ, Zwahlen M, Kitchener HC, Egger M, Renehan AG. Body mass index, hormone replacement therapy, and endometrial cancer risk: a meta-analysis. Cancer Epidemiol Biomarkers Prev. 2010;19(12):3119 LP – 3130.
Callahan CL, Hofmann JN, Corley DA, Zhao WK, Shuch B, Chow W-H, et al. Obesity and renal cell carcinoma risk by histologic subtype: a nested case-control study and meta-analysis. Cancer Epidemiol. 2018;56:31–7.
Schmid D, Ricci C, Behrens G, Leitzmann MF. Adiposity and risk of thyroid cancer: a systematic review and meta-analysis. Obes Rev. 2015;16(12):1042–54.
Campbell PT, Newton CC, Kitahara CM, Patel AV, Hartge P, Koshiol J, et al. Body size indicators and risk of gallbladder cancer: pooled analysis of individual-level data from 19 prospective cohort studies. Cancer Epidemiol Biomarkers Prev. 2017;26(4):597 LP – 606.
Abar L, Vieira AR, Aune D, Sobiecki JG, Vingeliene S, Polemiti E, et al. Height and body fatness and colorectal cancer risk: an update of the WCRF-AICR systematic review of published prospective studies. Eur J Nutr. 2018;57(5):1701–20.
Abar L, Sobiecki JG, Cariolou M, Nanu N, Vieira AR, Stevens C, et al. Body size and obesity during adulthood, and risk of lympho-haematopoietic cancers: an update of the WCRF-AICR systematic review of published prospective studies. Ann Oncol Off J Eur Soc Med Oncol. 2019;30(4):528–41.
Wallin A, Larsson SC. Body mass index and risk of multiple myeloma: a meta-analysis of prospective studies. Eur J Cancer. 2011;47(11):1606–15.
Larsson SC, Wolk A. Overweight and obesity and incidence of leukemia: a meta-analysis of cohort studies. Int J Cancer. 2008;122(6):1418–21.
Larsson SC, Wolk A. Body mass index and risk of non-Hodgkin’s and Hodgkin’s lymphoma: a meta-analysis of prospective studies. Eur J Cancer. 2011;47(16):2422–30.
Willett EV, Morton LM, Hartge P, Becker N, Bernstein L, Boffetta P, et al. Non-Hodgkin lymphoma and obesity: a pooled analysis from the InterLymph Consortium. Int J Cancer. 2008;122(9):2062–70.
Strongman H, Brown A, Smeeth L, Bhaskaran K. Body mass index and Hodgkin’s lymphoma: UK population-based cohort study of 5.8 million individuals. Br J Cancer. 2019;120(7):768–70.
Gaudet MM, Kitahara CM, Newton CC, Bernstein L, Reynolds P, Weiderpass E, et al. Anthropometry and head and neck cancer:a pooled analysis of cohort data. Int J Epidemiol. 2015;44(2):673–81.
Yang Y, Dong J, Sun K, Zhao L, Zhao F, Wang L, et al. Obesity and incidence of lung cancer: a meta-analysis. Int J Cancer. 2013;132(5):1162–9.
Duan P, Hu C, Quan C, Yi X, Zhou W, Yuan M, et al. Body mass index and risk of lung cancer: systematic review and dose-response meta-analysis. Sci Rep. 2015;5:16938.
Discacciati A, Orsini N, Wolk A. Body mass index and incidence of localized and advanced prostate cancer—a dose–response meta-analysis of prospective studies. Ann Oncol. 2012;23(7):1665–71.
Chen Y, Liu L, Wang X, Wang J, Yan Z, Cheng J, et al. Body mass index and risk of gastric cancer: a meta-analysis of a population with more than ten million from 24 prospective studies. Cancer Epidemiol Prev Biomarkers. 2013;22(8):1395–408.
Chen Y, Wang X, Wang J, Yan Z, Luo J. Excess body weight and the risk of primary liver cancer: an updated meta-analysis of prospective studies. Eur J Cancer. 2012;48(14):2137–45.
Giovannucci E. A growing link—what is the role of height in cancer risk? Br J Cancer. 2019;120(6):575–6.
We would like to thank all primary care health professionals in Catalonia who routinely collected the information needed for this study in electronic health records of the population. We also thank the cancer registries of Girona, Tarragona, and the Hospital del Mar for providing data for one of the sensitivity analyses.
Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization.
Funding [for grant number: 2017/1630] was obtained from Wereld Kanker Onderzoek Fonds (WKOF), as part of the World Cancer Research Fund International grant program. TDS is funded by the Department of Health of the Generalitat de Catalunya, awarded on the 2016 call under the Strategic Plan for Research and Innovation in Health (PERIS) 2016–2020, modality incorporation of scientists and technologists, with reference SLT002/16/00308. The funders had no role in study design, data collection, analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
The Clinical Research Ethics Committee of the IDIAPJGol (project code: P14/074) approved this study.
Consent for publication
All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organization for the submitted work, no financial relationships with any organizations that might have an interest in the submitted work in the previous 3 years, and no other relationships or activities that could appear to have influenced the submitted work.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Appendix 1.
definition of menopause and use of hormonal replacement therapy variables. Appendix 2. STROBE Statement-Checklist. Table S1. diagnostic codes used to define cancer cases. Table S2. characteristics of individuals with and without a BMI recorded. Table S3. BMI-cancer risk associations: results of the basic adjustment models. Table S4. P for non-linearity in WC-cancer risk associations. Table S5. A wide range of sensitivity analyses of BMI-cancer risk associations. Table S6. Sensitivity analyses of BMI-cancer risk associations using cancer registry data to confirm SIDIAP cases. Table S7. Sensitivity analyses of BMI-cancer risk associations excluding subgroups of participants. Table S8. Sensitivity analyses of BMI-cancer risk associations for women only cancers. Table S9. Sensitivity analysis including results of BMI/height/weight-cancer risk associations. Table S10. Sensitivity analysis of BMI/WC-cancer risk associations including additional adjustment for height. Table S11. Comparison of BMI information recorded in the SIDIAP and other studies’ data. Table S12. BMI-cancer risk associations stratified by sex. Figure 1. Directed Acyclic Graph that guided our decisions in the control for confounding. Figure 2. Sensitivity analysis of BMI/WC-cancer risk associations, including mutual adjustment using residuals of BMI and WC.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Recalde, M., Davila-Batista, V., Díaz, Y. et al. Body mass index and waist circumference in relation to the risk of 26 types of cancer: a prospective cohort study of 3.5 million adults in Spain. BMC Med 19, 10 (2021). https://doi.org/10.1186/s12916-020-01877-3
- Body mass index
- Waist circumference
- Body size
- Body fat distribution
- Electronic health records