- Research article
- Open Access
Your height affects your health: genetic determinants and health-related outcomes in Taiwan
BMC Medicine volume 20, Article number: 250 (2022)
Height is an important anthropometric measurement and is associated with many health-related outcomes. Genome-wide association studies (GWASs) have identified hundreds of genetic loci associated with height, mainly in individuals of European ancestry.
We performed genome-wide association analyses and replicated previously reported GWAS-determined single nucleotide polymorphisms (SNPs) in the Taiwanese Han population (Taiwan Biobank; n = 67,452). A genetic instrument composed of 251 SNPs was selected from our GWAS, based on height and replication results as the best-fit polygenic risk score (PRS), in accordance with the clumping and p-value threshold method. We also examined the association between genetically determined height (PRS251) and measured height (phenotype). We performed observational (phenotype) and genetic PRS251 association analyses of height and health-related outcomes.
GWAS identified 6843 SNPs in 89 genomic regions with genome-wide significance, including 18 novel loci. These were the most strongly associated genetic loci (EFEMP1, DIS3L2, ZBTB38, LCORL, HMGA1, CS, and GDF5) previously reported to play a role in height. There was a positive association between PRS251 and measured height (p < 0.001). Of the 14 traits and 49 diseases analyzed, we observed significant associations of measured and genetically determined height with only eight traits (p < 0.05/[14 + 49]). Height was positively associated with body weight, waist circumference, and hip circumference but negatively associated with body mass index, waist-hip ratio, body fat, total cholesterol, and low-density lipoprotein cholesterol (p < 0.05/[14 + 49]).
This study contributes to the understanding of the genetic features of height and health-related outcomes in individuals of Han Chinese ancestry in Taiwan.
Height is the growth phenotype during the entire developmental period from infancy to adulthood and becomes relatively stable in adulthood [1,2,3]. Previous studies have reported that social and environmental factors can influence height. Some of these factors include educational attainment, smoking, alcohol consumption, and regular exercise [3,4,5,6,7,8,9]. Higher levels of education [4, 5] and regular exercise  are associated with increased growth and height. In contrast, exposure to smoking or drinking may cause bone mass loss, reduced growth, and reduced height [10,11,12]. Height is determined by polygenic inheritance under complex and multi-locus genetic regulation [13,14,15]. Genome-wide association studies (GWAS) of height have identified hundreds of genetic loci (or single nucleotide polymorphisms [SNPs]) with genome-wide significance, especially in individuals of European ancestry [15,16,17,18,19,20,21,22,23,24,25,26]. These identified genetic loci are associated with proteins of the tyrosine phosphatase family, insulin-like growth factors, proteins involved in skeletal development and mitosis, fibroblast growth factors, the Wnt/β-catenin pathway, Hedgehog signaling, and cancer-associated pathways. These findings highlight the polygenic, complex, and multilocus genetic regulation of height.
Height is associated with several health-related outcomes later in life [27,28,29,30,31,32,33,34]. For instance, taller people tend to have a higher risk of cancer  and cancer-related mortality  but have a reduced risk of CVD [27, 29], CVD-related mortality [27, 29], type 2 diabetes , better retention of cognitive function [30,31,32], and healthy aging . Height can be measured as a genetic component using a polygenic risk score (PRS). PRS is the sum of the weighted risk alleles from a combination of independent SNPs, usually with genome-wide significance, derived from GWAS results [13, 35, 36]. PRS serves as a genetic instrument variable and can be used to assess associations with health-related outcomes without confounding [37, 38]. Genetically determined taller height (in those with European ancestry) is also associated with an increased risk of cancers [39,40,41,42,43,44] and cancer-related mortality [45, 46] but a reduced risk of CVD [42, 47,48,49,50]. The precise shared genetic loci between height and health-related outcomes are yet to be elucidated, especially in individuals of Han Chinese ancestry. In addition, the mechanisms on how shared genetic loci contribute to both height and health-related outcomes remain unclear.
Therefore, this study aimed to identify the genetic architecture for height in individuals from the Taiwan Biobank—a community-based biobank in Taiwan. We also performed observational and genetic PRS analyses of height and health-related outcomes.
The Taiwan Biobank is a database for phenotypic and genomic measurements of the Taiwanese population that was established in 2012. The study recruited volunteers aged 30–70 years with no history of malignancy at enrollment (Twbiobank; https://www.twbiobank.org.tw/new_web/) [51, 52]. All volunteers were residents of Taiwan and provided informed consent. Participants completed questionnaires and underwent interviews, anthropometric measurements, and blood and urine tests to collect demographic, lifestyle, and genomic data.
Taiwan Biobank phenotypes
Anthropometric measurements, including height, body weight, waist circumference, hip circumference, and body fat percentage, were obtained from participants in the Taiwan Biobank (Additional file 1: Table S1). Body mass index (BMI) was calculated as BMI = body weight/body height2. The waist-hip ratio (WHR) was calculated as WHR = waist circumference/ hip circumference. Anthropometric measurements were stratified by sex and analyzed using the mean and standard deviation (SD), where data were normalized to one SD before further analysis.
Blood pressure and lipid and glucose levels were quantitatively measured in participants in the Taiwan Biobank. Systolic blood pressure (SBP), diastolic blood pressure (DBP), total cholesterol (TC), triglyceride (TG), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), fasting glucose, and hemoglobin (Hb) A1c levels were obtained (Additional file 1: Table S1).
Participants were asked to report their health status using questionnaires and interviews. According to participants’ self-reported health status (comorbidities) in the Taiwan Biobank, 10 broad categories of 49 diseases were investigated in our study as follows (Additional file 1: Table S1): (1) orthopedic or joint disorders: osteoporosis, arthritis, rheumatoid arthritis, osteoarthritis, and gout; (2) lung and respiratory diseases: asthma and emphysema or chronic bronchitis; (3) cardiovascular diseases: valvular heart disease, coronary artery disease, heart arrhythmia, cardiomyopathy, congenital heart defect, other type of heart disease, hyperlipidemia, hypertension, and stroke; (4) diabetes: type 1 diabetes and type 2 diabetes; (5) digestive diseases: peptic ulcer disease, gastroesophageal reflux disease, and irritable bowel syndrome; (6) Mental or emotional disorders: depression, bipolar disorder, postpartum depression, obsessive-compulsive disorder, alcohol addiction or drug abuse, and schizophrenia; (7) nervous system disorders: epilepsy, migraine, multiple sclerosis, Parkinson’s disorder, and dementia; (8) other types of disease: gallstones, kidney stones, kidney failure, and vertigo; (9) eye diseases: cataract, glaucoma, dry eye syndrome, retinal detachment, floaters, blindness, color blindness, and others; and (10) female diseases: severe menstrual cramps, uterine fibroids, ovarian cysts, endometriosis, and uterine/cervical polyps.
A total of 132,720 participants were selected from the Taiwan Biobank (Fig. 1). The exclusion criteria were as follows: (1) individuals who did not have GWAS data (N = 16,654), (2) individuals who did not pass the quality control (QC) and principal component analysis (PCA) of GWAS data (N = 19,555), (3) individuals who did not have height information (N = 25), (4) individuals with their height more than ±4 SD (N = 23), (5) individuals without drinking information (N = 52), (6) individuals without smoking information (N = 12), and (7) individuals without regular exercise information (N = 38). The criteria for drinking included current drinkers for at least 6 months; smoking criteria included current smokers for at least 6 months; finally, regular exercise criteria included participants performing regular exercise currently, for at least 6 months.
Finally, 96,361 participants of Han Chinese ancestry were included in this study (Fig. 1) and assigned to the training, testing, and validation groups using a simple random sampling method (7:1.5:1.5 ratio). The training group (N = 67,452 participants) comprised 70% of the total study population and underwent GWAS based on height (Additional file 2: Table S2; Figs. 2 and 3). Before the height GWAS analysis, the measured height (phenotype) was stratified by sex and subsequently mean-centered and normalized to one SD. GWAS for height was then performed using a linear regression model with the assumption of additive allelic effects of SNP dosages, with adjusted covariates including age, sex, education, drinking, smoking, regular exercise, and the first 10 PCAs (Additional file 4: Fig. S2), using the PLINK software (version 1.9, 2.0) [3,4,5,6,7,8,9, 20, 53,54,55,56]. A genome-wide significance value was used (p < 5.00E−8 for the additive test).
We also ensured that our GWAS findings of the training group replicated previously identified height-associated genetic variants (https://www.ebi.ac.uk/gwas/efotraits/EFO_0004339). The reported GWAS height-associated genetic variants were mainly from individuals of European ancestry (excluding the genetic variants of infant or child height traits) and were downloaded from the GWAS catalog website. After removal of the repeated SNPs, 1722 reported GWAS body height-related SNPs were obtained from the GWAS catalog (Additional file 3: Table S3). These SNPs were replicated in our cohort, and we further identified 313 GWAS SNPs associated with height in our cohort (p < 0.05/1722 SNPs) (Additional file 3: Table S3).
The testing group (N = 14,454 participants) comprised 15% of the total study population and was used to select the best-fit PRS, to investigate the association between genetically determined height (PRS251) and measured height (phenotype) using linear regression analysis (Fig. 4). The validation group (n = 14,455 participants) comprised 15% of the total population and was used to determine the association between genetically determined height (PRS251) and measured height (phenotype) using linear regression analysis (Fig. 4). This study was approved by the Human Studies Committee of the China Medical University Hospital, Taichung, Taiwan (approval number: CMUH107-REC3-074).
QC of the original data
Genomic DNA from the Taiwan Biobank was genotyped using Axiom genome-wide TWB1 (653,291 SNPs) or TWB2 (752,921 SNPs) array plates based on the Axiom genome-wide array plate system, according to the manufacturer’s instructions (Affymetrix Inc., Santa Clara, CA, USA). Genotyping was performed at the National Genotyping Center of Academia Sinica, Taipei, Taiwan (http://ncgm.sinica.edu.tw/affymetrix_tech_01.html) (https://www.biobank.org.tw/fd.php).
Genotypic data were then subjected to QC procedures (individual QC and SNP QC) in the Taiwan Biobank (https://www.biobank.org.tw/fd.php). The exclusion criteria for individual QC were as follows: (1) individuals with a missing call rate of > 5%, (2) a heterozygosity rate of >±5 SD, (3) individual identity by descent (IBD) score of ≥ 0.125, and (4) individuals who did not fit the East Asia Summit ancestry PCA. Similar to the results of a previous study , most individuals from the Taiwan Biobank were of Han Chinese ancestry. The exclusion criteria for SNP QC were as follows: (1) SNPs with a missing call rate of >5%, (2) SNPs with Hardy–Weinberg equilibrium (HWE) p-value of <1 × 10−5, and (3) SNPs with a minor allele frequency (MAF) of <5%.
Qualified genotype data were then subjected to an imputation procedure to maximize the number of SNPs in the Taiwan Biobank (https://www.biobank.org.tw/fd.php). First, the SNPs of the qualified genotype data were excluded based on the following criteria: (1) SNPs with MAF of <1%, (2) SNPs with a HWE p-value of <1 × 10−5, and (3) SNP with a missing call rate > 5% using the PLINK software (versions 1.9 and 2.0, http://zzz.bwh.harvard.edu/plink/). SHAPEIT2 (v2.r790) was used to phase the genotypes into full haplotypes (https://mathgen.stats.ox.ac.uk/genetics_software/shapeit/shapeit.html). Third, imputation was performed using IMPUTE2 (v2.3.1, https://mathgen.stats.ox.ac.uk/impute/impute_v2.html), according to the pooled reference panel [Taiwan Biobank (TWB) + East Asian (EAS)]. The pooled reference panel comprised 973 phased individuals with the TWB panel from the Taiwan Biobank  and 504 phased individuals from the EAS panel [57, 58] (The Phase 3 1000 Genomes Project reference panel; The 1000 Genomes Project Consortium, 2010). The pooled reference panel with TWB and EAS ancestry groups was used to improve imputation accuracy [57, 58]. The following imputed SNPs were excluded: (1) SNP with a missing call rate of > 5%, (2) SNPs with MAF < 0.01%, and (3) SNPs with IMPUTE2 information score of < 0.3.
QC for this study
Imputed GWAS data were obtained from the Taiwan Biobank. In our study, SNP QC and individual QC procedures were applied before GWAS of height (Fig. 1). SNPs were excluded from the SNP QC based on the following criteria: (1) SNP with a missing call rate of > 5%, (2) SNPs with HWE p-value of < 1 × 10−6; and (3) SNPs with MAF < 0.01%. After SNP QC, the remaining SNPs were used to perform ancestry PCA for the population structure analysis. The exclusion criteria for individual QC were as follows: (1) individual with a missing call rate > 5%, (2) heterozygosity rate > ±5 SD, (3) individual IBD score ≥0.125, and (4) individuals who did not fit the East Asia Summit ancestry PCA. Participants of non-Chinese ancestry, with evidence of relatedness, or with DNA contamination were excluded.
The PRS was calculated in the testing group using PLINK software (versions 1.9 and 2.0) [35, 53, 59], based on the statistical results of the 6,941 SNPs in the training group (Fig. 1). The 6941 SNPs comprised SNPs with genome-wide significance (p < 5 × 10−8) and SNPs that were replicated from previously reported body-height GWAS SNPs (p < 0.05/1722 SNPs; Fig. 1).
The 6941 SNPs were then subjected to the clumping procedure (within the range of 250,000 base pairs of the index SNP, where SNPs were removed when r2 > 0.1), according to the estimated linkage disequilibrium (LD) among the SNPs in the testing group (Additional file 4: Fig. S1A). After clumping, 251 SNPs were obtained. These 251 SNPs were used to select the “best-fit” PRS according to a series of cutoff values for height-associated p-value thresholds (including 5 × 10−15, 5 × 10−14, 5 × 10−13, 5 × 10−12, 5 × 10−11, 5 × 10−10, 5 × 10−9, 5 × 10−8, 5 × 10−7, 5 × 10−6, and 5 × 10−5) in the testing group. The p-value cutoff (5 × 10−5) was adopted by the “best-fit” PRS with the largest explicable phenotype r2 using only the PRS (PRS r2 = 0.0712, SNP number = 251; Additional file 4: Fig. S1A). In total, 251 SNPs were obtained for the best-fit PRS calculations for all participants. For each participant, the genetically determined height (PRS value) was calculated [35, 53, 59] using 251 SNPs obtained after the clumping protocol. Data centering and standardization were also performed for the PRS height data.
Genotype and imputed genotype data were used for GWAS analysis, as previously described. The HWE for the SNPs in the controls was evaluated using chi-square (χ2) tests. Lewontin’s D and R2 values were used to evaluate the inter-marker coefficient of LD for haplotype block analysis . The confidence interval (CI) for LD was used to construct haplotype blocks by resampling [61, 62]. LocusZoom was used to plot the resulting significant locus .
Measured height (phenotype) served as the exposure variables. Sixty-three health-related outcomes, including 14 traits and 49 diseases, were used as outcome variables. A multivariate linear regression model was made for continuous outcome variables (14 traits), with adjustments for age, sex, education, drinking, smoking, regular exercise, and 10 PCAs [3,4,5,6,7,8,9, 20]. Multivariate logistic regression analysis was performed for binary outcome variables (49 diseases), with adjustments for age, sex, education, drinking, smoking, regular exercise, and 10 PCAs [3,4,5,6,7,8,9, 20].
The genetically determined height (PRS251) also served as the exposure variable. Sixty-three health-related outcomes, including 14 traits and 49 diseases, were used as outcome variables. A multivariate linear regression model was performed for continuous outcome variables (14 traits), with adjustments for age, sex, education, drinking, smoking, regular exercise, and 10 PCAs [3,4,5,6,7,8,9, 20]. Multivariate logistic regression analysis was performed for binary outcome variables (49 diseases), with adjustments for age, sex, education, drinking, smoking, regular exercise, and 10 PCAs [3,4,5,6,7,8,9, 20]. PLINK software (versions 1.9 and 2.0) and R packages for Windows were used for all statistical analyses.
GWAS of the quantitative trait of height in Han Chinese in Taiwan
The Manhattan and QQ plots for the adult-height GWAS results are shown in Fig. 2. GWAS association analysis identified 6843 SNPs in 89 genomic regions with genome-wide significance (p < 5.00E−08 [5 × 10−8], not shown). The top lead SNPs were selected in 89 genomic regions with significant associations (p < 5 × 10−8) using an LD of < 0.2 (Additional file 2: Table S2). Among these, 18 novel lead SNPs within 18 novel regions, 48 novel lead SNPs within 48 reported regions, and 23 lead SNPs within 23 reported regions were found (Additional file 2: Table S2). Moreover, among these 89 genomic regions, the seven lead SNPs were located within seven genetic loci (Fig. 2). These seven genetic loci were located near the following genes: EGF-containing fibulin extracellular matrix protein 1 (EFEMP1), DIS3 like 3′-5′ exoribonuclease 2 (DIS3L2), zinc finger and BTB domain containing 38 (ZBTB38), ligand-dependent nuclear receptor corepressor like (LCORL), high-mobility group AT-hook 1 (HMGA1), citrate synthase (CS), and growth differentiation factor 5 (GDF5). The seven lead SNPs from these seven genetic loci are shown in Additional file 2: Table S2.
Regional plots of the lead SNPs and their neighboring SNPs from these seven genetic loci are shown in Fig. 3. Among them, two lead SNPs were novel (Fig. 3B, C; chromosome 2, rs76803230 in DIS3L2; chromosome 3, rs57345461 in ZBTB38), whereas the remaining five lead SNPs were previously reported (Fig. 3A, D–G). On chromosome 2, the lead SNP rs76803230 was located in the intronic region of DIS3L2 (risk allele: T, beta = 0.0681, [95% CI:0.0583–0.0780], p = 7.47E−42 [7.47 × 10−42]; Additional file 2: Table S2; Fig. 3B). On chromosome 3, the lead SNP rs57345461 was located in the intronic region of ZBTB38 (risk allele: T, beta = 0.0723, [95% CI: 0.0619–0.0827], p = 2.54E−42 [2.54 × 10−42]; Additional file 2: Table S2; Fig. 3C).
In the previously reported SNPs on chromosome 2, only a handful reached genome-wide significance associated with height, where the lead SNP rs3791675 was located in the intronic region of EFEMP1 (risk allele: C; training group: beta = 0.0753, [95% CI: 0.0638–0.0869], p = 2.78E-37 [2.78 × 10−37]; Additional file 2: Table S2; Fig. 3A). This SNP has been associated with body height, BMI-adjusted waist circumference, pelvic organ prolapse, and BMI-adjusted WHR [23, 64,65,66]. On chromosome 4, the lead SNP rs16895971 was located in the 3-untranslated region (3UTR) of LCORL (risk allele: T, beta = 0.1018, [95% CI: 0.0911–0.1125], p = 3.69E−77 [3.69 × 10−77]; Additional file 2: Table S2; Fig. 3D). This SNP has been associated with body height in East Asians . On chromosome 6, the lead SNP rs2780226 was located in the 5 untranslated region (UTR) of HMGA1 (risk allele: C, beta = 0.0948, [95% CI: 0.0791–0.1104], p = 1.75E−32 [1.75 × 10−32]; Additional file 2: Table S2; Fig. 3E). This SNP has been associated with body height, BMI-adjusted waist circumference, and birth weight [15, 68, 69]. On chromosome 12, the lead SNP rs3816804 was located in the intronic region of CS (risk allele: C, beta = 0.1124, [95% CI: 0.0993–0.1255], p = 6.35E−63 [6.35 × 10−63]; Additional file 2: Table S2; Fig. 3F). This SNP has also been associated with body height in East Asians . On chromosome 20, the lead SNP rs143384 was located in the 5-UTR of GDF5 (risk allele: G, beta = 0.0738, [95% CI: 0.0629–0.0847], p = 3.61E−40 [3.61 × 10−40]; Additional file 2: Table S2; Fig. 3G). This SNP has been associated with body height, BMI-adjusted hip circumference, BMI-adjusted WHR, and body fat [15, 64, 71, 72].
Replication of previously reported GWAS-determined SNPs in the Taiwan Han population
The previously reported GWAS-determined SNPs for height were obtained from the GWAS catalog (https://www.ebi.ac.uk/gwas/efotraits/EFO_0004339) and used to replicate the reported SNPs in the training group using the linear regression model, as described previously. In this study, an association analysis identified 313 SNPs that were significantly associated with height (p < 0.05/1722 SNPs; Additional file 3: Table S3).
In this study, GWAS-identified 6843 SNPs, and 313 of the reported SNPs were combined. After removing duplicate SNPs, 6941 SNPs were associated with height (Fig. 1). These 6941 SNPs were then applied to exclude SNPs with strong LD and to select the best SNP combination for the best-fit PRS calculation in the testing group, using PLINK software (versions 1.9 and 2.0) . This resulted in the identification of independent genetic signals for the best-fit PRS with 251 SNPs. These 251 SNPs included 168 GWAS-identified SNPs (Table 1) and 83 previously reported GWAS-determined SNPs (Table 2). These results show that 168 novel GWAS-identified SNPs and 83 reported SNPs were associated with height in individuals of Han Chinese ancestry in Taiwan.
Association between the genetically determined height (PRS251) and the measured height (phenotype)
The association between genetically determined height (PRS251) and measured height (phenotype) was investigated in the testing and validation groups, where height was stratified by sex (male: N = 10,919; female: N = 17,990; Fig. 4). For males, the regression line indicated that a 1-SD increase in PRS251 was associated with a 0.257-SD increase in normalized measured height (slope = 0.257; p < 0.001; green line). For females, the regression line indicated that a 1-SD increase in PRS251 was associated with a 0.274-SD increase in normalized measured height (slope = 0.274; p < 0.001; red line).
Furthermore, to assess the validity of our findings, we replicated the association between genetically determined height (PRS237) and measured heights (phenotype) in another cohort, kindly provided by the Big Data Center in China Medical University Hospital (CMUH), Taichung, Taiwan (Additional file 4: Fig. S3). As shown, only 237 of the 251 SNPs were available from the independent cohort of the Big Data Center at CMUH (Additional file 5: Table S4). The measured height (phenotype) and genetically determined height (PRS237) were normalized (standardized) by sex. For males, the regression line indicated that a 1-SD increase in PRS237 was associated with a 0.0972-SD increase in normalized measured height (slope = 0.0972; p < 0.001; green line; Additional file 4: Fig. S3). For females, the regression line indicated that a 1-SD increase in (PRS237) was associated with a 0.104-SD increase in normalized measured height (slope = 0.104; p < 0.001; red line; Additional file 4: Fig. S3).
Associations of height with health-related outcomes
In this study, we performed both observational (phenotype) and genetic PRS association analyses of height with 63 health-related outcomes using the Taiwan Biobank (Fig. 5). We examined the association between observational (phenotype) height with 63 health-related outcomes, including 14 traits and 49 diseases, in 67,452 individuals of Han Chinese ancestry (Fig. 5). Similar analyses of the association between genetic PRS of height and height were performed (Fig. 5). The genetically determined height of PRS251 (251 SNPs) applied in this analysis was calculated from our GWAS results, consisting of 168 GWAS-identified SNPs (Table 1) and 83 previously reported GWAS-determined SNPs (Table 2). The estimated beta values (95% CI) for the 14 traits are shown in Fig. 5A–C. The estimated odds ratios (95% CI) for the 49 diseases are also shown in Fig. 5D–M. After adjusting for age, sex, education, drinking, smoking, regular exercise, and 10 PCA results, our analyses showed that observational (phenotype) height was associated with eight of the 14 traits (p < 0.05/[14 + 49]; Table 3). Further analyses confirmed that genetic (PRS251) height was also associated with these eight traits (Table 3). No significant associations were observed between the measured and genetic PRS height with the 49 diseases (p > 0.05/[14 + 49]; Fig. 5D–M). Among anthropometric traits, observational height was positively associated with body weight, waist circumference, and hip circumference but negatively associated with BMI, WHR, and body fat (Table 3). Genetic PRS height was associated with increased body weight (beta = 1.2182, 95% CI = 1.1405–1.2959), waist circumference (beta = 0.4462, 95% CI = 0.3754–0.5171), and hip circumference (beta = 0.6006, 95% CI = 0.5488–0.6523), and a decreased BMI (beta = −0.0837, 95% CI = (−0.1110)–(−0.0563)), WHR (beta = −0.0008, 95% CI = (−0.0012)–(−0.0003)), and body fat (beta = −0.1401, 95% CI = (−0.1856)–(−0.0946)).
Regarding blood pressure, and lipid and glucose levels, observational height was negatively associated with TC and LDL-C (Table 3). Genetic PRS height was associated with decreased TC (beta = −0.5869, 95% CI = (−0.8530)–(−0.3207)) and LDL-C levels (beta = −0.6291, 95% CI = (−0.8672)–(−0.3910)).
We reported a genetic profile for height in the Han Chinese population using genome-wide SNP analysis and a replication study in the Taiwan Biobank—a community-based database in Taiwan. This is the first large-scale finding on the genetic basis for height and health-related outcomes in individuals of Han Chinese ancestry in Taiwan. Our study results are consistent with the genetic profile of height observed mainly in individuals of European ancestry [15,16,17,18,19,20,21,22,23,24,25,26]. Accordingly, our findings support the validity of height in observational (phenotype) studies and are consistent with the health-related outcomes of this phenotype [73,74,75,76,77,78,79,80,81].
In this study, we identified 6843 SNPs with genome-wide significance in 89 genomic regions, including 18 novel loci. Among these, we identified seven independent lead SNPs at seven genetic loci (two of these lead SNPs were novel: chromosome 2, rs76803230 in DIS3L2; chromosome 3, rs57345461 in ZBTB38) with genome-wide significance. DIS3L2 encodes one of the subunits of the RNA exosome, and its genetic variants are associated with height in individuals of European ancestry [21, 82] and East Asian ancestry [83, 84]. ZBTB38 is a zinc finger transcriptional activator that binds methylated DNA and is associated with apoptosis. ZBTB38 genetic variants are associated with height in individuals of European ancestry [17, 18] and East Asian ancestry [20, 23]. The remaining five lead SNPs were reported previously [15, 23, 64,65,66,67,68,69,70,71,72]. The lead SNP rs3791675 in EFEMP1 encodes an extracellular matrix glycoprotein of the fibulin family and has been associated with body height, BMI-adjusted waist circumference, pelvic organ prolapse, and BMI-adjusted WHR [23, 64,65,66]. The lead SNP rs16895971 in LCORL, a transcription factor involved in spermatogenesis, has been associated with height in East Asians . The lead SNP rs2780226 in HMGA1 encodes a chromatin-associated protein that regulates gene transcription and metastatic progression of cancer cells and has been associated with body height, BMI-adjusted waist circumference, and birth weight [15, 68, 69]. The lead SNP rs3816804 in CS has also been associated with height in East Asians . The lead SNP rs143384 in GDF5, which encodes a secreted ligand of the transforming growth factor-beta superfamily, regulates the development of numerous tissue and cell types and has been associated with body height, BMI-adjusted hip circumference, BMI-adjusted WHR, and body fat [15, 64, 71, 72]. Our observations report novel lead SNPs in the Han Chinese population that share an overlapping genetic architecture for height, mainly discovered in individuals of European ancestry.
Our observational (phenotype) analyses showed that height was associated with eight traits. Furthermore, our PRS251 analyses confirmed that genetic height was also associated with these eight traits. Taller height was associated with decreased BMI, WHR, body fat, TC, and LDL-C, but with increased body weight, waist circumference, and hip circumference. For anthropometric traits, we observed that both observational (phenotype) and genetically determined height (PRS251) were associated with BMI, WHR, body fat, body weight, waist circumference, and hip circumference. Taller height was associated with decreased BMI, WHR, and body fat, but increased body weight, waist circumference, and hip circumference. Our findings are consistent with previous observational (phenotype) studies that reported an inverse association between height and obesity-related traits, including BMI, WHR, and body fat [73, 74]. As expected, taller adults had lower rates of obesity [73, 74]. Our findings also support previous observational (phenotype) studies that reported positive associations between height and body weight [73,74,75], waist circumference [74, 75], and hip circumference . Furthermore, taller adults had increased body weight and waist and hip circumferences. The results of our genetic PRS of height association were also in agreement with previous genetic correlation studies, mainly conducted in individuals of European ancestry [78,79,80,81]. There was a negative correlation between genetically determined height and BMI [78,79,80], and positive correlations between genetically determined height with waist and hip circumference [80, 81]. Our findings may reflect a partial genetic overlap between height and anthropometric traits including BMI, WHR, body fat, body weight, waist circumference, and hip circumference. However, genetic correlations in individuals of Han Chinese ancestry remain to be elucidated.
Regarding blood pressure and lipids and glucose levels, both observational (phenotype) and genetically determined height (PRS251) were inversely associated with TC and LDL-C. Our findings are consistent with those of previous observational (phenotype) studies that reported an inverse association between height and blood lipid levels [76, 77, 85]. Taller adults have lower levels of TC and LDL-C [76, 77, 85]. The results of our genetic PRS of height are also in agreement with previous studies, mainly conducted in individuals of European ancestry [50, 80, 81]. Negative genetic correlations between height and TC were found [80, 81]. A taller genetic PRS was associated with lower LDL-C levels in individuals of European ancestry . Our results also suggest that genetically taller individuals of Han Chinese ancestry have lower levels of TC and LDL-C.
This large-scale assessment of the genetic architecture of height in the Han Chinese population of Taiwan quantified the extent of the shared genetic basis with individuals of European ancestry. Our observational and genetic study supports the relevance of height to the etiology of various health-related outcomes, especially those regarding anthropometric traits and blood lipids.
Availability of data and materials
Individual-level Taiwan Biobank data are available upon application to the Taiwan Biobank (https://www.twbiobank.org.tw/new_web/).
Body mass index
Diastolic blood pressure
DIS3 like 3′-5′ exoribonuclease 2
EGF-containing fibulin extracellular matrix protein 1
Growth differentiation factor 5
Genome-wide association study
High-density lipoprotein cholesterol
High-mobility group AT-hook 1
Individual identity by descent
Ligand-dependent nuclear receptor corepressor like
Low-density lipoprotein cholesterol
Linear regression model
Minor allele frequency
Principal component analysis
Polygenic risk score
Systolic blood pressure
Single nucleotide polymorphism
Zinc finger and BTB domain containing 38
Dubois L, Ohm Kyvik K, Girard M, Tatone-Tokuda F, Perusse D, Hjelmborg J, et al. Genetic and environmental contributions to weight, height, and BMI from birth to 19 years of age: an international study of over 12,000 twin pairs. PLoS One. 2012;7(2):e30153.
Jelenkovic A, Hur YM, Sund R, Yokoyama Y, Siribaddana SH, Hotopf M, et al. Genetic and environmental influences on adult human height across birth cohorts from 1886 to 1994. Elife. 2016;5:e20320.
Jelenkovic A, Sund R, Hur YM, Yokoyama Y, Hjelmborg JV, Moller S, et al. Genetic and environmental influences on height from infancy to early adulthood: an individual-based pooled analysis of 45 twin cohorts. Sci Rep. 2016;6:28496.
Tyrrell J, Jones SE, Beaumont R, Astley CM, Lovell R, Yaghootkar H, et al. Height, body mass index, and socioeconomic status: mendelian randomisation study in UK Biobank. BMJ. 2016;352:i582.
Magnusson PK, Rasmussen F, Gyllensten UB. Height at age 18 years is a strong predictor of attained education later in life: cohort study of over 950,000 Swedish men. Int J Epidemiol. 2006;35(3):658–63.
German A, Mesch G, Hochberg Z. People are taller in countries with better environmental conditions. Front Endocrinol (Lausanne). 2020;11:106.
Ravnborg TL, Jensen TK, Andersson AM, Toppari J, Skakkebaek NE, Jorgensen N. Prenatal and adult exposures to smoking are associated with adverse effects on reproductive hormones, semen quality, final height and body mass index. Hum Reprod. 2011;26(5):1000–11.
Carter RC, Jacobson JL, Sokol RJ, Avison MJ, Jacobson SW. Fetal alcohol-related growth restriction from birth through young adulthood and moderating effects of maternal prepregnancy weight. Alcohol Clin Exp Res. 2013;37(3):452–62.
Alves JGB, Alves GV. Effects of physical activity on children's growth. J Pediatr (Rio J). 2019;95(Suppl 1):72–8.
Lucas R, Fraga S, Ramos E, Barros H. Early initiation of smoking and alcohol drinking as a predictor of lower forearm bone mineral density in late adolescence: a cohort study in girls. PLoS One. 2012;7(10):e46940.
O'Loughlin J, Karp I, Henderson M, Gray-Donald K. Does cigarette use influence adiposity or height in adolescence? Ann Epidemiol. 2008;18(5):395–402.
Huang W, Lei X, Ridder G, Strauss J, Zhao Y. Health, height, height shrinkage, and SES at older ages: evidence from China. Am Econ J Appl Econ. 2013;5(2):86–121.
Crouch DJM, Bodmer WF. Polygenic inheritance, GWAS, polygenic risk scores, and the search for functional variants. Proc Natl Acad Sci U S A. 2020;117(32):18924–33.
Chan Y, Holmen OL, Dauber A, Vatten L, Havulinna AS, Skorpen F, et al. Common variants show predicted polygenic effects on height in the tails of the distribution, except in extremely short individuals. PLoS Genet. 2011;7(12):e1002439.
Lango Allen H, Estrada K, Lettre G, Berndt SI, Weedon MN, Rivadeneira F, et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature. 2010;467(7317):832–8.
Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46(11):1173–86.
Lettre G, Jackson AU, Gieger C, Schumacher FR, Berndt SI, Sanna S, et al. Identification of ten loci associated with height highlights new biological pathways in human growth. Nat Genet. 2008;40(5):584–91.
Weedon MN, Lango H, Lindgren CM, Wallace C, Evans DM, Mangino M, et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet. 2008;40(5):575–83.
Kim JJ, Lee HI, Park T, Kim K, Lee JE, Cho NH, et al. Identification of 15 loci influencing height in a Korean population. J Hum Genet. 2010;55(1):27–31.
Okada Y, Kamatani Y, Takahashi A, Matsuda K, Hosono N, Ohmiya H, et al. A genome-wide association study in 19 633 Japanese subjects identified LHX3-QSOX2 and IGF1 as adult height loci. Hum Mol Genet. 2010;19(11):2303–12.
Gudbjartsson DF, Walters GB, Thorleifsson G, Stefansson H, Halldorsson BV, Zusmanovich P, et al. Many sequence variants affecting diversity of adult human height. Nat Genet. 2008;40(5):609–15.
Yang J, Ferreira T, Morris AP, Medland SE, Genetic Investigation of ATC, Replication DIG, Meta-analysis C, Madden PA, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44(4):369–75 S361-363.
Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, Ban HJ, et al. A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet. 2009;41(5):527–34.
Chan Y, Salem RM, Hsu YH, McMahon G, Pers TH, Vedantam S, et al. Genome-wide analysis of body proportion classifies height-associated variants by mechanism of action and implicates genes important for skeletal development. Am J Hum Genet. 2015;96(5):695–708.
Soranzo N, Rivadeneira F, Chinappen-Horsley U, Malkina I, Richards JB, Hammond N, et al. Meta-analysis of genome-wide scans for human adult stature identifies novel Loci and associations with measures of skeletal frame size. PLoS Genet. 2009;5(4):e1000445.
Lin YJ, Cheng CF, Wang CH, Liang WM, Tang CH, Tsai LP, et al. Genetic architecture associated with familial short stature. J Clin Endocrinol Metab. 2020;105(6):dgaa131.
Emerging Risk Factors C. Adult height and the risk of cause-specific death and vascular morbidity in 1 million people: individual participant meta-analysis. Int J Epidemiol. 2012;41(5):1419–33.
Green J, Cairns BJ, Casabonne D, Wright FL, Reeves G, Beral V, et al. Height and cancer incidence in the Million Women Study: prospective cohort, and meta-analysis of prospective studies of height and total cancer risk. Lancet Oncol. 2011;12(8):785–94.
Paajanen TA, Oksala NK, Kuukasjarvi P, Karhunen PJ. Short stature is associated with coronary heart disease: a systematic review of the literature and a meta-analysis. Eur Heart J. 2010;31(14):1802–9.
Case A, Paxson C. Height, health, and cognitive function at older ages. Am Econ Rev. 2008;98(2):463–7.
Guven C, Lee WS. Height and cognitive function at older ages: is height a useful summary measure of early childhood experiences? Health Econ. 2013;22(2):224–33.
Guven C, Lee WS. Height, aging and cognitive abilities across Europe. Econ Hum Biol. 2015;16:16–29.
Ma W, Hagan KA, Heianza Y, Sun Q, Rimm EB, Qi L. Adult height, dietary patterns, and healthy aging. Am J Clin Nutr. 2017;106(2):589–96.
Shrestha S, Rasmussen SH, Pottegard A, Angquist LH, Jess T, Allin KH, et al. Associations between adult height and type 2 diabetes mellitus: a systematic review and meta-analysis of observational studies. J Epidemiol Community Health. 2019;73(7):681–8.
Choi SW, Mak TS, O'Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020;15(9):2759–72.
Janssens A. Validity of polygenic risk scores: are we measuring what we think we are? Hum Mol Genet. 2019;28(R2):R143–50.
Ye J, Wu C, Chu X, Wen Y, Li P, Cheng B, et al. Evaluating the effect of birth weight on brain volumes and depression: an observational and genetic study using UK Biobank cohort. Eur Psychiatry. 2020;63(1):e73.
Teumer A. Common methods for performing Mendelian randomization. Front Cardiovasc Med. 2018;5:51.
Thrift AP, Gong J, Peters U, Chang-Claude J, Rudolph A, Slattery ML, et al. Mendelian randomization study of height and risk of colorectal cancer. Int J Epidemiol. 2015;44(2):662–72.
Cornish AJ, Law PJ, Timofeeva M, Palin K, Farrington SM, Palles C, et al. Modifiable pathways for colorectal cancer: a mendelian randomisation analysis. Lancet Gastroenterol Hepatol. 2020;5(1):55–62.
Zhang B, Shu XO, Delahanty RJ, Zeng C, Michailidou K, Bolla MK, et al. Height and breast cancer risk: evidence from prospective studies and Mendelian randomization. J Natl Cancer Inst. 2015;107(11):djv219.
Lai FY, Nath M, Hamby SE, Thompson JR, Nelson CP, Samani NJ. Adult height and risk of 50 diseases: a combined epidemiological and genetic analysis. BMC Med. 2018;16(1):187.
Dixon-Suen SC, Nagle CM, Thrift AP, Pharoah PDP, Ewing A, Pearce CL, et al. Adult height is associated with increased risk of ovarian cancer: a Mendelian randomisation study. Br J Cancer. 2018;118(8):1123–9.
Dusingize JC, Olsen CM, An J, Pandeya N, Law MH, Thompson BS, et al. Body mass index and height and risk of cutaneous melanoma: Mendelian randomization analyses. Int J Epidemiol. 2020;49(4):1236–45.
Davies NM, Gaunt TR, Lewis SJ, Holly J, Donovan JL, Hamdy FC, et al. The effects of height and BMI on prostate cancer incidence and mortality: a Mendelian randomization study in 20,848 cases and 20,214 controls from the PRACTICAL consortium. Cancer Causes Control. 2015;26(11):1603–16.
Ong JS, An J, Law MH, Whiteman DC, Neale RE, Gharahkhani P, et al. Height and overall cancer risk and mortality: evidence from a Mendelian randomisation study on 310,000 UK Biobank participants. Br J Cancer. 2018;118(9):1262–7.
Nuesch E, Dale C, Palmer TM, White J, Keating BJ, van Iperen EP, et al. Adult height, coronary heart disease and stroke: a multi-locus Mendelian randomization meta-analysis. Int J Epidemiol. 2016;45(6):1927–37.
Stefan N, Haring HU, Hu FB, Schulze MB. Divergent associations of height with cardiometabolic disease and cancer: epidemiology, pathophysiology, and global implications. Lancet Diabetes Endocrinol. 2016;4(5):457–67.
Marouli E, Del Greco MF, Astley CM, Yang J, Ahmad S, Berndt SI, et al. Mendelian randomisation analyses find pulmonary factors mediate the effect of height on coronary artery disease. Commun Biol. 2019;2:119.
Nelson CP, Hamby SE, Saleheen D, Hopewell JC, Zeng L, Assimes TL, et al. Genetically determined height and coronary artery disease. N Engl J Med. 2015;372(17):1608–18.
Chen CH, Yang JH, Chiang CWK, Hsiung CN, Wu PE, Chang LC, et al. Population structure of Han Chinese in the modern Taiwanese population based on 10,000 participants in the Taiwan Biobank project. Hum Mol Genet. 2016;25(24):5321–31.
Lin WY, Chan CC, Liu YL, Yang AC, Tsai SJ, Kuo PH. Performing different kinds of physical exercise differentially attenuates the genetic effects on obesity measures: Evidence from 18,424 Taiwan Biobank participants. PLoS Genet. 2019;15(8):e1008277.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Marees AT, de Kluiver H, Stringer S, Vorspan F, Curis E, Marie-Claire C, et al. A tutorial on conducting genome-wide association studies: quality control and statistical analysis. Int J Methods Psychiatr Res. 2018;27(2):e1608.
Hsiao YJ, Chuang HK, Chi SC, Wang YY, Chiang PH, Teng PC, et al. Genome-wide polygenic risk score for predicting high risk glaucoma individuals of Han Chinese ancestry. J Pers Med. 2021;11(11):1169.
Lin YJ, Ho TJ, Lin TH, Hsu WY, Huang SM, Liao CC, et al. P-coumaric acid regulates exon 12 splicing of the ATP7B gene by modulating hnRNP A1 protein expressions. Biomedicine (Taipei). 2015;5(2):10.
Wei CY, Yang JH, Yeh EC, Tsai MF, Kao HJ, Lo CZ, et al. Genetic profiles of 103,106 individuals in the Taiwan Biobank provide insights into the health and history of Han Chinese. NPJ Genom Med. 2021;6(1):10.
Shi S, Yuan N, Yang M, Du Z, Wang J, Sheng X, et al. Comprehensive Assessment of Genotype Imputation Performance. Hum Hered. 2018;83(3):107–16.
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7.
Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263–5.
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science. 2002;296(5576):2225–9.
Lin YJ, Wan L, Wu JY, Sheu JJ, Lin CW, Lan YC, et al. HLA-E gene polymorphism associated with susceptibility to Kawasaki disease and formation of coronary artery aneurysms. Arthritis Rheum. 2009;60(2):604–10.
Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, Gliedt TP, et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26(18):2336–7.
Tachmazidou I, Suveges D, Min JL, Ritchie GRS, Steinberg J, Walter K, et al. Whole-genome sequencing coupled to imputation discovers genetic signals for anthropometric traits. Am J Hum Genet. 2017;100(6):865–84.
Olafsdottir T, Thorleifsson G, Sulem P, Stefansson OA, Medek H, Olafsson K, et al. Genome-wide association identifies seven loci for pelvic organ prolapse in Iceland and the UK Biobank. Commun Biol. 2020;3(1):129.
Christakoudi S, Evangelou E, Riboli E, Tsilidis KK. GWAS of allometric body-shape indices in UK Biobank identifies loci suggesting associations with morphogenesis, organogenesis, adrenal cell renewal and cancer. Sci Rep. 2021;11(1):10688.
Sakaue S, Kanai M, Tanigawa Y, Karjalainen J, Kurki M, Koshiba S, et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat Genet. 2021;53(10):1415–24.
Plotnikov D, Williams C, Guggenheim JA. Association between birth weight and refractive error in adulthood: a Mendelian randomisation study. Br J Ophthalmol. 2020;104(2):214–9.
Graff M, Scott RA, Justice AE, Young KL, Feitosa MF, Barata L, et al. Genome-wide physical activity interactions in adiposity - a meta-analysis of 200,452 adults. PLoS Genet. 2017;13(4):e1006528.
Hao Y, Liu X, Lu X, Yang X, Wang L, Chen S, et al. Genome-wide association study in Han Chinese identifies three novel loci for human height. Hum Genet. 2013;132(6):681–9.
Rask-Andersen M, Karlsson T, Ek WE, Johansson A. Genome-wide association study of body fat distribution identifies adiposity loci and sex-specific genetic effects. Nat Commun. 2019;10(1):339.
Wojcik GL, Graff M, Nishimura KK, Tao R, Haessler J, Gignoux CR, et al. Genetic analyses of diverse populations improves discovery for complex traits. Nature. 2019;570(7762):514–8.
Bosy-Westphal A, Plachta-Danielzik S, Dorhofer RP, Muller MJ. Short stature and obesity: positive association in adults but inverse association in children and adolescents. Br J Nutr. 2009;102(3):453–61.
Han TS, Seidell JC, Currall JE, Morrison CE, Deurenberg P, Lean ME. The influences of height and age on waist circumference as an index of adiposity in adults. Int J Obes Relat Metab Disord. 1997;21(1):83–9.
Heymsfield SB, Heo M, Pietrobelli A. Are adult body circumferences associated with height? Relevance to normative ranges and circumferential indexes. Am J Clin Nutr. 2011;93(2):302–7.
Lee MY, Nam GE, Han K, Kim DH, Kim YH, Cho KH, et al. Association between height and hypercholesterolemia in adults: a nationwide population-based study in Korea. Lipids Health Dis. 2019;18(1):198.
Oh NK, Song YM, Kim SH, Park MJ. Short stature is associated with increased risk of dyslipidemia in Korean adolescents and adults. Sci Rep. 2019;9(1):14090.
Vogelezang S, Bradfield JP, Ahluwalia TS, Curtin JA, Lakka TA, Grarup N, et al. Novel loci for childhood body mass index and shared heritability with adult cardiometabolic traits. PLoS Genet. 2020;16(10):e1008718.
Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41.
Shi H, Mancuso N, Spendlove S, Pasaniuc B. Local genetic correlation gives insights into the shared genetic architecture of complex traits. Am J Hum Genet. 2017;101(5):737–51.
Guo MH, Hirschhorn JN, Dauber A. Insights and implications of genome-wide association studies of height. J Clin Endocrinol Metab. 2018;103(9):3155–68.
Estrada K, Krawczak M, Schreiber S, van Duijn K, Stolk L, van Meurs JB, et al. A genome-wide association study of northwestern Europeans involves the C-type natriuretic peptide signaling pathway in the etiology of human height variation. Hum Mol Genet. 2009;18(18):3516–24.
He M, Xu M, Zhang B, Liang J, Chen P, Lee JY, et al. Meta-analysis of genome-wide association studies of adult height in East Asians identifies 17 novel loci. Hum Mol Genet. 2015;24(6):1791–800.
Akiyama M, Ishigaki K, Sakaue S, Momozawa Y, Horikoshi M, Hirata M, et al. Characterizing rare and low-frequency height-associated variants in the Japanese population. Nat Commun. 2019;10(1):4393.
Henriksson KM, Lindblad U, Agren B, Nilsson-Ehle P, Rastam L. Associations between body height, body composition and cholesterol levels in middle-aged men. the coronary risk factor study in southern Sweden (CRISS). Eur J Epidemiol. 2001;17(6):521–6.
We are grateful to the Health Data Science Center at the China Medical University Hospital, the iHi Clinical Research and iHi Genomics Platform from the Big Data Center of China Medical University Hospital, Taiwan Biobank (TWBR10812-06), and the National Center for Genome Medicine of the National Core Facility Program for Biotechnology for data exploration; administrative, technical, and statistical analysis; and funding support.
We thank the China Medical University, Taiwan (CMU-110-S-17, CMU-110-S-24, CMU110-MF-115, and CMU110-MF-49); the China Medical University Hospital, Taiwan (DMR-111-062 and DMR-111-153); and the Ministry of Science and Technology, Taiwan (MOST 108-2314-B-039-044-MY3, MOST 109-2320-B-039-035-MY3, MOST 109-2410-H-039-002, and MOST 110-2410-H-039-002) for supporting this publication. The funding entities had no role in the study design, data collection, data analysis, interpretation, or authorship of the manuscript.
Ethics approval and consent to participate
The study procedures were approved by the Human Studies Committee of the China Medical University Hospital, Taichung, Taiwan (approval number: CMUH107-REC3-074). All recruited patients provided written informed consent upon enrollment.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
. Basic characteristics of the study participants at enrollment.
. The top lead SNPs in 89 genomic regions that were significantly associated with height (p < 5 × 10−8), using LD (r2 < 0.2), in individuals of Han Chinese ancestry.
. Replication of a previous GWAS of body height in the SNPs of the training group (313 of 1722 SNPs).
. The clumping and p-value threshold method identifies the “best-fit” SNP number for the polygenic risk score (PRS) calculation, according to the largest explainable phenotype correlation r2 using only PRS (PRS r2 and SNP number). The x-axis shows the p-value thresholds from the height of GWAS results. The y-axis represents the explainable phenotypic correlation r2 using only the PRS (PRS r2). The p-values above the bars show the statistical significance of the associations between genetically determined height (PRS) and measured height (phenotype). (A) 251 SNPs were obtained from 6,941 SNPs (novel and reported SNPs; PRS r2 = 0.0712, SNP number = 251). (B) 194 SNPs were obtained from 6,843 SNPs (novel SNPs; PRS r2 = 0.0622, SNP number = 194). (C) 154 SNPs were obtained from 313 SNPs (reported SNPs; PRS r2 = 0.0706, SNP number = 154). Figure S2. Scree plot identifying the number of principal component analyses (PCA) needed for the correction of population structure in the height GWAS study, using pcadapt (an R package used to determine the number of principal components). Figure S3. Association between genetically determined height (PRS237) and measured height (phenotype) in an independent cohort of the Big Data Center in China Medical University Hospital in Taiwan. The measured height (cm) and calculated polygenic risk score (PRS) for height were stratified by sex, mean-centered, and normalized to one standard deviation (SD; males, N = 46,310; females, N = 54,728). The normalized measured height is represented on the y-axis, and normalized genetically determined height (PRS237) is represented on the x-axis.
. Characteristics of 237 out of the 251 SNPs associated with height in the independent cohort of the Big Data Center at China Medical University Hospital in Taiwan.
About this article
Cite this article
Chiou, JS., Cheng, CF., Liang, WM. et al. Your height affects your health: genetic determinants and health-related outcomes in Taiwan. BMC Med 20, 250 (2022). https://doi.org/10.1186/s12916-022-02450-w
- Genome-wide association studies
- Genetic single nucleotide polymorphisms
- Polygenic risk score
- Health-related outcomes