Skip to main content

Early onset of immune-mediated diseases in minority ethnic groups in the UK



The prevalence of some immune-mediated diseases (IMDs) shows distinct differences between populations of different ethnicities. The aim of this study was to determine if the age at diagnosis of common IMDs also differed between different ethnic groups in the UK, suggestive of distinct influences of ethnicity on disease pathogenesis.


This was a population-based retrospective primary care study. Linear regression provided unadjusted and adjusted estimates of age at diagnosis for common IMDs within the following ethnic groups: White, South Asian, African-Caribbean and Mixed-race/Other. Potential disease risk confounders in the association between ethnicity and diagnosis age including sex, smoking, body mass index and social deprivation (Townsend quintiles) were adjusted for. The analysis was replicated using data from UK Biobank (UKB).


After adjusting for risk confounders, we observed that individuals from South Asian, African-Caribbean and Mixed-race/Other ethnicities were diagnosed with IMDs at a significantly younger age than their White counterparts for almost all IMDs. The difference in the diagnosis age (ranging from 2 to 30 years earlier) varied for each disease and by ethnicity. For example, rheumatoid arthritis was diagnosed at age 49, 48 and 47 years in individuals of African-Caribbean, South Asian and Mixed-race/Other ethnicities respectively, compared to 56 years in White ethnicities. The earlier diagnosis of most IMDs observed was validated in UKB although with a smaller effect size.


Individuals from non-White ethnic groups in the UK had an earlier age at diagnosis for several IMDs than White adults.

Peer Review reports


The prevalence of immune mediated diseases (IMDs) is increasing worldwide together with associated mortality and morbidity rates. IMDs are a group of diseases that cause damage to tissues and organs in response to self-antigens [1]. Studies into individual diseases such as systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA) have shown that their incidence and prevalence differ between ethnic groups [2, 3]. Recent studies report the highest incidence of SLE to be in the African-Caribbean population [4, 5] and the highest incidence of RA in the South Asian population [3]. Other studies have reported the incidence of IMDs such as vitiligo and autoimmune thyroid disease (AIT) to be higher in these ethnic groups [3, 6].

Importantly, several studies indicate that severe disease involving major organs occurs at a younger age in different ethnic populations, which may indicate an earlier age of onset [7,8,9,10]. If the onset of IMDs is earlier in certain ethnic groups, this would likely result in longer disease duration and increase the risk of long-term disease complications which has implications for healthcare utilisation. Additionally, if individuals from these ethnic groups develop IMDs at an earlier age, this may suggest that ethnicity influences immune responses relevant to disease pathogenesis as well as potentially more broadly. However, we are not aware of any large-scale epidemiological studies that assess differences in diagnosis ages between different ethnic groups.

We therefore initiated a study to compare diagnosis ages of IMDs between different ethnic groups within the UK.


Our study applied data from IQVIA Medical Research Data (IMRD-UK) [Scientific Review Committee Reference Number: 18THIN064], which is an electronic health records (EHR) database of primary care patients in the UK. IMRD-UK is representative of the UK population in terms of demographic structure and common morbidity prevalence [11]. The ethnicity of patients was recorded by general practices through self-reporting by patients. Information relating to symptoms, diagnoses and referrals are recorded within IMRD-UK using Read Codes, a clinical hierarchy coding system [12].

This study was a population-based retrospective study with a cohort of ~4.5 million. The study period was set between 1st January 2006 (ethnicity recording at optimal completeness) and 31st December 2020, and patients of all ages were considered for the study. General practices in this study were included 12 months after their instalment of EHR or 12 months from the practice’s acceptable mortality recording dates to reduce under-recording of events [13, 14]. In addition, patients were only allowed to enter the study after 12 months of registration with an eligible general practitioner. Furthermore, a patient with more than one autoimmune condition could contribute to more than one disease group.

Ethnicity in this study was categorised broadly into the four most common groups based on the 2011 UK census classification [15]: (1) White (British, Irish, other White); (2) South Asian (Bangladeshi, Pakistani, Indian, Sri Lankan, British Asian or other South Asian); (3) African-Caribbean (Black African, Black Caribbean, Black British or other Black people); and (4) Mixed-race/Other ethnic groups (including Chinese, Vietnamese, and other South-East Asian). The most recent record of ethnicity was utilised in this study.

Outcomes of IMDs, as well as fibromyalgia (FM), as a comparator chronic but non-autoimmune disease, were identified by relevant Read Codes (Additional file 1). IMDs included in the analysis were: AIT; coeliac disease; inflammatory bowel disease (IBD); myasthenia gravis (MG); multiple sclerosis (MS); psoriasis; pernicious anaemia (PA); RA; SLE; Sjogren’s syndrome; and vitiligo. Read Code lists for all IMDs considered are detailed in a previous study [16]. Patients were followed up from index date (study entry) until the earliest of the following end points which marked the exit date: diagnosis date, death date, study end date, date patient transferred from GP, or last date of data collection from a given GP.

UK Biobank (UKB) [application number: 31224] is a cross-sectional study of approximately 500,000 participants, aged between 40 and 69 years, that were recruited between 2006 and 2010 [17]. The data collected include detailed information on health and lifestyle as well as genetic data.

The dates of the First Occurrence of Health Outcomes data were extracted using the ICD-10 codes from category 1712 (Table S1). Ethnic background was determined from field 21000. The data sources included ICD-10 codes obtained from hospital inpatient data, death Register records, self-reported and from read codes in the primary care data.

Statistical analysis

Baseline covariates were summarised using appropriate descriptive statistics, stratified by ethnicity. Median (interquartile range, IQR) for continuous variables due to their skewed distribution and frequency (%) for categorical variables were used to describe the baseline characteristics. Linear regression was used to provide unadjusted and adjusted estimates of age at diagnosis by ethnicity. Checks were performed to ensure that the assumptions of linear regression were not violated, and no evidence of heteroscedasticity was found. Additional tests confirmed that the residuals (errors) of the regression line were approximately normally distributed. Baseline record of sex, smoking, body mass index (BMI) and social deprivation (Townsend quintiles) were considered as potential confounders in the association between ethnicity and age at diagnosis of the outcome conditions. Patients in which the date of diagnosis preceded the baseline date were excluded from the analysis. In addition, missing data from each covariate were treated as a separate missing category and included in the regression analysis to enable the same number of patients to be compared across the different analyses. All analyses were performed using Stata 16 SE, and a p-value <0.05 was considered statistically significant.

Comparisons of the ages at IMDs diagnosis were between the three main groups: South Asian, African-Caribbean, and Mixed-race/Other and the White ethnic group, both with and without adjustment for relevant potential confounding variables.


Study cohort characteristics

The baseline study characteristics include the highest proportion of adults aged over 50 years in White (18.5%) group followed by South Asian (7.7%), African-Caribbean (6.8%) and Mixed-race/Other (5.5%) groups (Table 1). The African-Caribbean group contain the highest proportion (13.8%) of obese participants (BMI ≥30) followed by White (12.3%), South Asian (7.8%) and Mixed-race/Other (6.3%) groups. However, BMI data were missing or considered implausible (BMI values < 14 or > 75) from a considerable proportion of the cohort (Table 1). The proportion of participants who were most socially deprived was higher amongst the three non-White ethnic groups in comparison to White group although information was missing for a considerable proportion of participants (Table 1). The highest proportion of smokers was from the White group and the highest proportion of non-smokers were from the South Asian group (Table 1).

Table 1 IMRD-UK study characteristics of the age at diagnoses of IMDs in different ethnic groups

Age at diagnosis by ethnicity

There were significant differences in the ages at diagnosis of IMDs between the four ethnic groups after adjusting for sex and potential lifestyle confounders (smoking, BMI and levels of social deprivation) (Table 2). Age at disease diagnosis for all IMD cases combined, adjusted for risk factors, revealed that patients were diagnosed on average 4 years earlier in the African-Caribbean group and 6 years earlier in both the South Asian and Mixed-race/Other groups than those from the White group (Table 2 and Fig. 1). Age at diagnosis of almost all IMDs was earlier in patients of South Asian ethnicity (Fig. 1). The Mixed-race/Other group followed a similar trend to South Asians except for Sjogren's syndrome where the diagnosis age was not statistically different to the White group. Although the African-Caribbean group also followed a similar trend to the other two non-White groups, they differed for MS and AIT where the diagnosis age was not significantly different to the White group (Table 2). In contrast, there was no difference between the White and African-Caribbean groups in the diagnosis age for FM. The difference in FM diagnosis age was small at 1.6 years and 1.8 years in the South Asian, Mixed-race/Other groups, respectively, in comparison to the White group (Table 2, Fig. 1).

Table 2 Unadjusted and adjusted differences in age at disease diagnoses in different ethnic groups using White ethnicity as the reference group in the IMRD-UK cohort
Fig. 1
figure 1

Coefficient plot demonstrating the earlier diagnoses age of IMDs (Y-axis) in the different ethnic minority groups with comparison to the reference White group (indicated by dashed line to the right). The figure was generated using the age at diagnosis after adjustments for confounding variables: BMI, smoking status, sex and social deprivation

Rheumatic diseases

Rheumatoid arthritis

RA was diagnosed 7.1 years (95% CI=8.54 to −5.73, p=<0.001) earlier in the South Asian group than the White group, after adjustments for sex, BMI, smoking and deprivation (Table 2). RA was diagnosed 7.5 years (95% CI=5.29–9.63, p=<0.001) earlier in the Mixed-race/Other group relative to the White group, after statistical adjustments. The mean age at diagnosis was also significantly earlier by 5.8 years (95% CI= 3.5–8.14, p=<0.001) in the African-Caribbean group relative to the White group (Table 2).

Systemic lupus erythematosus and Sjogren’s syndrome

The total number of cases of SLE was low (n=95, n=99, n=78 in African-Caribbean, South Asian and Mixed-race/Other, respectively) in three of the ethnic groups compared to the White group (n=1213). Our data show that whilst the African-Caribbean group were diagnosed 4.7 years earlier (95% CI=1.76–7.69, p=<0.001), the South Asian and Mixed-race/Other groups had an even earlier age at diagnosis by 6.0 years (95% CI=3.06–8.93, p=<0.001) and 6.1 years (95% CI=3.28–8.87, p=<0.001), respectively (Table 2). Similarly, the number of cases of Sjogren’s syndrome was low in three of the ethnic groups (Table 2). We did not detect a significant difference in the diagnosis age of Sjogren’s between Mixed-race/Other and the White group, but the South Asian group were diagnosed 11.2 years (95% CI=7.70–14.77, p=<0.001) earlier.

Gastrointestinal diseases

Inflammatory bowel disease and coeliac disease

There were significant differences in diagnosis ages of IBD between the different ethnic groups (Table 2). The age of IBD diagnosis was earlier, after adjustments, by 7.1 (95% CI=4.98–9.26, p=<0.001), 4.8 (95% CI=2.72–3.42, p=<0.001) and 3.1 (95% CI=0.15–5.13, p=<0.001) years in the Mixed-race/Other, South Asian and African-Caribbean groups, respectively. Our data indicated coeliac disease diagnosis age, after adjustments, to be significantly earlier by 6.4 years (95% CI=4.78–8.0, p=<0.001) in the South Asian group. The diagnosis age in the African-Caribbean group varied greatly as reflected by the large confidence interval and was not significantly different to the White group (p =0.26). In the Mixed-race/Other group, coeliac disease was diagnosed earlier by up to 9.5 years (95% CI=6.52–12.47, P=<0.001).

Pernicious anaemia

The age at diagnosis of PA was significantly earlier by 10.7 (95% CI=6.52–14.49, P=<0.001) years in the African-Caribbean group than the White group and earlier by 9 years (95% CI=7.02–10.97, P=<0.001) in the South Asian group compared with the White group. The diagnosis was earlier by 8.4 years (CI=3.32-13.54, p=<0.01) in the Mixed-race/Other group (Table 2).

Neurological diseases

Multiple sclerosis and myasthenia gravis

There was no significant difference in MS diagnosis age between the African-Caribbean and White groups. Although the diagnosis was significantly earlier by 8.4 and 6.0 years in Mixed-race/Other and South Asian groups, respectively (Table 2). MG was diagnosed 27.0 years earlier in the Mixed-race/Other (n=8) group than the White group, although the number of cases was small. Similarly, a difference of 19.2 years was observed in the age at MG diagnosis in the African-Caribbean (n=13) group, but the age disparity was not as great (4.8 years) in the South Asian group (n=21).

Immune-mediated skin conditions


The diagnosis ages, after adjustments, were 5.1 (95% CI=4.18–6.09, p=<0.001), 4.8 (95% CI=4.15–5.43, p=<0.001) and 4.1 (95% CI=2.82–5.43, p=<0.001) years in Mixed-race/Other, South Asian and African-Caribbean groups, respectively, in comparison to the White group. The unadjusted diagnosis age showed a higher difference in diagnosis ages between the different ethnic groups (Table 3).

Table 3 Unadjusted and adjusted differences in age at disease diagnoses in different ethnic groups using White ethnicity as the reference group, data from UK Biobank


The differences in the ages at diagnosis, without adjustments for covariates, were 9.3, 8.1 and 4.5 years earlier in the South Asian, Mixed-race/Other and African-Caribbean groups, respectively. In the adjusted models, the age disparity between the three ethnic groups was reduced to 2.6 (95% CI=1.60–3.68, p=<0.001), 2.3 (95% CI=0.80–3.88, p=<0.001) and 1.6 years (95% CI=0.20–3.34, p=0.08) earlier in South Asians, Mixed-race/Other and African-Caribbean groups, respectively, but was still statistically significant (Table 3).

Autoimmune thyroid disease

There was no significant difference in the diagnosis age of AIT between the African-Caribbean and White groups. The diagnosis age difference was small, but significant, in Mixed-race/Other group (Table 3). Of the three non-White ethnic groups, the South Asian group differed the most in diagnosis age of AIT by 6.1 years (95% CI=4.51–7.63, p=<0.001).

Non-IMD comparator disease

Fibromyalgia: We used FM as a comparator disease which is a common disorder of pain regulation but is not an autoimmune or inflammatory disease. The diagnosis of FM was slightly earlier by 1.6 (95% CI=0.64–2.54, p=<0.001) and 1.8 (95% CI=0.36–3.28, p=0.001) years in South Asian and Mixed-race/Other groups, respectively, in comparison to the White group (Table 2). However, there was no statistically significant difference in the ages at diagnosis between the African-Caribbean and White groups. Furthermore, removing cases of concomitant IMDs did not alter these results.

Validation in UK Biobank

We used UKB to validate our observations and were able to confirm the earlier diagnosis in the three non-White ethnic groups for most IMDs although with a lower effect size (Table 3). However, we did not detect a difference in the diagnosis ages between the four ethnic groups when comparing all IMDs cases summed together (Table 3). Comparisons of diagnosis age could not be made for AIT and FM due to the lack of diagnosis dates in UKB. The IBD cases included ulcerative colitis and Crohn’s disease combined.

Rheumatic diseases

Rheumatoid arthritis

RA was diagnosed earlier by 1.3, 0.6 and 1.3 years, after adjustments, in South Asian, African-Caribbean, and Mixed-race/Other groups, respectively, in comparison to the White group, although only reaching near significance for the South Asian group (Table 3).

SLE and Sjogren’s syndrome

The diagnosis ages of SLE were earlier by 5.5 (p=0.01) and 6.3 (p=<0.01) years in African-Caribbean and Mixed-race/Other groups, respectively. In contrast, SLE diagnosis age in the South Asian group, after adjustments, was 3.3 years earlier than the White group, though not significant (p=0.17). Interestingly for Sjogren’s syndrome, the diagnosis age was significantly earlier for the African-Caribbean group in comparison to the White group by 5.8 years (95% CI=1.99–9.54, p=<0.01). In contrast, there were no significant differences, in diagnosis ages, between the other three groups.

Gastrointestinal diseases

Inflammatory bowel disease, coeliac disease

There were no significant differences in diagnosis ages of IBD between the four ethnic groups (Table 3). For Coeliac disease, the diagnosis age was earlier in all three non-White ethnic groups compared to the White group, which was statistically significant after adjustments for covariates (Table 3), with the African-Caribbean group diagnosed the earliest by 7.1 years (95% CI=1.47–12.66, p=0.01).

Pernicious anaemia

The diagnosis ages of PA were earlier in all three non-White groups in comparison to the White group although this difference was small and not significant (Table 3).

Neurological diseases

Multiple sclerosis and myasthenia gravis

MS was diagnosed earlier by 2.3 years (95% CI= 1.34–5.98, p=0.21) in the Mixed-race/Other group but was 7.0 years later (95% CI=− 3.55–17.55, p=0.19), in the South Asian group than in the White group, though neither reached significance (Table 3). MG was diagnosed earlier by 3.5 and 3.6 years in the African-Caribbean and Mixed-race/Other groups, respectively, in comparison to the White group although not significant. The diagnosis age in the South Asian group was later by 5.9 years than in the White group, although again not significant (Table 3). The number of cases of MS and MG was small in all three ethnic groups making it difficult to draw conclusions (Table 3).

Immune-mediated skin conditions


Psoriasis was diagnosed later by 3.3 years (95% CI= 1.38–5.26, p=<0.01) in the South Asian group compared with the White group but the later diagnosis by 3.2 (95% CI=− 0.77–7.23, p=0.11) and 1.3 (95% CI=− 1.03–3.54, p=0.28) years in African Caribbean and Mixed-race/Other groups, respectively, were not significant.


Vitiligo was diagnosed earlier by 2.9 years (95% CI= − 6.04–0.24, p=0.07) in South Asians compared to the White group; however, there were no statistical differences in the ages at diagnosis between the other ethnic groups.


Although it is known that the prevalence of IMDs varies by ethnicity, we did not find any publication that investigated the disparities in the ages at diagnoses between different ethnic groups. For example, an earlier age of onset of MS with higher disease severity scores was reported in the Black ethnic group in a US study, although the authors did not specify the degree of difference in the age of onset [18]. Two early studies of PA identified a much younger age of onset in Black and Latin American ethnic groups [19, 20] but without adjustments for confounding variables. This was also the case for SLE where the age at diagnosis was determined to be younger in the Black ethnic group (39.4 years, SD 15.9) in comparison to the White ethnic group (45.4 years, SD 17.7) but again with no adjustments for lifestyle factors [8].

Here we examined the ages at diagnosis of all common IMDs in four different ethnic groups in a UK cohort of ~4.5 million. We report for the first time that even after adjustment for potential lifestyle risk factors such as smoking, BMI and social deprivation, that South Asian, African-Caribbean, and Mixed-race/Other ethnicities have an earlier diagnosis of most IMDs in comparison to the White ethnic group. The number of years that diagnosis age is earlier by differs with the specific IMD and by ethnic group and ranges from 2 to 27 years.

We used UKB data to validate our findings and the trend was similar between the two data sets for most IMDs although the effect size was lower in the UKB cohort and for several IMDs did not reach significance. There are several possible explanations for this. Firstly, the number of participants in the UKB diagnosed with IMDs was lower (n=44,743) than in the IMRD-UK (n=80,776). Secondly, UKB is known to have a low representation of non-White ethnicities and is not representative of the UK population [21]. In UKB, the three non-White ethnic groups make up 4.7% (n=2079) of all IMDs cases whereas in the IMRD-UK dataset they represent 9% (n=7339) of all diagnosed IMDs. Thirdly, UKB participants are generally older and less likely to be from socioeconomically deprived areas [21]. Finally, the disease classification is slightly different in UKB from that in IMRD-UK. Read Codes were used to identify the specific IMDs within the IMRD-UK whilst ICD-10 codes were used to identify the different IMDs in UKB from either the hospital admissions records (HAR) or self-reports. Each of these factors could affect the determination of mean age at diagnosis.

However, there were also some inconsistencies between the IMRD-UK and UKB data. This includes psoriasis, which was actually diagnosed at a later age in the South Asian group in comparison to the White group and there were no differences in the ages at diagnosis between White, African-Caribbean and Mixed-race/Other ethnic groups. A potential explanation could be that of the 14,966 cases in UKB with diagnosis dates more than half were self-reports or from HAR records. It is possible that diagnosis dates obtained from HAR are secondary to another condition and therefore not reliable for determining the true age at diagnosis of psoriasis. Similarly, the self-reported diagnosis dates are less reliable as it is only the diagnosed condition that is verified by a healthcare professional and not the date. Removing cases with self-reported dates and diagnosis dates obtained from HAR reduced the number of cases to <10 in each of the three non-White ethnic groups, precluding statistical analysis. Similarly, for MG and MS, there was a disagreement between the two datasets for the South Asian group, but the number of cases was <10 in UKB making it difficult to draw conclusions.

A recent study from the US demonstrated that the incidence of IBD has increased by 39% and 134% in White and non-White individuals, respectively, between 1970 and 2010 [22]. Although we did not distinguish between the subtypes of IBD, we detected that the Mixed-race/Other group were diagnosed with IBD the earliest, by 7.1 years, in comparison to all other ethnic groups. This is also in agreement with a previous study which indicated that the median age at diagnosis for Crohn’s disease was the lowest in the Other ethnic group and highest in the White group [23] albeit not statistically significant possibly due to the small cohort size of the Other ethnic group (n=37).

A possible explanation for the observed earlier diagnosis age of IMDs in non-White groups, even after adjustments for BMI, could be the presence of a systemic pro-inflammatory state at an earlier age [24,25,26,27]. This hypothesis is supported by findings of only a slight difference in FM diagnosis age, a non-IMD, between the four ethnic groups. Previous large-cohort population studies have demonstrated that, irrespective of BMI, South Asians have higher levels of visceral, intermuscular and hepatic fat and significantly less total lean abdominal/back muscle mass than other ethnicities [28]. This could be a cause of the higher levels of pro-inflammatory adipokines such as resistin and lower levels of anti-inflammatory adiponectin detected in healthy individuals of this population [28, 29]. The combination of these factors likely contributes to a pro-inflammatory state as indicated by high hsCRP levels in South Asians [29]. However, exactly how increased inflammation could lead to earlier onset of IMDs is unclear. Inflammation has been shown to be one of the core processes driving ageing [30] and an intriguing possibility is that the biological ageing process occurs at different rates in different ethnicities. There are now methods to assess an individual’s biological as opposed to their chronological age based upon assessment of DNA methylation at specific sites, termed epigenetic clocks [31]. Such analyses have shown differences in biological ageing between ethnicities, for example Americans of Hispanic ethnicity had a higher extrinsic biological age compared to Whites and African Americans had a lower extrinsic biological age [32]. Ageing of the immune system, immunesenescence, is well documented and includes a predisposition to autoimmunity [33]. Immunesenescence both contributes to systemic inflammation and is increased by it and has been shown to occur at younger ages in IMDs such as RA [34]. The raised inflammation at an early age in non-White populations could accelerate immunesenescence and increase the risk of autoimmunity.

An additional explanation for earlier onset of IMDs in different ethnic minority groups includes the role of diet and the composition of the gut microbiome. Khine et al. compared the gut microbiome of pre-adolescents in Malaysia and China and reported a major influence of diet in the two ethnic groups [35], whilst other studies report that ethnicity per se influences the gut microbiome [36, 37]. The onset of IMD may be initiated subsequent to systemic inflammation which is highly influenced by the microbiome and is altered in several IMDs, including IBD and RA. RA patients have been shown to have a reduced diversity in the gut microbiota than healthy individuals [38]. Whilst certain bacterial species are more abundant in RA patients, others are scarce. For example, there is a higher composition of the bacterial species Colinsella sp., in RA patients and this has been shown to promote an increase in gut permeability and inflammation initially at a local level which then spreads to joints [38, 39]. Assessing microbiota diversity in different ethnic groups with IMDs would identify associations with disease prevalence and could suggest a role in the earlier age of onset.

Furthermore, other environmental factors such as psychosocial stress and socio-economic status may also contribute to the earlier onset of IMDs, though socio-economic status was taken into consideration in our analysis. This is supported by a recent Swedish population level sibling study that detected an association between stress-related disorders and an increased risk of IMDs [40]. Similar associations between post-traumatic stress disorders and an increased risk of IMDs have also been reported in large cohort studies [40, 41]. Lastly, the latest mendelian randomisation studies have reported a higher educational attainment with a protective benefit towards risk of RA [42]. The level of educational attainment is known to be affected by socio-economic inequalities. However, further studies are required to fully assess the risk of IMDs attributed to individual socio-economic factors and the earlier onset of disease.

Finally, a further potential explanation could be that patients from ethnic minority groups may have a different mode of onset of disease (e.g., a more abrupt and severe onset, as opposed to an insidious and mild onset). It is also likely that the measures of the ‘mode of disease onset’, ‘disease severity’ or ‘disease activity’ will be different across the different IMDs and future studies should use approaches to capture these constructs that are specific for each of the IMDs.

Limitations of study

The study limitations include missing ethnicity data however, this should not compromise the observations in this study as we assume the missing data to be absent from all ethnic groups and not disproportionately from a specific ethnic group. Furthermore, the proportional representation of the non-White groups reflects the UK population as per 2011 census data. Although, self-reported ethnicity could be a potential confounder.

Some IMDs such as SLE and Sjogren’s syndrome take several years to be diagnosed due to initial non-specific symptoms [43]. It is therefore likely that the onset of disease is substantially earlier than the age at diagnosis. An additional issue, particularly in the South Asian group, is patient’s reluctance to seek medical help early, which inevitably means a later diagnosis after disease onset [44,45,46,47]. Together these factors imply that the actual age of onset of IMDs is likely to be even earlier, by perhaps several years, than the diagnosis age in certain ethnic groups.

Another limitation is that our ethnicity grouping is broad, and it is likely that there is variation in diagnosis ages within each ethnic grouping. This is especially true for the Mixed-race/Other group, which includes several diverse ethnic groups and potentially dilutes the overall effect.

Using Read Codes for case definitions may also be imperfect and there may be a small amount of misclassification of IMDs, for example by including some patients with a non-autoimmune aetiology. To mitigate this as much as possible we used Read Codes which had been validated previously [3, 16]. Lastly, the number of cases of the rarer IMDs such as SLE, MG and Sjogren’s syndrome were very small, although the earlier age at SLE diagnosis was detected in all three ethnic groups in comparison to the White group, consistent with previous studies [48].


In conclusion, we observed an earlier age at diagnosis for almost all IMDs in non-White ethnic groups in comparison to the White group. Earlier onset of disease may suggest differing pathogenesis with ethnicity. The healthcare implication would be to screen patients from non-White ethnic groups at an earlier age to allow novel treatment interventions such as reversal of the ageing processes if accelerated biological ageing was identified as the cause of the earlier onset of IMDs. Earlier intervention may delay disease progression to severe disease and disability due to IMDs.

Availability of data and materials

Presented within the manuscript and as supplementary table and Additional file.



Immune-mediated diseases (autoimmune inflammatory diseases)


Autoimmune thyroid disease


Body mass index


Electronic health records




Hospital admissions records


Inflammatory bowel disease


IQVIA Medical Research Data


Interquartile range


Myasthenia gravis


Multiple sclerosis


Pernicious anaemia


Rheumatoid arthritis


Systemic lupus erythematosus


  1. Copper GS, Stroelha BC. The epidemiology of autoimmune diseases. Autoimmun Rev. 2003;2(3):119–25.

    Article  Google Scholar 

  2. Maningding E, Dall'Era M, Trupin L, Murphy LB and Yazdany J. Racial and Ethnic Differences in the Prevalence and Time to Onset of Manifestations of Systemic Lupus Erythematosus: The California Lupus Surveillance Project.

  3. Subramanian A, Adderley NJ, Gkoutos GV, Margadhamane K, Gokhale KM, Nirantharakumar K, et al. Ethnicity-based differences in the incident risk of allergic diseases and autoimmune disorders: A UK-based retrospective cohort study of 4.4 million participants. Clin Exp Allergy. 2021;51(1):144–7.

    Article  PubMed  Google Scholar 

  4. Rees F, Doherty M, Grainge MJ, Lanyon P, Zhang W. The worldwide incidence and prevalence of systemic lupus erythematosus: a systematic review of epidemiological studies. Rheumatology. 2017;56(11):1945–61.

    Article  PubMed  Google Scholar 

  5. Kabani N, Ginzler EM. Is ethnicity linked to the severity of SLE manifestations? Nat Rev Rheumatol. 2019;15(9):515–6.

    Article  CAS  PubMed  Google Scholar 

  6. Zhang Y, Cai Y, Shi M, Jiang S, Cui S, Wu Y. The prevalence of vitiligo: A meta-analysis. PLoS One. 2016;11(9):e0163806.

    Article  PubMed  PubMed Central  Google Scholar 

  7. McCarty DJ, Manzi S, Medsger TA Jr, Ramsey-Goldman R, LaPorte RE, Kwoh CK. Incidence of systemic lupus erythematosus. Race and gender differences. Arthritis Rheum. 1995;38(9):1260–70.

    Article  CAS  PubMed  Google Scholar 

  8. Lim SS, Bayakly AR, Helmick CG, Gordon C, Easley KA, Drenkard C. The incidence and prevalence of systemic lupus erythematosus, 2002-2004: The Georgia Lupus Registry. Arthritis Rheum. 2014;66(2):357–68.

    Article  Google Scholar 

  9. Nicholas RS, Kostadima V, Hanspal M, et al. MS in South Asians in England: early disease onset and novel pattern of myelin autoimmunity. BMC Neurol. 2015;15:72.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Brito-Zerón P, et al. Influence of geolocation and ethnicity on the phenotypic expression of primary Sjögren's syndrome at diagnosis in 8310 patients: a cross-sectional study from the Big Data Sjögren Project Consortium. Ann Rheum Dis. 2017;76(6):1042–50.

    Article  PubMed  Google Scholar 

  11. Blak BT, Thompson M, Dattani H, Bourke A. Generalisability of The Health Improvement Network (THIN) database: demographics, chronic disease prevalence and mortality rates. Inform Prim Care. 2011;19(4):251–5. PMID: 22828580.

    Article  PubMed  Google Scholar 

  12. Booth N. What are the Read Codes? Health Libr Rev. 1994;11(3):177–82.

    Article  CAS  PubMed  Google Scholar 

  13. Maguire A, Blak BT, Thompson M. The importance of defining periods of complete mortality reporting for research using automated data from primary care. Pharmacoepidemiol Drug Saf. 2009;18(1):76–83.

    Article  PubMed  Google Scholar 

  14. Horsfall L, Walters K, Petersen I. Identifying periods of acceptable computer usage in primary care research databases. Pharmacoepidemiol Drug Saf. 2013;22(1):64–9.

    Article  PubMed  Google Scholar 

  15. Hull SA, Mathur R, Badrick E, Robson J, Boomla K. Recording ethnicity in primary care: assessing the methods and impact. Br J Gen Pract. 2011;61(586):e290–4.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Krishna MT, Subramanian A, Adderley NJ, Zemedikun DT, Gkoutos GV, Nirantharakumar K. Allergic diseases and long-term risk of autoimmune disorders: longitudinal cohort study and cluster analysis. Eur Respir J. 2019;54(5):1900476.

  17. Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Amezcua L, McCauley JL. Race and ethnicity on MS presentation and disease course. Mult Scler. 2020;26(5):561–7.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Carmel R, Johnson CS. Racial patterns in pernicious anemia. Early age at onset and increased frequency of intrinsic-factor antibody in black women. N Engl J Med. 1978;298(12):647–50.

    Article  CAS  PubMed  Google Scholar 

  20. Carmel R, Johnson CS, Weiner JM. Pernicious Anemia in Latin Americans Is Not a Disease of the Elderly. Arch Intern Med. 1987;147(11):1995–6.

    Article  CAS  PubMed  Google Scholar 

  21. Fry A, Littlejohns TJ, Sudlow C, Doherty N. LAdamska L, Sprosen T, Collins R, Allen NE, Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. Am J Epidemiol. 2017;186(9):1026–34.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Aniwan S, Harmsen WS, Tremaine WJ, Loftus EV Jr. Incidence of inflammatory bowel disease by race and ethnicity in a population-based inception cohort from 1970 through 2010. Ther Adv Gastroenterol. 2019;12:1756284819827692.

    Article  Google Scholar 

  23. Misra R, Limdi J, Cooney R, Sakuma S, Brookes M, Fogden E, et al. Ethnic differences in inflammatory bowel disease: Results from the United Kingdom inception cohort epidemiology study. World J Gastroenterol. 2019;25(40):6145–57.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Ouchi N, Parker JL, Lugus JJ, Walsh K. Adipokines in inflammation and metabolic disease. Nat Rev Immunol. 2011;11(2):85–97.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Ruiz-Narváez EA, Palmer JR, Gerlovin H, Wise LA, Vimalananda VG, Rosenzweig JL, et al. Birth Weight and Risk of Type 2 Diabetes in the Black Women’s Health Study: Does Adult BMI Play a Mediating Role? Diabetes Care. 2014;37(9):2572–8.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Shah AD, Kanaya AM, et al. Less Favorable Body Composition and Adipokines in South Asians Compared to Other U.S. Ethnic Groups: Results from the MASALA and MESA Studies. Int J Obes. 2016;40(4):639–45.

    Article  CAS  Google Scholar 

  27. Muilwijk M, Nieuwdorp M, Snijder MB, et al. The high risk for type 2 diabetes among ethnic minority populations is not explained by low-grade inflammation. Sci. 2019;9(1):19871.

    CAS  Google Scholar 

  28. Shah A, Kanaya AM. Diabetes and Associated Complications in the South Asian Population. Curr Cardiol Rep. 2014;16(5):476.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Forouhi NG, Sattar N, McKeigue PM. Relation of C-reactive protein to body fat distribution and features of the metabolic syndrome in Europeans and South Asians. Int J Obes Relat Metab Disord. 2001;25(9):1327–31.

    Article  CAS  PubMed  Google Scholar 

  30. López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013;153(6):1194–217.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14(10):R115.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Horvath S, et al. An epigenetic clock analysis of race/ethnicity, sex, and coronary heart disease. Genome Biol. 2016;17:R171.

    Article  Google Scholar 

  33. Freund A, Orjalo AV, Desprez P-Y, Campisi J. Inflammatory networks during cellular senescence: causes and consequences. Trends Mol Med. 2010;16(5):238–46.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Goronzy JJ, Weyand CM. Immune aging and autoimmunity. Cell Mol Life Sci. 2012;69:1615–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Khine WWT, Zhang Y, Goie GJY, Wong MS, Liong M, Lee YY, et al. Gut microbiome of pre-adolescent children of two ethnicities residing in three distant cities. Sci Rep. 2019;9(1):7831.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Chong CW, Ahmad AF, Lim YA, Teh CS, Yap IK, Lee SC, et al. Effect of ethnicity and socioeconomic variation to the gut microbiota composition among pre-adolescent in Malaysia. Sci Rep. 2015;5:13338.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Dwiyanto J, Hussain MH, Reidpath D, Ong KS, Qasim A, Lee SWH, et al. Ethnicity influences the gut microbiota of individuals sharing a geographical location: a cross-sectional study from a middle-income country. Sci Rep. 2021;11(1):2618.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Gioia C, Lucchino B, Tarsitano MG, Iannuccelli C, Di Franco M. Dietary Habits and Nutrition in Rheumatoid Arthritis: Can Diet Influence Disease Development and Clinical Manifestations? Nutrients. 2020;12(5):1456.

    Article  CAS  PubMed Central  Google Scholar 

  39. Horta-Baas G, Romero-Figueroa MDS, Montiel-Jarquín AJ, Pizano-Zárate ML, García-Mena J, Ramírez-Durán N. Intestinal Dysbiosis and Rheumatoid Arthritis: A Link between Gut Microbiota and the Pathogenesis of Rheumatoid Arthritis. J Immunol Res. 2017;2017:4835189. Epub 2017 Aug 30.

  40. Song H, Fang F, Tomasson G, Arnberg FK, Mataix-Cols D, Fernández de la Cruz L, et al. Association of Stress-Related Disorders With Subsequent Autoimmune Disease. JAMA. 2018;319(23):2388–400.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Lee YC, Agnew-Blais J, Malspeis S, Keyes K, Costenbader K, Kubzansky LD, et al. Post-Traumatic Stress Disorder and Risk for Incident Rheumatoid Arthritis. Arthritis Care Res. 2016;68(3):292–8.

    Article  Google Scholar 

  42. O'Donovan A, Cohen BE, Seal KH, Bertenthal D, Margaretten M, Nishimi K, et al. Elevated risk for autoimmune disorders in iraq and afghanistan veterans with posttraumatic stress disorder. Biol Psychiatry. 2015;77(4):365–74.

    Article  PubMed  Google Scholar 

  43. Zhao SS, Holmes MV, Zheng J, Sanderson E, Carter AR. The impact of education inequality on rheumatoid arthritis risk is mediated by smoking and body mass index: Mendelian randomization study. Rheumatology (Oxford). 2022;61(5):2167–75.

    Article  Google Scholar 

  44. Kuhn A, Bonsmann G, Anders HJ, Herzer P, Tenbrock K, Schneider M. The Diagnosis and Treatment of Systemic Lupus Erythematosus. Dtsch Arztebl Int. 2015;112(25):423–32.

    PubMed  PubMed Central  Google Scholar 

  45. Kumar K, Reehal J, Stack RJ, Adebajo A, Adams J. Experiences of South Asian patients in early inflammatory arthritis clinic: a qualitative interview study. Rheumatol Adv Pract. 2019;3(2):rkz017.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Kumar K, Daley E, Carruthers DM, et al. Delay in presentation to primary care physicians is the main reason why patients with rheumatoid arthritis are seen late by rheumatologists. Rheumatology. 2007;46:1438–40.

    Article  CAS  PubMed  Google Scholar 

  47. Kumar K, Gordon C, Barry R, et al. “It's like taking poison to kill poison but I have to get better”: a qualitative study of beliefs about medicines in RA and SLE patients of South Asian origin. Lupus. 2011;20:837–44.

    Article  CAS  PubMed  Google Scholar 

  48. Chambers SA, Charman SC, Rahman A, Isenberg DA. Development of additional autoimmune diseases in a multiethnic cohort of patients with systemic lupus erythematosus with reference to damage and mortality. Ann Rheum Dis. 2007;66(9):1173–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


The views expressed here are those of the authors and not necessarily the NHS, NIHR or Department of Health and Social Care.


This work was supported by a grant from EULAR and FOREUM. JML and KR are supported by the NIHR Birmingham BRC. DTZ was supported by the MRC-Versus Arthritis Centre for Musculoskeletal Ageing Research.

Author information

Authors and Affiliations



JML, A S-O and KR conceived the study; A S-O, DZ, JAW, LB and VRC analysed the data; A S-O and DZ drafted the manuscript. JAR, KR, KN and AJ provided clinical input to the study. KK and GG reviewed and revised the manuscript. All authors have read and approved the final manuscript.

Authors’ information

Archana Sharma-Oates and Dawit T Zemedikun contributed equally to the manuscript.

Corresponding author

Correspondence to Archana Sharma-Oates.

Ethics declarations

Ethics approval and consent to participate

Data for this study were obtained from the data provider IMRD-UK to the University of Birmingham. Studies using the IMRD-UK database have had initial ethical approval from the NHS South-East Multicentre Research Ethics Committee, subject to prior independent scientific review. The Scientific Review Committee of the data provider approved the study protocol (SRC Reference Number: 18THIN064) prior to its undertaking. The permission to use data from UKB was granted under project number 31224. UKB has an ethics permit from the National Research Ethics Committee (REC reference 11 / NW / 0382).

Consent for publication

Not applicable.

Competing interests

Not applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

A list of the Read codes used to identify all cases of each of the IMDs used in the study from IMRD.

Additional file 2: Table S1.

The ICD-10 codes used to identify all cases of each of the IMDs, used in the study, from UKB.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Sharma-Oates, A., Zemedikun, D.T., Kumar, K. et al. Early onset of immune-mediated diseases in minority ethnic groups in the UK. BMC Med 20, 346 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Autoimmune inflammatory diseases
  • Immune-mediated diseases
  • Ethnicity
  • South Asian
  • African-Caribbean
  • Diagnosis
  • Rheumatic diseases
  • Ageing