Skip to main content
  • Research article
  • Open access
  • Published:

Profiling Mycobacterium tuberculosis transmission and the resulting disease burden in the five highest tuberculosis burden countries



Tuberculosis (TB) control efforts are hampered by an imperfect understanding of TB epidemiology. The true age distribution of disease is unknown because a large proportion of individuals with active TB remain undetected. Understanding of transmission is limited by the asymptomatic nature of latent infection and the pathogen’s capacity for late reactivation. A better understanding of TB epidemiology is critically needed to ensure effective use of existing and future control tools.


We use an agent-based model to simulate TB epidemiology in the five highest TB burden countries—India, Indonesia, China, the Philippines and Pakistan—providing unique insights into patterns of transmission and disease. Our model replicates demographically realistic populations, explicitly capturing social contacts between individuals based on local estimates of age-specific contact in household, school and workplace settings. Time-varying programmatic parameters are incorporated to account for the local history of TB control.


We estimate that the 15–19-year-old age group is involved in more than 20% of transmission events in India, Indonesia, the Philippines and Pakistan, despite representing only 5% of the local TB incidence. According to our model, childhood TB represents around one fifth of the incident TB cases in these four countries. In China, three quarters of incident TB were estimated to occur in the ≥ 45-year-old population. The calibrated per-contact transmission risk was found to be similar in each of the five countries despite their very different TB burdens.


Adolescents and young adults are a major driver of TB in high-incidence settings. Relying only on the observed distribution of disease to understand the age profile of transmission is potentially misleading.

Peer Review reports


Tuberculosis (TB) is now the leading cause of death worldwide from a single infectious agent [1]. While effective prevention and treatment tools have been available for many decades, their impact on the global epidemic has been limited by challenges that TB control programs still face today. Among them, the difficulties in identifying diseased individuals and providing them with adequate care may be the most critical, with only 61% of cases receiving effective treatment [1]. Even more alarming is that the global case detection rate could be as low as 35% in children [2]. As well as ensuring that control policies are as effective as possible, comprehensive knowledge of the epidemic age-profile is essential for estimating burden of disease and predicting the course of the epidemic.

TB epidemiology is also clouded by the propensity of Mycobacterium tuberculosis (M.tb) to enter a latent infection state within its host (latent TB infection, LTBI), in which it may persist for many years before reactivating [3]. Source tracing is therefore difficult due to the unknown time lag between infection and activation, making transmission events even more challenging to infer than disease burden. While modelling estimates of the global prevalence of LTBI were recently published [4], better understanding pathogen transmission in the population and the resulting infection burden would enable better targeting of high-risk groups.

The slow dynamics of TB limit the feasibility of field investigations that could build epidemic knowledge and mean that historical trends for many decades into the past may have significance for the modern epidemic. For these reasons, mathematical modelling provides a valuable tool to investigate hidden features of the disease [5]. In particular, agent-based models, which explicitly simulate each individual in a population, together with their demographic characteristics, social contacts and infection history, capture important heterogeneities present in real-world populations [6, 7]. This faculty is critical when modelling TB, as M.tb transmission is subject to important heterogeneity in characteristics of the infectious host, susceptible host and environment [8]. Meanwhile, the recent availability of contact survey data has dramatically improved our understanding of social mixing [9,10,11]. In particular, estimates of age-specific contact frequency and intensity in different contexts/locations are now publicly available and provide empiric evidence of preferential mixing patterns, such as age assortativity. Agent-based models can capture specific patterns of social mixing with a high degree of fidelity. For example, they can account for contact saturation in households and other settings such as schools and workplaces where repeated contact is frequent [12]. Incorporating such contacts into a model can enable more accurate estimates of setting-specific contribution to transmission [13].

In this study, we combine data on social mixing and population demography with data on historical indicators of TB control to parametrise an agent-based model. We use the model to build a rich picture of the current profile of M.tb transmission and disease burden in the world’s five highest burden countries in 2016 according to the World Health Organization (WHO): India, Indonesia, China, the Philippines and Pakistan [1].


We developed the SNAP-TB platform (Social Network Abstraction to Profile TB Burden) to simulate M.tb transmission and the resulting burden of infection and disease. SNAP-TB is a stochastic agent-based model developed in Python that uses a household, school and workplace framework to generate realistic demographic patterns and social mixing. The population model is overlaid with a TB model that simulates infection, transmission and several existing control measures (Fig. 1). The main model principles are described in the following sections and detailed in Additional file 1, with a description of how data were incorporated in model development and calibration (Additional file 1: Figure S1, and Table S1).

Fig. 1
figure 1

Schematic illustration of the agent-based model. The upper panel represents the structure of the simulated population and the diverse types of contacts simulated (household, school, workplace, other location). The lower panel illustrates individuals’ progression through the various stages of life and infection/disease using diamonds to represent events and boxes for extended phases. Solid arrows indicate deterministic progressions that occur in all surviving individuals, while dashed arrows represent possible but not universal progressions. *Only a fraction of the individuals enters the organised workforce

Model initial conditions—replicating the demographic and epidemic configuration of year 2018—were reached by running a burn-in phase to allow demographic processes, age distributions and TB distribution to emerge naturally. The model was then run for five further years to produce outputs. Our approach to model calibration using Latin Hypercube Sampling is described in details in Section 4 of Additional file 1. This approach accounts for uncertainty around 11 important model parameters. Therefore, the results presented in this manuscript are not associated with a single parameter set but emerge from the various parameterisations that were found to produce realistic TB burden according to country-specific data.

Population model

All individuals are assigned a household at birth. Life events such as forming a couple, moving home and having babies are simulated, such that plausible household compositions emerge from the model. A Siler model is used to derive age-specific natural mortality rates [14], and back-calculated birth rates are used during burn-in to reproduce the desired modern country age distribution.

All children are assumed to attend school (commencing from 3 to 5 years old and completing by 15–21 years old), before optionally entering the workforce, with individuals explicitly assigned to specific schools and workplaces.

Social mixing

Individuals interact through social contacts that occur in four different contexts: households, schools, workplaces and other locations. A social contact is considered conceptually as either a physical contact or a two-way conversation involving three or more words (consistent with reporting of input data) [9, 15]. All individuals of the same household are assumed to contact each other every day [11, 16]. In contrast, social contacts occurring within congregate settings (schools and workplaces) and in other locations are generated stochastically at each time step. Their frequency and age assortativity pattern are derived for each country from estimates of the location- and age-specific contact matrices [15]. A detailed description of our approach to contact generation in the different settings is provided in Section 2.3 of Additional file 1.

TB model

Figure 1 illustrates the infection stages simulated. Age-specific parameters derived from empiric data are used to determine whether and when infected individuals progress to active disease [17]. Active cases may be smear-positive, smear-negative or extrapulmonary TB and will either spontaneously cure or die from their disease in the absence of treatment. The type of natural history outcome and the time at which it occurs are randomly generated based on the TB natural history characteristics observed during the pre-chemotherapy era (see Additional file 1, Section 3.1) [18].

M.tb transmission may occur when a person with active TB contacts a susceptible individual. The baseline probability that a social contact leads to transmission is calibrated to observed national TB prevalence aggregated for all ages (Additional file 1, Section 4). Empirical estimates of the age-specific TB prevalence (for all countries except India due to absence of data) were compared to model outputs for independent validation but were not used for calibration. We assume that school contacts are less likely to lead to transmission than household contacts (relative risk: RR = 0.89), as are work contacts (RR = 0.82) and other contacts (RR = 0.75). These relative risks are based on the reported proportions of high-intensity contacts by location [9, 10], combined with the assumption that low-intensity contacts are half as likely to lead to transmission as high-intensity contacts (Additional file 1, Section 3.2), with sensitivity analyses used to explore alternative assumptions. The transmission probability also depends on the characteristics of the two individuals making contact, as described in Table 1.

Table 1 Model assumptions regarding the factors affecting the risk of transmission

The time to detection of active TB is exponentially distributed, and the associated rate is calculated based on the country’s estimated case detection rate (Additional file 1, Section 3.3). Although a detection time is generated for all TB cases, detection only actually occurs if this time precedes the pre-determined time of the natural history outcome.

In our model, all detected cases are commenced on treatment between 0 and 14 days following detection. Successfully treated individuals (i.e. cured or completing treatment) are assumed to clear infection and become susceptible again. If unsuccessfully treated, patients remain active and the TB episode outcome (cure or death) and its timing remain as defined by the TB natural history that was originally generated. Time-variant parameters are used to specify Bacillus Calmette–Guérin (BCG) vaccine coverage, as well as rates of case detection and treatment success. The associated scale-up functions for BCG vaccine coverage and rates of case detection and treatment success are based on WHO data (Additional file 1: Figure S9). The parameters used to inform the model are presented in Table 2.

Table 2 Model parameters

In order to understand the role played by the past programmatic conditions in shaping the current epidemic picture, we run an additional analysis where all programmatic parameter values are assumed constant and equal to their most recent estimates.


Model calibration and validation against age-specific TB prevalence for all forms of TB

The crude probability of transmission per contact obtained from calibration was found to be very similar in each of the five countries, with median values ranging from 0.00361 in China to 0.00398 in Indonesia (Table 2 and Additional file 1: Figure S10). We validated the model by comparing the resulting age-specific prevalence estimates to those obtained from the prevalence surveys conducted in Indonesia (in 2014), China (in 2010), the Philippines (in 2016) and Pakistan (in 2011) (Fig. 2). Age-specific estimates of prevalence are also presented for India (Additional file 1: Figure S11), although comparison to data was impossible in the absence of a prevalence survey.

Fig. 2
figure 2

Validation of model outputs against prevalence survey estimates for the age-specific TB prevalence in Indonesia (2014), China (2010), the Philippines (2016) and Pakistan (2011). No data were available for the less than 15-year-old individuals from these surveys. Error bars represent the 95% confidence intervals of the survey estimates (in purple) and the 95% simulation intervals resulting from the stochastic variability of the model and the parameter uncertainty (in green)

Profile of M.tb transmission

In order to better characterise transmission, we recorded contacts’ characteristics by tracking their location (school, work, home, other) and the age of the individuals involved. The same information was recorded for each transmission event by location, along with whether the associated infection resulted in active TB (Fig. 3). According to our model, contacts occurring in locations other than home, school or workplace are a major driver of M.tb transmission in each of the five countries, with contributions ranging from 34% (28–40, 95% simulation interval) of the total number of transmission events in Pakistan to 49% (44–55) in China. Household contacts were estimated to be the predominant driver of M.tb transmission in Pakistan (40%, 35–46). The estimated proportion of active TB burden attributable to household contacts varies between 17% (4–32) in China and 44% (31–57) in Pakistan.

Fig. 3
figure 3

Contributions of the various locations to the burden of contact and transmission. Error bars represent the 95% simulation intervals

Figure 4 presents the age-specific contact and transmission patterns obtained from simulation. Contact patterns disaggregated by location are presented in Additional file 1: Figure S5. We note that our approach of allowing for household compositions to emerge naturally during simulation led to plausible age-specific contact patterns that are similar to those described in several social mixing studies [9,10,11]. The high-intensity contact zones naturally translate into high densities of M.tb transmission, except where index individuals are aged under 15 years (due to lack of infectiousness) and where contact recipients are young and therefore retain immunity from BCG vaccination. In contrast, the effect of immunity from infection was no longer observed when considering only contacts leading to active disease. This finding is due to the fact that young individuals are at higher risk of progression to active disease than adults [42, 43]. Our results highlight that the 15–19 years age category represents a critical driver of transmission in all countries except China. In India, Indonesia, the Philippines and Pakistan, we estimate that more than 20% of transmission events involve in this age category as either index or recipient (Table 3).

Fig. 4
figure 4

Age-specific pattern of social mixing and transmission

Table 3 Contributions of the 15–19-year-old individuals to the estimated total number of transmission events between 2018 and 2022

Age distribution and risk associated with the current latency reservoir

We estimated the country-level prevalence of LTBI in 2018 at 25% (14–36), 47% (35–55), 30% (18–41), 43% (34–52) and 25% (14–39) in India, Indonesia, China, the Philippines and Pakistan, respectively. These estimates are very similar to those obtained from a previous modelling work, and a comparison between the two studies is presented in Additional file 1: Figure S12 [4]. Figure 5 presents the age-specific size of the LTBI reservoir as estimated for 2018 (green spheres), as well as the risk that it represents in terms of future TB disease (purple spheres). The relative LTBI prevalence steadily increases with age in all countries, whereas the absolute LTBI burden decreases at advanced ages due to population mortality.

Fig. 5
figure 5

Age distribution of latent tuberculosis infection. Coloured discs should be interpreted as spheres (to increase the relative size of the smaller spheres), with the volume of the spheres being proportional to the following quantities: 2018 total population (grey), size of the LTBI pool in 2018 (green), and number of individuals currently infected in 2018 who will ever develop active TB (purple). The numbers surrounding each disc indicate the age categories represented. Note that LTBI prevalence is predicted to reach extremely high levels among the oldest age category, which is explained by the high historical intensity of transmission in these countries and by the fact that we do not incorporate LTBI clearance

According to our model, the age category that represents the highest risk in terms of future disease emanating from current infections is the “30–39-year-old” age category in India (29% of future TB burden), Indonesia (27%), the Philippines (26%) and Pakistan (31%), while the “40–49-year-old” age category was most prominent in China (33%). The youngest age category “0–9-year-old” includes very few infected individuals in all countries, although the per-infection risk of disease is much higher in this age category than in older populations.

Age profile of active TB

Figure 6 shows the estimated age distribution of TB cases in the five countries. In China, we estimate that TB affects the ≥ 45-year-old category much more severely than the younger age categories, accounting for 76% (73–79) of the national TB burden. In particular, the age category 55–59 is the most represented, alone contributing 13% (11–15) to the Chinese TB burden.

Fig. 6
figure 6

Age distribution of TB cases. The population age distribution (green) was captured at the starting time of analysis (year 2018). Age of TB cases at activation (red) was recorded over a period of 5 years starting from 2018. Error bars represent the 95% simulation intervals obtained for the TB age distribution

Young individuals (particularly those aged under 5 or 10–14) are severely affected with active TB in India, Indonesia, the Philippines and Pakistan. Although the Philippines and Pakistan present similar population pyramids, their TB age distributions differ noticeably. We find a prominent peak for the 40–49-year age category in Pakistan (contributing 21% of TB burden), which is not observed in the Philippines (14% for the same age category). In contrast, young adults (aged 20–24 years) constitute a considerably larger proportion of the burden in the Philippines (7%) than in Pakistan (4%). Finally, the youngest age category (0–4-year-old) was estimated to be a major contributor to the TB epidemic in the Philippines, with an estimated contribution reaching 9% (9–10). The proportion of paediatric TB (< 15 years old) among all TB cases is estimated at 17%, 15%, 2%, 22% and 18% in India, Indonesia, China, the Philippines and Pakistan, respectively.

Additional file 1: Figure S17 presents the TB age distribution obtained for the Philippines in the sensitivity analysis assuming constant historical programmatic conditions (i.e. removing time-variant programmatic parameters). We note that a substantial share of the estimated TB burden is shifted towards the youngest age categories under this scenario, making the TB age profile more similar to the population age distribution which is highly inconsistent with the 2016 prevalence survey results. In another sensitivity analysis where we assume that low-intensity contacts cannot result in transmission, households become the predominant context of M.tb transmission in all countries except China (Additional file 1: Figure S18). We observe that the calibrated crude probability of transmission per contact remains similar across each of the five countries, regardless of the assumption made around the relative risk of transmission through low-intensity contacts as compared to high-intensity contacts (Additional file 1: Figure S26).


We present a detailed representation of M.tb transmission and the resulting burden of infection and TB disease in the five highest TB burden countries. Using an agent-based model that combines household structure, social mixing matrices, age-specific infectiousness and reactivation rates, and the history of national TB control, we provide insights into major TB epidemic characteristics that would be otherwise unattainable. These include the age profile of M.tb transmission, the age-specific LTBI prevalence and associated risk of future disease, the age distribution of incident TB cases, and the contributions of different contact types to the burden of transmission and disease. Furthermore, we demonstrate that the demographic and programmatic model inputs alone are sufficient to explain the considerable heterogeneity in burden observed between countries, with calibrated per-contact transmission rates being very similar.

We show that the 15–19-year-old age category is a major driver of M.tb transmission in all countries except China. This observation, which is due to the high frequency of contacts and waning of immunity conferred by BCG at this age [9, 24], contrasts with the relatively low estimated burden of active disease observed in this age group. This finding highlights the marked difference between the age profile of M.tb transmission and that of TB burden and implies that relying only on the observed burden of active disease to understand the age profile of a TB epidemic would provide an incomplete and misleading picture. The relatively low TB burden estimated in the 15–19-year-old age group may explain why adolescents and young adults constitute a neglected group in global TB control and are rarely considered as a target population for preventive measures [44]. However, our model suggests that preventing infection (e.g. by raising TB awareness) and reactivation (through prophylaxis treatment) within this group could potentially yield significant burden reductions in the older age categories. Identifying individuals that should be targeted with TB prevention is critical to guide control policies, as world political leaders have recently declared their commitment to provide 30 million people with preventive treatment by 2022 [45].

Another age-specific transmission peak was identified between parents and their children in all settings, which is especially concerning for children under five, as they are more likely to progress to active disease once infected [42]. This observation underscores the critical importance of implementing rapid screening and control measures for the youngest contacts of identified adult pulmonary TB cases. We estimate that childhood TB (< 15 years old) contributes to around one fifth of the total TB incidence in India, the Philippines and Pakistan, as a consequence of the countries’ young populations and their high contact intensities. This is in line with previous estimates obtained in other high-incidence settings [21, 22]. Incorporating age-specific epidemiological characteristics such as infectiousness, risk of activation and waning BCG immunity allowed us to further refine the distribution of TB cases among < 15-year-olds using 5-year age brackets. This insight is particularly valuable because it is difficult to directly assess in real-world settings due to the challenges encountered with the diagnosis and surveillance of paediatric TB [22].

The TB age profile in China is dramatically different to that reported for the other four countries modelled in this study. China experiences TB principally in the oldest part of the population, with three quarters of the TB burden attributed to the ≥ 45-year-old category, although population ageing is not the only explanation for this phenomenon. The dramatic improvement in case detection since 2000 combined with high treatment success rates (over 90%) maintained over the last three decades has resulted in a dramatic fall in M.tb transmission over recent years, such that younger cohorts have now been much less exposed to the pathogen than preceding generations. This suggests that the current burden of active TB in China results primarily from reactivation of old infections that were acquired when transmission was still intense, consistent with previous work [46]. The importance of the programmatic history in shaping the current age profile of TB was further highlighted by the discrepancies observed in our sensitivity analysis performed without time-variant parameters and ignoring past TB control.

We provide estimates of the age-specific size of the LTBI reservoir, along with the risk that it represents in terms of future disease. Knowing who is latently infected provides valuable knowledge for policy-makers when designing contextualised preventive strategies. Our country-specific predictions could be used to estimate the yield of mass LTBI screening/treatment programs targeted at specific age categories, both in terms of the number of current infections treated and future disease episodes prevented. Although broad recommendations for the management of LTBI have been adopted [44], little is known about how best to adapt these to local programmatic and epidemiological contexts.

Social interactions occurring outside of homes, schools and workplaces were identified as the main driver of transmission in India, Indonesia, China and the Philippines. This finding implies that control measures focusing on close and easy-to-identify contacts of diagnosed TB cases may have a limited impact at the population level in these settings. This is consistent with other modelling works which suggest a limited role of household transmission due to contact saturation [12, 13]. In contrast, simulated M.tb transmission in Pakistan occurs primarily in homes due to Pakistan’s large average household size (6.8 persons). Therefore, interventions such as providing household contacts with screening and prophylaxis treatment are likely to be more efficient in Pakistan. We found that the contribution to the TB burden from household contacts and those occurring in “other locations” was sensitive to our assumptions about the relative risk of transmission through low-intensity contacts as compared to high-intensity contacts. However, it is important to note that the two scenarios considered in our sensitivity analyses are extreme and likely unrealistic, as they represented either a null risk of transmission for low-intensity contacts or a risk that is equal to that of high-intensity contacts.

The transmission probability, calibrated separately to the different TB burdens, was remarkably similar in the five countries we studied, providing confidence about model robustness. Moreover, it indicates that the socio-demographic characteristics included, along with the simulated time-variant programmatic changes, are able to account for the bulk of the heterogeneity in TB burden. This finding also suggests that the per-contact risk of transmission could be similar in all settings after adjustment for age, household composition and other factors relevant to infectiousness and susceptibility. The validity of our model was further reinforced by the closely matching estimates obtained when comparing our simulated age-specific prevalence to the equivalent estimates from the prevalence surveys conducted in the Philippines, Indonesia, China and Pakistan. Furthermore, our estimates of LTBI prevalence were remarkably close to those produced in a previous modelling study [4], although our 95% simulation intervals are much wider than those obtained in the previous work.

A limitation of this study is that the social mixing matrices that we incorporated into the model were not directly obtained from contact surveys. Instead, we used country-specific estimates generated by combining survey data from other countries with an extrapolation model [15]. Our estimates will therefore be refined further as local mixing data such as those provided by the POLYMOD study became available for a greater range of contexts [9]. Another potential limitation is that we opted for model parsimony in relation to factors including gender, comorbidities and sub-national geography, which are the subjects of current work. Due to high computational expense, we were unable to employ classic approaches such as Monte-Carlo Markov Chain methods to perform uncertainty analysis. However, the parameter values used in the model are based on empirical evidence and official reports, which has dramatically reduced the need to make assumptions. Furthermore, multi-dimensional uncertainty was included around 11 input parameters in order to explore various model configurations and selected parameters considered the most likely to affect model outputs were varied in sensitivity analyses, which did not jeopardise our main findings.


We show that it is possible to create new and valuable insights into the profile of local TB epidemics by combining agent-based simulation with social mixing data and TB control history. We demonstrate that social contacts involving 15–19-year-old individuals are a critical driver of TB which is not evident from the age distribution of TB cases. Our model also highlights the high burden of childhood TB in high-incidence settings and underlines the critical role played by parents-to-children transmission.

Availability of data and materials

All data generated or analysed during this study are included in this published article and its supplementary information file.


  1. WHO. Global Tuberculosis Report 2018. Geneva, Switzerland: World Health Organization; 2018.

  2. Dodd PJ, Gardiner E, Coghlan R, Seddon JA. Burden of childhood tuberculosis in 22 high-burden countries: a mathematical modelling study. Lancet Glob Health. 2014;2(8):e453–9.

    Article  Google Scholar 

  3. Lillebaek T, Dirksen A, Baess I, Strunge B, Thomsen VO, Andersen AB. Molecular evidence of endogenous reactivation of Mycobacterium tuberculosis after 33 years of latent infection. J Infect Dis. 2002;185(3):401–4.

    Article  CAS  Google Scholar 

  4. Houben RM, Dodd PJ. The global burden of latent tuberculosis infection: a re-estimation using mathematical modelling. PLoS Med. 2016;13(10):e1002152.

    Article  Google Scholar 

  5. Gomes MG, Franco AO, Gomes MC, Medley GF. The reinfection threshold promotes variability in tuberculosis epidemiology and vaccine efficacy. Proc Biol Sci. 2004;271:617–23.

    Article  Google Scholar 

  6. Ferguson NM, Cummings DA, Cauchemez S, Fraser C, Riley S, Meeyai A, et al. Strategies for containing an emerging influenza pandemic in Southeast Asia. Nature. 2005;437(7056):209–14.

    Article  CAS  Google Scholar 

  7. Geard N, Glass K, McCaw JM, McBryde ES, Korb KB, Keeling MJ, et al. The effects of demographic change on disease transmission and vaccine impact in a household structured population. Epidemics. 2015;13:56–64.

    Article  Google Scholar 

  8. Trauer JM, Dodd PJ, Gomes MGM, Gomez GB, Houben RM, McBryde ES, et al. The importance of heterogeneity to the epidemiology of tuberculosis. Clin Infect Dis. 2019;69(1):159–166.

  9. Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008;5(3):e74.

    Article  Google Scholar 

  10. Horby P, Pham QT, Hens N, Nguyen TT, Le QM, Dang DT, et al. Social contact patterns in Vietnam and implications for the control of infectious diseases. PLoS One. 2011;6(2):e16965.

    Article  CAS  Google Scholar 

  11. Grijalva CG, Goeyvaerts N, Verastegui H, Edwards KM, Gil AI, Lanata CF, et al. A household-based study of contact networks relevant for the spread of infectious diseases in the highlands of Peru. PLoS One. 2015;10(3):e0118457.

    Article  Google Scholar 

  12. Eames KT. Modelling disease spread through random and regular contacts in clustered populations. Theor Popul Biol. 2008;73(1):104–11.

    Article  CAS  Google Scholar 

  13. McCreesh N, White RG. An explanation for the low proportion of tuberculosis that results from transmission between household and known social contacts. Sci Rep. 2018;8(1):5382.

    Article  Google Scholar 

  14. Siler W. A competing-risk model for animal mortality. Ecology. 1979;60(4):750–7.

    Article  Google Scholar 

  15. Prem K, Cook AR, Jit M. Projecting social contact matrices in 152 countries using contact surveys and demographic data. PLoS Comput Biol. 2017;13(9):e1005697.

    Article  Google Scholar 

  16. Goeyvaerts N, Santermans E, Potter G, Torneri A, Van Kerckhove K, Willem L, Aerts M, Beutels P and Hens N. Household members do not contact each other at random: implications for infectious disease modelling. Proc R Soc B. 2018;285(1893).

  17. Ragonnet R, Trauer JM, Scott N, Meehan MT, Denholm JT, McBryde ES. Optimally capturing latency dynamics in models of tuberculosis transmission. Epidemics. 2017;21:39–47.

    Article  Google Scholar 

  18. Tiemersma EW, van der Werf MJ, Borgdorff MW, Williams BG, Nagelkerke NJ. Natural history of tuberculosis: duration and fatality of untreated pulmonary tuberculosis in HIV negative patients: a systematic review. PLoS One. 2011;6(4):e17601.

    Article  CAS  Google Scholar 

  19. Behr MA, Warren SA, Salamon H, Hopewell PC, Ponce de Leon A, Daley CL, et al. Transmission of Mycobacterium tuberculosis from patients smear-negative for acid-fast bacilli. Lancet. 1999;353(9151):444–9.

    Article  CAS  Google Scholar 

  20. Tostmann A, Kik SV, Kalisvaart NA, Sebek MM, Verver S, Boeree MJ, et al. Tuberculosis transmission by patients with smear-negative pulmonary tuberculosis in a large cohort in the Netherlands. Clin Infect Dis. 2008;47(9):1135–42.

    Article  Google Scholar 

  21. Donald PR. Childhood tuberculosis: the hidden epidemic. Int J Tuberc Lung Dis. 2004;8(5):627–9.

    PubMed  Google Scholar 

  22. Marais BJ, Gie RP, Schaaf HS, Beyers N, Donald PR, Starke JR. Childhood pulmonary tuberculosis: old wisdom and new challenges. Am J Respir Crit Care Med. 2006;173(10):1078–90.

    Article  Google Scholar 

  23. Nguipdop-Djomo P, Heldal E, Rodrigues LC, Abubakar I, Mangtani P. Duration of BCG protection against tuberculosis and change in effectiveness with time since vaccination in Norway: a retrospective population-based cohort study. Lancet Infect Dis. 2016;16(2):219–26.

    Article  Google Scholar 

  24. Abubakar I, Pimpin L, Ariti C, Beynon R, Mangtani P, Sterne JA, et al. Systematic review and meta-analysis of the current evidence on the duration of protection by bacillus Calmette-Guerin vaccination against tuberculosis. Health Technol Assess. 2013;17(37):1–372 v-vi.

    Article  CAS  Google Scholar 

  25. Colditz GA, Brewer TF, Berkey CS, Wilson ME, Burdick E, Fineberg HV, et al. Efficacy of BCG vaccine in the prevention of tuberculosis. Meta-analysis of the published literature. JAMA. 1994;271(9):698–702.

    Article  CAS  Google Scholar 

  26. Andrews JR, Noubary F, Walensky RP, Cerda R, Losina E, Horsburgh CR. Risk of progression to active tuberculosis following reinfection with Mycobacterium tuberculosis. Clin Infect Dis. 2012;54(6):784–91.

    Article  Google Scholar 

  27. The United Nations - Population Division. Household size and composition. 2017. Available from: [cited 27/04/2018].

    Google Scholar 

  28. Republic of The Philippines Department of Education. Number of schools. 2018. Available from:

    Google Scholar 

  29. Shaweno D, Karmakar M, Alene KA, Ragonnet R, Clements AC, Trauer JM, et al. Methods used in the spatial analysis of tuberculosis epidemiology: a systematic review. BMC Med. 2018;16(1):193.

    Article  Google Scholar 

  30. China Ministry of Education. Number of schools. 2018. Available from:

    Google Scholar 

  31. The World Bank. World Bank and education in Indonesia. 2014. Available from:

    Google Scholar 

  32. Pakistan Education Statistics. Number of schools. 2016.

    Google Scholar 

  33. The World Bank. Labor force participation rate. 2018. Available from: [cited 27/04/2018].

    Google Scholar 

  34. Zumla A, Raviglione M, Hafner R, von Reyn CF. Tuberculosis. N Engl J Med. 2013;368(8):745–55.

    Article  CAS  Google Scholar 

  35. WHO. Global tuberculosis report 2017. Geneva: World Health Organization; 2017.

    Google Scholar 

  36. Official country estimates of immunization coverage for the year 2016. 2018. Available from: [cited 30/04/2018].

  37. WHO. Tuberculosis country profiles. Geneva; 2017. Available from:

  38. Belkina TV, Khojiev DS, Tillyashaykhov MN, Tigay ZN, Kudenov MU, Tebbens JD, et al. Delay in the diagnosis and treatment of pulmonary tuberculosis in Uzbekistan: a cross-sectional study. BMC Infect Dis. 2014;14:624.

    Article  Google Scholar 

  39. Sreeramareddy CT, Qin ZZ, Satyanarayana S, Subbaraman R, Pai M. Delays in diagnosis and treatment of pulmonary tuberculosis in India: a systematic review. Int J Tuberc Lung Dis. 2014;18(3):255–66.

    Article  Google Scholar 

  40. Asefa A, Teshome W. Total delay in treatment among smear positive pulmonary tuberculosis patients in five primary health centers, southern Ethiopia: a cross sectional study. PLoS One. 2014;9(7):e102884.

    Article  Google Scholar 

  41. Jurcev-Savicevic A, Mulic R, Kozul K, Ban B, Valic J, Bacun-Ivcek L, et al. Health system delay in pulmonary tuberculosis treatment in a country with an intermediate burden of tuberculosis: a cross-sectional study. BMC Public Health. 2013;13:250.

    Article  Google Scholar 

  42. Trauer JM, Moyo N, Tay EL, Dale K, Ragonnet R, McBryde ES, et al. Risk of active tuberculosis in the five years following infection . . . 15%? Chest. 2016;149(2):516–25.

    Article  Google Scholar 

  43. Sloot R, Schim van der Loeff MF, Kouw PM, Borgdorff MW. Risk of tuberculosis after recent exposure. A 10-year follow-up study of contacts in Amsterdam. Am J Respir Crit Care Med. 2014;190(9):1044–52.

    Article  Google Scholar 

  44. WHO. Latent tuberculosis infection: updated and consolidated guidelines for programmatic management. Geneva, Switzerland: World Health Organization; 2018.

  45. Union T. UN General Assembly adopts first-ever political agenda for ending tuberculosis epidemic. 2018. Available from:

    Google Scholar 

  46. Huynh GH, Klein DJ, Chin DP, Wagner BG, Eckhoff PA, Liu R, et al. Tuberculosis control strategies to reach the 2035 global targets in China: the role of changing demographics and reactivation disease. BMC Med. 2015;13:88.

    Article  Google Scholar 

Download references


Only the persons listed as authors contributed to this study.


RR was funded by the Australian Government through a Research Training Program Scholarship.

Author information

Authors and Affiliations



RR conceived the study, accessed data sources, developed the mathematical model, and did all computer programming. JMT, NS and ESM provided inputs on all aspects of the model throughout the process. NG provided input on the demographic component of the model. All authors edited and approved the final version of the manuscript.

Corresponding author

Correspondence to Romain Ragonnet.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Technical appendix providing detailed descriptions of the methods and results of sensitivity analyses.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ragonnet, R., Trauer, J.M., Geard, N. et al. Profiling Mycobacterium tuberculosis transmission and the resulting disease burden in the five highest tuberculosis burden countries. BMC Med 17, 208 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: