Individual social contact data and population mobility data as early markers of SARS-CoV-2 transmission dynamics during the first wave in Germany—an analysis based on the COVIMOD study
BMC Medicine volume 19, Article number: 271 (2021)
The effect of contact reduction measures on infectious disease transmission can only be assessed indirectly and with considerable delay. However, individual social contact data and population mobility data can offer near real-time proxy information. The aim of this study is to compare social contact data and population mobility data with respect to their ability to reflect transmission dynamics during the first wave of the SARS-CoV-2 pandemic in Germany.
We quantified the change in social contact patterns derived from self-reported contact survey data collected by the German COVIMOD study from 04/2020 to 06/2020 (compared to the pre-pandemic period from previous studies) and estimated the percentage mean reduction over time. We compared these results as well as the percentage mean reduction in population mobility data (corrected for pre-pandemic mobility) with and without the introduction of scaling factors and specific weights for different types of contacts and mobility to the relative reduction in transmission dynamics measured by changes in R values provided by the German Public Health Institute.
We observed the largest reduction in social contacts (90%, compared to pre-pandemic data) in late April corresponding to the strictest contact reduction measures. Thereafter, the reduction in contacts dropped continuously to a minimum of 73% in late June. Relative reduction of infection dynamics derived from contact survey data underestimated the one based on reported R values in the time of strictest contact reduction measures but reflected it well thereafter. Relative reduction of infection dynamics derived from mobility data overestimated the one based on reported R values considerably throughout the study. After the introduction of a scaling factor, specific weights for different types of contacts and mobility reduced the mean absolute percentage error considerably; in all analyses, estimates based on contact data reflected measured R values better than those based on mobility.
Contact survey data reflected infection dynamics better than population mobility data, indicating that both data sources cover different dimensions of infection dynamics. The use of contact type-specific weights reduced the mean absolute percentage errors to less than 1%. Measuring the changes in mobility alone is not sufficient for understanding the changes in transmission dynamics triggered by public health measures.
The role of social contacts in the spread of respiratory infections has been discussed extensively in the year 2020 due to the global outbreak of severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2) [1, 2]. As of March 2021, over 100 million confirmed cases and over 2.5 million deaths have been recorded worldwide . SARS-CoV-2 is primarily transmitted via droplets and aerosols, so person-to-person contacts are a strong determinant of transmission dynamics [2,3,4]. Non-pharmaceutical interventions (NPIs) focusing on the reduction of person-to-person contacts are one of the cornerstones of the pandemic response. In the middle of March 2020, Germany mandated school and kindergarten closures, postponed academic semesters, prohibited visiting of nursing homes and restricted the number of people allowed at public and private gatherings in an attempt to protect the vulnerable groups . In the following weeks, contact reduction measures were implemented on a population level by regulating the maximum number of close social contacts outside one’s household and by closing non-essential shops as well as places for leisure activities . After a considerable reduction in reported case numbers, federal governments decided to ease these restrictions gradually starting at the beginning of May 2020.
Social contact patterns are known to be a critical factor for the transmission dynamics of respiratory infections [4, 6,7,8,9,10]. However, empirical social contact data have been scarce before the emergence of SARS-CoV-2 [11,12,13]. One exception is the POLYMOD study, a large-scale survey that described social mixing patterns in eight European countries . In 2005/2006, POLYMOD measured contacts of more than 7000 participants across eight European countries . Contact patterns observed in POLYMOD have been widely used to parametrize various mathematical models of infectious disease dynamics [3, 4, 12, 14].
During the SARS-CoV-2 pandemic, contact surveys were initiated in several countries to understand the effect of contact precaution measures on social contact patterns [3, 4, 10, 15,16,17,18,19]. While contact surveys offer a direct approach to social contact patterns, they are time- and cost-intensive and need to be initiated actively. Mobile phone-based mobility data offer a complementary approach to infer changes in contact patterns in a population. Google and Apple granted free access to anonymized mobility data in a global attempt to provide insights into the change of mobility during the pandemic given different physical distancing policies [20, 21]. Several SARS-CoV-2 modelling studies assumed that aggregated mobility data can be used as a proxy for the actual number and intensity of contacts of individuals in a defined population, although mobility data measure only certain dimensions of contact behaviour. In this article, we present survey-based social contact data for the first wave of the pandemic in Germany and assess their ability to reflect transmission dynamics 10 days later (measured by reported reproduction number (R estimates)) when compared to open source population mobility data from Google and Apple [20,21,22].
Pandemic contact survey—COVIMOD
The contact survey COVIMOD was initiated in April 2020 based on participants of the online panel i-say.com. To ensure the samples’ broad representativeness of the German population, participants were recruited by sending email invitations to existing members of the panel based on age, sex and regional quotas. To gain information on children’s social contacts, a defined subgroup of adult participants with under-aged children (< 18 years of age) living in their household were invited to provide information as a proxy for their children. This approach, however, resulted in the sample being no longer representative of the German population as we under-sampled the middle-aged participants who instead filled out the questionnaire for their children. The first COVIMOD survey wave was launched on 30/04/2020 corresponding to the time of the strictest contact reduction measures in Germany. Survey waves 2 to 4 were launched during a time of a gradual easing of the contact reduction measures in May and June 2020. For wave 1, a sample of 1500 participants was recruited with an expected response rate of 85% for the next survey waves. Before the launch of survey wave 4, the sample was increased by 1000 additional participants.
The COVIMOD questionnaire is based on the questionnaire of the CoMix study and includes questions on demographics, current behaviours, attitudes towards SARS-CoV-2 and the social contacts of participants . Participants were asked to provide each social contact between 5 am the preceding day and 5 am the day of the survey, the age and sex of the contact, the duration they spend with each contact, the setting where the contact occurred and if the contact was a household member or not. The questionnaire can be found in Additional file 1.
We defined a contact in COVIMOD in line with the POLYMOD study’s definition as “people who you met in person and with whom you exchanged at least a few words, or with whom you had physical contact” . During survey waves 1 and 2, participants were asked to provide each contact separately. Instead of providing each contact one by one, some participants included a group of contacts as one contact (e.g. “customers”). For these groups, we assumed a specific number of social contacts (Additional file 2). From survey wave 3 onwards, participants were offered the opportunity to provide a number of additional contacts (group contacts) they were not able to list individually in case they had too many contacts.
As participants were offered to enter these additional contacts separately, we used different analysis approaches to work with these contacts (sensitivity analyses). The main scenario includes all reported contacts plus group contacts weighted for the German population for COVIMOD and POLYMOD. Unweighted results and those without group contacts can be found in Additional file 3.
Pre-pandemic contact survey—POLYMOD
The European contact survey POLYMOD was used as a baseline pre-pandemic comparison. In Germany, POLYMOD was conducted paper-based with the help of a market research company in 2005/2006. Further details about POLYMOD can be found elsewhere . As in COVIMOD, participants in POLYMOD were also allowed to enter the number of additional contacts (group contacts) they had if participants had too many contacts to report them separately.
We obtained publicly available aggregated mobility data from the Google COVID-19 Community Mobility Reports and from the COVID-19 Apple Mobility trends for the times corresponding to the COVIMOD survey waves [20, 21].
Google COVID-19 Community Mobility Reports provide the percentage change in mobility from February 2020 onwards compared to the median of the corresponding weekday between 03/01/2020 and 06/02/2020. Google COVID-19 Community Mobility Reports use aggregated information about true individual movement histories to provide location-specific changes in mobility over time. Data are stratified by the destination of the movement, i.e. retail and recreation, grocery and pharmacy, transit stations, workplace, residential and parks. COVID-19 Apple Mobility trends provide information about the relative volume of requests for directions for all weeks in 2020 compared to a base volume on 13/01/2020.
Reproduction number estimates by the German Public Health Institute
R values used in our analysis as the “reference standard” for infection dynamics were obtained from the German Public Health Institute (Robert Koch Institute (RKI)) [22, 23]. The method applied by the RKI to obtain current R values is based on the reported numbers of individuals notified for being newly infected with SARS-CoV-2 and includes a nowcasting approach taking into account the delay in diagnosis, reporting and data delivery. If possible, incident cases are attributed to the day of first symptoms (an information available for the majority of cases in the German notification system). If this information is not available, it is imputed taking into account measured delays from the day of the first symptom to the notification date, age of the case and day and week of notification. Based on this nowcasting, RKI estimates the time-dependent reproduction number . The 4-day reproduction number calculated by the RKI provides information on the transmission dynamics 8 to 13 days prior . The R values are continuously corrected retrospectively for delayed notifications. We used R values provided by the RKI for 10 days after the timing of our survey waves as a reference for the comparison of infection dynamics. Since we extracted R values more than 1 year after the day they were calculated for, all delayed notifications were already accounted for. The R values based on case numbers as reported by RKI reflect both changes in transmission dynamics due to contact reduction measures as well as due to developing immunity in the population, while contact survey and mobility data cannot take into account population immunity. For this analysis, we assumed that SARS-CoV-2 immunity in the population is negligible for our analyses as this study only includes the first wave of the SARS-CoV-2 pandemic in Germany, and seroprevalence estimates for this period are below 1% in representative studies .
Data management and statistical analyses
As the COVIMOD sample is not fully representative of the German population, we used data from the 2011 census to apply survey weights based on the participants’ age, sex, household size and region of residence . The region of residence was not available for POLYMOD, so the POLYMOD data were only weighted according to the participants’ age, sex and household size using the R package “survey” . As the COVIMOD data collection was not always started on the same day of the week and the duration of the survey waves did vary slightly, we also weighted both COVIMOD and POLYMOD for weekdays/weekends.
We calculated the mean number of social contacts per participant per day as well as the 95% confidence interval of the bootstrapped mean of 1000 samples. We stratified social contacts by age group, sex, household size and the day of the week. Additionally, we assessed setting-specific contacts, i.e. home, childcare/school/university, work, public transport and others; childcare/school/university contacts were assessed in the subgroup of participants who reported to attend childcare, school or university, and work contacts were assessed in the subgroup of participants who worked full- or part-time. We calculated social contact matrices for the age-specific mean number of direct social contacts using the “socialmixr” package in R . To obtain the final contact matrices, the age-specific mean number of daily contacts were adjusted, so that the total number of contacts of one group with another was the same as vice versa . For the calculation of the contact matrices, participants who reported more than 100 group contacts were excluded from the analysis (COVIMOD: wave 3, 6 participants; wave 4, 13 participants; POLYMOD, 10 participants).
To assess how changes in infection dynamics are reflected by contact survey data, we applied two different approaches. First, we performed a simple analysis for which we calculated the mean relative reduction in contacts for each COVIMOD wave when compared to pre-pandemic data. For this, we translated the number of the mean contacts and the corresponding 95% confidence interval values into a mean relative reduction from baseline, i.e. in this case, the number of mean contacts before the SARS-CoV-2 pandemic as estimated in the POLYMOD study.
Second, we performed a more complex analysis by using additional information from the contact survey for calculating the next-generation matrix. We assumed that the next-generation matrix for SARS-CoV-2 is a function of the age-specific effective contact rate, given by the number of age-specific contacts multiplied by the probability of transmission per contact, and the duration of infectiousness . Hence, the basic reproduction number (R0) is proportional to the dominant eigenvalue of the contact matrix . To be able to calculate R as the result of a relative reduction in R0, we assumed that the social contact patterns before the implementation of the contact reduction measures were similar to the POLYMOD contact patterns and that the duration of infectiousness and the per-contact transmission probability remained constant. Additionally, we assumed that the transmission probability did not depend on age. Under these assumptions, the relative reduction of R compared to R0 is equivalent to the reduction in the contact matrices’ dominant eigenvalue allowing us to estimate the reproduction number corresponding to contacts recorded in COVIMOD. We assumed R0 during the first wave in Germany to follow a normal distribution with a mean of 2.6 and a standard deviation of 0.54 . We drew 10,000 bootstrap samples from POLYMOD and COVIMOD to assess uncertainty.
Similar to the first approach, we then translated the R estimates from the COVIMOD study into a mean relative reduction from baseline, i.e. in this case, the basic reproduction number (assumed as R0 = 2.6).
We used mobility data collected for the same time intervals as the COVIMOD waves’ timings and compared it to the pre-pandemic data available from the respective data sources. In addition to assessing the distinct movement types provided by Google, we also composed an indicator for overall mobility by averaging across all the movement types separately for both the Google mobility data and the Apple mobility data (with the exception of movements to parks as this is expected to vary considerably during seasons).
We calculated the mean relative change compared to pre-pandemic data within the time intervals corresponding to the COVIMOD waves as well as the 95% confidence interval of the bootstrapped mean of 1000 samples for Google and Apple mobility. In line with the approach we applied for COVIMOD and POLYMOD, we weighted the population mobility data for weekdays/weekends.
RKI reproduction number estimates
We calculated the mean R estimates for the corresponding time intervals 10 days after the COVIMOD waves as well as 95% confidence interval of the bootstrapped mean of 1000 samples based on the daily R estimates provided by the RKI, the German Public Health Institute. We then translated the mean and 95% confidence interval value into a relative reduction from baseline, i.e. in this case, the basic reproduction number (assumed as R0 = 2.6 during the first wave in Germany), to provide a reference standard for infection dynamics against which the changes in social contact data and population mobility data could be compared.
Weights by contact type and calibration of scaling factors
As the probability that a contact leads to a transmission varies according to the setting, we performed additional analyses using two different concepts to take this into account. First, we assigned different but specific weights to home contacts/home mobility and non-home contacts/non-home mobility (i.e. all other contact settings combined) based on setting-specific secondary attack rates (SAR) from a systematic review by Thompson et al. . Based on Thompson et al., the household SAR was estimated to be 21.1 and the SAR in a healthcare setting, at the workplace and with casual close contacts to be 3.6%, 1.9% and 1.2%, respectively. We used normalised weights based on household SAR and the average of the healthcare, workplace and casual close contacts (SAR = 2.23%) and applied the household weight to the home contacts/home mobility and the non-household weight to the non-home contacts/non-home mobility. We then allowed for an additional scaling factor per contact survey approach, i.e. simple approach—mean relative reduction in contacts, complex approach—contact data with next-generation matrix, google mobility data; the same scaling factor was used within each approach for all waves as well as for all types of contacts in the contact survey approaches and all types of mobility, in the mobility approach. We used this scaling approach with the aim to obtain the minimum residual sum of squares across the four survey waves when compared to our reference standard, i.e. relative reductions estimated based on R values reported by the RKI. For a better understanding of the effect of contact/mobility-type weights, we also performed an analysis in which we fitted the scaling factor with the same weight for all types of contacts and mobility. In the second concept, we did not apply pre-defined weights for home/non-home contacts and for home/non-home mobility but fitted them from the data by allowing independent scaling factors for home contacts and home mobility and non-home contacts and non-home mobility per approach, i.e. simple approach—mean relative reduction in contacts, complex approach—contact data with next-generation matrix, google mobility data. By doing so, we estimated the relative weights for both contact/mobility types based on the data collected for this study and did not take into account external information for transmission probabilities in different settings. The optim function in R was used for the fitting/scaling. Apple mobility data could not be used for these analyses as there is no differentiation in home/non-home mobility available.
Comparison of the results of the different approaches with the reference standard
For all analyses, we calculated the mean absolute percentage error of the estimates obtained by the approaches for the COVIMOD contact data as well as for the Google and Apple mobility data when compared to the reference standard of relative changes in infection dynamics based on R estimates from RKI. We did this in the base case concept without scaling factor and contact type-specific weighting, as well in all three concepts with scaling factors. Moreover, we applied repeated measures ANOVA to assess the differences between error rates provided by the different data sources.
Participant characteristics of POLYMOD and COVIMOD
During POLYMOD, 1341 participants were surveyed in Germany; they recorded a total of 27,154 contacts. In the first COVIMOD wave, we surveyed 1560 participants who recorded a total of 3256 social contacts; this changed to 1356 participants with a total of 4852 contacts in the second survey wave, 1081 participants with a total of 6344 in the third wave and 1890 participants with a total of 13,471 contacts in the fourth wave.
The youngest participants in all COVIMOD waves were younger than 1 (the parents were surveyed as a proxy), and the oldest was 91 years of age. Between 47% and 50% of all COVIMOD participants were female (Table 1). In POLYMOD and all COVIMOD waves, the median household size of the participants was 3 (POLYMOD IQR 2–4, COVIMOD wave 1 IQR 2–4, wave 2 IQR 2–3, wave 3 IQR 2–3, wave 4 IQR 1–3). In COVIMOD survey waves 1, 2 and 3, most participants reported their social contacts on a Thursday, whereas in wave 4, most contacts were reported on a Monday; less than a quarter of participants reported the contacts during the weekend (Table 1).
A comparison of the characteristics of the German population and the POLYMOD and COVIMOD participants can be found in Additional file 3, Table 1. Participant characteristics after weighting can be found in Additional file 3, Table 1.1a. The analyses hereafter are based on the weighted data including group contacts.
Number of social contacts
The mean number of contacts measured per participant during all COVIMOD waves (wave 1, 2.0 contacts (SD 1.9); wave 2, 3.3 contacts (SD 4.7); wave 3, 6.2 contacts (SD 18.4); wave 4, 6.9 contacts (SD 326.3)) was considerably lower in comparison with the 18.9 contacts (SD 24.6) measured in POLYMOD in the pre-pandemic period (Fig. 1C; Additional file 3 Table 1.2a). The reduction in the number of overall contacts between POLYMOD and COVIMOD was consistent across age, sex, household size and weekday (Fig. 1; Additional file 3 Table 1.2a).
While the mean number of home contacts was stable across all COVIMOD waves (and just a little bit lower than in POLYMOD, Fig. 1, Additional file 3 Table 1.2b), contacts at work and in educational settings were dramatically reduced during the first COVIMOD wave. Contacts at work increased gradually thereafter but remained much lower than in POLYMOD even for survey wave 4; educational contacts started to increase only at survey wave 4 as schools were closed before (Fig. 1, Additional file 3 Table 1.2b). Moreover, the distribution of contacts observed changed considerably over the different COVIMOD waves. While the maximum number of contacts reported overall and in specific settings was clearly reduced in the first COVIMOD wave, it approximated the one reported in POLYMOD already in waves 2 and 3 and reached it in wave 4 (although the mean and median contact numbers were still clearly reduced). The number of contacts in different settings for the other analyses can be found in Additional file 3 Tables 2.2b, 3.2b and 4.2b.
POLYMOD and COVIMOD participants in all age groups shared the majority of their contacts with individuals of similar age, demonstrating the expected age-assortative pattern (Additional file 3 Figure 1.5; Additional file 5). Contact matrices derived from the first two COVIMOD waves were dominated by contacts at home, revealing mainly contacts with life partners and children. This changed slowly through survey waves 3 and 4 due to the gradual increase in work and leisure time (“other”) contacts, which resulted in a broader distribution of the age of potential contact persons (Additional file 3 Figure 1.5).
Representation of transmission dynamics by contact survey and mobility data
In the base case approach without scaling factors and contact type-specific weighting, the mean R estimated based on the next-generation matrices of COVIMOD data was smaller than 1 in all COVIMOD waves (representing a mean relative reduction in contacts of at least 75%); we observed the highest mean relative reduction with 91% at the end of April (survey wave 1), which corresponds to the time of the strictest contact reduction measures. Subsequently, the mean relative reduction decreased with time as the contact reduction measures were loosened (wave 2, 87%; wave 3, 80%; wave 4, 74%; Figs. 2 and 3; Additional file 3 Table 1.3b).
A very similar pattern both in the lowering of the mean relative reduction and in the level of the relative reduction was seen with the simple approach based only on the reduction of the number of contacts itself. We observed a mean relative reduction between 89% at the end of April and 63% in the middle of June 2020 (survey wave 4; Figs. 2 and 3).
Compared to the relative reductions estimated based on R values reported by the RKI, relative reductions in contacts measured by COVIMOD both in the simple and the more complex approach were higher during the first survey wave but fit quite well during waves 2 to 4. Mobility reduction estimates based on Google and Apple data were considerably smaller than relative reductions estimated based on R values reported by the RKI throughout the entire study (Figs. 2 and 3; Additional file 3 Table 1.3a and b). Both mobility data sources found mobility patterns similar to pre-pandemic data already during the time of survey waves 1 and 2, while reported R values 10 days later were still considerably below 1.
The mean absolute percentage error of the relative reduction measured in COVIMOD based only on the reduction of contacts itself (the simple approach) was 19% (SD 12), measured in COVIMOD based on the more complex derivation of the next-generation matrix was 28% (SD 12), measured based on Google mobility data was 75% (SD 8) and measured based on Apple mobility data was 87% (SD 33). The mean absolute percentage error (MAPE) of the simple and more complex COVIMOD approach were smaller than the ones obtained via Google (p < 0.001 for both approaches) and Apple mobility data (p = 0.010 and p = 0.015). The introduction of a scaling factor reduced MAPE values considerably, especially for both COVIMOD approaches (Fig. 4).
According to the systematic review of Thompson et al. , the SAR for the healthcare/workplace/casual close contacts is around 10% of that of household contacts. When we fitted the reduction in social contacts based on the simple approach to the relative reductions estimated based on R values reported by the RKI, the best fit for the relative transmission risk for non-home contacts compared with home contacts was obtained with a very similar estimate of around 8% compared to around 20% in the mobility data. The mean absolute percentage error decreased to 5% (SD 0.7%) based on COVIMOD and to 18% (SD 14%) based on Google when we used this approach to derive contact/mobility type-specific weights and scaling factors. Applying the estimates from Thompson et al. , the mean absolute percentage error was very similar for COVIMOD (5%, SD 0.25%) but larger for estimates based on Google mobility (27%, SD 17%). An even lower mean absolute percentage error was obtained by using the more complex contact survey data approach based on the next-generation matrix (mean absolute percentage error of 1% (SD 1%) for both weighting based on estimates by Thompson et al.  and fitting of home/non-home contacts (Fig. 4).
In this study, we quantified the relative reduction in contacts based on contact survey data and publicly available mobility data. We found that both data sources represent different dimensions of transmission dynamics; changes in contact patterns measured in survey data represented transmission dynamics (measured as R) better than the changes measured in aggregated mobility data independently of the introduction of contact- and mobility type-specific weights and the use of scaling factors. Non-pharmaceutical interventions introduced in Germany during the first wave of the SARS-CoV-2 pandemic were, however, associated with both a considerable reduction in social contacts reported in contact surveys as well as with reductions in mobility patterns. The results of our study indicate that deriving contact behaviour from mobility data alone, as it was often the case in political decision-making during the first and second wave, is not suitable for making real-time inferences on the effects of public health measures on the transmission dynamics in a population. Mobility data used in this study suggested that contact behaviour went back to normal almost instantly after the contact reduction measures were relaxed, which did not reflect the observed R values. A reason for that might be that people still tried to minimise close contacts outside their own households and maximised distance to the contacts they had, although their mobility, e.g. back to work, already reached almost pre-pandemic levels. Therefore, a complementary approach including both aspects, i.e. social contact behaviour as well as mobility behaviour, is necessary to fully reflect transmissions dynamics. Although repeated contact surveys need considerable investment in terms of time and costs, the potential benefits and financial savings if used as a near real-time proxy for transmission dynamics on a population level are likely to outweigh the efforts needed. Benefits include a better preparedness towards expected case numbers as well as earlier information on the effect of newly introduced contact reduction measures, which allows timely adaptation if needed.
In our study, we found a 73% mean reduction in contacts across the first four waves of COVIMOD (i.e. from April to June 2020) which is consistent with studies from other European countries [3, 4]. Even though the reported number of daily contacts increased over the survey waves, it was still considerably lower than in POLYMOD, indicating sustainable behaviour change even after the end of the strictest contact reduction measures. We found an increased variance in the reported daily number of contacts as the COVIMOD waves progressed, with the maximum number of contacts increasing from 16 in survey wave 1 to 674 in wave 4, while median contact numbers were not affected similarly. Since SARS-CoV-2 has been shown to be associated with a high variance in the number of transmissions arising from one infectious individual , this sharp increase in the maximum number of contacts has huge implications for the risk of superspreading events as the direct aftermath of the end of public health interventions. Participants aged 60 and above reported fewer contacts in all COVIMOD waves as well as a larger reduction to pre-pandemic values when compared to children and middle-aged persons. This should be taken into account when assessing the effects of vaccination prioritisation strategies in combination with NPIs, as people in this age group are known to be more vulnerable to SARS-CoV-2 infections .
We further observed a smaller and more stable reduction in home contacts than in work, educational and leisure time contacts, which confirms that reduction in contacts is location-specific . This is reasonable as most of the social distancing measures implemented at that time had their main impact outside the household. We confirmed that the majority of remaining contacts under strict contact reduction measures happens between life partners and parents and children, which mirrors the huge role of this transmission setting under contact reduction measures [7, 34].
When introducing contact- and mobility type-specific weights representing different transmission probabilities for home and non-home contacts/mobility, we were able to considerably reduce the differences in estimates for transmission dynamics when compared to the reported R values 10 days later, even if scaling factors had been fitted to the different data source models before. However, the remaining differences were in all analyses much smaller for estimates based on contact survey data than for mobility data. These results show that the presented approach might be suitable for a near real-time estimation of transmission dynamics based on contact survey data alone or in combination with mobility data. A data-driven estimation (based on contact survey data) of the relative transmission risk at home compared to non-home transmission resulted in estimates very similar to those derived from setting-specific secondary attack rates reported in the literature. Our results indicate that the differentiation in home and non-home contacts based on contact survey data supports the representation of the true role of different types of contacts for transmission dynamics.
Our analyses suggest that the use of contact survey data, especially after weighing for home and non-home contacts together with an additional scaling factor, can indeed be used as an early marker of current transmission dynamics, especially if they are mainly determined by contact reduction measures. We show that aggregated mobility data offer a different behavioural perspective but can also contribute to a better understanding of how transmission dynamics might develop in near real-time. The analyses performed in this study were rather simplistic by nature, as they aimed to provide an overall estimate of transmission dynamics without differentiating by too many different factors and without a formal dynamic mathematical model. In reality, the information provided about changes over time in contact settings, intensities and frequencies with contact partners offers especially for contact survey data but also for mobility data much wider perspectives. Since these analyses require a dynamic modelling approach taking into account various other assumptions not necessarily available in the early phases of an epidemic, they might not be as suitable for near real-time communication with decision-makers as the simpler approaches presented here. However, future analyses should focus on using the available contact and mobility data to construct and validate multi-layer mathematical models which take into account mobility data for large scale movements and contact survey data for small scale effect contacts, and this combines the strengths of the different data sources.
Our study has several limitations. COVIMOD data are not fully representative of the German population since some adult participants with under-aged children living in their households were invited to provide information as a proxy for their children. Moreover, the elderly (> 70 years) and the very young (< 10 years of age) are underrepresented in COVIMOD. We tried to correct for that by introducing weights for sex, age and household size; however, there were no relevant differences in the results of the unweighted and weighted analyses. Participants in COVIMOD were asked to record their contacts retrospectively so that different forms of information bias could have been introduced. For example, it might be challenging to remember a higher number of contacts, or the participants’ willingness to report high numbers of contacts individually might be lower as this is quite tedious and time-consuming. We tried to minimise this by allowing the participants to record group contacts. We also cannot rule out that COVIMOD attracted specifically participants who adhered to social distancing rules as these individuals might be more likely to respond to health surveys. This could have led to an overestimation of the relative reduction of contacts and could explain the gap between relative reductions in social contacts and reported R values. We tried to minimise this bias by using an established online panel not focusing on healthcare questions as the platform for COVIMOD. Even though contact-related questions were similarly phrased between POLYMOD and COVIMOD, POLYMOD was paper-based, and COVIMOD surveys were web-based. Previous research suggested that participants might report more contacts in paper-based surveys than in web-based surveys [11, 35]. Future research will be conducted on the differences between web- and paper-based contacts during the pandemic. However, our findings are consistent with other studies that examined social contact patterns under strict contact reduction measures [3, 4, 15, 36]. We used aggregated mobility data in our study that were freely available and have been discussed as a potential real-time proxy for SARS-CoV-2 transmission dynamics. Although we took advantage of two different data sources representing complementary ways to define mobility, our results cannot be automatically generalised to other ways of measuring mobility (e.g. based on individual movement patterns). The R values derived from RKI represent the changes in transmission dynamics based on contact reduction measures as well as population immunity, while contact survey data and mobility data can only assess the former. Since population immunity was below 1% in the study period, this is unlikely to have played a major role in this analysis but needs to be taken into account for future studies. Application of scaling factors, which include information on developing population immunity, might be a useful tool for later phases of an epidemic.
In summary, our study provides a comprehensive quantification of social contacts and mixing patterns as well as aggregated mobility information relevant to the spread of SARS-CoV-2 during spring and summer 2020 in Germany. Our results indicate that population-based contact surveys provide a suitable platform for near real-time assessment of transmission dynamics for respiratory infections in a population in the absence of population immunity. Aggregated mobility data as a proxy for effective contacts did not show the same degree of persistent reduction. The introduction of contact and mobility type-specific weights led to a considerable improvement in the reflection of reported changes in case numbers 10 days later. Mobility data and social contact data provide information on different dimensions of human behaviour. A complementary approach including both aspects, social contact behaviour and mobility behaviour might be needed to reflect transmission dynamics best.
Availability of data and materials
The data are available from the corresponding author upon valid scientific request.
Coronavirus disease 2019
Basic reproduction number
- R :
Robert Koch Institute
Severe acute respiratory syndrome coronavirus 2
Cucinotta D, Vanelli M. WHO declares COVID-19 a pandemic. Acta Biomedica. 2020;91(1):157–60. https://doi.org/10.23750/abm.v91i1.9397.
World Health Organization. WHO announces COVID-19 outbreak a pandemic. 2020. https://www.euro.who.int/en/health-topics/health-emergencies/coronavirus-covid-19/news/news/2020/3/who-announces-covid-19-outbreak-a-pandemic. Accessed 6 Oct 2020.
Jarvis CI, Van Zandvoort K, Gimma A, Prem K, Auzenbergs M, O’Reilly K, et al. Quantifying the impact of physical distance measures on the transmission of COVID-19 in the UK. BMC Med. 2020;18(1):124. https://doi.org/10.1186/s12916-020-01597-8.
Coletti P, Wambua J, Gimma A, Willem L, Vercruysse S, Vanhoutte B, et al. CoMix: comparing mixing patterns in the Belgian population during and after lockdown. Sci Rep. 2020;10(1):21885. https://doi.org/10.1038/s41598-020-78540-7.
Besprechung der Bundeskanzlerin mit den Regierungschefinnen und Regierungschefs der Länder vom 22.03.2020. 2020. https://www.bundesregierung.de/breg-de/themen/coronavirus/besprechung-der-bundeskanzlerin-mit-den-regierungschefinnen-und-regierungschefs-der-laender-vom-22-03-2020-1733248. Accessed 1 Mar 2021.
Luh DL, You ZS, Chen SC. Comparison of the social contact patterns among school-age children in specific seasons, locations, and times. Epidemics. 2016;14:36–44. https://doi.org/10.1016/j.epidem.2015.09.002.
Mikolajczyk RT, Akmatov MK, Rastin S, Kretzschmar M. Social contacts of school children and the transmission of respiratory-spread pathogens. Epidemiol Infect. 2008;136(6):813–22. https://doi.org/10.1017/S0950268807009181.
Melegaro A, Del Fava E, Poletti P, Merler S, Nyamukapa C, Williams J, et al. Social contact structures and time use patterns in the Manicaland Province of Zimbabwe. PLoS One. 2017;12(1):1–17. https://doi.org/10.1371/journal.pone.0170459.
Glass LM, Glass RJ. Social contact networks for the spread of pandemic influenza in children and teenagers. BMC Public Health. 2008;8:1–15.
Backer JA, Mollema L, Vos ER, Klinkenberg D, van der Klis FR, de Melker HE, et al. Impact of physical distancing measures against COVID-19 on contacts and mixing patterns: repeated cross-sectional surveys, the Netherlands, 2016–17, April 2020 and June 2020. Eurosurveillance. 2021;26(8):2000994. https://doi.org/10.2807/1560-7917.ES.2021.26.8.2000994.
Leung K, Jit M, Lau EHY, Wu JT. Social contact patterns relevant to the spread of respiratory infectious diseases in Hong Kong. Sci Rep. 2017:7(1):7974. https://doi.org/10.1038/s41598-017-08241-1.
Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008;5(3):0381–91. https://doi.org/10.1371/journal.pmed.0050074.
Klepac P, Kucharski AJ, Conlan AJK, Kissler S, Tang ML, Fry H, et al. Contacts in context: large-scale setting-specific social mixing matrices from the BBC Pandemic project. medRxiv. 2020. https://doi.org/10.1101/2020.02.16.20023754.
Vandendijck Y, Camarda CG, Hens N. Cohort-based smoothing methods for age-specific contact rates. bioRxiv. 2018. doi: https://doi.org/10.1101/290551
Zhang J, Litvinova M, Liang Y, Wang Y, Wang W, Zhao S, et al. Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science (80-). 2020;368:1481–6.
Fava E Del, Cimentada J, Perrotta D, Grow A, Rampazzo F, Gil-Clavel S, et al. The differential impact of physical distancing strategies on social contacts relevant for the spread of COVID-19. medRxiv. 2020. https://doi.org/10.1101/2020.05.15.20102657.
Feehan DM, Mahmud AS. Quantifying population contact patterns in the United States during the COVID-19 pandemic. Nat Commun. 2021;12:1–9.
Latsuzbaia A, Herold M, Bertemes JP, Mossong J. Evolving social contact patterns during the COVID-19 crisis in Luxembourg. PLoS One. 2020;15(8):e0237128. https://doi.org/10.1371/journal.pone.0237128.
Brankston G, Merkley E, Fisman DN, Tuite AR, Poljak Z, Loewen PJ, et al. Quantifying contact patterns in response to COVID-19 public health measures in Canada. medRxiv. 2021. https://doi.org/10.1101/2021.03.11.21253301.
Google. Mobilitätsberichte zur Coronakrise. 2020. https://www.google.com/covid19/mobility/. Accessed 20 Aug 2020.
Apple. COVID-19 – Berichte zu Mobilitätstrends. 2020. https://covid19.apple.com/mobility. Accessed 20 Aug 2020.
RKI - Coronavirus SARS-CoV-2 - Nowcasting und R-Schätzung: Schätzung der aktuellen Entwicklung der SARS-CoV-2-Epidemie in Deutschland. 2020. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Projekte_RKI/Nowcasting.html. Accessed 18 Mar 2021.
Robert Koch Institut. Erläuterung der Schätzung der zeitlich variierenden Reproduktionszahl R. 2020. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Projekte_RKI/R-Wert-Erlaeuterung.html. Accessed 6 Oct 2020.
an der Heiden M, Hamouda O. Schätzung der aktuellen Entwicklung der SARS-CoV-2- Epidemie in Deutschland – Nowcasting. Epidemiol Bull. 2020;17:10–5.
Fischer B, Knabbe C, Vollmer T. SARS-CoV-2 IgG seroprevalence in blood donors located in three different federal states, Germany, March to June 2020. Eurosurveillance. 2020;25(28):2001285. https://doi.org/10.2807/1560-7917.ES.2020.25.28.2001285.
Statistische Ämter des Bundes und der Länder. Zensus 2011 - Bevölkerungs- und Wohnungszählung 2011. 2011. https://www.zensus2011.de/DE/Home/home_node.html. Accessed 21 Sep 2020.
Lumley T. Survey: analysis of complex survey samples. 2020. http://r-survey.r-forge.r-project.org/survey/.
Funk S, Dunbar MB-N, Carl A. B. Pearson, Clifford S, Jarvis C, Robert A. socialmixr: social mixing matrices for infectious disease modelling. 2020. https://cran.r-project.org/web/packages/socialmixr/socialmixr.pdf.
Diekmann O, Heesterbeek JAPP, Roberts MG. The construction of next-generation matrices for compartmental epidemic models. J R Soc Interface. 2010;7(47):873–85. https://doi.org/10.1098/rsif.2009.0386.
Wallinga J, Teunis P, Kretzschmar M. Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents. Am J Epidemiol. 2006;164(10):936–44. https://doi.org/10.1093/aje/kwj317.
Thompson HA, Mousa A, Dighe A, Fu H, Arnedo-Pena A, Barrett P, et al. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) setting-specific transmission rates: a systematic review and meta-analysis. Clin Infect Dis. 2021;73(3):e754–64. https://doi.org/10.1093/cid/ciab100.
R Core Team. R: a language and environment for statistical computing. 2020.
Adam D, Wu P, Wong J, Lau E, Tsang T, Cauchemez S, et al. Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat Med. 2020;26(11):1714–9. https://doi.org/10.1038/s41591-020-1092-0.
Eames KTD, Tilston NL, Edmunds WJ. The impact of school holidays on the social mixing patterns of school children. Epidemics. 2011;3(2):103–8. https://doi.org/10.1016/j.epidem.2011.03.003.
Rübsamen N, Akmatov MK, Castell S, Karch A, Mikolajczyk RT. Comparison of response patterns in different survey designs: a longitudinal panel with mixed-mode and online-only design. Emerg Themes Epidemiol. 2017;14(1):1–11. https://doi.org/10.1186/s12982-017-0058-2.
Cowling BJ, Ali ST, Ng TWY, Tsang TK, Li JCM, Fong MW, et al. Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study. Lancet Public Heal. 2020;5(5):e279–88. https://doi.org/10.1016/S2468-2667(20)30090-6.
The authors would like to thank Christopher Jarvis, Kevin Van Zandvoort, Amy Gimma, John Edmunds and the entire CoMix team for giving us the chance to use an adapted version of the CoMix questionnaire and for the great cooperation. The authors would also like to thank the team at IPSOS-Mori for their work on implementing the COVIMOD survey.
COVIMOD is funded by intramural funds from the Institute for Epidemiology and Social Medicine, University of Münster, Institute for Medical Epidemiology Biometry and Informatics and Martin Luther University Halle-Wittenberg, as well as by funds of the Robert Koch Institute, Berlin; the Helmholtz-Gemeinschaft Deutscher Forschungszentren e.V. via the HZEpiAdHoc “The Helmholtz Epidemiologic Response against the COVID-19 Pandemic” project; the German Free State of Saxony via the SaxoCOV project; and the Federal Ministry of Education and Research (BMBF) as part of the Network University Medicine (NUM) via the egePan Unimed project (funding code: 01KX2021). Open Access funding enabled and organized by Projekt DEAL.
Ethics approval and consent to participate
COVIMOD was approved by the ethics committee of the Medical Board Westfalen-Lippe and the University of Münster, reference number 2020-473-f-s. Informed consent was obtained from all COVIMOD participants. The POLYMOD data collection was approved by national institutional review boards . As only anonymised POLYMOD data were used in this study, an institutional review was not required for reanalysis.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
COVIMOD questionnaire. This additional file includes the questionnaire.
Consideration of additional contacts. This file illustrates how additional contacts were dealt with in the data management process.
Additional results. This file provides results additional to the ones provided in the manuscript.
Additional data analyses information. This file provides more data analyses details.
About this article
Cite this article
Tomori, D.V., Rübsamen, N., Berger, T. et al. Individual social contact data and population mobility data as early markers of SARS-CoV-2 transmission dynamics during the first wave in Germany—an analysis based on the COVIMOD study. BMC Med 19, 271 (2021). https://doi.org/10.1186/s12916-021-02139-6
- Contact patterns
- Contact surveys