Skip to main content

Table 1 Demographic distribution of UCSF and Vanderbilt cohorts. We identified women with preterm and not preterm deliveries at UCSF and Vanderbilt using similar ascertainment (the “Methods” section). For each woman, we predicted the earliest delivery in their EHR. We report age at delivery (patient age) as mean with standard deviation (SD) in parenthesis and self- or third-party-reported race for both cohorts as the count and the column-wise proportion in parenthesis. The T-tests and chi-squared tests of independence were used to compare distributions stratified by delivery label

From: Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth

 

UCSF

Vanderbilt

Not preterm

Preterm

p-value

Not preterm

Preterm

p-value

n

5615

363

 

18,498

2651

 

Patient age (mean (SD))

36.65 (5.08)

36.54 (5.96)

0.691

27.71 (5.75)

27.73 (6.38)

0.876

Patient race (%)

  

< 0.001

  

< 0.001

American Indian or Alaska Native

26 (0.5)

3 (0.8)

 

47 (0.2)

4 (0.01)

 

Asian

1336 (23.8)

51 (14.0)

 

1051 (5.8)

100 (3.8)

 

Black or African American

336 (6.0)

31 (8.5)

 

2962 (16.5)

486 (18.8)

 

Declined

72 (1.3)

5 (1.4)

 

NA

NA

 

Hispanic

NA

NA

 

2379

322

 

Native Hawaiian/Pacific Islander

86 (1.5)

3 (0.8)

 

NA

NA

 

Others

866 (15.4)

77 (21.2)

 

162 (0.9)

12 (0.04)

 

Unknown

200 (3.6)

32 (8.8)

 

619 (3.3)

69 (2.6)

 

White or Caucasian

2693 (48.0)

161 (44.4)

 

11,278 (63.0)

1658 (64.2)