- Research article
- Open Open Peer Review
Development and external validation of a faecal immunochemical test-based prediction model for colorectal cancer detection in symptomatic patients
BMC Medicinevolume 14, Article number: 128 (2016)
Risk prediction models for colorectal cancer (CRC) detection in symptomatic patients based on available biomarkers may improve CRC diagnosis. Our aim was to develop, compare with the NICE referral criteria and externally validate a CRC prediction model, COLONPREDICT, based on clinical and laboratory variables.
This prospective cross-sectional study included consecutive patients with gastrointestinal symptoms referred for colonoscopy between March 2012 and September 2013 in a derivation cohort and between March 2014 and March 2015 in a validation cohort. In the derivation cohort, we assessed symptoms and the NICE referral criteria, and determined levels of faecal haemoglobin and calprotectin, blood haemoglobin, and serum carcinoembryonic antigen before performing an anorectal examination and a colonoscopy. A multivariate logistic regression analysis was used to develop the model with diagnostic accuracy with CRC detection as the main outcome.
We included 1572 patients in the derivation cohort and 1481 in the validation cohorts, with a 13.6 % and 9.1 % CRC prevalence respectively. The final prediction model included 11 variables: age (years) (odds ratio [OR] 1.04, 95 % confidence interval [CI] 1.02–1.06), male gender (OR 2.2, 95 % CI 1.5–3.4), faecal haemoglobin ≥20 μg/g (OR 17.0, 95 % CI 10.0–28.6), blood haemoglobin <10 g/dL (OR 4.8, 95 % CI 2.2–10.3), blood haemoglobin 10–12 g/dL (OR 1.8, 95 % CI 1.1–3.0), carcinoembryonic antigen ≥3 ng/mL (OR 4.5, 95 % CI 3.0–6.8), acetylsalicylic acid treatment (OR 0.4, 95 % CI 0.2–0.7), previous colonoscopy (OR 0.1, 95 % CI 0.06–0.2), rectal mass (OR 14.8, 95 % CI 5.3–41.0), benign anorectal lesion (OR 0.3, 95 % CI 0.2–0.4), rectal bleeding (OR 2.2, 95 % CI 1.4–3.4) and change in bowel habit (OR 1.7, 95 % CI 1.1–2.5). The area under the curve (AUC) was 0.92 (95 % CI 0.91–0.94), higher than the NICE referral criteria (AUC 0.59, 95 % CI 0.55–0.63; p < 0.001). On the basis of the thresholds with 90 % (5.6) and 99 % (3.5) sensitivity, we divided the derivation cohort into three risk groups for CRC detection: high (30.9 % of the cohort, positive predictive value [PPV] 40.7 %, 95 % CI 36.7–45.9 %), intermediate (29.5 %, PPV 4.4 %, 95 % CI 2.8–6.8 %) and low (39.5 %, PPV 0.2 %, 95 % CI 0.0–1.1 %). The discriminatory ability was equivalent in the validation cohort (AUC 0.92, 95 % CI 0.90–0.94; p = 0.7).
COLONPREDICT is a highly accurate prediction model for CRC detection.
Colorectal cancer (CRC) is the most common tumour, the seventh cause of death and the fourth cause of years of life lost in Western Europe . Health authorities have developed two strategies to reduce CRC-related impact: CRC screening and prompt diagnosis in symptomatic patients [2–6]. In order to reduce the delay between the onset of symptoms and diagnosis and improve prognosis, several criteria with high probability for CRC detection have been established. In this regard, the best known guidelines are the National Institute for Health and Care Excellence (NICE) criteria for suspected cancer . Although patients meeting these criteria are more likely to have CRC, their specificity is low [7–9]. Moreover, these criteria are under the physician’s subjective evaluation .
In recent years, several CRC prediction models have been designed and validated in different settings . Although diagnostic accuracy is acceptable and better than the existing referral criteria, these prediction models have not been widely implemented [11–13]. Nowadays, there are several potential biomarkers available that could be used to determine the risk of CRC detection in symptomatic patients. A faecal immunochemical test (FIT) has proven to be a useful diagnostic test both for CRC screening in asymptomatic individuals and for diagnosis in symptomatic patients [8, 14–18]. Semiquantitative FIT allows for quantification of faecal haemoglobin (f-Hb) concentration. There are several prediction models in asymptomatic individuals for CRC detection based on FIT . However, no one has evaluated the effect of FIT together with other clinical parameters to determine the risk of CRC in symptomatic patients [7–10].
On the basis of the hypothesis that a predictive model for CRC diagnosis based on symptoms, biomarkers and demographical information could improve the diagnostic accuracy of the NICE referral criteria, we have carried out a cross-sectional study on symptomatic patients referred for colonoscopy to develop a CRC prediction model and have subsequently externally validated it in a different set of patients.
COLONPREDICT is a multicentre, cross-sectional, blinded study of diagnostic tests. The study aimed to create and validate a CRC prediction index based on available biomarkers and clinical and demographic data.
The derivation cohort consisted of consecutive patients with gastrointestinal symptoms referred for colonoscopy from primary and secondary health care to Complexo Hospitalario Universitario de Ourense, Spain. Exclusion criteria were age under 18, pregnancy, asymptomatic individuals who were undergoing colonoscopy for CRC screening, patients with a previous history of colonic disease who underwent a surveillance colonoscopy, patients requiring hospital admission, patients whose symptoms had ceased within 3 months before evaluation, and patients who declined to participate after reading the informed consent form. The study was approved by the Clinical Research Ethics Committee of Galicia (Code 2011/038). Patients provided written informed consent.
The Colonoscopy Research Into Symptom Prediction questionnaire was used to record symptoms and demographic data. This had been translated into Spanish after receiving permission from the authors . Nurses specifically trained in the assessment of gastrointestinal symptoms administered the questionnaire to the patients. They also collected administrative information and determined if patients met any of the NICE referral criteria for CRC detection: patients ≥40 years with rectal bleeding and a change of bowel habit persisting ≥6 weeks; patients ≥60 years with rectal bleeding persisting ≥6 weeks without a change in bowel habit and without anal symptoms; patients ≥60 years with a change in bowel habit persisting ≥6 weeks without rectal bleeding; patients presenting a right lower abdominal mass consistent with involvement of the large bowel; patients presenting with a palpable rectal mass; or patients with unexplained iron deficiency anaemia (<11 g/100 mL in men, <10 g/100 mL in non–menstruating women) .
All individuals collected a faeces sample from one bowel movement without specific diet or medication restrictions the week before the colonoscopy. They were specifically instructed to sample a stool where no blood was visible. f-Hb concentration was assessed using the automated OC-SENSOR™ (Eiken Chemical Co., Tokyo, Japan) and faecal calprotectin was determined using a commercial ELISA kit (Bühlmann fCAL ELISA calprotectin, Bühlmann Laboratories AG, Basel, Switzerland). The stool sample for the f-Hb determination was collected using the OC-SENSOR probe. The stool sample for the faecal calprotectin determination was collected independently. We determined blood haemoglobin (b-Hb) and mean corpuscular volume with a Beckman Coulter Autoanalyzer (Beckman Coulter Inc., CA, USA) and serum carcinoembryonic antigen (CEA) using a chemiluminescent microparticle immunoassay (UniCel DXI 800; Beckman Coulter).
Colonoscopy was performed blind for the questionnaire and analytical results. Before the colonoscopy, endoscopists performed a digital rectal examination as well as an anoscopy to determine anorectal findings. Bowel cleansing and sedation was performed as previously described . We considered colonoscopy complete if caecal intubation was achieved. All colonoscopies were performed by experienced endoscopists (>200 colonoscopies per year). Endoscopists described all colorectal lesions and obtained biopsies if appropriate.
The main outcome was CRC. We determined the location of CRC as rectum, distal or proximal to splenic flexure. Tumour staging was performed according to the American Joint Committee on Cancer (AJCC) classification 7th edition . The secondary outcomes were advanced neoplasia (AN) and significant colonic lesion (SCL). We defined AN as CRC or advanced adenoma (≥10 mm, villous histology, high-grade dysplasia). SCL was defined as CRC, advanced adenoma, polyposis (>10 polyps of any histology, including serrated lesions), histologically confirmed colitis (any aetiology), polyps ≥10 mm, complicated diverticular disease (diverticulitis, bleeding), colonic ulcer and bleeding angiodysplasia. The remaining lesions were considered non-significant colonic lesions. Data from each individual were registered in an online database.
Sample size calculation
The sample size for the derivation cohort was calculated on the hypothesis that our prediction index sensitivity for CRC detection would be better than the NICE referral criteria. Assuming that CRC prevalence was between 5 and 10 %, NICE referral criteria sensitivity for CRC was 80 % and our prediction index sensitivity for CRC was 90 %, a sample size of 2526 patients would provide 80 % power at a 5 % significance level using a two-sided test.  Assuming 10 % losses we would need a final sample size of 2778 patients. An interim analysis was performed after including 800 patients . In this intermediate analysis, CRC prevalence was 12 % and the number of losses was 5 %. On the basis of these data, the final sample size required to include in the derivation cohort was 1607 patients.
Development of the prediction model
Initially we performed a descriptive analysis where continuous variables were expressed as median [minimum–maximum] and qualitative variables as frequency and percentage. We determined potential associations between CRC and the independent variables with parametric/nonparametric tests (Chi-square, Student’s t test, Mann–Whitney). We studied correlations by exploratory data to detect a relationship or interaction between the different variables. Before logistic regression, we performed a univariate analysis using generalised additive models with smoothing splines for continuous variables. The objective of this analysis was to determine, in those non-linear variables, the different strata or classes. We introduced significant variables in this first analysis and those that could be of clinical interest in the multivariate logistic regression analysis (we eliminated those with colinearity or linear combination of others). We used the regression coefficients to construct a CRC prediction score, where the dependent variable was presence/absence of CRC. We calculated the R2 (a measure of variation) of the model for CRC detection and the area under the curve (AUC) in the receiver operating characteristic (ROC) curve. Finally, we also assessed the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). The final model was chosen on the basis of the highest discriminatory ability measured with the AUC.
In order to evaluate the diagnostic yield of the final prediction model, we established two example thresholds with a 90 and 99 % sensitivity for CRC detection, and we determined the diagnostic accuracy for CRC and SCL at each threshold. According to these thresholds, we divided the cohort into three groups: high (values over the 90 % threshold), intermediate (values between the 90 and 99 % threshold) and low risk (values below the 99 % threshold) for CRC detection. We calculated the number of patients, the positive predictive value (PPV) and the number needed to endoscopy to detect a CRC and an SCL in each group. We compared our predictive model with the NICE referral criteria in two ways: (1) AUC using the Chi-square test of homogeneity of areas and (2) comparison of the sensitivity and specificity at the sensitivity thresholds established with the McNemar’s test. We additionally calculated the diagnostic accuracy in two additional example thresholds: 50 % sensitivity and 90 % specificity for CRC detection.
External validation of the prediction model
The validation cohort included a prospective cohort of patients with gastrointestinal symptoms referred for colonoscopy in 11 hospitals in Spain. We collected the variables included in the model prospectively and we used the coefficients to calculate the COLONPREDICT score for each patient in the validation dataset. We also determined those patients that met the criteria for 90 and 99 % sensitivity. We compared the discriminatory ability of the model in the derivation and the validation cohorts with ROC curves and AUC on one side, and with the Chi-square test to determine differences in sensitivity and specificity at the established thresholds between both cohorts for CRC, AN and SCL detection.
Diagnostic accuracy according to healthcare level
Finally, we performed a post hoc analysis of our model to determine if its diagnostic accuracy was modified on the basis of the healthcare level referring the patient for colonoscopy: primary versus secondary healthcare. In order to perform this analysis we grouped derivation and validation cohorts and we compared discriminatory ability with ROC curves, AUC, and sensitivity and specificity with the Chi-square test.
We report differences with 95 % confidence intervals (95 % CI). A p-value <0.05 was considered to be statistically significant. Analysis was carried out using SPSS statistical software, version 15.0 (SPSS Inc., Chicago, IL, USA) and EPIDAT 3.1 (Dirección Xeral de Saúde Pública, Santiago de Compostela, Spain).
Description of the derivation cohort
Between March 2012 and September 2013, 2381 patients were referred for colonoscopy for the evaluation of symptoms. After excluding 745 patients due to exclusion criteria, 1636 patients were included in the initial cohort. Finally, 64 patients did not complete the study protocol, so there were 1572 evaluable patients (Fig. 1). We show the baseline characteristics of the patients included in Table 1. We detected CRC in 214 (13.6 %) patients, located in the rectum (37.4 %) and colon (43.5 % of total CRC distal and 19.2 % proximal to the splenic flexure). Tumour staging was 0 (2.8 %), I (18.6 %), II (25.1 %), III (37.7 %) and IV (15.8 %). Additionally, we found advanced adenomas in 251 patients (16.0 %), a polyp ≥10 mm with non-adenoma histology in 6 patients (0.4 %), colitis in 36 patients (2.3 %) and other SCLs in 6 patients (0.4 %). Overall, we detected a SCL in 463 patients (29.5 %).
Development of the prediction model
In Table 1 we show the results from the initial analyses performed to determine which variables were associated with the risk of detecting CRC. Several variables – age, sex, rectal bleeding, primary healthcare referral, change in bowel habit, symptoms lasting 1–12 months, rectal mass and laboratory results – were associated with an increased risk of CRC detection. On the other hand, the presence of abdominal or anal pain, the detection of benign anorectal lesions, a previous colonoscopy or a family history of CRC reduced the risk of CRC detection on colonoscopy. Age had a normal distribution and a linear relationship with the risk of CRC. In contrast, we transformed the rest of the continuous variables into categorical variables before introducing them into the multivariate logistic regression. We introduced the variables on account of their statistical relationship and their clinical relevance. Our final model (Fig. 2) consisted of 11 variables. The mathematical formula to calculate the COLONPREDICT score is as follows: 0.789 × rectal bleeding + 0.536 × change in bowel habit + 2.694 × rectal mass − 1.283 × benign anorectal lesions + 2.831 × f-Hb ≥20 μg/g of faeces + 1.561 × b-Hb <10 g/dL + 0.588 × b-Hb 10–12 g/dL + 1.511 × CEA ≥3 ng/mL + 0.040 × age (years) + 0.813 × sex (male) − 2.073 × previous colonoscopy (last 10 years) − 0.849 × continuous treatment with aspirin. The intercept of the logistic regression that the COLONPREDICT Score is based on is −7.807. As an example, a 70-year-old man with rectal bleeding, no change in bowel habit, haemorrhoids with no rectal mass on anorectal examination, no previous colonoscopy, continuous treatment with aspirin, serum CEA = 0.2 ng/mL, b-Hb =14 g/dL and a f-Hb of 50 μg/g of faeces would have a COLONPREDICT score of 5.1.
The R2 of our prediction model was 0.55 and the AUC was 0.92 (95 % CI 0.91–0.93). The AIC and BIC were 1213 and 1220. Previously we performed several prediction models with different combinations of variables. We show some of the prediction models evaluated as an example: FIT and rectal mass (AUC 0.85, 95 % CI 0.80–0.85); FIT, CEA, blood haemoglobin and rectal mass (AUC 0.88, 95 % CI 0.86–0.9); and FIT, age, sex, CEA, blood haemoglobin, rectal mass and previous colonoscopy (AUC 0.90, 95 % CI 0.88–0.92). All of them had a significantly inferior discriminatory ability when compared with the final COLONPREDICT model. Finally, a prediction model with the same variables as the COLONPREDICT score but with f-Hb introduced in four strata (undetectable, between 0 and 20 μg Hb/g faeces, between 20 and 200 μg Hb/g faeces, and at least 200 μg Hb/g faeces) had the same discriminatory ability as the final model (AUC 0.92, 95 % CI 0.91–0.94).
Diagnostic accuracy of the model
We compared the discriminatory ability of our prediction model with the NICE referral criteria. Overall, the AUC of the COLONPREDICT score was significantly higher than the NICE referral criteria (0.59, 95 % CI 0.55–0.63; p < 0.001), as shown in Fig. 3. The example thresholds of the b-coefficient of our prediction model with 90 % and 99 % sensitivity were 5.6 and 3.5, respectively. When comparing the sensitivity and the specificity with the NICE referral criteria, the COLONPREDICT score had higher sensitivity at both thresholds. In contrast, the COLONPREDICT score was less specific than the NICE referral criteria at the 3.5 threshold. The diagnostic accuracy analysis for CRC detection of the NICE referral criteria and the COLONPREDICT score is shown in Table 2. At the example threshold with 50 % sensitivity, the sensitivity, specificity, PPV, negative predictive value (NPV) and number of positives were 53.1 % (46.1–59.9), 96.5 (95.3–97.4), 71.1 % (63.3–77.8), 92.7 % (91.1–94.0) and 10.4 %. In the same way, at the example threshold with 90 % specificity, the sensitivity, specificity, PPV, NPV and number of positives were 77.5 % (71.1–82.8), 89.3 % (87.4–90.9), 53.9 % (48.2–59.6), 96.1 % (94.8–97.1) and 20.0 %
We also analysed the discriminatory ability of the COLONPREDICT score for AN and SCL detection in symptomatic patients. The AUC of the model was 0.83 (0.80–0.85) and 0.82 (0.80–0.84), respectively. The analysis of the sensitivity and specificity at the two example thresholds is shown in Table 3. According to these thresholds, we divided our derivation cohort into three risk groups: high, intermediate and low. We show the diagnostic yield of this classification for CRC, AN and SCL detection in Table 4. In sum, while the number needed to endoscopy to detect a CRC or a SCL was 603 and 11.8 in the low-risk group, the number needed to endoscopy to detect a CRC or a SCL in the high-risk group was 2.5 and 1.6, respectively. The odds ratio (OR) in the high-risk group for CRC detection was 17 (95 % CI 10.5–27) when compared with the intermediate-risk group and 413 (95 % CI 57.5–2961) when compared with the low-risk group. In the same way, patients in the high-risk group had more risk than intermediate- (OR 4.9, 95 % CI 3.7–6.5) and low-risk groups (OR 17.2, 95 % CI 12.3–24.3) for SCL detection.
Validation of the prediction model
The validation cohort consisted of 1481 patients referred for colonoscopy in 11 hospitals in Spain between March 2014 and March 2015. We show the characteristics of the validation cohort and its comparison with the derivation cohort in Table 5. The validation cohort differed from the derivation cohort with respect to age, primary health care referral, symptoms, treatment with aspirin, benign anorectal lesions, a positive FIT result (≥20 μg Hb/g of faeces), caecal intubation and CRC prevalence. FIT was measured with a qualitative test (HEM-CHECK-2, VEDA-LAB, Alençon Cedex, France) in 22 patients and with a quantitative test in the remaining 1459 patients: 725 with the OC-SENSOR™ (Eiken Chemical Co.), 202 with the OC-Auto 3 Latex™ (Eiken Chemical Co.), 35 with the FOB Gold Test (Sentinel Diagnostics, Milan, Italy), and 497 with Linear i-FOB (Leti, Barcelona, Spain). After using the coefficients to calculate the COLONPREDICT score for each patient in the validation dataset, we compared the discriminatory ability for CRC and SCL detection between both cohorts. We show the results in Fig. 4 and Table 3. The AUC for CRC (0.92, 95 % CI 0.90–0.94; p = 0.7), AN (0.82, 95 % CI 0.79–0.85; p = 0.5) and SCL (0.78, 95 % CI 0.75–0.81; p = 0.05) detection in the validation cohort was similar to the derivation cohort. The −2 log likelihood and the R2 of the model for CRC prediction were 501.1 and 0.49 in the validation dataset, respectively. The Hosmer–Lemeshow test significance was p = 0.9 and the calibration plot for CRC detection of the model is shown in Additional file 1: Figure S1. Furthermore, we found no differences in sensitivity of specificity for CRC or SCL detection between both cohorts in the 5.6 and 3.5 thresholds. In the validation cohort, 401 patients (27.1 %) met high-risk group criteria with a 30.3 % (95 % CI 25.8–35.3 %) PPV for CRC detection; 453 (30.6 %) met intermediate-risk group criteria with a 3.5 % (95 % CI 2.1–5.9 %) PPV and 628 patients (42.4 %) met low-risk group criteria with a 0.0 % PPV for CRC detection.
Diagnostic accuracy comparison between primary and secondary healthcare referrals
In our post-hoc analysis comparing patients referred from primary and secondary healthcare, we found no significant differences either for CRC (primary 0.91, 95 % CI 0.89–0.94, secondary 0.93, 95 % CI 0.91–0.94; p = 0.3), AN (primary 0.83, 95 % CI 0.80–0.87, secondary 0.81, 95 % CI 0.79–0.84; p = 0.4) or SCL (primary 0.80, 95 % CI 0.77–0.84, secondary 0.80, 95 % CI 0.77–0.82; p = 0.8) detection in the AUC analysis. In addition, apart from a significant difference in specificity for CRC detection at the 90 % sensitivity threshold between both cohorts, we found no differences in the diagnostic accuracy of the COLONPREDICT model as shown in Table 6.
Discussion and conclusions
Statement of principal findings
We have developed and externally validated a prediction model for CRC and SCL detection in symptomatic patients referred for colonoscopy. The COLONPREDICT model is based on easily obtainable variables – demographic, laboratory results, symptoms and anorectal examination findings – and is thus applicable both in primary and secondary healthcare. This prediction model is highly accurate, as the calibration plot shows, and allows for differentiation of a high-risk group and, especially, a low-risk group with a probability of CRC detection below 1 %.
Strengths and weaknesses of the study
We have designed and validated a CRC prediction model on the basis of the hypothesis that symptom-based models had a limited accuracy for CRC detection. We designed our study to compare our prediction model with the NICE referral criteria, the most widely evaluated and implemented criteria for CRC detection. Finally, we were able to validate our prediction model in an external cohort prospectively recruited in several hospitals in Spain, in accordance with the TRIPOP Statement recommendations .
However, we believe that the diagnostic accuracy of our prediction model should be externally evaluated in a population with gastrointestinal symptoms attended to in primary care before its use is recommended. Hypothetically, we believe that the diagnostic accuracy of the COLONPREDICT score may increase in CRC low-prevalence populations due to an increase in specificity [10, 12]. Furthermore, although our research has answered the three questions related to a diagnostic test performance identified by Sackett and Haynes before incorporating tests into clinical practice , we cannot answer the fourth question: whether patients undergoing the diagnostic test fare better than similar untested patients. Specific research should be carried out in order to evaluate the diagnostic performance in patients with gastrointestinal symptoms evaluated in primary care as well as the efficiency .
A secondary outcome of our study is that we have produced the first SCL prediction model in symptomatic patients available in the literature. Furthermore, our score is highly accurate with an AUC of 0.82 and 64.2 and 83.1 % sensitivity at the two thresholds evaluated. We are aware that this score does not exclude the detection of a significant colonic lesions, mainly advanced adenomas. Although advanced adenoma detection is a secondary endpoint of a CRC screening programme, it is not clear that this should be the endpoint in the evaluation of symptomatic patients.
Strengths and weaknesses in relation to other studies, discussing important differences in results
We have made two main contributions in the design of CRC prediction models. The first one is the inclusion of laboratory findings, mainly FIT, in the prediction model. FIT has recently been evaluated for CRC diagnosis in symptomatic patients and compared with available referral criteria. The available studies show that FIT has a high diagnostic accuracy for CRC detection and our results confirm these findings [8, 15–18]. In fact, the COLONPREDICT score is the first FIT-based CRC prediction model in this setting. Recently, NICE published a new version of the NICE referral criteria for suspected cancer . In this new guideline, they have included offering testing for occult blood in faeces to patients with a PPV below 3 % such as abdominal pain, weight loss, changes in bowel habits or anaemia. Our results suggest that, if faeces are handled appropriately, patients with gastrointestinal symptoms should be evaluated with FIT-based prediction models, even with rectal bleeding. Unfortunately, we could not compare the new NICE referral criteria with our score because the new criteria were published after the study was completed.
Our second main contribution is to determine thresholds based on sensitivity rather than on PPV. The diagnosis of CRC is a balance between the risk of CRC detection and the resources required for the evaluation of patients. Any diagnostic strategy should determine a high-risk group where most of the CRCs are detected and which require a fast-track referral to colonoscopy. But, at the same time, it should also establish a low-risk group where no additional explorations are recommended. In this low-risk group, the probability of missing CRC should be well below 1 %, so that the risk of missing CRC is balanced with the risk of colonoscopy complications, mainly perforation . In this regard, the thresholds with 90 and 99 % sensitivity in our model meet these criteria. In fact, the 99 % sensitivity threshold is consistent with the new NICE guidelines, which aimed to be less specific in order to miss less CRC. Another limitation of the prediction models based on PPVs is that they cannot be transferred to high-prevalence populations. In our opinion, the COLONPREDICT model solves this problem, as we base the referral criteria on sensitivity thresholds. In fact, the number of patients meeting low-risk group criteria would probably increase in low CRC prevalence populations, limiting the resources required for further evaluation [10, 12].
Another important finding of our study is the relevance of the anorectal examination in the evaluation of patients with gastrointestinal symptoms. Although anorectal examination is included within practice guidelines for rectal bleeding evaluation, this information is not included in most of the CRC prediction models available [5, 10, 12, 28]. Moreover, we have confirmed the protective effect of previous colonoscopy and treatment with aspirin in symptomatic patients [29, 30]. However, in the univariate analysis, we did not find a relationship between aspirin and the risk of CRC, which was due to the effect of two confounders: male sex and advanced age. After adjusting for these two variables, aspirin had a protective effect on the risk of detecting a CRC on colonoscopy. Similarly, we found a significant reduction in the risk of detecting a CRC when symptomatic patients had a first-degree relative with CRC in the univariate analysis that was due to the effect of two confounders: female sex and younger age. After adjusting for these two variables, family history had no effect on the risk of detecting a CRC on colonoscopy. Finally, another contribution of our study is the introduction of age as a continuous variable. Available referral criteria use age cut-off points (40, 50 or 60 years) to determine which patients with gastrointestinal symptoms should be evaluated [3, 5, 26], thus hindering the diagnosis of CRC in young patients.
Unanswered questions and future research
Two main issues need to be answered in the future. As stated before, the diagnostic accuracy and applicability of the COLONPREDICT model in a primary care setting must be addressed in a prospective study. Second, simpler prediction models with similar performance based on laboratory findings must be designed and evaluated in a primary care setting. In this respect, the introduction of new CRC biomarkers may ease the CRC diagnosis process in symptomatic patients.
Akaike Information Criterion
Area under the curve
Bayesian Information Criterion
Faecal immunochemical test
National Institute for Health and Care Excellence
Number needed to endoscopy
Negative predictive value
Positive predictive value
Receiver operating characteristic
Significant colonic lesion
Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, Abraham J, Adair T, Aggarwal R, Ahn SY, Alvarado M, Anderson HR, Anderson LM, Andrews KG, Atkinson C, Baddour LM, Barker–Collo S, Bartels DH, Bell ML, Benjamin EJ, Bennett D, Bhalla K, Bikbov B, Bin Abdulhak A, Birbeck G, Blyth F, Bolliger I, Boufous S, Bucello C, Burch M, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380:2095–128.
Atkin WS, Valori R, Kuipers EJ, Hoff G, Senore C, Segnan N, Jover R, Schmiegel W, Lambert R, Pox C, International Agency for Research on Cancer. European guidelines for quality assurance in colorectal cancer screening and diagnosis. First Edition--Colonoscopic surveillance following adenoma removal. Endoscopy. 2012;44 Suppl 3(SUPPL3):SE151–63.
NICE Clinical Guideline 27 Referral Guidelines for Suspected Cancer. 2011. www.nice.org.uk/CG027. Accessed 17 Aug 2016
Hamilton W. Five misconceptions in cancer diagnosis. Br J Gen Pract. 2009;59:441–5. 447; discussion 446.
Scottish Intercollegiate Guidelines Network (SIGN). Diagnosis and management of colorectal cancer (SIGN Publication No. 126). Edinburgh: SIGN; 2011. Available from http://www.sign.ac.uk/pdf/sign126.pdf. Accessed 17 Aug 2016.
Vega-Villaamil P, Salve-Bouzo M, Cubiella J, Valentín-Gómez F, Sánchez-Hernández E, Gómez-Fernández I, Fernández-Seara J. Evaluation of the implementation of Galician Health Service indications and priority levels for colonoscopy in symptomatic patients: prospective, cross-sectional study. Rev Esp Enferm Dig. 2013;105:600–8.
Jellema P, van der Windt DA, Bruinvels DJ, Mallen CD, van Weyenberg SJB, Mulder CJ, de Vet HCW. Value of symptoms and additional diagnostic tests for colorectal cancer in primary care: systematic review and meta-analysis. BMJ. 2010;340:c1269.
Cubiella J, Salve M, Díaz-Ondina M, Vega P, Alves MT, Iglesias F, Sánchez E, Macía P, Blanco I, Bujanda L, Fernández-Seara J. Diagnostic accuracy of the faecal immunochemical test for colorectal cancer in symptomatic patients: comparison with NICE and SIGN referral criteria. Colorectal Dis. 2014;16:O273–82.
Thorne K, Hutchings HA, Elwyn G. The effects of the Two-Week Rule on NHS colorectal cancer diagnostic services: a systematic literature review. BMC Health Serv Res. 2006;6:43.
Williams TGS, Cubiella J, Griffin SJ, Walter FM, Usher-Smith JA. Risk prediction models for colorectal cancer in people with symptoms: a systematic review. BMC Gastroenterol. 2016;16:63.
Hodder RJ, Ballal M, Selvachandran SN, Cade D. Pitfalls in the construction of cancer guidelines demonstrated by the analyses of colorectal referrals. Ann R Coll Surg Engl. 2005;87:419–26.
Rai S, Ballal M, Thomas WM, Miller AS, Jameson JS, Steward WP. Assessment of a patient consultation questionnaire-based scoring system for stratification of outpatient risk of colorectal cancer. Br J Surg. 2008;95:369–74.
Ballal MS, Selvachandran SN, Maw A. Use of a patient consultation questionnaire and weighted numerical scoring system for the prediction of colorectal cancer and other colorectal pathology in symptomatic patients: a prospective cohort validation study of a Welsh population. Colorectal Dis. 2010;12:407–14.
Oono Y, Iriguchi Y, Doi Y, Tomino Y, Kishi D, Oda J, Takayanagi S, Mizutani M, Fujisaki T, Yamamura A, Hosoi T, Taguchi H, Kosaka M, Delgado P. A retrospective study of immunochemical fecal occult blood testing for colorectal cancer detection. Clin Chim Acta. 2010;411:802–5.
McDonald PJ, Digby J, Innes C, Strachan JA, Carey FA, Steele RJC, Fraser CG. Low faecal haemoglobin concentration potentially rules out significant colorectal disease. Colorectal Dis. 2013;15:e151–9.
Mowat C, Digby J, Strachan JA, Wilson R, Carey FA, Fraser CG, Steele RJC. Faecal haemoglobin and faecal calprotectin as indicators of bowel disease in patients presenting to primary care with bowel symptoms. Gut. 2016;65:1463–9.
Rodríguez-Alonso L, Rodríguez-Moranta F, Ruiz-Cerulla A, Lobatón T, Arajol C, Binefa G, Moreno V, Guardiola J. An urgent referral strategy for symptomatic patients with suspected colorectal cancer based on a quantitative immunochemical faecal occult blood test. Dig Liver Dis. 2015;47:797–804.
Godber IM, Todd LM, Fraser CG, MacDonald LR, Younes HB. Use of a faecal immunochemical test for haemoglobin can aid in the investigation of patients with lower abdominal symptoms. Clin Chem Lab Med. 2016;54(4):595–602.
Usher-Smith JA, Walter FM, Emery J, Win AK, Griffin SJ. Risk prediction models for colorectal cancer: a systematic review. Cancer Prev Res (Phila). 2016;9(1):13–26.
Adelstein B-A, Irwig L, Macaskill P, Katelaris PH, Jones DB, Bokey L. A self administered reliable questionnaire to assess lower bowel symptoms. BMC Gastroenterol. 2008;8:8.
Jover R, Herráiz M, Alarcón O, Brullet E, Bujanda L, Bustamante M, Campo R, Carreño R, Castells A, Cubiella J, García-Iglesias P, Hervás AJ, Menchén P, Ono A, Panadés A, Parra-Blanco A, Pellisé M, Ponce M, Quintero E, Reñé JM, Sánchez del Río A, Seoane A, Serradesanferm A, Soriano Izquierdo A, Vázquez Sequeiros E. Clinical practice guidelines: quality of colonoscopy in colorectal cancer screening. Endoscopy. 2012;44:444–51.
Edge SB, Byrd DR, Compton CC, Fritz AG, Greene FL, Trotti A (Ed). AJCC Cancer Staging Manual. 7th ed. New York: Springer; 2010.
Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement. Ann Intern Med. 2015;162:55–63.
Sackett DL, Haynes RB. The architecture of diagnostic research. BMJ. 2002;324:539–41.
Usher-Smith J, Emery J, Hamilton W, Griffin SJ, Walter FM. Risk prediction tools for cancer in primary care. Br J Cancer. 2015;113:1645–50.
Suspected cancer: recognition and referral. https://www.nice.org.uk/guidance/ng12. Accessed 17 Aug 2016
Rex DK, Schoenfeld PS, Cohen J, Pike IM, Adler DG, Fennerty MB, Lieb JG, Park WG, Rizk MK, Sawhney MS, Shaheen NJ, Wani S, Weinberg DS. Quality indicators for colonoscopy. Gastrointest Endosc. 2015;81:31–53.
Grupo de trabajo de la guía de práctica clínica sobre rectorragia. Manejo del paciente con rectorragia. Guía de Práctica Clínica. Actualización 2007. (Management of patients with rectal bleeding. Clinical practice guideline. 2007 update. Clinical Practice Guideline Working Group on rectal bleeding).
Adelstein B-A, Macaskill P, Turner RM, Katelaris PH, Irwig L. The value of age and medical history for predicting colorectal cancer and adenomas in people referred for colonoscopy. BMC Gastroenterol. 2011;11:97.
Nishihara R, Wu K, Lochhead P, Morikawa T, Liao X, Qian ZR, Inamura K, Kim SA, Kuchiba A, Yamauchi M, Imamura Y, Willett WC, Rosner BA, Fuchs CS, Giovannucci E, Ogino S, Chan AT. Long-term colorectal-cancer incidence and mortality after lower endoscopy. N Engl J Med. 2013;369:1095–105.
CIBERehd is funded by Instituto de Salud Carlos III. We acknowledge Rebeca Gimeno, Romina Fernández-Poceiro, María Luisa de Castro, Lucía Cid, José Ignacio Rodríguez-Prada, María del Carmen González-Mao, María Ángeles López-Martínez and Manuel Rubio for their technical support in the inclusion of the validation cohort.
This study was funded by a grant from Instituto de Salud Carlos III (PI11/00094). JC and VH have received an intensification grant through the European Commission funded "BIOCAPS" project (FP–7–REGPOT 2012–2013–1, Grant agreement no. FP7– 316265). The validation cohort recruitment was funded by a grant from Fundació de la Marató TV3 2012 (785/U/2013). The funding institutions had no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the article for publication.
Availability of data and materials
Input data for the model are available from the corresponding author on request.
JCubiella, PV, MTA and JFS participated in the model development design; JCubiella, MS, MDO, PV, MTA and LB in the collection, analysis, and derivation of the prediction model; PV, VAS, MS, MDO, LB, EQ, FFB, RC, JB, RC, LB, JB, JClofent, AF, LT, VP, DRA and VH in the design of the validation study and the recruitment of the validation cohort; JCubiella in the validation analysis and the writing of the report; and all the authors decided to submit the article for publication. All authors had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis. JCubiella had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. All authors read and approved the final manuscript.
All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: JC and MS had financial support from Instituto de Salud Carlos III for the submitted work but they had no financial relationships with any organisations that might have an interest in the submitted work in the previous three years, and no other relationships or activities that could appear to have influenced the submitted work. The remaining authors had no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years and no other relationships or activities that could appear to have influenced the submitted work.
Ethics approval and consent to participate
The study was approved by the Clinical Research Ethics Committee of Galicia (Code 2011/038), Santiago de Compostela, Spain.
Copyright for authors
The Corresponding Author has the right to grant on behalf of all authors and does grant on behalf of all authors, a worldwide licence to the Publishers and its licensees in perpetuity, in all forms, formats and media (whether known now or created in the future), to i) publish, reproduce, distribute, display and store the Contribution, ii) translate the Contribution into other languages, create adaptations, reprints, include within collections and create summaries, extracts and/or abstracts of the Contribution, iii) create any other derivative work(s) based on the Contribution, iv) exploit all subsidiary rights of the Contribution, v) include electronic links from the Contribution to third–party material wherever it may be located and, vi) licence any third party to do any or all of the above.
The lead author affirms that the manuscript is an honest, accurate and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.
This research was partially presented in the XVII Reunión Nacional de la Asociación Española de Gastroenterología. Madrid, 26–28 March 2014, and in the 22nd United European Gastroenterology Week. Vienna, 20–22 October 2014.
Investigators of the COLONPREDICT study
Complexo Hospitalario Universitario de Ourense: Joaquín Cubiella, Pablo Vega, María Salve, Marta Díaz-Ondina, Irene Blanco, Pedro Macía, Eloy Sánchez, Javier Fernández-Seara.
University of Vigo: María Teresa Alves.
Hospital Universitario de Canarias: Enrique Quintero, Natalia González-López.
Complejo Hospitalario de Pontevedra: Victoria Álvarez Sánchez, José Mera, Juan Turnes.
Hospital Universitari Mútua de Terrassa: Fernando Fernández-Bañares, Victoria Gonzalo, Mar Pujals.
Registre del Càncer de Catalunya Pla Director d’Oncologia de Catalunya, Hospital Duran i Reynals, L’Hospitalet de Llobregat: Josepa Ribes, Ramón Cleries, Xavier Sanz.
Consorci Sanitari de Terrassa: Jaume Boadas, Sara Galter.
Corporació Sanitària i Universitària Parc Taulí: Rafel Campo, Marta Pujol, Eva Martínez-Bauer.
Departamento de Bioquímica, CATLAB, Viladecavalls, Barcelona: Antonio Alsius.
Donostia Hospital: Luis Bujanda, Jesús Bañales, María J Perugorria.
Hospital de Sagunto: Joan Clofent, Ana Garayoa.
Hospital Clínico Universitario de Zaragoza: Ángel Ferrández, Marina Solano Sánchez.
Hospital Dr. Josep Trueta: Leyanira Torrealba, Virginia Piñol.
Hospital Universitario de Móstoles: Daniel Rodriguez-Alcalde, Jorge López-Vicente.
Complexo Hospitalario Universitario de Vigo: Vicent Hernández, Felipe Iglesias.
Calibration plot of COLONPREDICT model for colorectal cancer detection in the validation cohort. The calibration plot is calculated from the observed and expected proportions within the groups formed by the Hosmer–Lemeshow test. The reference line from the equation is shown. (TIF 739 kb)