Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
00
Printed in Great Britain.All rightsreserved Copyright0 1991PergamonPressplc
ways. First, the cumulative logit for each out- 42 patients (9%); 5% had a previous diagnosis
come category was plotted vs quintiles of age. of genital or breast cancer and 5 patients had
Second, plots of log odds ratios with 95% radiation therapy in the past. A remote history
confidence limits for each variables in the model of peptic ulcer disease was given by 12% of
were plotted vs the point of dichotomization patients, a history of cholecystectomy in 10%
(i.e. 0 vs 1, 2, 3; 0, 1 vs 2, 3; and 0, 1, 2 vs 3). and a family history of colon cancer in 10%.
Finally, a partial proportional odds model Stools were guaiac positive in 34% of patients;
was fitted to test the proportional odds 114 patients did not have a FOBT. The
assumption [3]. median hematocrit was 40% (interquartile
Two different validation methods were used range 36-44%). Eight patients had missing
to assess the predictive accuracy of the model hematocrit values. These 8 patients were not
developed in the training sample when applied included in the logistic regression analysis.
to the test sample. In the first method, patients Most patients underwent colonoscopy to
were divided into two age and two sex groups. evaluate occult or overt bleeding (38%). Other
The predicted probability of outcome 1 or worse recorded reasons included: abnormal barium
(any colorectal neoplasia), outcome 2 or worse enema (32%), abnormal proctoscopy (14%),
(adenomatous polyp a5 mm or colorectal abdominal pain (9%), iron deficiency (6%),
cancer), and outcome 3 (colorectal cancer) was history of colorectal neoplasia (5%), and IBD
calculated for each patient and compared with (3%) (more than one reason possible per
the observed outcome status: patient in outcome patient).
group 1 or worse, 2 or worse, and 3 [4]. There were 4 complications including 3 post-
In the second method, the reliability relating polypectomy bleeding episodes requiring trans-
predicted probabilities to observed outcomes is fusion only and 1 cecal perforation requiring
estimated by logistic-linear calibration. In this surgery. There were no deaths.
method, a reliability curve is estimated by fitting Neoplastic (adenoma or carcinoma) lesions
a second logistic model, with the independent were confirmed by histopathology in 163
variable being the predicted log odds of the patients. One hundred twenty-nine had ade-
outcome (formulated from the training sample) nomatous polyps (87% 20.5 cm) and 34 had
and the dependent variable being a dichoto- cancer. An additional 4 patients with colon
mous indicator of whether or not the outcome tumors were identified. Two patients had
actually occurred (from the test sample). If the metastatic lesions to the colon and 2 had colonic
predictions are reliable, the reliability curve lymphoma. These patients were not considered
generated should be a 45” line. Tests for overall to have colorectal neoplasia for purposes of
unreliability are reported. analysis. Of the 34 colorectal cancers, 27 (79%)
Somer’s rank correlation [17] between the were localized to the bowel wall. Six had spread
linear combination of factors in the logistic to regional lymph nodes (Dukes’ Stage C) and
regression model and the outcome level was only one patient had distant metastases at
computed in the training and test sample to presentation.
quantify predictive ability. A nomogram was Plots of the cumulative logit of lesion vs age
generated from the training sample to allow demonstrated near linearity. A score test of
predictions to be made in individual patients in linearity (vs a restricted cubic spline) was
the test sample [18]. insignificant (x: 4.79, p = 0.2). Conversely, the
restricted cubic spline of hematocrit demon-
RESULTS strated significant deviation from linearity (x$
8.35, p = 0.04). A plot of the restricted cubic
Endoscopy of the entire colon was accom- spline function suggested a quadratic relation-
plished in 93% of the 461 patients; 63% ship. The model using the quadratic term was
were outpatients. Nearly two-thirds of the 461 superior to the model using the restricted cubic
patients were female and over half were over spline (x: 6.77, p = 0.03 vs x: 9.78, p = 0.04).
60 years of age (median = 61; interquartile There were no significant interactions
range 48-71). Common presenting symptoms between the dummy variable coding for a
included: abdominal pain (46%), hematochezia missing FOBT and the 5 predictors ultimately
(37%), constipation (23%), and diarrhea (17%). chosen for the model (xi 3.89, p = 0.57). A plot
A history of IBD was given by 20 patients. of the cumulative logit of lesion vs FOBT
A history of colorectal neoplasia was given by negative, missing and positive showed the
1266 SCXXTR. BRAZERet al.
missing values between negative and positive. The predicted probability of disease is very
Therefore, missing FOBT were set to the mean. similar to the actual prevalence when patients
Clinical characteristics associated with col- were divided into four age and sex groups
orectal neoplasia as determined by univariate [Figs l(A-D)]. Figures 2(A) and (B) demon-
analysis are shown in Table 1. strate the reliability of predictions for any col-
Independent, statistically important char- orectal neoplasia and for large polyps (2 5 mm)
acteristics which correlated with colorectal or cancer respectively in the test sample. An
neoplasia are shown in Table 2. These char- overall test for unreliability was insignificant
acteristics included not having diarrhea or both for any colorectal neoplasia (x: 2.35,
pain as the sole indication for evaluation p = 0.3) and for large polyps or cancer (xi 0.45,
(x: 42.23, p < O.OOOl),advancing age xi 33.49, p = 0.8).
p < O.OOOl), the quadratic hematocrit term The Somer’s correlation was 0.475 in the
(x: 15.95, p <O.OOOl), male sex (x: 6.65, training sample and 0.463 in the test sample,
p < O.Ol), a positive FOBT (x: 5.12, p = 0.02), indicating that slight overfltting to the training
and hematocrit (xi 3.27, p = 0.07). Iron de- sample caused slight inability of the model to
ficiency, hematochezia, a positive history risk validate in test sample.
factor score, constipation, and abdominal pain
did not correlate with colorectal neoplasia after DISCUSSION
adjustment for the variables included in the
model (p > 0.05). A global test of all two way This study demonstrates the ability of ordinal
interactions was not signifiant (x:, 31.87, logistic regression analysis to provide accurate
p = 0.8). predictions of risk for colorectal neoplasia in
The model does not violate the proportional selected patients undergoing colonoscopy. The
odds assumption. The cumulative logit plot of logistic regression model makes few assump-
lesion vs age reveals three nearly parallel lines. tions which should be verified and tested.
Plots of the log odds ratios for the FOBT, sex, The five variables included in the model meet
age, hematocrit and hematocrit x hematocrit the linearity, additivity and proportional odds
show no clear deviation from a line with a assumptions. These five characteristics, avail-
slope of zero. Finally, a global test of pro- able from a standard clinical exam, include the
portional odds suggested no deviation from patient’s age, sex, FOBT results, hematocrit,
proportionality (& 9.57, p = 0.48). and indication for colonoscopy. The study
”
ANY NEOPLASIA BIG POLYP OR WNOER CIHOER
(D) (Cl
Fig. 1. Predictions in test population. Clockwise from upper left: (A) represents women 16-62 years,
(B) women 63-93 years, (C) men aged 24-58 years, and (D) men 5949 years. Average predicted
probabilities of colorectal neoplasia, colorectal adenomatous polyps 2 5 mm or cancer, and colorectal
cancer compared with the observed prevalence. Sex-specific median age was used to derive age groupings.
demonstrates that the information contained The results of our study demonstrate that risk
in each characteristic contributes independent estimates of colorectal neoplasia in a randomly
information to the other characteristics. Risk selected test sample closely correspond to that
estimates may be easily generated from a predicted. Because estimates were only devel-
nomogram (Fig. 3). [N.B. The nomogram is oped and validated in patients who had been
provided to demonstrate the feasibility of such referred to a gastroenterologist for evaluation
estimates and should not be used to develop risk and ultimately underwent colonoscopy, it is
estimates in individual patients.] Estimates possible that risk estimates in patients not
based on the model developed in the training referred for colonoscopy might be substan-
sample are reliable when applied to patients in tially different. Several types of bias could be
the test sample. introduced through the referral process for
(4 (6)
, OBSERVED PROPORTION OBSERVED PROPORTION
1 44,0
,I_
_,._ _.*_
w
v
0 0.1 0.2 0.3 0.4 0.6 0.3 0.7 a3 a3 1 0 0.1 0.2 0.3 0.4 0.6 0.3 0.7 0.3 0.3 1
PREDICTED PROBABILITY PREDICTED PROBABILITY
Fig. 2. Reliability of the predictions. Predictions of any colorectal neoplasia (A) and adenomatous polyps
25 mm or cancer (B) based on the training sample were applied to all patients in the test sample.
Shown are the probabilities of colorectal neoplasia estimated by logistic-linear calibration (solid line).
If predictions are reliable, the solid line should fall on the line with a slope of 1 (dashed line).
1268 San-r R. BRAZLXet al.
PREDICTED PROBABILITY
* * 3c
0.1 - ----
3c ” % m m * *
I I I I I I I I I I I I I I
0
50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80
AGE
PREVALENCE OF NEOPLASIA
0.6
+
I I 4 I 1 I I I I I -I
01
30 35 40 45 50 55 60 65 70 75 80 85
AGE
Fig. 5. Lack of interaction between age and FOBT. The patients have been grouped by their FOBT result
(positive = solid line, negative = dashed line) and thereafter divided into four quantiles by age. The mean
prevalence of any colorectal neoplasia is plotted. The lack of interaction is shown by the two parallel lines.
a continuous scale (Fig. 4). Furthermore, the from those not referred. However, an over-
study demonstrates that the information con- whelming percentage (96%) of patients with
tent in the FOBT is similar at different ages, colorectal neoplasia in the current study had
as demonstrated by the lack of an important localized (Duke’s Stage A or B) colon cancer
interaction between the two characteristics or colorectal polyps. These lesions rarely cause
(Fig. 5). pain, altered bowel habits, hematochezia, or
While the presence or absence of given pre- iron deficiency [33].
dictors in the model may make biologic or Endoscopic examination of the entire colon is
epidemiologic “sense”, their presence (or the gold standard for the diagonsis of colon
absence) may only represent biases present in neoplasia. Several reasons, including unaccept-
the patient population studied. For example, able cost and risk proscribe routine application
while some investigators have found an of colonoscopy unless a high risk group is the
increased prevalence of colorectal neoplasia in target. Until more sophisticated diagnostic tests
men undergoing screening colonoscopy [24], (e.g. accurate genetic and/or biochemical tests)
the association of male sex with colorectal neo- become available, patients at high risk can only
plasia seen in this study may have resulted from be identified by analysis of clinical character-
the increased investigation of women with the istics obtained from patients with localized col-
irritable bowel syndrome. Known risk factors ore&al neoplasia. Although a selection bias is
for the development of colon cancer: IBD implicit in the present referral pattern, the
[25-27J; family [28,29] or personal [30,31] his- model developed from analysis of clinical vari-
tory of colorectal neoplasia; cholecystectomy ables reliably predicted colon neoplasia in a
[3 11;and history of breast or female genital tract test sample of this population. Particularly
cancer [l 1,321; were not included in the model. important, is the application of this model to a
Again, this may reflect peculiarities of the popu- relatively large number of patients harboring
lation studied or the low prevalence of these risk localized disease. Application of similar tech-
factors in the subjects studied. niques to independent patient populations could
Symptoms and signs commonly used to provide rationale for this approach to early
detect colon cancer (abdominal pain, consti- diagnosis of colon neoplasia.
pation, hematochezia, and iron deficiency) were One may conclude that accurate estimates of
not helpful in the identification of patients at colorectal neoplasia in selected patients are
high risk for colorectal neoplasia. These systems possible using ordinal logistic regression analy-
may have prompted the referral of these patients sis. It is hoped that this technique, not the
to the gastroenterologist; their value may have specific model developed, is applied to an
been to discriminate a moderate risk subset unselected population in order to obtain a
1270 Scorr R. BRAZERet al.
model providing unbiased predictions. Future is. Banks J. Nomograms. In: Kotz S, Johnson NL, Eds.
Encyclopedia of StatisticaI Sciences. Vol. 6: Multi-
studies should include the clinical characteristics variate Analysis-PIackett and Burman Designs.
identified in our study as candidate variables as New York: Wiley; 1985: 265-271.
well as a detailed dietary [34-361 and bowel 19. Philbrick JT, Horwitz RI, Feinstein AR. Methodo-
logic problems of exercise testing for coronary artery
habit history [371. disease: groups, analysis and bias. Ant J Cardiol 1980;
46: 807-812.
Acknowledgements-This work was supported,in part, by 20. Sackett DL. Bias in analytical research. J Ckron Dis
research grants from the National Institutes of Health 1979; 32: 51-63.
AM0716610, the National Center for Health Services 21. Boland CB, Itzbowitz SH, Kim YS. Colonic
Research HS04873, HS05635, the National Heart, Lung, polyps and the gastrointestinal polyposis syn-
and Blood Institute HL36587, the National Library of dromes. In: Sleisenger MH, Fordtran JS, Eds.
Medicine lGO8 LM04613, and a grant from the Andrew Gastrointestinal Disease: Patkophysiology, Diagno&,
W. Mellon Foundation, New York. Management. Philadelphia: W.B. Saunders; 1989:
1487-1489.
22. Simon JB. Occult blood screening for colorectal carci-
REFERENCES noma: a critical review. Gsstroenterology 1985; 88:
820-837.
1. Hlatky MA, Califf RM, Harrell FE Jr, Lee KL, Mark 23. Gilbertsen VA, McHugh R, Schuman L, Williams SE.
DB, Muhlbaier LH, Pryor DB. Clinical judgement and The earlier detection of colorectal cancers. Cancer
therapeutic decision making. J Ant CoII Cardlol 1990, 1980; 45: 2899-2901.
15: 1-14. 24. Rex D, Lehman G, Hawes R et al. Screening
2. Armstrong BG, Sloan M. Ordinal regression models colonoscopy in asymptomatic average-risk volunteers
for epidemiologic data. Am J Epidemiol 1989; 129: with negative fecal occult blood tests; age and sex
191-204. differences. Am J Gastroenterol 1990; 85: 274A.
3. Peterson BL, Harrell FE Jr. Partial proportional odds 25. Devroede GJ, Taylor WF, Sauer WG, Jackman RJ,
models for ordinal response variables. Appl Stat 1990; Stickler GB. Cancer risk and life expectancy of
39: 205-217. children with ulcerative colitis. N EngI J Med 1971;
4. Harrell FE, Lee KL. The practical value of logistic 285: 17-21.
regression. Pram 1OtkAnoual SAS User’s Group Inter- 26. Ekbom A, Helmick C, Zack M, Adami H. Ulcerative
national Conference, Cat-y, NC. SAS Institute; 1985: colitis and colorectal cancer. N Enel J Med 1990; 323:
1031-1036. 1228-1233.
5. Lee KL, Pryor DB, Harrell FE Jr et al. Predicting 27. Katzka I, Brody RS, Morris E, Katz S. Assessment of
outcome in coronary disease: statistical models versus colorectal cancer risk in patients with ulcerative colitis:
exnert clinicians. Am J Med 1986: 80: 553-560. experience from a private practice. GastroenteroIogy _.
6. Kong DF, Lee KL, Harrell FE Jr et al. Clinical 1983; 85: 22-29. - -
expehence and predicting survival in coronary disease. 28. Kussin SZ. Linkin M. Winawer SJ. Inherited colon
Arch Intern Mad 1989: 149: 1177-1181. cancer: clinical’implications. Am J Gastroenterol 1979;
7. Silverberg E, Boring CC, Squires TS. Cancer Statistics, 72: 448-457.
1990. CA 1990; 40: 9-26. 29. Schottenfeld D. Patient risk factors and the detection
8. Silverberg E, Holleb AI. Major trends in cancer: of early cancer. Prev Med 1972; 1: 335-351.
25 year survey. CA 1975; 25: 2-7. 30. Lotfi AM, Spencer RJ, Ilsturp DM, Melton J.
9. Winawer SJ. Screening for colorectal cancer: an Colorectal polyps and the risk of subsequent
overview. Cancer 1980; 45: 1093-1098. carcinoma. Mayo CBn Proc 1986; 61: 337-343.
10. Winawer SJ, Sherlock P. Surveillance for colorectal 31. Vemick LJ, Kuller LH. A case-control study
cancer in average-risk patients, familial high-risk of cholecystectomy and right-sided colon cancer:
groups, and patients with adenomas. Cancer 1982; 50: the influence of alternative data sources and
2609-2614. differential interview participation proportion on
11. Schottenfield D, Winawer SJ. Large intestine. In: odds ratio estimates. Ant J Epidemiol 1982; 116:
Schottenfeld D, Fraumen J Jr, Eds. Cancer Epidemi- 86-101.
ology and Prevention. Philadelphia: W.B. Saunders; 32. Schottenfeld D, Berg JW, Vitsky B. Incidence of
1982: 703-727. multiple primary cancers. II. Index cancers arising in
12. Winawer SJ, Sherlock, P, Schottenfeld D ef al. the stomach and lower digestive system. J Natl Cancer
Screening for colon cancer. Gastroenterology 1976; Illst 1969; 43: 77-86.
70: 783-789. 33. Ma&se FA, St John JB. Relationship between pat-
13. Sherlock P, Winawer SJ. The role of early diagnosis terns of bleeding and hemoccult sensitivity in patients
in controlling large bowel cancer. Cancer 1977; 40: with colorectal cancers or adenomas. Gastroenterology
2609-2615. 1982; 82: 891-898.
14. Walker SH, Duncan DB. Estimation of the prob- 34. Modan B. Role of diet in cancer etiology. Cancer
ability of an event as a function of several independent 1977; 40: 1887-1891.
variables. Biometrlkn 1967; 34: 167-179. 35. Hoff G, Moen IE, Trygg K et al. Epidemiology
15. Harm.11FE. The LOGIST procedure. In: Hastings RP, of polyps in the rectum and sigmoid colon. Scand J
Ed. SUGI Supplemental Llbnry User’s Guide, version Gastroenterol 1986; 21: 199-204.
5 edn. Cary, NC: SAS Institute Inc.; 1986: 269-293. 36. Hoff G, Larsen S. Epidemiology of polyps in the
16. Harrell FE, Lee KL, Califf RM, Pryor DB, Rossati rectum and sigmoid colon. Discriminant analysis for
RA. Regression modelling strategies for improved identification of individuals at risk of developing col-
prognostic prediction. Stat Med 1984; 3: 143-153. ore&d neoplasia. Bcand J Gastroenterol 1986; 7:
17. Goodman LA, Kruskal WI-I. Measwas of Assad&ion 848-852.
for Cross-Clam&ations. New York: Springer-Verlag; 37. Burkitt DP, Walker ARP, Painter NS. Dietary fiber
1979. and disease. JAMA 1974; 229: 1068-1074.