Sei sulla pagina 1di 5

ORIGINAL ARTICLE

Histologic Severity of Appendicitis Can Be Predicted by Computed Tomography


Adam J. Hansen, MD; Scott W. Young, MD; Giovanni De Petris, MD; Deron J. Tessier, MD; Jose L. Hernandez, BA; Daniel J. Johnson, MD

Hypothesis: A regression model based on computed tomographic (CT) findings alone can accurately predict the histologic severity of acute appendicitis in patients who have a high disease likelihood. Design: Retrospective study. Setting: Mayo Clinic in Scottsdale, Ariz. Patients: Consecutive sample of 105 patients (50 women

Main Outcome Measure: Agreement between predicted and actual histologic severity, using weighted measurement. Results: Computed tomography variables used in the model were fat stranding, appendix diameter, dependent fluid, appendolithiasis, extraluminal air, and the radiologists overall confidence score. The weighted measurement of agreement between predicted and actual histologic severity was 0.75, with a 95% confidence interval between the values of 0.59 and 0.90. Conclusions: Computed tomographic findings, when used with the regression model developed from this pilot study, can accurately predict the histologic severity of acute appendicitis in patients initially seen with a high clinical suspicion of the disease. These findings provide a platform from which to prospectively test the model.

and 55 men, aged 15-89 years) undergoing nonincidental appendectomy within 3 days of nonfocused abdominal CT.
Interventions: Computed tomographic scans and his-

tologic features were retrospectively reinterpreted. Each patients histologic and CT findings were scored by standardized criteria. An ordinal logistic regression model was constructed with a subset of CT findings that statistically correlated best with the final histologic features. Predicted severity values were then generated from the model.

Arch Surg. 2004;139:1304-1308 in unnecessary morbidity (4.60%) and mortality (0.14%) through negative laparotomy or delay in surgical therapy.11,12 Prior studies have identified common radiologic findings in acute appendicitis.10,13-16 Identification of a set of these findings may lead an interpreter to accept or to reject the radiologic diagnosis of acute appendicitis. Clinicians interpreting CT scans and clinical information must ultimately rely on their subjective experience to establish a diagnosis of acute appendicitis. A wealth of literature has been published citing experience with and methods of accurately diagnosing or ruling out the disease. Previous publications have described scoring systems that grade appendicitis clinically, based on various combinations of CT, surgical, and pathologic findings.17-20 To our knowledge, no formal, predictive model has been created to ascertain the pathologic severity of the disease, given a set of radiologic findings. Such a sys-

Author Affiliations: Division of General Surgery, Departments of Surgery (Drs Hansen, Tessier, and Johnson), Radiology (Dr Young), Pathology (Dr De Petris), and Biostatistics (Mr Hernandez), Mayo Clinic in Scottsdale, Ariz.

common abdominal surgical procedure performed on an emergent basis worldwide.1 Early diagnosis is crucial in preventing perforation and resultant morbidity. Historically, the appendix is normal in approximately 20% of suspected cases of appendicitis that proceed to surgery, although modern negative laparotomy rates are significantly lower.2-5 Computed tomography (CT) has been shown to aid in the diagnosis of acute appendicitis, with up to 98% sensitivity, 98% specificity, and an overall accuracy of 98%.2,3,6-10 Computed tomography has contributed greatly to decreasing the performance of negative laparotomies. Although the accuracy of CT has been clearly demonstrated, areas of diagnostic uncertainty still exist. Uncertainty in the diagnosis of acute appendicitis, even in the presence of a high clinical suspicion, may result

PPENDECTOMY IS THE MOST

(REPRINTED) ARCH SURG/ VOL 139, DEC 2004 1304

WWW.ARCHSURG.COM

2004 American Medical Association. All rights reserved.

tem could provide a surgical consultant with a powerful tool to aid in clinical decision making. The purpose of this study is to devise a system that may accurately predict the histologic severity of acute appendicitis, as proven by final pathologic findings, based on CT findings. Such a scoring system could potentially facilitate improved judgment in operative timing and potential nonoperative management of patients initially seen with a highly probable clinical picture of acute appendicitis.
METHODS After obtaining approval of the Mayo Clinic in Scottsdale, Ariz, institutional review board, the medical records of all patients (Table 1) evaluated at our institution during the study period ( January 1, 2001-April 30, 2003) with suspected acute appendicitis were reviewed. Only those patients who proceeded to laparotomy or laparoscopy for nonincidental appendectomy are included in this study. The number of initial cases was 148. On initial medical record review, some variability in CT technique was observed. To establish uniformity of analysis, patients were selected who had undergone preoperative, nonfocused, abdominal CT scan, using enteric and intravenous contrast media. These factors were chosen based on their demonstrated results reported in recent literature.3,6,9,21 Rectal contrast or air enema was present in most but was not required as inclusion criteria. The selected group also proceeded to appendectomy, via laparoscopy or laparotomy, within 3 days of the CT. One patient was eliminated because of the inability to visualize the appendix on CT, which precluded the accurate diagnosis of appendicitis. Another patient was the only one in the study group to have the finding of diffuse cecal wall thickening. This patient was excluded from the final statistical analysis with the intent to use findings that are commonly seen on CT performed for suspected appendicitis. The final number of patients totaled 105. A board-certified staff radiologist (S.W.Y.), fellowship trained in body imaging, interpreted all CT scans. The radiologist was provided with the list of selected patients, in addition to the following predetermined list of possible CT findings in acute appendicitis that was based on current literature13: abscess, adenopathy, phlegmon, cecal bar, fat stranding, enlarged appendix, focal cecal apical thickening, appendolithiasis, arrowhead sign, dependent fluid, extraluminal air, terminal ileal wall thickening, sigmoid wall thickening, focal cecal wall thickening, and diffuse cecal wall thickening. The radiologist was blinded to the outcome of each individual case, although, by necessity of the electronic viewing systems retrieval function, the radiologist was provided with the patients identification numbers. The original radiology interpretation was not accessed. Instead, the CT scans were reinterpreted using the standardized list of potential findings. A value of 0 through 3 was assigned to each item (0, absent; 1, mild; 2, moderate; and 3, severe). Appendix diameter was not graded, but instead measured in millimeters. For comparison purposes to usual clinical practice, an overall confidence of diagnosis score was assigned to each CT scan by the radiologist (0, appendicitis absent/unlikely; 1, equivocal; 2, appendicitis likely; and 3, appendicitis strongly suspected). Next, a board-certified staff pathologist (G.D.), fellowship trained in both gastrointestinal and oncologic pathology, was provided with the histologic slides from the cases corresponding to the aforementioned CT scans. The pathologist was also blinded to the outcome of each individual case and the slides were assigned research numbers for anonymity. The original pathologic interpretations were not accessed, due to lack of uniformity of diagnostic criteria. Each patients slides were then

Table 1. Study Group Demographics


Variable No. of patients Age, y Mean Median Range Patients who underwent laparoscopic appendectomy Patients who underwent open appendectomy Female/Male 50/55 48/45 50/43 16-89/15-80 42/37 8/18 Total 105 47 45 15-89 79 26

Absent Fat Stranding Wall Enhancement Appendix Diameter Focal Cecal Apical Thickening Adenopathy Appendolithiasis Arrowhead Sign Dependent Fluid Abscess Cecal Bar Extraluminal Air Phlegmon Terminal Ileal Wall Thickening Sigmoid Wall Thickening Diffuse Cecal Wall Thickening 0 10 20 30

Mild

Moderate

Severe

40

50

60

70

80

90

100

% Patients With Each Finding (by Severity)

Figure. Frequency of individual computed tomographic findings in acute appendicitis, tabulated by severity level. *All findings were graded on a 0 (absent) through 3 (severe) scale, except for appendiceal diameter, which was separated into the following 3 groups: 5 to 9, 10 to 14, and 15 to 19 mm.

reinterpreted, assigning a score from 0 through 3, based on the following criteria: 0 indicates no acute inflammation; 1, mucosal infiltrate of neutrophils (acute mucosal appendicitis); 2, inflammation into submucosa and/or muscle (acute suppurative appendicitis); and 3, extensive necrosis of appendiceal wall (gangrenous appendicitis). The relationship between radiologic findings (predictor variables) and pathologic findings (outcome variables) was statistically analyzed. Each radiologic finding, including the overall confidence of diagnosis score, was correlated with the actual severity score of appendicitis as determined by pathologic review. The Spearman correlation for ranks was used to assess statistical significance. The predictor variables with the strongest correlation coefficients were then selected for inclusion in an ordinal logistic regression model. A predicted histologic severity value was obtained from the model for each patient and the predicted value was compared with the corresponding actual histologic severity score. The weighted measurement of agreement was used to analyze the comparison, providing a measure of accuracy of the regression model. RESULTS

The Figure graphically displays the frequency of each radiologic finding, tabulated by severity levels. All findWWW.ARCHSURG.COM

(REPRINTED) ARCH SURG/ VOL 139, DEC 2004 1305

2004 American Medical Association. All rights reserved.

Table 2. Correlation of the Presence/Severity of Computed Tomographic (CT) Findings to Actual Histologic Severity Using Spearman Correlations
CT Finding Extraluminal air Radiologists overall confidence score Appendix diameter Fat stranding Appendolithiasis Dependent fluid Abscess Phlegmon Arrowhead sign Wall enhancement Focal cecal wall thickening Adenopathy Cecal bar Terminal ileal wall thickening Sigmoid wall thickening Diffuse cecal wall thickening Focal cecal wall thickening Abbreviation: NA, not applicable. Spearman Correlation 0.31 0.29 0.28 0.27 0.26 0.21 0.20 0.19 0.14 0.10 0.10 0.10 0.08 0.08 0.04 0.21 NA

Table 3. Comparison Between Predicted Histologic Severities Using the Regression Model and the Actual Pathologic Values*
Actual Pathologic Scores Predicted Severity Scores 0 1 2 3 Total 0 2 1 0 0 3 1 0 1 2 0 3 2 0 0 75 3 78 3 0 0 9 12 21 Total 2 2 86 15 105

*The severity scores indicate the following: 0, absent; 1, mild; 2, moderate; and 3, severe.

ings were graded on a 0- through 3-point scale, except for appendiceal diameter, which was separated into the following 3 groups: 5 to 9, 10 to 14, and 15 to 19 mm. Table 2 lists the Spearman correlations, which correlate the presence of each radiologic finding to the actual histologic severity. The variables that were included in the ordinal logistic regression model, owing to their relatively higher correlative values, include the following: (1) extraluminal air, (2) the radiologists overall confidence score, (3) appendix diameter, (4) fat stranding, (5) appendolithiasis, and (6) dependent fluid. Table 3 gives the comparison between the predicted histologic severities from the regression model and the actual pathologic values. The weighted value derived from the data was 0.75, with a 95% confidence interval between the values of 0.59 and 0.90.
COMMENT

A model that accurately predicts the histologic status of patients who had a CT scan consistent with acute appendicitis could prove to be a useful clinical tool, especially in cases of uncertain patient disposition. Given the common
(REPRINTED) ARCH SURG/ VOL 139, DEC 2004 1306

presentation of this surgical entity, many patients could potentially be affected if even a small percentage were triaged with greater accuracy, based on mathematical prediction of the histologic severity of their disease state. A numeric score falling within the range indicating definite acute appendicitis would support the decision to proceed immediately to surgery. However, patients with scores at either end of the spectrum would benefit from more specific predictive classification of their illness. It has been shown that some patients with symptoms consistent with, but uncertain for, acute appendicitis may avoid surgery altogether, instead being observed, then discharged if their symptoms resolve.18,19 It is plausible that these patients, indeed, have early or mild appendicitis, although this would be impossible to confirm without appendectomy. Some patients with the diagnosis of appendicitis by CT scan have been treated nonoperatively with reasonable results.22 Additionally, nonoperative treatment of perforated appendicitis has been documented with good success.23,24 The traditional practice of interval appendectomy has been called into question by some, indicating that patients who do not have recurrent episodes of appendicitis within 3 to 6 months may never need an appendectomy.23 Any improvement to the current process of diagnosis and determination of patient disposition by an accurate predictive model could help streamline and potentially decrease the costs and morbidity associated with this significant health problem. This study was designed as a pilot study to lay the foundation for a future randomized controlled trial. Although the results are encouraging, further research will be required before they may be used in clinical practice. A strength of the current study was its strict use of objective, standardized radiologic and pathologic data evaluated retrospectively by independent observers. Although the outcome of appendectomy was known for every patient, the actual outcome being measured was histologic severity, which allowed the opportunity to blind the radiologist and the pathologist interpreting the CT scans and slides. The study was designed with the specific intent of avoiding potential sources of error, such as reporting bias, to strengthen the assumptions made regarding the validity and applicability of the regression model. One radiologist interpreted all of the CT scans, and one pathologist interpreted all of the histologic specimens. This maximized precision of interpretation. However, no specific attempt was made to determine the reproducibility of the individual interpreters findings, which potentially introduces decreased accuracy of interpretation. A weakness of the study was selection of a group of patients who had the known clinical outcome of appendectomy. As discussed by Raptopoulos et al17 in a related study, a very high probability of acute appendicitis introduces test review bias. This may in turn lead to overestimation of the sensitivity of CT and the regression model in predicting the actual histologic severity of acute appendicitis. However, the study was intended to determine a useful regression model that would predict histologic severity based on CT findings, not to determine how accurately CT scanning can distinguish the presence or absence of acute appendicitis. Additionally, the radiologist and pathologist were blinded to the pathologic findings to decrease test review bias.
WWW.ARCHSURG.COM

2004 American Medical Association. All rights reserved.

A further weakness of the study was its inability to include all patients with appendicitis. Our patient study group was compiled based on performance of appendectomy shortly after having an abdominopelvic CT scan. It is likely that there were patients seen in the study period who had acute appendicitis but were treated for other conditions and recovered without appendectomy. The patients in the study group with low predicted and actual histologic severity scores were too few to state firmly that the model would be truly useful for predicting histologic severity in patients with early or mild appendicitis. Although some patients may have been unintentionally excluded, no patients were intentionally treated nonoperatively for acute appendicitis during the study period. At the other extreme, patients undergoing interval appendectomy would not have been included in the study group, owing to the inclusion requirement of having a CT scan within 3 days of appendectomy. This only amounted to one patient in the study period. These exclusions, although minimized, could be an important source of bias, and could weaken the assumption that the devised model can accurately predict histologic severity, based on exclusion of some patients at either end of the spectrum of severity. A thorough description of the statistical methods used in the current study may be obtained elsewhere.25,26 However, certain points warrant mention. Spearman correlations associate predictor variables with outcome measures. With respect to the study population collectively, a high value would indicate the absence of a particular finding when appendicitis is absent or mild, and the highgrade presence of the same finding in severe appendicitis. Conversely, a low or negative value would indicate weak or inverse correlation, respectively. As expected, the relationships between each individual radiologic finding and the actual histologic severity were relatively weak. These data show that no single finding on CT scan can reliably and specifically predict the severity of appendicitis, which substantiates the findings of previous studies.14,15 Instead, multiple concurrent findings contribute to a common diagnosis. Although all of the listed radiologic findings correlated to some extent with the final histologic diagnoses in the study patients, the ones that were shown to correlate most strongly were (1) extraluminal air, (2) the radiologists overall confidence score, (3) appendix diameter, (4) fat stranding, (5) appendolithiasis, and (6) dependent fluid. Thus, these findings were used in the predictive model. The radiologists overall confidence score is not a specific, objective finding on CT, but it was shown to be a highly correlative factor in predicting final histology. As may be noted from the Figure, inclusion in the regression model did not directly depend on the frequency of occurrence. In fact, the findings of dependent fluid, appendolithiasis, and extraluminal air were relatively infrequent, which suggests that their occurrence, in the presence of the more common findings of fat stranding and appendiceal dilation should help to solidify the diagnosis of appendicitis. The ordinal logistic regression model was chosen specifically to relate multiple independent radiologic variables to one dependent variable (predicted histologic severity of appendicitis). Since the dependent outcome variable was
(REPRINTED) ARCH SURG/ VOL 139, DEC 2004 1307

ranked on a scale of 0 through 3, rather than a simple binary outcome variable, simple logistic regression could not be used. Rather, the order of the ranked scale was considered and used in the ordinal logistic regression. The weighted measurement of agreement was used in this study to measure the regression models overall suitability in predicting actual histologic severity of acute appendicitis. A weighted measurement considers complete and partial agreement, and assigns a weight related to the degree of disagreement.26 The weighted value derived from the data was 0.75, with a 95% confidence interval between the values of 0.59 and 0.90. A value of this magnitude is generally interpreted as strong and suggests a useful predictive value of the proposed regression model. A study designed to rely on analysis such as a weighted measurement has the inherent problem of generalizability.25 Therefore, application of the model should be reserved for the type of population in which it was developed, namely, patients having a working diagnosis of acute appendicitis.
CONCLUSIONS

Computed tomography is a commonly used clinical tool that has been clearly demonstrated to contribute to the accurate diagnosis of acute appendicitis. Once a strong clinical suspicion of acute appendicitis has been affirmed, deciding on the most advantageous treatment is key. This study accomplished the goal of devising a predictive model that in the near future may help stratify such patients based on prediction of the histologic severity of their disease. It provides a foundation for a subsequent prospective trial that will use predictive stratification to determine the need for and timing of appropriate operative intervention. Such prospective study is necessary before the model can be applied to widespread clinical practice. Accepted for Publication: May 20, 2004. Correspondence: Daniel J. Johnson, MD, Department of Surgery, Division of General Surgery, Mayo Clinic Scottsdale, 13400 E Shea Blvd, Scottsdale, AZ 85259 (johnson .daniel1@mayo.edu).
REFERENCES
1. Korner H, Sondenaa K, Soreide JA, et al. Incidence of acute nonperforated and perforated appendicitis: age-specific and sex-specific analysis. World J Surg. 1997; 21:313-317. 2. Balthazar EJ, Rofsky NM, Zucker R. Appendicitis: the impact of computed tomography imaging on negative appendectomy and perforation rates. Am J Gastroenterol. 1998;93:768-771. 3. Stroman DL, Bayouth CV, Kuhn JA, et al. The role of computed tomography in the diagnosis of acute appendicitis. Am J Surg. 1999;178:485-489. 4. Colson M, Skinner KA, Dunnington G. High negative appendectomy rates are no longer acceptable. Am J Surg. 1997;174:723-727. 5. Rao PM, Rhea JT, Rattner DW, Venus LG, Novelline RA. Introduction of appendiceal CT: impact on negative appendectomy and appendiceal perforation rates. Ann Surg. 1999;229:344-349. 6. Balthazar EJ, Megibow AJ, Siegel SE, Birnbaum BA. Appendicitis: prospective evaluation with high-resolution CT. Radiology. 1991;180:21-24. 7. Raman SS, Lu DS, Kadell BM, Vodopich DJ, Sayre J, Cryer H. Accuracy of nonfocused helical CT for the diagnosis of acute appendicitis: a 5-year review. AJR Am J Roentgenol. 2002;178:1319-1325.

WWW.ARCHSURG.COM

2004 American Medical Association. All rights reserved.

8. Rao PM, Rhea JT, Novelline RA, Mostafavi AA, McCabe CJ. Effect of computed tomography of the appendix on treatment of patients and use of hospital resources. N Engl J Med. 1998;338:141-146. 9. Rao PM, Rhea JT, Novelline RA, Mostafavi AA, Lawrason JN, McCabe CJ. Helical CT combined with contrast material administered only through the colon for imaging of suspected appendicitis. AJR Am J Roentgenol. 1997;169:12751280. 10. Choi YH, Fischer E, Hoda SA, et al. Appendiceal CT in 140 cases: diagnostic criteria for acute and necrotizing appendicitis. Clin Imaging. 1998;22:252-271. 11. Velanovich V, Satava R. Balancing the normal appendectomy rate with the perforated appendicitis rate: implications for quality assurance. Am Surg. 1992; 58:264-269. 12. Berry J Jr, Malt RA. Appendicitis near its centenary. Ann Surg. 1984;200:567-575. 13. Rao PM, Rhea JT, Novelline RA. Sensitivity and specificity of the individual CT signs of appendicitis: experience with 200 helical appendiceal CT examinations. J Comput Assist Tomogr. 1997;21:686-692. 14. Rettenbacher T, Hollerweger A, Macheiner P, et al. Outer diameter of the vermiform appendix as a sign of acute appendicitis: evaluation at US. Radiology. 2001; 218:757-762. 15. Weyant MJ, Eachempati SR, Maluccio MA, et al. Interpretation of computed tomography does not correlate with laboratory or pathologic findings in surgically confirmed acute appendicitis. Surgery. 2000;128:145-152. 16. Gwynn LK. The diagnosis of acute appendicitis: clinical assessment versus computed tomography evaluation. J Emerg Med. 2001;21:119-123. 17. Raptopoulos V, Katsou G, Rosen MP, Siewert B, Goldberg SN, Kruskal JB.

18. 19. 20. 21.

22. 23. 24. 25. 26. 27.

Acute appendicitis: effect of increased use of CT on selecting patients earlier. Radiology. 2003;226:521-526. van den Broek WT, Bijnen BB, Rijbroek B, Gouma DJ. Scoring and diagnostic laparoscopy for suspected appendicitis. Eur J Surg. 2002;168:349-354. Christian F, Christian GP. A simple scoring system to reduce the negative appendicectomy rate. Ann R Coll Surg Engl. 1992;74:281-285. Alvarado A. A practical score for the early diagnosis of acute appendicitis. Ann Emerg Med. 1986;15:557-564. Jacobs JE, Birnbaum BA, Macari M, et al. Acute appendicitis: comparison of helical CT diagnosis focused technique with oral contrast material versus nonfocused technique with oral and intravenous contrast material. Radiology. 2001; 220:683-690. Kirshenbaum M, Mishra V, Kuo D, Kaplan G. Resolving appendicitis: role of CT. Abdom Imaging. 2003;28:276-279. Oliak D, Yamini D, Udani VM, et al. Nonoperative management of perforated appendicitis without periappendiceal mass. Am J Surg. 2000;179:177-181. Bagi P, Dueholm S. Nonoperative management of the ultrasonically evaluated appendiceal mass. Surgery. 1987;101:602-605. Soeken K, Prescott P. Issues in the use of kappa to estimate reliability. Med Care. 1986;24:733-741. Norman GR, Streiner DL. Biostatistics: The Bare Essentials. 2nd ed. Hamilton, Ontario: BC Decker Inc; 2000:324. Ransohoff D, Feinstein A. Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. N Engl J Med. 1978;299:926-930.

Announcement

he Archives of Surgery will give priority review and early publication to seminal works. This policy will include basic science advancements in surgery and critically performed clinical research.

(REPRINTED) ARCH SURG/ VOL 139, DEC 2004 1308

WWW.ARCHSURG.COM

2004 American Medical Association. All rights reserved.

Potrebbero piacerti anche