Background
The indications for surgical treatment of patients with head and neck cancer can be limited by the risk of perioperative complications. Prediction of outcome is important in disease stratification and the subsequent decision-making process.
Objective
To assess the value of the APACHE II (Acute Physiology and Chronic Health Evaluation II) score, POSSUM (Physiological and Operative Severity Score for the Enumeration of Mortality and Morbidity), and ASA (American Society of Anesthesiologists) classification in the prediction of complications in patients with oral or oropharyngeal cancer.
Methods
Five hundred thirty patients with oral or oropharyngeal carcinomas who underwent surgical treatment were evaluated using ASA, POSSUM, and APACHE II scores. The outcome measure was morbidity within 30 days. Logistic regression and receiver operating characteristic curve analyses were used to estimate the predictive ability of the scoring systems.
Results
The overall complication rate was 58.9%. Most of the patients had local complications. The mortality was 2.6%. The results showed that APACHE II (relative risk, 1.09; P = .001) and POSSUM (relative risk, 1.09; P<.001) equally predicted perioperative complications and were superior to the ASA system (relative risk, 0.98; P = .89) (area under the curve, 0.65 for APACHE II, 0.68 for POSSUM, and 0.56 for ASA).
Conclusions
The POSSUM and APACHE II scores were useful in predicting perioperative morbidity for patients with oral or oropharyngeal cancer, serving as objective methods to assist the surgeon in classifying patients into risk groups with different probabilities of perioperative complications. The poorer results achieved with the ASA classification are possibly because this system is primarily based on subjective clinical judgments.
THE ESTIMATION of outcome is of paramount importance in disease stratification and subsequent management, particularly in severely ill surgical patients.1 The American Society of Anesthesiologists (ASA) began using preoperative evaluation in 1963 to predict perioperative mortality and morbidity.2
In recent years, improvements in advanced statistical methods have allowed the creation of new indexes such as the APACHE (Acute Physiology and Chronic Health Evaluation) score3 and POSSUM (Physiological and Operative Severity Score for Enumeration of Mortality and Morbidity).4
In the development of the APACHE index, it was initially assumed that disease severity could be evaluated by quantifying 34 physiological variables.3 In 1985, use of the initial APACHE system was replaced by APACHE II, which considers 12 routinely available physiological and laboratory measurements, with additional points for age and previous health status.5 The APACHE index was primarily designed to predict mortality of patients in intensive care units, but attempts have been made to apply this scoring system to patients with severe trauma,6 abdominal complications,7,8 and acute pancreatitis.9 The results suggested that it is possible to use this scoring system to predict perioperative events in patients who undergo various surgical procedures.
The POSSUM was specially developed for surgical patients and includes 12 physiological and 6 operative variables.4 Its ability to accurately predict not only morbidity but also postoperative mortality was proven, and it has been used for surgical audits in general surgery and in patients with lung and colorectal cancer. Recent investigations have indicated that the POSSUM is the most appropriate score in the establishment of operative risk.10 Also, improvements in anesthetic and surgical techniques, including microvascular reconstruction, have allowed the implementation of radical oncological procedures for treating advanced head and neck cancer.11 However, the curative intent can be limited by the hazard of perioperative complications, which increase the length of hospital stay and require diagnostic and therapeutic procedures, adding to treatment costs.12 There is no specific scoring system to predict morbidity or mortality in patients with head and neck cancer.
The objective of this study was to compare the APACHE II, POSSUM, and ASA scoring systems in the prediction of perioperative complications in patients undergoing curative surgical treatment for oral or oropharyngeal cancer.
Five hundred thirty medical records of patients with squamous cell carcinoma of the oral cavity or oropharynx admitted to the Centro de Tratamento e Pesquisa Hospital do Câncer A. C. Camargo from January 1, 1990, to December 31, 1997, were reviewed. The following criteria were used for inclusion in the study: a histologically confirmed diagnosis, absence of previous oncological treatment for the primary tumor, no distant metastasis, and surgical treatment (exclusive or as part of a multidisciplinary approach) with a curative purpose.
The APACHE II classification (a revised version of the prototype APACHE system) includes 12 physiological measures (temperature, mean arterial pressure, heart rate, respiratory rate, oxygenation, arterial pH, serum sodium, serum potassium, serum creatinine, hematocrit, white blood cell count, and Glasgow Coma Scale score); age; and severe chronic health problems. The physiological score is determined by the worst value (eg, the lowest hematocrit or the highest respiratory rate) found during the initial 24 hours after intensive care unit admission.5 In our study, we also applied APACHE II to patients not referred to the intensive care unit, based on the findings of the first postoperative 24-hour period (Table 1).
The POSSUM was developed through multivariate discriminant analysis to obtain a method of risk assessment. This 12-factor, 4-grade physiological score includes the following: age; cardiac status; pulse rate; systolic blood pressure; respiratory status; Glasgow Coma Scale score; serum concentrations of urea, potassium, and sodium; hemoglobin concentration; white blood cell count; and findings on electrocardiography. It is subsequently combined with a 6-factor operative score that adjusts for the type of surgical procedure and includes the type and number of procedures, volume of blood loss, peritoneal contamination, presence and extent of malignancy, and timing of the operation.4 In this study, the operative extent was defined as minor (partial glossectomy and other intraoral resections without neck dissection), moderate (en bloc or discontinuous wide excision of an oral or oropharyngeal cancer and a unilateral neck dissection), major (any surgical procedure with myocutaneous flap reconstruction or bilateral neck dissection), or major plus (any surgical procedure with microvascular reconstruction). For both indexes, if an item was not evaluated or if the patient did not have any disease that could be responsible for an abnormal result, the lowest score was assigned (Table 2).
The ASA classification, assigned preoperatively by the anesthesiologist during a preoperative outpatient consultation, was also recorded. The POSSUM and APACHE II scores were calculated based on medical chart review.
Morbidity was defined as any complication occurring within the first 30 postoperative days. Local complications were those that arose on the surgical wound without systemic repercussions. Systemic complications were defined as those that affected the whole body, including pneumonia, sepsis, respiratory failure, cerebral vascular accident, acute myocardial infarction, massive bleeding requiring reoperation, and operative death. Any death within 30 days of operation was classified as an operative death.
The statistical analysis included Spearman rank correlation coefficient, 1-way analysis of variance (with the Tukey HSD [Honestly Significantly Different] test for multiple comparisons), and χ2 and Fisher exact tests. Logistic regression analysis was applied to estimate the predictive ability of the ASA, APACHE II, and POSSUM scoring systems in assessing perioperative morbidity. The dependent variable was the postoperative course (uncomplicated or complicated), and the independent variables were the ASA classification and APACHE II and POSSUM scores. The significance of the independent variable in each model was tested with G statistics, which (according to the hypothesis that β1 is equal to zero) follow a χ2 distribution.13
The second step in the statistical analysis compared the predictive ability between the 3 scores using receiver operating characteristic (ROC) curve analysis. The area under the ROC curve indicates the probability of concordance between the predicted probability of postoperative complications and the actual postoperative state, and has been described as the best "index of detectability."14 The area ranges from 0.50 for chance performance to 1.00 for perfect prediction. Each area ± SD referring to a predictive model was estimated and compared with each other using the method proposed by Metz et al.15 Data were considered statistically significant at P<.05.
We reviewed the medical records of 530 consecutive patients, 439 men (82.8%) and 91 women (17.2%), with a mean (SD) age of 57.3 (10.8) years. There was a predominance of white ethnicity (84.2%), tumors located in the oral tongue (26.0%), and TNM clinical stage IV (48.3%). The TNM clinical stage (P<.001) and the type of surgical procedure (P = .003) showed a statistically significant association with the occurrence of perioperative complications. Tumor location was not associated with perioperative complications (P = .09) (Table 3).
The overall perioperative mortality was 2.6% (n = 14). All deaths were attributed to systemic complications. The postoperative morbidity was 58.9% (312/530 patients). Among the perioperative complications, 257 (82.4%) were local complications and 55 (17.6%) were local plus systemic (Table 4). The length of hospital stay ranged from 0 to 71 days (median, 8 days).
The median APACHE II scores were 7.0 in patients with uncomplicated courses and 8.5 in those who developed perioperative complications. The median (interquartile) POSSUM scores were 27 (range, 24-31) overall, 25 (range, 23-28) in patients with uncomplicated courses, and 28 (range, 25-32) in those with complicated ones.
There was a statistically significant difference between the means of the APACHE II and POSSUM scores (P<.001 for both) according to the type of complication (none, local, or local plus systemic) (Table 5). The postoperative mortality was higher among patients presenting with an APACHE II score higher than 10 (5.3%) compared with subjects with an APACHE II score of 10 or lower (1.8%) (P = .052). A statistically significant difference between the operative mortality was also noted between patients with POSSUM scores higher than 24 (3.7%) vs 24 or lower (0.6%) (P = .04).
The correlation between the ASA classification and the length of hospitalization was weak (r = 0.14; P = .002). The APACHE II showed a weak positive correlation with the length of hospitalization (r = 0.22), while the POSSUM showed a modest positive correlation (r = 0.41) (P<.001 for both) (Figure 1).
The results of the logistic regression analysis showed a significant relationship between the risk of complications and the predictive score of each index, as indicated by the G statistics: ASA, 6.18 (P = .01); APACHE II, 32.98 (P<.001); and POSSUM, 39.82 (P<.001) (Table 6). The positivity of the logistic regression coefficient of all scores revealed that the risk increases along with the scores. The results showed that APACHE II (relative risk, 1.09; P = .001) and POSSUM (relative risk, 1.09; P<.001) equally predicted perioperative complications and were superior to the ASA system (relative risk, 0.98; P = .89) (area under the curve, 0.65 for APACHE II, 0.68 for POSSUM, and 0.56 for ASA).
The ROC curve analysis demonstrated that the POSSUM index (area under the ROC curve, 0.68) had a statistically significant better prognostic capacity than ASA classification (area under the ROC curve, 0.56) (P = .002). The APACHE II score (area under the ROC curve, 0.65) also showed a better prognostic ability than ASA classification, with statistical significance (P = .04). There was no statistically significant difference between the areas under the curve for APACHE II and POSSUM scores (P = .23) (Figure 2).
Any surgical procedure should be routinely preceded by a satisfactory preoperative assessment with the main objective of reducing risks. Improvements in methods and equipment and the development of new drugs have decreased the risks associated with anesthesiological and surgical procedures. The acceptable level of risk has probably not changed, with a significant number of unhealthy and older patients still undergoing extensive surgical procedures. Therefore, the role of the preoperative evaluation in the planning and management of anesthesia, surgery, and postoperative care continues to be a relevant issue.16 Several scoring systems can calculate the risk of death, but unfortunately they are inclined to ignore morbidity.4,17
Perioperative complications are defined as unpredicted but preventable events that occur during surgery or in the postoperative period.16,18 Complications after major surgery for head and neck cancer raise treatment costs, delay adjuvant treatment, increase late sequelae, affect quality of life, and can cause a patient's death if not diagnosed and treated quickly.19
Our results support an equal value of POSSUM and APACHE II scores for predicting perioperative complications in patients undergoing surgical treatment for oral or oropharyngeal cancer. The APACHE II also showed a better ability to distinguish between prediction of local and local plus systemic complications. Both systems are superior to the ASA classification, which depends more on subjective clinical decisions.20,21 These results are in agreement with a previous study22 that described the APACHE II score as an independent risk factor for perioperative complications in oral cancer.
The ASA classification system is concisely defined using simplistic terms. This fact paradoxically may have contributed to its extensive and worldwide application, but it is recognized that the use of the system in clinical practice is subject to large variation among assessors.21,23
The APACHE II system has been described as flexible, with good prognostic capacity and with no significant differences in the prediction of outcomes between elective and emergency surgery or between benign and malignant disease, and it is able to predict increasing levels of complications (minor, major, and death).1 However, its weaknesses are that it ignores conditions such as the nutritional status of the patient, cardiological findings such as acute ischemic electrocardiographic alterations, presence of severe arrhythmias, or a history of recent myocardial infarction, which would increase the operative risk.24,25
The POSSUM system has been recommended as an accurate method in evaluating the outcome of surgical patients and allowing direct comparisons, despite distinct patterns of referral and populations.4 It has the benefit of being simple and includes variables that are easy to collect. It considers the physiological condition of the patient at admission and the severity of the surgical procedure to predict the rates of morbidity and mortality.
The POSSUM index aims to draw attention to differences in surgical care and outcome. The inclusion of operative factors adjusts for dependencies on surgical procedure, but adjustments for differences among surgeons, anesthetists, and operating time are lacking.26 The large acceptance and precision in recording a score demands that all the data be promptly accessible from the patient's regular assessment. The POSSUM uses complex data, but all the physiological information is available from a routine preoperative evaluation, and the operative data are included in an ordinary operative note.10 Scores that require nonroutine information may be of value in patients who are prone to be at higher risk from surgery, but they are not appropriate for broad use.10
The accurate measurement of outcomes in surgical patients is a challenge, requiring variables that should be easily quantifiable preoperatively and intraoperatively.27 Some scores are accurate in assessing the risk of morbidity and mortality in particular patients, and the APACHE II is probably the most extensively used and recognized.27 This index is good for intensive care unit patients but has the disadvantages of requiring 24 hours of surveillance28 and not taking into account the extent of the surgical procedure. The POSSUM has the advantages of inclusion of operative complexity, extent of malignant spread, and correction for comorbidities.4
The effect of comorbidities on the prognosis of patients with oral or oropharyngeal cancer has already been demonstrated.29-31 It is well-known that comorbidities contribute to a higher incidence of perioperative complications in this group of patients,11,32 which is subsequently related to a worse prognosis.20 This study demonstrated a good performance of APACHE II and POSSUM scores in the prediction of perioperative complications in patients with oral or oropharyngeal carcinomas. However, current scoring systems for the assessment of surgical risk of complications and death are not completely satisfactory for these patients, who usually have cardiac, pulmonary, and hepatic disorders. Therefore, the development of a new scoring system for patients with head and neck cancer, comprising perioperative and operative factors, should be included as an essential part of clinical outcomes research.
Corresponding author: Luiz Paulo Kowalski, MD, PhD, Hospital Cancer Registry and Department of Head and Neck Surgery and Otorhinolaryngology, Centro de Tratamento e Pesquisa Hospital do Câncer A. C. Camargo, R. Professor Antônio Pudente, 211, CEP 01509-010, São Paulo-SP, Brazil (e-mail: lp_kowalski@uol.com.br).
Accepted for publication December 4, 2002.
This study was presented at the annual meeting of the American Head and Neck Society, Boca Raton, Fla, May 11, 2002.
1.Goffi
LSaba
VGhiselli
RNecozione
SMattei
ACarle
F Preoperative APACHE II and ASA scores in patients having major general surgical operations: prognostic value and potential clinical applications.
Eur J Surg.1999;165:730-735.
PubMedGoogle Scholar 2.Saklad
M Grading of patients for surgical procedures.
Anesthesiology.1941;2:281-285.
Google Scholar 3.Knaus
WAZimmerman
JEWagner
DPDraper
EALawrence
DE APACHE–Acute Physiology and Chronic Health Evaluation: a physiologically based classification system.
Crit Care Med.1981;9:591-597.
PubMedGoogle Scholar 4.Copeland
GPJones
DWalters
M POSSUM: a scoring system for surgical audit.
Br J Surg.1991;78:355-360.
PubMedGoogle Scholar 5.Knaus
WADraper
EAWagner
DPZimmerman
JE APACHE II: a severity of disease classification system.
Crit Care Med.1985;13:818-829.
PubMedGoogle Scholar 6.Rutledge
RFakhry
SRutherford
EMuakkassa
FMeyer
A Comparison of APACHE II, Trauma Score, and Injury Severity Score as predictors of outcome in critically injured trauma patients.
Am J Surg.1993;166:244-247.
PubMedGoogle Scholar 7.Bohnen
JMMustard
RASchouten
BD Steroids, APACHE II score, and the outcome of abdominal infection.
Arch Surg.1994;129:33-37.
PubMedGoogle Scholar 8.Altomare
DFSerio
GPannarale
OC
et al Prediction of mortality by logistic regression analysis in patients with postoperative enterocutaneous fistulae.
Br J Surg.1990;77:450-453.
PubMedGoogle Scholar 9.Fan
STLai
ECMok
FPLo
CMZheng
SSWong
J Prediction of the severity of acute pancreatitis.
Am J Surg.1993;166:262-268.
PubMedGoogle Scholar 11.Singh
BCordeiro
PGSantamaria
EShaha
ARPfister
DGShah
JP Factors associated with complications in microvascular reconstruction of head and neck defects.
Plast Reconstr Surg.1999;103:403-411.
PubMedGoogle Scholar 12.Durazzo
AESMachado
FSCaramelli
B Avaliação cardiológica pré-operatória para a cirurgia não cardíaca.
Rev Bras Cir Cabeça Pescoço.1999;23:7-13.
Google Scholar 13.Hosmer
DWLemeshow
S Applied Logistic Regression. New York, NY: Wiley & Sons; 1989.
14.Hanley
JAMcNeil
BJ The meaning and use of the area under a receiver operating characteristic (ROC) curve.
Radiology.1982;143:29-36.
PubMedGoogle Scholar 15.Metz
CEWang
PLKronman
HB A new approach for testing the significance of differences between ROC curves measured from correlated data.
In: Deconik
F, ed.
Information Processing in Medical Imaging. Vol 8. Boston, Mass: Martinus Nijhoff; 1984:286-295.
Google Scholar 16.Arvidsson
SOuchterlony
JNilsson
SSjöstedt
LSvärdsudd
K The Gothenburg study of perioperative risk, I: preoperative findings, postoperative complications.
Acta Anaesthesiol Scand.1994;38:679-690.
PubMedGoogle Scholar 17.Sagar
PMHartley
MNMancey-Jones
BSedman
PCMay
JMacfie
J Comparative audit of colorectal resection with the POSSUM scoring system.
Br J Surg.1994;81:1492-1494.
PubMedGoogle Scholar 18.Shaheen
OH Problems in Head and Neck Surgery. London, England: Balliere Tindall; 1984.
19.Evans
PHR Complications in head and neck surgery and how to avoid trouble.
J Laryngol Otol.1989;103:926-929.
PubMedGoogle Scholar 20.Clavien
PASanabria
JRMentha
G
et al Recent results of elective open cholecystectomy in a North American and a European center: comparison of complications and risk factors.
Ann Surg.1992;216:618-626.
PubMedGoogle Scholar 21.Owens
WDFelts
JASpitznagel
ELJ ASA physical status classifications: a study of consistency of ratings.
Anesthesiology.1978;49:239-243.
PubMedGoogle Scholar 22.De Melo
GMRibeiro
KCBKowalski
LPDeheinzelin
D Risk factors for postoperative complications in oral cancer and their prognostic implications.
Arch Otolaryngol Head Neck Surg.2001;127:828-833.
PubMedGoogle Scholar 23.Haynes
SRLawler
PG An assessment of the consistency of ASA physical status classification allocation.
Anaesthesia.1995;50:195-199.
PubMedGoogle Scholar 24.Gagner
MFranco
DVons
CSmadja
CRossi
RLBraasch
JW Analysis of morbidity and mortality rates in right hepatectomy with the preoperative APACHE II score.
Surgery.1991;110:487-492.
PubMedGoogle Scholar 25.Schein
MGecelter
G APACHE II score in massive upper gastrointestinal haemorrhage from peptic ulcer: prognostic value and potential applications.
Br J Surg.1989;76:733-736.
PubMedGoogle Scholar 26.Whiteley
MSPrytherch
DRHiggins
BWeaver
PCProut
WG An evaluation of the POSSUM surgical scoring system.
Br J Surg.1996;83:812-815.
PubMedGoogle Scholar 27.Shuhaiber
JH Quality measurement of outcome in general surgery revised: commentary and proposal.
Arch Surg.2002;137:52-54.
PubMedGoogle Scholar 28.Civetta
JMHudson-Civetta
JANelson
LD Evaluation of APACHE II for cost containment and quality assurance.
Ann Surg.1990;212:266-276.
PubMedGoogle Scholar 29.Pugliano
FAPiccirillo
JFZequeira
MR
et al Clinical-severity staging system for oropharyngeal cancer: five-year survival rates.
Arch Otolaryngol Head Neck Surg.1997;123:1118-1124.
PubMedGoogle Scholar 30.Pugliano
FAPiccirillo
JFZequeira
MRFredrickson
JMPerez
CASimpson
JR Clinical-severity staging system for oral cavity cancer: five-year survival rates.
Otolaryngol Head Neck Surg.1999;120:38-45.
PubMedGoogle Scholar 31.Ribeiro
KCKowalski
LPLatorre
MR Impact of comorbidity, symptoms, and patients' characteristics on the prognosis of oral carcinomas.
Arch Otolaryngol Head Neck Surg.2000;126:1079-1085.
PubMedGoogle Scholar 32.Singh
BBhaya
MZimbler
M
et al Impact of comorbidity on outcome of young patients with head and neck squamous cell carcinoma.
Head Neck.1998;20:1-7.
PubMedGoogle Scholar