Data were obtained from the American College of Surgeons’ National Surgical Quality Improvement Program (ACS NSQIP) (2006-2011). CPT indicates Current Procedural Terminology.
To convert creatinine to micromoles per liter, multiply by 88.4. Solid vertical line indicates 0, dashed lines, 0.1 and −0.1. ASA indicates American Association of Anesthesiologists; AST, aspartate aminotransferase; BP, blood pressure; CNS, central nervous system; CVA, cerebrovascular accident; DNR, do not resuscitate; INR, international normalized ratio; and PT, prothrombin time.
eTable 1. Bivariable Association of Patient and Operative Characteristics With Pregnancy in the Unmatched Cohort (N=519,469)
eTable 2. Propensity Model Predicting the Probability of a Female Patient Being Pregnant Based on Patient and Operative Characteristics
Customize your JAMA Network experience by selecting one or more topics from the list below.
Moore HB, Juarez-Colunga E, Bronsert M, et al. Effect of Pregnancy on Adverse Outcomes After General Surgery. JAMA Surg. 2015;150(7):637–643. doi:10.1001/jamasurg.2015.91
The literature regarding the occurrence of adverse outcomes following nonobstetric surgery in pregnant compared with nonpregnant women has conflicting findings. Those differing conclusions may be the result of inadequate adjustment for differences between pregnant and nonpregnant women. It remains unclear whether pregnancy is a risk factor for postoperative morbidity and mortality of the woman after general surgery.
To compare the risk of postoperative complications in pregnant vs nonpregnant women undergoing similar general surgical procedures.
Design, Setting, and Participants
In this retrospective cohort study, data were obtained from the American College of Surgeons’ National Surgical Quality Improvement Program participant user file from January 1, 2006, to December 31, 2011. Propensity-matched females based on 63 preoperative characteristics were matched 1:1 with nonpregnant women undergoing the same operations by general surgeons. Operations performed between January 1, 2006, and December 31, 2011, were analyzed for postoperative adverse events occurring within 30 days of surgery.
Main Outcomes and Measures
Rates of 30-day postoperative mortality, overall morbidity, and 21 individual postoperative complications were compared.
The unmatched cohorts included 2764 pregnant women (50.5% underwent emergency surgery) and 516 705 nonpregnant women (13.2% underwent emergency surgery) undergoing general surgery. After propensity matching, there were no meaningful differences in all 63 preoperative characteristics between 2539 pregnant and 2539 nonpregnant patients (all standardized differences, <0.1). The 30-day mortality rates were similar (0.4% in pregnant women vs 0.3% in nonpregnant women; P = .82), and the rate of overall morbidity was also not significantly different between pregnant vs nonpregnant patients (6.6% vs 7.4%; P = .30).
Conclusions and Relevance
There was no significant difference in overall morbidity or 30-day mortality rates in pregnant and nonpregnant propensity-matched women undergoing similar general surgical operations. General surgery appears to be as safe for pregnant women as it is for nonpregnant women.
Historical data1 suggest that 1 in 500 pregnant patients require nonobstetric surgery. Pregnancy is associated with physiologic changes in body habitus and the coagulation,2 cardiovascular,3 pulmonary,4 and immune5 systems. These changes pose a diagnostic and treatment challenge to surgeons because physical examination findings and laboratory test values are different from those routinely encountered.6,7 Therefore, it might be expected that postoperative complications in pregnant patients are increased compared with those in nonpregnant patients. Several retrospective studies8,9 have supported this hypothesis. However, equivalent complication rates between pregnant and nonpregnant patients have also been reported.10,11 Review of these disparate studies suggests that the heterogeneity of outcomes is likely the result of differences in the types of operations studied and the inability to account for differences in patient characteristics of pregnant and nonpregnant women in the statistical analyses.
Although several investigators9-11 report that they have adjusted for differences in patient baseline characteristics, none of these studies has attempted to do this using propensity matching. Because pregnancy usually occurs in younger women as well as considering the broad alterations across the body’s physiology as a result of pregnancy, we anticipated differences in preoperative characteristics between nonpregnant and pregnant patients. Therefore, we used propensity matching, a technique recommended12 for comparison of groups of interest with low event rates.
Using the American College of Surgeons’ National Surgical Quality Improvement Program (ACS NSQIP) participant use file (PUF), we evaluated adverse operative outcome rates in general surgery contrasting pregnant with propensity-matched nonpregnant women. We hypothesized that pregnant women are at greater risk of complications than are comparable nonpregnant women undergoing similar general surgical operations.
This retrospective cohort study compared 30-day postoperative surgical outcomes of pregnant vs nonpregnant women undergoing nonobstetric operations by general surgeons. Patients were identified from the ACS NSQIP PUF, a database of surgical procedures performed in hospitals participating in the ACS NSQIP from January 1, 2006, to December 31, 2011. Patients were excluded if they were male, underwent obstetric surgery, or were missing one or more of the preoperative patient characteristics used in the study. Nonpregnant women were also excluded if they underwent operations that were not performed in the group of pregnant patients. The Colorado Multiple Institutional Review Board classified this study as not involving human subject research.
The primary outcome variables for the analyses in this study were death from any cause and overall morbidity (≥1 of the 21 ACS NSQIP perioperative complications) occurring within 30 days of the index operation. These complications included acute renal failure requiring dialysis or hemofiltration; progressive renal insufficiency; bleeding requiring transfusion of more than 4 U of packed red blood cells; cardiac arrest requiring cardiopulmonary resuscitation; Q-wave myocardial infarction; deep venous thrombosis or thrombophlebitis requiring treatment; pulmonary embolism; pneumonia; prolonged (>48 hours) intubation; unplanned intubation; septic shock; cerebrovascular accident (including trauma such as a fall, resulting in an injury to the head) or stroke with subsequent neurologic deficit; sepsis; superficial surgical site infection; deep incisional surgical site infection; organ or organ space surgical site infection; urinary tract infection; wound disruption; peripheral nerve injury; graft, prosthesis, or flap failure; and coma lasting longer than 24 hours.13
Differences between pregnant and nonpregnant patients were compared with χ2 tests for categorical variables and 2-tailed independent t tests for continuous variables in the unmatched cohorts. Differences between the groups were also evaluated using the standardized differences14 to enable comparison of covariate imbalance between the matched and unmatched cohorts. The absolute value of the standard difference of less than 0.1 indicates that the groups are well balanced for that characteristic; differences greater than 0.1 or less than −0.1 indicate some imbalance. The McNemar test, either in its large sample size approximation or exact form, was used to compare 30-day mortality and morbidity in the propensity-matched cohorts.
Propensity-score matching methods were used to reduce confounding related to nonrandom assignment of pregnancy.15 A propensity score is the predicted probability, based on logistic regression, that a given woman will be pregnant. This approach was used because of its performance and simplicity. Pregnant patients were propensity matched 1:1 to controls with a greedy algorithm.16 The propensity score logit model included 63 patient preoperative characteristics.
Despite the large sample of nonpregnant women, we conducted one-to-one matching to avoid the possible bias of many-to-one matching.17 Each pregnant patient was matched to a single nonpregnant control patient if her predicted propensity scores were identical to 8 decimal places. If such a match was not found, the pregnant patient was matched to a nonpregnant patient on the basis of a 7–, 6–, 5–, 4–, 3–, 2–, or 1–decimal place match, tested sequentially. Missing values were treated as a separate category for the categorical variables of race/ethnicity, body mass index, and the 12 preoperative laboratory test values. Laboratory test values were coded as missing, abnormal low, normal, and abnormal high according to values presented in a widely used medical textbook.18
Differences in complications and mortality rates were also analyzed in subgroups of emergency and nonemergency operations. To retain high power, we included an interaction term for pregnancy and emergency in a conditional logistic regression model. Evidence of an interaction would indicate that the association between pregnancy and complication rates was different in emergency vs nonemergency cases. The a priori level of statistical significance was set at α = .05 for all analyses, which were 2-tailed. Statistical analyses were performed with SAS, version 9.4 (SAS Institute Inc).
There were 651 594 adult women undergoing operations by general surgeons in the ACS NSQIP PUF from 2006 to 2011. Exclusion criteria and sample size of patients in this analysis are demonstrated in the Strengthening the Reporting of Observational Studies in Epidemiology diagram (Figure 1). Of the 651 594 patients, 49 977 (7.7%) were excluded because they were missing 1 or more preoperative patient characteristic, and 2011 patients (0.3%) were excluded because they underwent an obstetric operation. An additional 80 137 nonpregnant patients (12.3%) were excluded because they underwent an operation that was not performed in the group of pregnant patients. A total of 519 469 patients remained: 2764 (0.5%) were pregnant and 516 705 (99.5%) were nonpregnant.
Characteristics of unmatched pregnant patients and nonpregnant patients are presented in eTable 1 in the Supplement. The unmatched pregnant patients were significantly younger than the nonpregnant patients (29.8 vs 53.0 years; P < .001). Compared with the nonpregnant patients, the pregnant women were more likely to undergo surgery as an inpatient (2074 [75.0%] vs 308 375 [59.7%]; P < .001) and undergo an emergency operation (1396 [50.5%] vs 68 156 [13.2%]; P < .001). Pregnant patients generally had lower rates of preoperative comorbidities but higher rates of abnormal laboratory test results (high white blood cell count and low blood urea nitrogen, hematocrit, serum creatinine, serum sodium, and serum albumin levels) compared with nonpregnant patients.
The standardized differences for the proportions or means between the pregnant and nonpregnant unmatched cohorts are reported in eTable 1 in the Supplement. The standardized differences were greater than 0.1 or less than −0.1 for 31 of the 63 preoperative patient characteristics (49.2%), indicating the expected, important imbalances between pregnant and nonpregnant patients in the unmatched cohorts.
In the unmatched cohort, 10 of 2764 pregnant patients (0.4%) died within 30 days of surgery compared with 5759 of 516 705 nonpregnant patients (1.1%) (P < .001) (Table). The overall morbidity rate was also lower for pregnant patients (183/2764 [6.6%] vs 48 394/516 705 [9.4%]; P < .001) than nonpregnant patients. Pregnant patients had significantly lower rates of superficial surgical site infection, urinary tract infection, bleeding requiring transfusion of more than 4 U of packed red blood cells, myocardial infarction, and unplanned intubation compared with nonpregnant patients in the unmatched cohort (P value range, .005-.049).
The propensity model is provided in eTable 2 in the Supplement. Thirty-seven of the preoperative patient characteristics were significant predictors of pregnancy, with the C statistic for the full model of 0.939. A total of 2539 of the 2764 pregnant patients (91.9%) were matched to 2539 of 516 705 (0.5%) nonpregnant patients. The standardized differences in baseline characteristics between the groups before and after matching on the propensity score are shown in Figure 2. In the propensity-matched cohort, none of the 63 patient characteristics had standardized differences greater than 0.1 or less than −0.1, indicating that the propensity-matched samples were well balanced.
As reported in the Table for the propensity-matched cohort, there was no significant difference in the 30-day mortality rates between pregnant and nonpregnant patients (0.4% vs 0.3%; P = .82) or in the overall morbidity rate in the pregnant patients vs nonpregnant women (6.6% vs 7.4%; P = .30). No significant differences were found when we compared the rates of the 21 individual complications in the pregnant vs nonpregnant patients after propensity matching. There was no evidence of a different association between pregnancy and overall morbidity or mortality rates in the emergency and nonemergency subgroups (interaction P values: overall morbidity, P = .11; mortality, P = .74).
We performed an analysis of pregnant women matched to nonpregnant women undergoing general surgical operations using the ACS NSQIP PUF to determine whether pregnancy was associated with an increased rate of postoperative adverse outcomes. We observed that pregnant patients had different preoperative risk factors than nonpregnant women: the pregnant women were younger, had fewer comorbidities, and more frequently had abnormal laboratory test values. Pregnant women were also more likely to undergo emergency procedures. Unbalanced preoperative risk factors between the groups were balanced after propensity matching, thereby minimizing bias in comparison of outcomes between the 2 patient populations. Analysis of matched cohorts showed no significant differences in 30-day mortality or the occurrence of 1 or more complications between the groups. Nonobstetric general surgery appears to be as safe in pregnant women as in nonpregnant women.
Prior studies reporting increased complication rates in pregnant vs nonpregnant women undergoing nonobstetric surgery come from analysis of the Health Care Utilization Project Nationwide Inpatient Sample. Kuy et al9 reported increased rates of complications, length of stay, and cost for pregnant women undergoing thyroid and parathyroid surgery despite risk adjustment with logistic regression analysis for the dichotomous outcome (complications) and linear regression for the continuous variables (length of stay and cost). Using a similar time period, the same group11 evaluated pregnant women undergoing cholecystectomy. Prior to regression analysis, the pregnant patients had an increased complication rate. However, after age and procedure matching, as well as adjustment for insurance, race, and surgeon case volume, pregnancy was not associated with an increased risk of surgical complications. A recent publication8 using the Health Care Utilization Project Nationwide Inpatient Sample reported that postoperative complication rates following appendectomy were higher in pregnant vs nonpregnant women. In that study, Abbasi et al8 matched more than 7000 pregnant women to 35 000 nonpregnant women based on age and then performed multivariable logistic regression on categories of race, obesity, income, and insurance type. Although postoperative complication rates were higher in the pregnant group, the most notable finding was that peritonitis on presentation was the highest predictor of postoperative complication rates. This study identified that pregnant women more frequently present with peritonitis than do nonpregnant women. The authors concluded that this factor was the causality for this discrepancy between the groups.
In contrast, some studies report low postoperative complication rates in pregnant patients. Erekson et al19 analyzed the ACS NSQIP PUF. Their descriptive findings parallel our results of low maternal postoperative complication rates, but they did not contrast pregnant and nonpregnant women. Silvestri et al10 found a similar rate of morbidity between pregnant and nonpregnant women undergoing cholecystectomy and appendectomy. McMaster et al20 also found that pregnant patients had postoperative complication rates similar to those of nonpregnant women after breast surgery.
Because a prospective randomized clinical trial to identify whether pregnancy is a risk factor for postoperative complications is not feasible, only observational studies are available. The latter are dependent upon statistical adjustment to account for significant baseline differences between pregnant and nonpregnant patients. The ACS NQIP PUF during our study time frame contained data on more than 500 000 nonpregnant women undergoing operations similar to those of pregnant patients. Propensity matching is well suited for this type of observational study in which a “large reservoir” of potential controls is contrasted to a moderately sized group.21 Propensity matching controls for measured baseline covariates before analysis of the outcomes.22 This technique does not require the complexity of forming multiple strata to balance covariates and is superior in reducing bias.12 Propensity matching is used frequently in medical studies because of its simplicity and robust performance, but it is not always reported appropriately.15 Key elements of propensity modeling that are often neglected include reporting the model construct, assessment of prematching and postmatching differences between groups, and appropriate outcome analysis.
In our study, the maximum C statistic for the propensity model was 0.939 (eTable 2 in the Supplement) compared with the value of 1.0 for a perfect model. This finding supports the conclusion that we have developed a reasonable model to predict pregnancy based on preoperative characteristics. The model eliminated measured, unbalanced preoperative variables quantified by standardized differences (Figure 2), and outcomes were appropriately assessed with a paired analysis contrasting pregnant to nonpregnant women. Other approaches, such as double robust inverse probability weighting, requiring specification of an outcomes regression model would not have been a feasible approach with these data given the small number of outcome events.12
This study has strengths and limitations. Strengths include (1) a large sample from a broad range of hospitals, (2) a broad range of operations included in the database using a systematic sampling method, and (3) a standardized protocol for collection of the ACS NSQIP PUF, with central auditing of the data. The primary limitations of this study include (1) the observational design so that only association (ie, not causation) may be concluded and (2) a lack of data on fetal outcomes. There is clearly a risk to the fetus when a pregnant woman undergoes surgery. Fetal loss after appendectomy was found to be 4% in women with a normal appendix.23 The increasing number of reports indicates that infectious surgical indications, such as appendicitis and cholecystitis, are associated with an unfavorable outcome for the fetus24,25 and that advanced disease is a risk factor for fetal and maternal complications.23,26 Attempting medical management of surgical diseases (eg, appendicitis and cholecystitis) is associated with a worse outcome compared with early operative management.8,11 Therefore, the well-being of the fetus represents an additional risk-benefit factor to consider in pregnant women, and an unclear diagnosis may require further expeditious evaluation to minimize delay of definitive management.
Pregnant patients undergoing emergency and nonemergency general surgery do not appear to have elevated rates of mortality or morbidity. We did not account for fetal complications in this study and would not advocate that our findings be generalized to elective surgical situations that can be postponed until after delivery. Therefore, general surgery appears to be as safe in pregnant as it is in nonpregnant women. These findings support previous reports that pregnant patients who present with acute surgical diseases should undergo the procedure if delay in definitive care will lead to progression of disease.
Accepted for Publication: December 1, 2014.
Corresponding Author: Robert A. Meguid, MD, MPH, Surgical Outcomes and Applied Research Program, Department of Surgery, School of Medicine, University of Colorado, Mail Stop C310, 12631 E 17th Ave, Aurora, CO 80045 (firstname.lastname@example.org).
Published Online: May 13, 2015. doi:10.1001/jamasurg.2015.91.
Author Contributions: Drs H. B. Moore and Juarez-Colunga had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: H. B. Moore, Hammermeister, Henderson, Meguid.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: H. B. Moore, Juarez-Colunga, Hammermeister, E. E. Moore, Meguid.
Critical revision of the manuscript for important intellectual content: All authors.
Statistical analysis: Juarez-Colunga, Bronsert, Henderson, Meguid.
Obtained funding: Meguid.
Administrative, technical, or material support: E. E. Moore, Meguid.
Study supervision: Hammermeister, Henderson, E. E. Moore, Meguid.
Conflict of Interest Disclosures: None reported.
Funding/Support: This project was supported by funding from the Department of Surgery, Adult and Child Center for Health Outcomes Research and Delivery Science Joint Surgical Outcomes and Applied Research Program at the University of Colorado.
Role of the Funder/Sponsor: The funding sources had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.