aRefers to the 91.25-day period relative to the index procedure. Note that data from quarters −2, −1, and 0 were omitted from analysis. Number (%) of patients enrolled during the presurgery and postsurgery periods are as follows: In the adjustable gastric banding group, there were 4935 (100%) in quarter −4, 4935 in quarter 0, 4935 (100%) in quarter 4, 3272 (66%) in quarter 8, and 1993 (40%) in quarter 12. In the Roux-en-Y gastric bypass group, there were 4935 (100%) in quarter −4, 4935 in quarter 0, 4935 (100%) in quarter 4, 3018 (61%) in quarter 8, and 1770 (36%) in quarter 12. Values are plotted using a smoothing function and no actual values go below zero.
Quarters −2, −1, and 0 were omitted from the analysis and are not shown here because their extreme high values would have made visually interpreting preoperative to postoperative trend and level changes difficult.
aRefers to the 91.25-day period relative to the index procedure.
eTable 1. Size of Surgical Groups During Postoperative Observation Period
eTable 2. Sensitivity Analysis: Within and Between Procedure Changes for Utilization Outcomes – Pre- to Post-surgery for 7170 Propensity-Score Matched Individuals With at Least 2 Years of Preoperative Enrollment Data
eFigure. Flow Diagram of Cohort Selection and Inclusion Criteria
Lewis KH, Zhang F, Arterburn DE, Ross-Degnan D, Gillman MW, Wharam JF. Comparing Medical Costs and Use After Laparoscopic Adjustable Gastric Banding and Roux-en-Y Gastric Bypass. JAMA Surg. 2015;150(8):787-794. doi:10.1001/jamasurg.2015.1081
There is conflicting evidence about how different bariatric procedures impact health care use.
To compare the impact of laparoscopic adjustable gastric banding (AGB) and laparoscopic Roux-en-Y gastric bypass (RYGB) on health care use and costs.
Design, Setting, and Participants
Retrospective interrupted time series with comparison series study using a national claims data set. The data analysis was initiated in September 2011 and completed in January 2015. We identified bariatric surgery patients aged 18 to 64 years who underwent a first AGB or RYGB between 2005 and 2011. We propensity score matched 4935 AGB to 4935 RYGB patients according to baseline age group, sex, race/ethnicity, socioeconomic variables, comorbidities, year of procedure and baseline costs, emergency department (ED) visits, and hospital days. Median postoperative follow-up time was 2.5 years.
Main Outcomes and Measures
Quarterly and yearly total health care costs, ED visits, hospital days, and prescription drug costs. We used segmented regression to compare pre-to-post changes in level and trend of these measures in the AGB vs the RYGB groups and difference-in-differences analysis to estimate the magnitude of difference by year.
Both AGB and RYGB were associated with downward trends in costs; however, by year 3, AGB patients had total annual costs that were 16% higher than RYGB patients (P < .001; absolute change: $818; 95% CI, $278 to $1357). In postoperative years 1 and 2, AGB was associated with 27% to 29% fewer ED visits than RYGB (P < .001; absolute changes: −0.6; 95% CI, −0.9 to −0.4 and −0.4; 95% CI, −0.6 to −0.1 visits/person, respectively); however, by year 3, there were no detectable differences. Postoperative annual hospital days were not significantly different between the groups. Although both procedures lowered prescription costs, annual postoperative prescription costs were 17% to 32% higher for AGB patients than RYGB patients (P < .001).
Conclusions and Relevance
Both laparoscopic AGB and RYGB were associated with flattened total health care cost trajectories but RYGB patients experienced lower total and prescription costs by 3 years postsurgery. On the other hand, RYGB was associated with increased ED visits in the 2 years after surgery. Clinicians and policymakers should weigh such differences in use and costs when making recommendations or shaping regulatory guidance about these procedures.
The prevalence of severe obesity (body mass index [BMI, calculated as weight in kilograms divided by height in meters squared] ≥ 40) is rising faster than that of obesity (BMI ≥ 30) in the United States.1 Patients with severe obesity have greater health care use rates2 and higher levels of morbidity/mortality. Thus, it is increasingly important to evaluate the comparative effectiveness of treatments for this condition.
Bariatric surgery results in dramatic weight loss, as well as remission of many comorbidities,3 but different procedure types vary substantially in their effects and mechanisms of action.4- 11 Additionally, the procedures are costly and can result in a number of short- and long-term complications.5
Laparoscopic Roux-en-Y gastric bypass (RYGB) and laparoscopic adjustable gastric banding (AGB) represent 2 of the most common bariatric procedures in the United States.5,11 Roux-en-Y gastric bypass is more effective at producing weight loss and diabetes mellitus remission than AGB; however, it is also a more complex procedure.4,5 Despite the relative simplicity of AGB, its use has declined owing to high reoperation rates related to device failures, gastric erosions, and the inability to achieve or maintain weight loss.12- 14
To our knowledge, few studies have addressed how these procedures affect total health care use or the extent to which they differentially influence postoperative use and expenditures. Most prior studies have not distinguished between surgical types15- 20 or have included a substantial proportion of outdated procedures.21 For patients, health care professionals, and payers deciding between current surgical modalities, more comparative effectiveness research is needed.
Our objective was to assess the impact of laparoscopic AGB and RYGB on emergency department (ED) visits, hospital days, prescription drug costs, and total health care costs, comparing the procedures using propensity-matched groups.
Our data source was 2000-2011 claims from a US-wide commercial insurer including enrollment and demographic information and inpatient, outpatient, and pharmacy claims for all members. For claims from 2004 onwards, the data vendor (OptumInsight) also calculated standardized costs using an algorithm to closely approximate the health plan’s amount allowed (ie, the total cost of the claim), while eliminating variability in pricing across geography and time. This study was approved by the Harvard Pilgrim Health Care institutional review board, and a waiver of informed patient consent was obtained.
We identified members aged 18 to 64 years who underwent AGB or RYGB between January 2005 and December 2011 using Current Procedural Terminology (CPT), and International Classification of Diseases, Ninth Revision (ICD-9) codes. We classified members with CPT code 43770 or ICD-9 procedure code 44.95 as having undergone AGB, and those with CPT codes 43644 or 43645 or with ICD-9 code 44.38 and a code for morbid obesity (278.01) as having undergone laparoscopic RYGB (eFigure in the Supplement). We used additional codes to identify other bariatric procedures, and if members had coding for more than 1 procedure type during their enrollment, we defined the earliest as their index procedure. We excluded members with CPT codes for revisional procedures (43848 and 43771-43774) occurring on the index date or within a year prior. We limited our analyses to members who were continuously enrolled for at least 1 year before and after their procedures. We excluded members with evidence of gastrointestinal malignancy within 6 months prior to surgery or perforated gastrointestinal ulcer on the date of surgery to avoid including those with surgical indications other than obesity.
The data analysis was initiated in September 2011 and completed in January 2015.
Demographic covariates included age group (18-34, 35-49, and 50-64 years), sex, race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, and other), and region of the United States (Pacific, Mountain, West-North Central, West-South Central, East-South Central, East-North Central, South Atlantic, Mid-Atlantic, and New England). We classified race/ethnicity according to a variable constructed using place of residence (geocoding) and surname analysis.22 Geocoding is sensitive in detecting black or white race and surname analysis provides accurate information on Asian and Hispanic ethnic group.23 To generate proxy measures of socioeconomic status, we created previously established24 categorical variables of census block group poverty and education levels derived from 2000 US Census reports.25 Variables were as follows: census block group education (level 1: <15% of people with < high school education, level 2: 15%-24.9%, level 3: 25%-39.9%, and level 4: ≥40%) and 2000 census block group poverty level (level 1: <5% of persons below poverty, level 2: 5%-9.9%, level 3: 10%-19.9%, and level 4: ≥20%). We determined baseline disease burden using claims from 12 to 3 months prior to the index procedure (9-month period), including Charlson Comorbidity Index score26 (0, 1, 2, 3, 4, and ≥5) and diagnoses of type 2 diabetes mellitus, kidney disease, liver disease, hypertension, and dyslipidemia. Finally, we included year of procedure (before or after 2008) as a covariate to minimize bias due to secular trends.
We used 1:1 caliper propensity score matching to create AGB and RYGB groups that were balanced with respect to baseline characteristics including age group, sex, neighborhood education and poverty levels, race/ethnicity, region of the United States, Charlson Comorbidity Index score, several comorbid conditions, and year of procedure. We also matched on standardized costs, ED visits, and hospital days for quarters −4, −3, and −2 (excluding the 3 months immediately before surgery which included a spike in use owing to preoperative workup). Propensity score matching is a well-established method that assists in generating a comparison group with similar measured characteristics when patients have not been randomly allocated into study groups.27- 30 We used a caliper equal to 0.6 times the standard deviation of the propensity score, and diagnostics of the propensity score distributions revealed excellent overlap (Table 1). Our prematch pool included 5608 AGB and 6459 RYGB patients with complete covariate data, and the final sample comprised 4935 patients in each group.
We examined 4 measures of health care use at quarterly (91.25 days) and yearly intervals relative to a patient’s index procedure: ED visits, hospital days, standardized prescription costs, and total health care costs. We assessed person-level ED visits per quarter and year, as well as hospital days per quarter and year as count variables, with a maximum of 15 preoperative and 22 postoperative quarters, based on the distribution of follow-up time in our cohort.
We measured standardized total health care costs as a proxy for overall use. We summed prescription drug and total standardized costs at the individual level per quarter and per year. Because costs were non-normally distributed and contained some extreme outliers, we capped them at the 99th percentile. Additionally, because cost data were not available before 2004, cost outcomes relied on baseline period data from 2004 onwards, regardless of whether an individual had been enrolled earlier.
To visualize changes in the levels and trends of outcomes, we generated time series plots of the unadjusted mean values in each quarter. We used patient-level segmented regression analysis to test for postsurgical level or trend changes using 1-part generalized estimating equations speciﬁed with the log link and Poisson variance function. We applied a ﬁrst-order autoregressive working model to adjust for autocorrelation between adjacent individual quarterly measurements and estimated the variance using the empirical sandwich estimator. This approach is statistically valid even when a substantial fraction of members do not have outcomes.31 The analytic model produced population-level use or cost-rate ratios using the quarterly person-level outcomes. The primary independent variables in our analyses were time (in quarters from the start of the baseline period through postsurgical follow-up), intervention (whether a given quarter was before or after the index [surgical] date), and time after intervention (time in quarters after the index date). Thus, the exponentiated coefﬁcient for time indicated the baseline trend of use of the referent group (the ratio of 2 adjacent quarterly use rates); the exponentiated coefﬁcient for intervention indicated a level change (a rate ratio) in use immediately after the index date compared with the prior quarter; and the exponentiated coefﬁcient for time after intervention indicated a trend change (a rate ratio) in use after the index date compared with the baseline trend. Models were run with each group (AGB and RYGB) serving as referent to obtain estimates for within-group changes. To calculate the differential changes in the level and trend of use between AGB and RYBG, we examined interactions between the study groups and primary independent variables listed here to estimate the ratio of rate ratios. Because differential dropout between groups over time could introduce imbalance with respect to confounders, we built multivariable models to adjust for changing population characteristics over baseline and follow-up, including all propensity-matching covariates except for outcome variables and less-common comorbidities.
Although interrupted time series designs are among the most robust for establishing causal inference, interpreting the ratio of ratios from segmented regression models can be challenging. Therefore, we performed multivariable difference-in-differences analysis on all outcomes, comparing AGB with RYGB in postoperative years 1 to 3, with a baseline consisting of the analyzed portion of the preoperative year (months −12 to −6). Estimates from these models are presented as the annual pre-to-post change for the procedure of interest (AGB) compared with the control procedure (RYGB).
Outcome analyses excluded 6 months before and 3 months after surgery, a period of higher use of medical care (eg, owing to preoperative testing) not representative of the general preoperative or postoperative course.17,32 Given that many cohort members had little more than a year of baseline data available (thus, little more than 6 months of baseline), we examined whether a longer baseline would alter our findings. We conducted a sensitivity analysis, repeating segmented regression models on a subset of patients with at least 2 years of preoperative baseline, still excluding months −6 through 3 months postoperative.
Our propensity-matched cohort included 4935 AGB and 4935 RYGB patients. The groups were balanced with respect to all measured baseline characteristics (Table 1). The mean (SD) age of patients was 43.8 (10.2) years at the time of the index procedure. Seventy-nine percent were female, 70.4% were non-Hispanic white, and just over half (53.5%) resided in census tracts where 85% or more of residents had at least a high school education. Mean postoperative follow-up time was nearly 3 years (2.9 for AGB and 2.8 for RYGB; P < .001). Attrition owing to disenrollment occurred over the postoperative follow-up period, and 3763 patients (38%) (1993 AGB and 1770 RYGB) remained enrolled by the end of the third postoperative year (eTable 1 in the Supplement).
In time series analyses, AGB patients experienced a significantly lower level of ED visits relative to RYGB patients immediately after surgery; however, there was no significant difference in trend change between the 2 procedures (Figure 1; Table 2). In the corresponding multivariable difference-in-differences models, AGB patients had fewer annual ED visits during early follow-up than RYGB patients (Table 3). In postoperative years 1 and 2, AGB patients had 29% (95% CI, −39% to −20%) and 27% (95% CI, −41% to −13%) fewer ED visits than RYGB patients, respectively (absolute changes: −0.6; 95% CI, −0.9 to −0.4 and −0.4; 95% CI, −0.6 to −0.1 visits per member per year, respectively). There was no detectable difference between procedures by year 3 (absolute change: −0.1; 95% CI, −0.4 to 0.1 visits per member per year).
Time series analyses of quarterly hospital days revealed that AGB patients had a significantly smaller immediate increase in postoperative level of hospital days than RYGB patients but neither group experienced significant changes in trend (Figure 1; Table 2). Multivariable difference-in-differences models, which are less sensitive to immediate postoperative changes, showed no significant differences in annual hospitalization days at years 1, 2, or 3 in the AGB relative to the RYGB group (Table 3).
Adjustable gastric banding patients experienced a smaller immediate postoperative drop in prescription costs but a greater downward trend compared with RYGB patients (Figure 2; Table 2). Similarly, in difference-in-differences models, AGB patients’ prescription costs were 17% to 32% (P < .001) higher than those of RYGB patients during all 3 postoperative years (absolute differences in postoperative years 1, 2, and 3: $324, $245, and $159, respectively; P < .001; Table 3).
Mean total costs in the quarter immediately following the surgical date were $19 731 for AGB and $28 664 for RYGB patients. Total costs initially trended down slightly more after AGB relative to RYGB, but there was no significant difference in level change (Table 2; Figure 2). In difference-in-differences analyses, the cohorts had no detectable differences in total costs for the first 2 postoperative years, but by year 3, AGB patients had 16% (95% CI, 4% to 24%) higher total costs than RYGB patients (absolute increase: $818 per year; P < .01; Table 3).
In our subset of 3585 AGB and 3585 propensity score–matched RYGB patients with at least 2 years of baseline data, all observed patterns for level and trend change from preoperative to postoperative were nearly identical to the interrupted time series results from our main segmented regression analysis (eTable 2 in the Supplement).
We found that both laparoscopic AGB and RYGB patients experienced reduced drug and total cost trends in the first 3 years after surgery, despite initial relative increases in ED visits and hospital days among RYGB patients. By year 3, AGB patients experienced total annual medical costs that were 16% higher and prescription costs 17% higher on average compared with RYGB patients. The prescription drug findings are unsurprising given RYGB’s greater impact on weight loss and chronic disease resolution.5,11 However, RYGB’s increased risk for early complications, implied by higher initial ED visits and hospital days, may have nullified any potential early use or spending reductions from remission of comorbid disease.
To our knowledge, this is the first study to compare the impact of laparoscopic AGB and RYGB on costs and use. Most previous comparative effectiveness studies have included a large proportion of patients receiving open procedures and may be less useful for health care professionals and patients deciding between current surgical modalities. Our early postoperative use findings support prior clinical and claims-based studies. The increased ED visits we observed among RYGB patients are consistent with higher early complication rates vs AGB.5,11,32 Also, many prior studies have revealed short-term increases in hospitalizations after surgery, although few have differentiated by surgical type.15,17,19,21,32,33 Our findings of reduced prescription costs are not surprising, given prior studies demonstrating reductions in non-hospital–based health care use after bariatric surgery.21,33 Our findings do differ from those of a Veterans Affairs–based analysis that compared bariatric surgery patients to nonoperative control individuals and found no decreases in total costs during 3 follow-up years.17 This discrepancy may be related to the Veterans Affairs study’s focus on open rather than laparoscopic procedures34,35 and its greater inclusion of higher-morbidity male patients. Prior modeling work by Campbell et al36 suggested that bariatric procedures are most likely to be cost-effective in younger women than older men.
Our study adds several insights potentially useful to patients, clinicians, and policymakers. Our finding that both procedures were associated with flattened cost trends among commercially insured patients is novel. Presuming that the middle-aged patients comprising our sample continue on the trajectories demonstrated in the 3 years after their procedures, our analyses suggest both AGB and RYGB might be associated with long-term cost savings, although our follow-up period prevented us from definitively saying so. Therefore, payers might consider whether the higher upfront costs of RYGB are balanced by potential longer-term savings. To our knowledge, our study is also one of the few to demonstrate that ED and hospital use, while higher among RYGB patients for 2 years, is not detectably different by year 3. Therefore, clinicians and patients choosing between the procedures might weigh these early changes in ED and hospital use against expected weight loss or other factors.
Our study had several potential limitations. First, we did not attempt cost-effectiveness or return-on-investment analyses, as some prior work has done,35- 37 limiting our ability to make inferences about relative changes in shorter-term costs from the societal or payer perspective. However, our longitudinal approach did permit key inferences about longer-term use trends and costs. Second, our study could be subject to confounding by indication if patients and health care professionals chose a certain procedure type based on unmeasured factors that differed between the groups and that also predicted outcomes. Related to this issue, we had no weight or BMI data. If RYGB patients were heavier than AGB patients, their higher starting BMI could have been partially responsible for greater cost/use changes independent of the procedure itself.36 Nevertheless, our propensity score–matching algorithm generated study groups with nearly identical baseline trends and measured characteristics such as rates of diabetes mellitus, hypertension, and hyperlipidemia. Controlled interrupted time series designs are among the most robust for causal inference, and accumulating evidence suggests that matching on baseline trend generates effect estimates similar to randomized clinical trials.38
Requiring at least a year of postsurgical follow-up could have biased results if sicker patients, likely to have higher costs, disenrolled early. Attrition after the first follow-up year did not differ substantially between the groups and we controlled for it using patient-level segmented regression models. Importantly, our analysis captured outcomes of surgical patients independent of whether they later had reversal of their procedure (eg, band removal). Complications, such as AGB removal, are an important component of the costs of and clinical decision-making around procedure selection. To the extent that AGB removals increased health care use or costs, we expect that these differences are captured in our study. Our analysis was also unable to explore health care provider–level factors, such as designation as a surgical Center of Excellence, that might have influenced outcomes. However, the procedures we analyzed require prior authorization from the health insurer, likely directing both groups to similar surgical facilities. Therefore, the lack of detail about this information was unlikely to impact our results.
Finally, although AGB and RYGB were the most popular procedures during much of the period we examined, surgeons are increasingly using the newer vertical sleeve gastrectomy as an alternative to both of these procedures.39 Future studies should compare this procedure relative to RYGB and AGB as well.
We found that laparoscopic AGB and RYGB were associated with reduced health care costs among commercially insured patients, partially driven by reductions in prescription costs. Although RYGB patients experienced higher rates of hospitalizations and ED visits than AGB patients in the first 2 postoperative years, rates were similar by year 3. For patients, clinicians, and policymakers interested in reducing overall use and prescription use, both of these laparoscopic procedures offer promise. However, future studies should assess emerging procedures, such as vertical sleeve gastrectomy, to more fully inform patients and health care professionals.
Corresponding Author: Kristina H. Lewis, MD, MPH, SM, Kaiser Permanente Georgia, Center for Clinical and Outcomes Research, Bldg 11, 3495 Piedmont Rd NE, Atlanta, GA 30304 (firstname.lastname@example.org).
Accepted for Publication: March 13, 2015.
Published Online: June 3, 2015. doi:10.1001/jamasurg.2015.1081.
Author Contributions: Dr Lewis had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Lewis, Arterburn, Ross-Degnan, Gillman, Wharam.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: Lewis, Wharam.
Critical revision of the manuscript for important intellectual content: All authors.
Statistical analysis: Lewis, Zhang, Wharam.
Obtained funding: Lewis, Wharam.
Administrative, technical, or material support: Lewis, Gillman.
Study supervision: Lewis, Arterburn, Ross-Degnan, Gillman, Wharam.
Conflict of Interest Disclosures: None reported.
Funding/Support: This work was funded in part by a P30 grant from the National Institutes of Health (P30 HL101312; principal investigator, Dr Gillman).
Role of the Funder/Sponsor: The National Institutes of Health had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.