Butzlaff RL, Hooley JM. Expressed Emotion and Psychiatric RelapseA Meta-analysis. Arch Gen Psychiatry. 1998;55(6):547-552. doi:10.1001/archpsyc.55.6.547
Expressed emotion (EE) is a measure of the family environment that has been demonstrated to be a reliable psychosocial predictor of relapse in schizophrenia. However, in recent years some prominent nonreplications of the EE-relapse relationship have been published. To more fully address the question of the predictive validity of EE, we conducted a meta-analysis of all available EE and outcome studies in schizophrenia. We also examined the predictive validity of the EE construct for mood disorders and eating disorders.
An extensive literature search revealed 27 studies of the EE-outcome relationship in schizophrenia. Using meta-analytic procedures, we combined the findings of these investigations to provide an estimate of the effect size associated with the EE-relapse relationship. We also used meta-analysis to provide estimates of the effect sizes associated with EE for mood and eating disorders.
The results confirmed that EE is a significant and robust predictor of relapse in schizophrenia. Additional analyses demonstrated that the EE-relapse relationship was strongest for patients with more chronic schizophrenic illness. Interestingly, although the EE construct is most closely associated with research in schizophrenia, the mean effect sizes for EE for both mood disorders and eating disorders were significantly higher than the mean effect size for schizophrenia.
These findings highlight the importance of EE in the understanding and prevention of relapse in a broad range of psychopathological conditions.
EXPRESSED EMOTION (EE) is a measure of the family environment that is based on how the relatives of a psychiatric patient spontaneously talk about the patient. Assessed during the Camberwell Family Interview (CFI), relatives are classified as being high in EE if they make more than a specified threshold number of critical comments or show any signs of hostility or marked emotional overinvolvement.1
In the last 15 years, the EE construct has been extensively studied.2,3 More than 20 studies, conducted in many countries, have investigated the EE-relapse relationship in patients with schizophrenia. In addition, there is a growing literature concerning the role of EE in unipolar depression,4,5 bipolar disorder,6 and eating disorders.7 Expressed emotion has also been used in outcome studies of patients with dementia8 and diabetes mellitus.9 The results of these investigations make 2 things clear. First, rather than being a construct of interest solely with respect to schizophrenia, EE is a more general predictor of poor outcome across a range of conditions. Second, EE is a construct that is modifiable. Results from several trials of family-based treatment indicate that when family EE levels decrease, patients' relapse rates also fall.10 From a clinical perspective, these findings are clearly very encouraging.
Given this, it is surprising that EE remains a somewhat controversial construct. However, based on the results of a small number of nonreplications, some clinicians and researchers have been quick to conclude that EE is not a reliable predictor of relapse. This article represents an effort to examine this issue in a statistically rigorous manner. Although aggregate analyses of the EE literature do exist,11,12 we chose to meta-analyze the studies because of the dangers of aggregating or pooling raw data without blocking, especially when using 2×2 tables.13 Moreover, because meta-analysis provides a way to combine similar studies in a manner that allows contrast analyses to be applied to the data, it allows us to consider several factors that might increase or attenuate the strength of the EE-relapse link. Finally, because we investigate the effect size of EE as a predictor of outcome in mood disorders and eating disorders, this article provides the first estimates of the effect sizes of EE for nonschizophrenia-related conditions.
To be included, articles had to meet the following criteria: (1) patient diagnosis of schizophrenia or schizoaffective disorder or mood or eating disorder; (2) EE assessed using the CFI administered at the time of the index hospitalization (one exception14 was published before the CFI became the standard method for assessing EE); (3) EE used to predict relapse for 9 to 12 months; and (4) published data allowed an estimate of the effect size and significance level to be calculated. A total of 27 articles met these criteria. We excluded 22 experimental articles for the following reasons: (1) the CFI was not administered at intake; (2) the CFI was used to measure EE in nonfamily members (eg, nursing staff); (3) EE was used to predict something other than adult psychiatric relapse; (4) the sample subjects did not include both low- and high-EE families or the relapse data were not reported for both groups; and (5) the report described data that were also published elsewhere.
The studies' data were cast into 2×2 tables of counts—high vs low EE by relapsed vs not relapsed status. We chose to use φ as our measure of effect size because of the problems associated with other indices.15 In cases where authors did not report the number of subjects relapsing in the high- and low-EE groups,16 we used a reliable formula for calculating an effect size estimate.15 Three other studies17- 19 reported relapse rates of 0 for one of the EE groups. We used a correction suggested by Overall20 in calculating the effect size estimates for these studies.
All effect size estimates were transformed into Fisher z scores before any other calculations were done to account for the nonnormal distribution of r.13 We also calculated the associated standard normal deviate z score for each study. This summary statistic is analogous to a t or F test in studies comparing differences between groups. Combining these z scores meta-analytically served as a test statistic for the estimate of the overall significance of the combined average effect size estimate of the studies. Table 1 details the studies used in this meta-analysis, noting the corrections described above.5,14,16- 19,21- 41
A distinct advantage of meta-analytic work is that it allows us to use contrast analyses to statistically test hypotheses using all of the studies as our sample population. Below we describe the methods used to code for these contrast analyses.
In an earlier review,42 one of us (J.M.H.) suggested that there might be a relationship between the duration of schizophrenic illness and the magnitude of the EE-relapse link, with longer durations of illness being associated with greater effect sizes. We classified the EE reports according to the mean chronicity of the patient sample studied (Table 1).
Our categorization criteria were as follows. In the first group were studies in which the majority of patients (>50%) were experiencing their first hospitalization. Our second category included studies with more heterogeneous samples, in which the mean chronicity of the patient sample was neither very recent nor very chronic (eg, a sample that contained 30% recent-onset patients but where the average number of prior hospitalizations was 2.8). The third category included studies with more chronic patients, where chronic was defined as more than 3 prior hospital admissions or a mean duration of illness of at least 5 years.
Recently, Bebbington and Kuipers12 used visual inspection of a graph to conclude that there was no variation in EE findings based on geographic location. We tested this hypothesis using an analysis of variance (ANOVA) method43 involving the Q statistic, which is much the same as a χ2 statistic but using meta-analytic data. Following Bebbington and Kuipers, studies were grouped according to their most obvious broad geographic location (ie, Northern Europe, Southern Europe, North America, Australia, and Asia).
Expressed emotion was developed as a psychosocial predictor of relapse in schizophrenia. However, several researchers have documented the link between EE and relapse in patients with mood disorders and eating disorders, such as anorexia and obesity. We chose to meta-analyze these studies to establish the effect size of EE disorders other than schizophrenia. Because the number of studies in these areas is limited, readers should view these findings as preliminary.
Six studies have examined the relationship between EE and relapse in patients with major mood disorders (Table 2).4- 6,44- 46 All found a positive association between EE and relapse. However, because of the relatively small number of studies that have been conducted, the number of critical comments required to classify families as high-EE has not been firmly established. Cutoff scores of 2 criticisms5 and 3 criticisms4 seem to have validity in unipolar patients. For patients with bipolar disorder, a cutoff score of 6 critical comments (ie, the cutoff score used for schizophrenia) is the most appropriate.6 Our results are based on cutoff scores of 2 and 3 for unipolar samples and 6 for bipolar samples. To facilitate future meta-analysis, we encourage researchers to report relapse by varying levels of critical comments, in addition to reporting the cutoff score that proves most significantly predictive.
The simple answer to this question is: quite well. All but 3 of the studies described in Table 1 (89%) showed a significant association between EE and patient relapse. If there were no relationship, we would expect 50% of the studies to have positive effect sizes and 50% to have negative effect sizes. The mean effect size for EE predicting relapse was r=0.30 (z=11.30, P< .001). Weighting by degrees of freedom (which is preferable because it takes into account the number of subjects in each study) resulted in a weighted mean r=0.31. Thus, family levels of EE are significantly predictive of elevated rates of relapse in schizophrenia patients. Moreover, our 95% confidence intervals (CIs) suggest there is only a 5% probability that the effect size does not lie between 0.23 and 0.37.
It is not always easy to grasp the practical importance of a meta-analytic effect size r=0.31. However, for a hypothetical sample of 200 patients (high EE=100; low EE=100), an effect size r=0.30 translates into a high and low EE relapse rate of 65% and 35%, respectively.47 In this model, EE is associated with approximately one third of the relapses that do occur and with two thirds of the relapses that do not occur.
The issue of unpublished research is often called the "file-drawer" problem. It is addressed using a calculation suggested by Rosenthal.13 To lower the significance level of this meta-analysis to just barely significant (P=.05), there would have to be 1246 new, unpublished, or undiscovered studies averaging null results. When we consider that research that does not confirm the relationship between EE and relapse often receives more attention than experimental replications, this seems unlikely.
Combining the effect sizes of individual studies in a meta-analysis requires that the assumption of homogeneity of variance be tested and met. We used the χ2 statistic to test for the heterogeneity of the effect sizes.13,48 The analyses resulted in a significant χ2=54.01, df=26, P=.001. We therefore conducted additional analyses to identify the variable(s) that accounted for the heterogeneity. These are described below.
The linear contrast of the relationship between effect size for the EE-relapse association and the chronicity category of the patients resulted in a contrast z =1.93, P=.03. As previously noted,42 EE appears to be a stronger predictor of relapse in patients with more longstanding illnesses.
Grouping studies according to patient chronicity accounted for the heterogeneity of the effect sizes for the recent-onset and the mixed chronicity groups. This suggests that illness chronicity was a hidden variable in the earlier combined analysis. Interestingly, however, the effect sizes of studies in the most chronic category still showed significant heterogeneity (χ2=31.53, df=12, P=.001). Further examination revealed that this was largely attributable to Parker et al.34 This study accounted for 12.94 of the χ2 value of 31.53. Methodological problems associated with this study may explain its outlier status.2
Does the magnitude of the EE-relapse association vary according to the location of the research site? The test of mean effect sizes across locations was not significant(Qbetween=5.91, df=4). When categorized according to Bebbington and Kuipers,12 geographic location is not related to variations in effect size.
One advantage of this analytic approach was that it followed the grouping method described by Bebbington and Kuipers.12 Unfortunately, when we did this, the ANOVA assumption of homogeneity of variance was violated (Qwithin=48.1, df =22, P =.001). Although ANOVA is relatively robust in this regard, we reanalyzed the data employing more narrow geographic boundaries. We assumed that accounting for more of the variance using location as a blocking variable would allow us to replicate the previously found nonsignificant effect of geographic region. However, this was not the case. Regrouping the studies as United Kingdom, Northern Europe, Southern Europe, Eastern Europe, North America, Australia, or Asia resulted in a significant variation in effect sizes between locations (Qbetween=19.21, df=6, P=.004). Further examination revealed that the studies from Eastern Europe accounted for much of this heterogeneity (6.25 of the Qbetween statistic total). The second largest contributors to the total heterogeneity were the Australian studies,34,41 accounting for 4.46 of the total Qbetween statistic. These results suggest that the effect sizes of EE in Eastern Europe are unusually high. In contrast, the effect size associated with 1 of the 2 EE studies conducted in Australia34 is particularly low when compared with effect size estimates from other parts of the world.
With high EE defined as 2 or more critical comments, the mean and the weighted mean effect sizes were both r =0.39. This effect size was associated with z=5.21, P<.001. There was no significant heterogeneity in the effect sizes. When critical comments were set to 3 or more for high EE, the mean and weighted mean effect sizes rose to 0.45 (z=6.12, P<.001), again with no significant heterogeneity in the data.
It is important to note that even when the EE cutoff for critical comments was set at 2 or more, we found a meta-analytic effect size r=0.39. With a hypothetical group of 200 patients, this translates into expected relapse rates of 69.5% for patients with high-EE relatives and 30.5% for patients with low-EE relatives.47 The 95% CIs were also narrow (0.28-0.50). Our file drawer calculations show that there would need to be 54 unpublished studies to reduce the significance level to P=.05. When a critical comments cutoff of 3 or more was used, the 95% CIs narrowed even further (0.40-0.50) with a file-drawer statistic of 77. These findings provide strong support for the role of EE in the course of mood disorders.
Three studies have reported on the relationship between EE and outcome in patients being treated for eating disorders. These studies differed from the schizophrenia studies in that outcome was measured by diet compliance,49 weight gain after treatment for obesity,50 and premature termination of treatment in patients with anorexia.7 The cutoffs for determining high EE also varied across the 3 studies. For our analysis, where more than 1 cutoff was provided we selected the cutoff score that would result in the smallest effect size.
All of the studies found a positive association between high EE and poor outcome. The weighted mean effect size was 0.51 (z=5.05, P<.001), with 95% CIs of 0.36 to 0.70 and a file-drawer statistic of 25. As was the case for mood disorders, there was no significant heterogeneity of the effect sizes. Although only based on a few studies, our analyses suggest that EE is a strong predictor of early treatment outcome for patients with weight or eating disorders.
The substantial and robust effect sizes for EE in mood and eating disorders raise the question of whether EE is a significantly better predictor of outcome when patients have disorders other than schizophrenia. We therefore compared the effect sizes associated with EE in schizophrenia, mood disorders, and eating disorders. The analyses revealed that EE was a significantly better predictor of outcome for mood disorders than it was for schizophrenia (t=1.93, P =.03, using 3 critical comments as a cutoff for the depression studies). The same was also true for the comparison of the effect sizes of EE in schizophrenia vs eating disorders (t=2.03, P=.03). The effect sizes for EE in mood disorders and eating disorders did not differ significantly, however. These findings suggest that although EE is a reliable predictor of poor outcome for schizophrenia, mood disorders, and eating disorders, EE is a significantly better predictor for the latter 2 disorders than it is for schizophrenia. Given the overwhelming amount of research that examines EE in families of patients with schizophrenia, this difference in effect sizes is an interesting and potentially important finding.
The results of this meta-analysis confirm that EE is a reliable predictor of relapse in patients with schizophrenia. Although several nonreplications exist, these do not require that the predictive validity of the construct be called into question.
The association between geographic location and the magnitude of the EE-relapse link is less clear. Although our primary analysis revealed no significant relationship between study site and effect size, additional analysis indicated that this might not invariably be the case. Specifically, the effect size for EE appears to be unusually high in studies coming from Eastern Europe, and unusually low in one Australian investigation.34 More research from Eastern Europe would obviously be valuable, as would further examination of the EE-relapse link in Australian samples.
Although EE predicts relapse regardless of the chronicity of the patients being studied, the magnitude of the EE-relapse association increases when research samples contain more chronically ill patients. One explanation for this is that EE is a more reliable measure of the family environment when patients have been ill longer.42 Another possibility is that patients may become more sensitive to EE as the illness continues, perhaps through a process that resembles kindling or sensitization.51,52 This, of course, assumes that EE does indeed play some causal role in the relapse process. However, there is some evidence that this is the case.2,19
Schizophrenia is a disorder in which biological factors play a very central role. The role that psychosocial factors play might thus be somewhat restricted. That psychosocial factors are important is indicated by the significant EE-relapse relationship and by the success of family-based interventions designed to reduce patient relapse rates.53,54 However, EE may play an even more important role in the course of mood and eating disorders than schizophrenia. Examining the effect of family-based interventions on relapse rates and outcome for mood-disordered and eating-disordered patients is also likely to be a worthwhile avenue of empirical inquiry.
Finally, it warrants mention that more studies of the EE-relapse link in schizophrenia will not influence the effect sizes reported here in any appreciable way. Almost 40 years after the initial observation of Brown et al,14 the elevated risk for relapse associated with high EE family environments appears well established. The time has now come for creative and sophisticated research that will tell us why EE is associated with relapse in such a wide range of psychopathological conditions.
Accepted for publication February 12, 1998.
We thank Robert Rosenthal, PhD, for statistical consultation and Lisa Vagge for assistance with the literature search.
Reprints: Jill M. Hooley, DPhil, Department of Psychology, Harvard University, 33 Kirkland St, Cambridge, MA 02138 (e-mail: firstname.lastname@example.org).