Predicting Optimal Outcomes in Cognitive Therapy or Interpersonal Psychotherapy for Depressed Individuals Using the Personalized Advantage Index Approach
JPR Vol 7 #1
Predicting Optimal Outcomes in Cognitive Therapy or Interpersonal Psychotherapy for Depressed Individuals Using the Personalized Advantage Index Approach
Affiliations Department of Clinical Psychology, VU University Amsterdam, Amsterdam, The Netherlands, Department of Psychology, University of Pennsylvania, Philadelphia, United States of America
- Marcus J. H. Huibers,
- Zachary D. Cohen,
- Lotte H. J. M. Lemmens,
- Arnoud Arntz,
- Frenk P. M. L. Peeters,
- Pim Cuijpers,
- Robert J. DeRubeis
- Published: November 10, 2015
5 Feb 2016: Huibers MJH, Cohen ZD, Lemmens LHJM, Arntz A, Peeters FPML, et al. (2016) Correction: Predicting Optimal Outcomes in Cognitive Therapy or Interpersonal Psychotherapy for Depressed Individuals Using the Personalized Advantage Index Approach. PLOS ONE 11(2): e0148835. https://doi.org/10.1371/journal.pone.0148835 View correction
Although psychotherapies for depression produce equivalent outcomes, individual patients respond differently to different therapies. Predictors of outcome have been identified in the context of randomized trials, but this information has not been used to predict which treatment works best for the depressed individual. In this paper, we aim to replicate a recently developed treatment selection method, using data from an RCT comparing the effects of cognitive therapy (CT) and interpersonal psychotherapy (IPT).
134 depressed patients completed the pre- and post-treatment BDI-II assessment. First, we identified baseline predictors and moderators. Second, individual treatment recommendations were generated by combining the identified predictors and moderators in an algorithm that produces the Personalized Advantage Index (PAI), a measure of the predicted advantage in one therapy compared to the other, using standard regression analyses and the leave-one-out cross-validation approach.
We found five predictors (gender, employment status, anxiety, personality disorder and quality of life) and six moderators (somatic complaints, cognitive problems, paranoid symptoms, interpersonal self-sacrificing, attributional style and number of life events) of treatment outcome. The mean average PAI value was 8.9 BDI points, and 63% of the sample was predicted to have a clinically meaningful advantage in one of the therapies. Those who were randomized to their predicted optimal treatment (either CT or IPT) had an observed mean end-BDI of 11.8, while those who received their predicted non-optimal treatment had an end-BDI of 17.8 (effect size for the difference = 0.51).
Depressed patients who were randomized to their predicted optimal treatment fared much better than those randomized to their predicted non-optimal treatment. The PAI provides a great opportunity for formal decision-making to improve individual patient outcomes in depression. Although the utility of the PAI approach will need to be evaluated in prospective research, this study promotes the development of a treatment selection approach that can be used in regular mental health care, advancing the goals of personalized medicine.
Citation: Huibers MJH, Cohen ZD, Lemmens LHJM, Arntz A, Peeters FPML, Cuijpers P, et al. (2015) Predicting Optimal Outcomes in Cognitive Therapy or Interpersonal Psychotherapy for Depressed Individuals Using the Personalized Advantage Index Approach. PLoS ONE 10(11): e0140771. https://doi.org/10.1371/journal.pone.0140771
Editor: Igor Branchi, Istituto Superiore di Sanità, ITALY
Received: March 4, 2015; Accepted: September 30, 2015; Published: November 10, 2015
Copyright: © 2015 Huibers et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: The data are available from the Dryad database (accession number doi:10.5061/dryad.m112v).
Funding: Robert J. DeRubeis and Zachary D. Cohen are supported in part by a grant from MQ: Transforming mental health MQ14PM_27. The opinions and assertions contained in this article should not be construed as reflecting the views of the sponsor.
Competing interests: The authors have declared that no competing interests exist.
Of all the effective psychotherapies for depression, cognitive therapy (CT) and interpersonal psychotherapy (IPT) are most prominent and extensively researched. A recent meta-analysis  identified five head-to-head comparisons of CT and IPT. In three studies [2–4], individual therapy was studied and in the remaining two a group format was used [5, 6]. Results showed no significant differences between CT and IPT, although severely depressed patients fared better in IPT in one study  and better in CT in another .
Despite the fact that these psychotherapies produced equivalent outcomes on average, individual patients might respond differently to different therapies. Prediction studies have indeed identified differential treatment effects for subgroups of patients. Before we review some of those findings in the context of this paper, it is important to highlight the distinction between prognostic factors (predictors) and prescriptive factors (moderators). Prognostic variables derive from designs that hold treatment constant (or ignore differences in modality) and seek to determine whether individual differences measured at baseline predict subsequent variation in response . Prognostic indices tell us which patients benefit most from a given treatment or set of treatments, but not which of two or more treatments is likely to be the best one for a given patient. Prescriptive variables on the other hand derive from comparative treatment designs and seek to determine whether individual baseline differences predict subsequent variation in response as a function of treatment type . Prescriptive indices can in principle be used to guide treatment decisions for a given patient.
The National Institute of Mental Health Treatment of Depression Collaborative Research Project (TDCRP) compared the effects of CT, IPT, antidepressants (ADM) with clinical management, and placebo with clinical management and found no overall outcome differences among the groups , although a secondary analysis of the data indicated that CT was less efficacious than ADM and IPT in the more severely depressed patients in their sample [2, 8]. Sotsky et al. investigated other potential predictors and moderators of outcome in a multivariate analysis . They found six predictors: social dysfunction; cognitive dysfunction; (low) expectation of improvement; “endogenous” depression; double depression; and duration of current episode. They also identified three moderators of treatment outcome relative to pill-placebo; patients with low social dysfunction showed a better (specific) response to IPT; patients with low cognitive dysfunction showed a better (specific) response to CT or ADM; and patients with high work dysfunction showed a better (specific) response to ADM. None of these factors predicted differential response among the active treatments.
Luty et al. compared the effectiveness of CT and IPT for major depression and found no differences in outcome between the two psychotherapies in the full sample, although severely depressed patients responded better to CT than to IPT , suggesting severity is a moderator of response. A subsequent multivariate prediction analysis identified three other predictors and one moderator : a single episode of depression (versus recurrent depression), a higher perceived logicalness of therapy, and a moderate belief that childhood factors caused the depression were all associated with better overall outcomes post-treatment (predictors), whereas patients with more comorbid personality disorder symptoms did better in CT than in IPT (moderators) .
Using prescriptive indices to predict and select the best available treatment for a given individual (what works best for whom?) has been referred to as personalized medicine, and is considered to be one of the major challenges in health care research today . Personalized medicine requires that the individual differences that predict different outcomes in different treatments are known. Simon and Perlis reviewed the available evidence relevant to personalized medicine for depression and found that only a few studies examined true moderators of outcome , in part because the statistical power of most existing trials is too limited to generate the interactions that reveal moderation. Some clinical characteristics, such as the presence of a personality disorder, might inform the choice between antidepressants and psychotherapy, or the choice between CT and IPT. Others have proposed that the preferences of the individual should guide treatment selection, especially in the choice between antidepressants and psychotherapy, but the results from studies assessing the impact of patient preference on treatment outcomes in depression have been mixed, mostly showing no association between preference and outcome . Given the large number of trials comparing different antidepressant medications with each other, surprisingly little data guide treatment selection of specific antidepressants. Cuijpers et al. concluded in their review of the literature that the development of personalized medicine in this area has just begun .
But even if more moderators are identified in future studies, how can we actually use this information to prescribe the best available treatment to individual patients? Most prediction studies stop at reporting the predictors or moderators considered in isolation. When multivariate analyses are presented, it is often to show that the individual variables continue to exhibit an interaction effect with treatment condition, even when other variables are in the model. Clinicians are then left with no guidance as to how to use this information, especially if multiple moderators point in different directions . An exception to the rule is a prediction analysis by Barber and Muenz based on the TDCRP data . In this paper, patients are matched to treatment (CT versus IPT) using a matching factor that combined three prescriptive indices (i.e. marital status, avoidant personality style, obsessive personality style) in a regression formula, indicating a preference for CT (positive matching factor) or IPT (negative matching factor) for individual patients.
Nearly 20 years later, DeRubeis et al.  returned to the subject of treatment selection. Noting that the statistical approach and clinical recommendations made by Barber and Muenz (1996) had not been followed up in subsequent years, they introduced a new method to combine multiple predictors and moderators in a statistical model that produces the Personalized Advantage Index (PAI). This PAI identifies the treatment that is predicted to be more efficacious for an individual patient, and it provides an index that reflects the magnitude of the predicted advantage. Using pre-identified predictors and moderators [18–20] from a large-scale trial comparing antidepressant medication (ADM) and cognitive therapy for depression , it was found that for 60% of the sample a “clinically significant advantage” was predicted in either ADM or CT, compared to the other treatment. Moreover, among those patients with a clinically significant advantage predicted, those who had been randomized to their predicted optimal treatment evidenced far better outcomes than those randomized to their predicted non-optimal treatment, with a significant effect size of 0.58 between the groups.
In this paper, we aim to replicate the treatment selection method that DeRubeis et al.  introduced, but this time by comparing two theoretically distinct psychotherapies that are known to result in overall equivalent outcomes . Data come from a recently published trial in which we found no significant differences between CT and IPT for depression in the acute phase . First, we identified predictors and moderators using a modified domain approach  that was replicated successfully . Second, we combined these prescriptive indices in models that we used to predict optimal treatment outcomes in individual trial patients, employing the PAI approach as a post-hoc analysis.
Design and participants
Data come from a single-center RCT into the effectiveness and mechanisms of change of individual CT and IPT for depression . Details about the study design have been fully described elsewhere . A total of 182 depressed outpatients were randomly assigned to (a) CT (n = 76), (b) IPT (n = 75), (c) or a 2-month waiting-list control condition followed by treatment of choice (n = 31). Only the two active groups are part of the present analysis, as are the data collected pre-treatment at baseline and post-treatment (after 7 months). The primary outcome measure was depression severity as measured with the Beck Depression Inventory-II . The Medical Ethics Committee of Maastricht University approved the study protocol, and all participants provided written informed consent. The study is registered at the Netherlands Trial Register, part of the Dutch Cochrane Centre (ISRCTN67561918).
Participants were adult outpatients (18–65 years) referred to the mood disorder program of the Academic Community Mental Health Centre Maastricht with a primary diagnosis of MDD, confirmed by the Structured Clinical Interview for DSM-IV Axis I disorders (SCID-I) . Further inclusion criteria were: internet access, an e-mail address, and sufficient knowledge of the Dutch language. Exclusion criteria were: bipolar or chronic (current episode > 5 years) depression, elevated acute suicide risk, current use of antidepressant medication, concomitant psychological treatment, drug and/or alcohol abuse or dependence, and mental retardation (IQ < 80).
Treatment (CT or IPT) contained 12 to 20 individual sessions of 45 minutes, depending on the individual progress of patients. CT was based on the manual by Beck et al. , and IPT followed the guidelines by Klerman et al. .
The ten therapists (five in each condition) that delivered the interventions were licensed psychologists, psychotherapists and psychiatrists who received adequate training in CT and/or IPT, with an average of 9.1 years of clinical experience (range 4 to 21 years). To prevent contamination, therapists were uniquely assigned to either CT or IPT. Quality of therapy was rated by independent assessors as being good to excellent in both conditions .
The Beck Depression Inventory II (BDI-II)  assesses the severity of depression symptoms. In this analysis, baseline (0 months) and post-treatment (7 months) scores on the BDI-II were used. Treatment condition was a binary variable and referred to either cognitive therapy (CT) or interpersonal therapy (IPT).
Putative predictors were all measured at baseline and are summarized in Table 1. Instruments used to assess these predictors included the Attributional Style Questionnaire (ASQ); the Beck Hopelessness Scale (BHS); the Brief Symptom Inventory (BSI); the Dysfunctional Attitudes Scale-17 (DAS17); the EuroQol; the Inventory of Interpersonal Problems (IIP); the Leiden Index of Cognitive Reactivity (LEIDS); the Quick Inventory of Depressive Symptoms (QIDS); the RAND-36; the Ruminative Response Scale (RRS); the Structured Clinical Interview for DSM-IV axis I Disorders (SCID-I); the Structured Clinical Interview for DSM-IV axis II Disorders (SCID-II); the Self-Liking and Self-Competence Scale (SLSC-R); and the Work and Social Adjustment Scale (WSAS) (for a complete description and references to these instruments, we refer to our design paper ).
For 134 participants (89% of total sample) BDI-II post-treatment scores (7 months) were available. We used the square root of the BDI in all analyses because tests showed the residuals of the raw BDI scores were non-normally distributed, whereas the transformed variable yielded model residuals that were not significantly different from a normal distribution. Following the recommendations of Kraemer et al. , we centered the treatment variable at -0.5 (CT) and +0.5 (IPT). All predictor variables that were binary were also centered -0.5 and + 0.5, and continuous variables were mean-centered.
To identify predictors and moderators in this sample, we adopted the principles of the domain approach outlined by Fournier et al. . First, we grouped potential predictors in separate domains (Table 1). In the depression domain, for example, we grouped predictors that relate to the depression concept. We built separate linear regression models for each domain, and each model included all the domain variables and their interaction with treatment (CT or IPT), as well as the main effect of treatment and BDI baseline.
Table 1. Variables per domain.
In step 1, for each domain model, the variables that were significant at a threshold of α = 0.2 were carried into step 2. The main effect of treatment and baseline BDI were always included in every model (despite the fact that treatment main effect was non-significant). If an interaction between a variable and treatment was significant, the main effect was carried through to the next step regardless of whether or not it was significant. In step 2, a new model was built with main effects of treatment and baseline BDI, as well as any main effects and interactions that were carried through from step 1. The same significance threshold process was then applied using the new threshold value of α = 0.1. In Step 3, the same process was repeated but with the threshold of α = 0.05. Any interactions that were significant at α = 0.05 were carried into the final model with their main effects (regardless of significance), as were any main effects that were significant. The final model was then created using the main effect of treatment and baseline BDI, as well as any interactions or main effects that emerged from step 3 of the domain approach–combining all of the variables from step 3 from all of the domains.
Building the Personalized Advantage Index
The leave-one-out cross-validation (LOO) approach [30, 31] was used so that a new model, containing all previously identified predictors and moderators, was built for each individual patient. The regression formula then looks like this:SQRT (End-BDI) = k + BDI pre + Tx (CT or IPT) + prognostic (main) + prescriptive (Tx * main). This creates an analogous situation to what would happen if we built a model on a group of patients in a clinic and then a new patient walked in the door. It protects against overfitting by ensuring that a patient’s data cannot contribute to or inform the model that will be used to predict that patient’s outcome. A more detailed description of the approach can be found elsewhere . A new model (using the exact same set of variables) is built for the 134th patient using data from the other 133 patients. Next, the 134th patient’s observed values on all the variables in the model are entered into each of two models, one in which treatment is set to -0.5 (CT) and one in which treatment is set to 0.5 (IPT). The prediction generated for the treatment the patient actually received, the factual prediction, is produced by inputting all the real values for that patient, including their treatment assignment. The counterfactual prediction is produced by substituting the opposite value of the treatment variable (e.g., from -0.5 to +0.5) wherever it appears in the model (i.e., as main effect and in all treatment by predictor interaction terms). The factual and counterfactual predictions are then squared to convert them back to units of the BDI. The predicted end of treatment BDI score in CT minus the predicted end of treatment BDI score in IPT is termed the PAI, and represents an index of the expected advantage of one treatment over another. The sign of the PAI indicates which treatment is predicted to be optimal for a patient. A negative PAI indicates that the first treatment (CT) is predicted to be optimal, whereas a positive PAI indicates that the second treatment (IPT) is predicted to be optimal. Higher absolute values of PAIs indicate a stronger predicted benefit of one treatment over the other.
Note that for any given individual, both prognostic variables (predictors) and prescriptive variables (moderators) are used to derive the PAI for reasons of precision, and add to the predicted end-score on the BDI. However, only the moderators in the equation are responsible for the magnitude of an individual’s PAI, as these factors are indicators of the predicted difference in CT and IPT.
Effectiveness of cognitive therapy and interpersonal psychotherapy
Effectiveness findings were published in a previous paper  and showed that both CT (n = 76) and IPT (n = 75) were effective interventions, compared to waiting-list control condition (n = 31). Depressive symptoms decreased significantly in the 7-month course of both therapies (Cohen’s d for within-group effect size of 1.71 for CT and 1.72 for IPT), and the symptom reduction remained stable in the 5 months following treatment termination. No significant differences between CT and IPT were observed.
Identification of predictors and final prediction model
Of the 151 patients who were randomized to CT or IPT, 134 patients (CT = 69; IPT = 65) provided post-treatment BDI scores (end-BDI) and were included in the present analysis. In total 61 potential predictors at baseline and their interaction with treatment condition (CT or IPT) were evaluated within 6 domains (see Table 1 for an overview).
Table 2 presents the variables that were used in the prediction models. Five predictors (main effects) and six moderators (interaction effects) emerged as significant at α = 0.05 within their separate domains, and were thus included in the final prediction model. Note that some of these predictors and moderators did not remain significant when entered in the final model.
Table 2. Final Prediction Model.
Female gender, being actively employed, low anxiety scores, the absence of a personality disorder and a high quality of life all predicted lower depression symptoms after treatment, irrespective of the therapy received. These were the prognostic variables, represented in the models as main effects only.
Somatic complaints, cognitive problems, paranoid symptoms, interpersonal self-sacrificing (i.e., caring for others, even when it requires sacrificing one’s own needs), attributional style focused on achievement goals and the number of life events in the past year predicted a differential response in CT and IPT, with cognitive problems predicting a better response to IPT and the other five moderators predicting a better response to CT.
Predicted outcomes and PAIs
Using the identified predictors and moderators, Personalized Advantage Indexes (PAI) were calculated for each individual patient, by subtracting the predicted outcomes in each of the two therapies for that patient. We refer to the therapy that is predicted to produce the greater benefit for the patient as the optimal treatment, whereas the other therapy is referred to as non-optimal treatment. In Fig 1, we present the frequency of predicted end-BDI scores in both the optimal and non-optimal treatment for every individual patient, a lower BDI score representing a more favorable outcome.
Fig 1. Frequency histogram showing predicted end-BDI scores for each patient in their Optimal and their Non-Optimal treatment, as indicated by the treatment selection algorithm.
In Fig 2, the actual sizes of the PAIs we calculated are presented according to frequency. The average absolute value of the PAIs was 8.9 (7.7), which means an average 8.9 point difference predicted on the BDI between optimal (predicted mean = 8.5; sd = 6.6) and non-optimal treatment (predicted mean = 17.3; sd = 10.1). Approximately 17% of the PAIs fell between 0 and 2, indicating a small or negligible predicted advantage of one therapy over the other. However, the PAI was 5 or greater for 63% of the patients in the sample, indicating that a substantial difference was predicted between the two treatments on the BDI .
Fig 2. Frequency histogram showing Personalized Advantage Index (PAI) scores for all patients in the sample.
Observed outcomes in optimal and non-optimal treatment
Fig 3 presents the observed outcomes of the patients in our sample, stratified according to their predicted optimal treatment and the actual treatment they were randomized to. For those who were predicted to do better in CT (CT-Optimal) and who actually received CT, the observed mean end-BDI was 10.7 (sd = 12.3; n = 31), whereas for those in the CT-Optimal group who received IPT, the mean end-BDI was 19.8 (sd = 15.2; n = 30). The effect size estimate (Cohen’s d) for this difference is 0.66. A similar, albeit less strong difference was observed in the IPT-Optimal patients. The mean end-BDI for those who received IPT was 12.7 (sd = 10.8; n = 35), whereas IPT-Optimal patients who received CT had a mean end-BDI of 16.1 (sd = 8.9; n = 38, effect size for the difference is 0.34).
Fig 3. Comparison of observed mean end-BDI scores for patients randomly assigned to their Optimal treatment versus those assigned to their Non-Optimal treatment, by psychotherapy type.
On an aggregated level, those who received their predicted optimal treatment (either CT or IPT) had an observed mean end-BDI of 11.8 (sd = 11.7; n = 66), whereas those who received their predicted non-optimal treatment had an observed mean end-BDI of 17.8 (sd = 12.1; n = 68). The effect size for this difference is 0.51.
In this study, we aimed to identify predictors and moderators of outcome in two evidence-based psychotherapies for depression, cognitive therapy and interpersonal therapy, that yielded similar overall distributions of outcomes in a randomized comparative trial. We further aimed to use these prognostic and prescriptive indices to identify the predicted optimal treatment option for each individual patient, to provide an estimate of the advantage that might be gained by applying a multivariate-based treatment selection procedure prospectively.
Using a state-of-the-art data analytic approach, we were able to identify five predictors and six moderators of outcome. Female gender, active employment, low anxiety, the absence of a personality disorder and high quality of life were all indicators of a favorable prognosis. Somatic complaints, cognitive problems, paranoid symptoms, interpersonal self-sacrificing, attributional style (achievement) and the number of life events in the past year were indicators of a differential treatment response; cognitive problems predicted a better response to IPT, while the other five moderators predicting a better response to CT. When compared to previously reported predictors and moderators, we find little overlap. Sotsky et al.  reported cognitive dysfunction as both a predictor and a moderator, but only between the active treatments CT and ADM in comparison to pill-placebo. Carter et al.  found that patients with more comorbid personality disorder symptoms responded better to CT than to IPT, while we identified the absence of personality disorder only as a general prognostic factor.
We then replicated the procedure as described by DeRubeis et al.  to build a treatment selection algorithm using the identified predictors and moderators, and calculate the PAIs for all individual patients in our sample. PAI sizes had a considerable range, with 63% of our sample having a PAI larger than 5, in units of the BDI, which corresponds to a clinically meaningful advantage. The distribution of observed outcomes for those who had been randomized to the psychotherapy they were predicted to benefit the most from (optimal treatment) was substantially better than for those assigned to their non-optimal treatment. The effect size estimate for the difference was 0.51.
This is the first successful replication of the PAI approach that was developed by DeRubeis et al. . Within the context of a randomized trial comparing CT and antidepressant medication (ADM), they combined four predictors (depression severity; chronicity; age; IQ) and five moderators (marital status; employment status; comorbid personality disorder; number of life stresses; number of prior ADM trials) of outcome into a treatment selection algorithm, and found that those assigned to their predicted optimal treatment fared much better than those assigned to their non-optimal treatment.
The effect size estimate they presented for the difference in observed outcome between the optimal and non-optimal groups in the subgroup of patients who showed a clinically meaningful advantage was 0.58, similar to the effect size we estimated in the entire sample in the present study. Cognitive therapy was one of the treatments in each of these studies, but the treatment to which it was compared was very different in the two studies: antidepressant medications in the DeRubeis study and interpersonal therapy in the present case. Thus, it was not expected that the same variables would feature in the respective prescriptive models. The magnitude of the effect sizes in both studies does suggest that combining a relatively large number of moderators into a treatment selection algorithm leads to substantial differences in predicted outcomes, on average and for a majority of the individual patients. Moreover, the differences in observed outcomes we found between the optimal and the non-optimal group are large and clinically relevant, especially considering that most available depression treatments, both psychological and pharmacological, yield similar outcomes , as did the treatments in the present study, in aggregate.
Finding moderators of the effect of two equally effective psychotherapies might point to different mechanisms of change , in line with the theorized therapy models or with other causal pathways. The moderators that we identified cannot be directly translated or interpreted as fitting the mechanisms of one therapy model or the other. However, whatever the active mechanisms in the two psychotherapies are, they seem to yield different outcomes for different subgroups of patients.
Noteworthy in this respect is that the effect size for the observed outcomes in those who received their predicted optimal treatment versus those who received their predicted non-optimal treatment was twice as high in CT (0.66) compared to IPT (0.34). This suggest that for those who are predicted to do well in CT, the expected impact of receiving CT is bigger on average than the impact of receiving IPT for those who are predicted to do well in IPT. As the moderators that add to the calculation of the PAI might point to underlying mechanisms of change, the difference in effect size between CT and IPT might indicate a difference in the importance and magnitude of certain change processes, or the therapeutic procedures that bring about change. Future research on the underlying mechanisms of change and the associated change procedures in CT and IPT could shed more light on this.
We were able to replicate the PAI treatment selection approach using data from an RCT, while protecting against the risk of statistical overfitting, which is one of the major advantages of the leave-one-out method (LOO) we applied. In a similar vein however, one could argue that the LOO approach should also be used to identify predictors and moderators, especially if these are used to generate individual treatment recommendations in a subsequent step. Future research should explore the use of LOO predictor selection, and weigh the advantages and disadvantages of this method against those of less complicated approaches like the one presented here. It should also be noted that we replicated the PAI method, and not the findings that were reported by DeRubeis et al  specifically. The generalization of the predictors and moderators we found is therefore limited, as they have yet to confirmed in a subsequent study comparing CT and IPT.
Despite the advantages of LOO, the use of data from an existing RCT to generate treatment recommendations has its limitations, most importantly combining predictions and actual observations within the same study population and time frame. Our analysis demonstrating how PAI-derived predictions are associated with observed end-scores for the participants in our trial is a post-hoc analysis, much like unplanned subgroup analyses in the context of an RCT. The ultimate validation of the PAI approach will come from prospective tests in which predictors and moderators are identified in one cohort of patients, after which the treatment selection algorithm is tested in a new cohort of patients seeking health care, for example by randomizing them to their predicted optimal treatment versus “allocation as usual” or a treatment the patient prefers. For this purpose, prospective testing within routine practice settings appears most suitable and informative. Predictors and moderators of treatment outcome reported in the literature so far showed only limited overlap. Many factors could account for this heterogeneity, including differences in study populations, study designs, measurement instruments and data analytic approaches. Predictive indices might therefore vary between populations, stressing the need for population-specific treatment selection algorithms. We believe that, ultimately, patient data routinely assessed in a given mental health clinic can be used to build and implement PAI-based treatment selection systems in that clinic, even in the absence of a randomized comparison between treatments, as the practical added value of the PAI will lie in its use and contribution to clinical practice. A PAI approach applied in a clinical setting will be inexpensive, such that even if the added value is not large, or is confined to patients whose PAI is large (in absolute value), the benefit to cost ratio will be favorable. But for now, this is an empirical question.
The findings from this study suggest the promise of a simple method that combines prognostic and prescriptive information in a treatment selection approach that produces treatment recommendations for individual patients. Depressed patients who were randomized to the psychotherapy they were predicted to benefit the most from fared much better than those randomized to their non-optimal psychotherapy. When given a choice, most patients prefer psychotherapy compared to medications . A data-informed choice between two equally effective psychotherapies not only improves outcomes for the individual patient, but also facilitates efficient use of mental health care services.
As inter-individual heterogeneity in treatment response tends to be large, and evidence-based predictive information is rarely used in decision-making, there exists a great opportunity for formal decision-making to improve patient outcomes in depression. Despite its promising nature and appeal, personalized medicine is a concept that is rarely applied in clinical practice. Developments in this direction have started to begin however, and a related, yet conceptually different approach to combining multiple moderators has recently been introduced by Kraemer . In line with these developments, the present study encourages the introduction of treatment selection methods that can be used in regular mental health care, matching patients to the treatment option they are predicted to benefit the most from, increasing the a priori chance of recovery for the individual patient, and thereby advancing the goals of personalized medicine.
Robert J. DeRubeis and Zachary D, Cohen are supported in part by a grant from MQ: Transforming mental health MQ14PM_27. The opinions and assertions contained in this article should not be construed as reflecting the views of the sponsor.
Conceived and designed the experiments: MH AA FP. Performed the experiments: MH LL AA FP. Analyzed the data: MH ZC RD. Contributed reagents/materials/analysis tools: ZC RD. Wrote the paper: MH ZC LL AA FP PC RD.
- Jakobsen JC, Hansen JL, Simonsen S, Simonsen E, Gluud C. Effects of cognitive therapy versus interpersonal psychotherapy in patients with major depressive disorder: a systematic review of randomized clinical trials with meta-analyses and trial sequential analyses. Psychol Med. 2012;42(7):1343–57. Epub 2011/11/05. pmid:22051174.
- Elkin I, Shea MT, Watkins JT, Imber SD, Sotsky SM, Collins JF, et al. NIMH treatment of depression collaborative research program. General effectiveness of treatments. Archives of General Psychiatry. 1989;46:971–82. pmid:2684085
- Luty SE, Carter JD, McKenzie JM, Rae AM, Frampton CM, Mulder RT, et al. Randomised controlled trial of interpersonal psychotherapy and cognitive-behavioural therapy for depression. Br J Psychiatry. 2007;190:496–502. Epub 2007/06/02. doi: 190/6/496 [pii] pmid:17541109.
- Quilty LC, McBride C, Bagby RM. Evidence for the cognitive mediational model of cognitive behavioural therapy for depression. Psychol Med. 2008;38(11):1531–41. Epub 2008/06/27. pmid:18578895.
- Bellino S, Zizza M, Rinaldi C, Bogetto F. Combined therapy of major depression with concomitant borderline personality disorder: comparison of interpersonal and cognitive psychotherapy. Canadian journal of psychiatry Revue canadienne de psychiatrie. 2007;52(11):718–25. Epub 2008/04/11. pmid:18399039.
- Neimeyer RA, Weiss ME. Cognitive and symptomatic predictors of outcome of group therapies for depression. Journal of Cognitive Psychotherapy: An International Quaterly. 1990;4:23–32.
- Kraemer HC, Wilson GT, Fairburn CG, Agras WS. Mediators and moderators of treatment effects in randomized clinical trials. Archives of General Psychiatry. 2002;59(10):877–83. pmid:12365874.
- Elkin I, Gibbons RD, Shea MT, Sotsky SM, Watkins JT, Pilkonis PA, et al. Initial severity and differential treatment outcome in the National Institute of Mental Health Treatment of Depression Collaborative Research Program. J Consult Clin Psychol. 1995;63(5):841–7. Epub 1995/10/01. pmid:7593878.
- Sotsky SM, Glass DR, Shea MT, Pilkonis PA, Collins JF, Elkin I, et al. Patient predictors of response to psychotherapy and pharmacotherapy: findings in the NIMH Treatment of Depression Collaborative Research Program. Am J Psychiatry. 1991;148(8):997–1008. Epub 1991/08/01. pmid:1853989.
- Carter JD, Luty SE, McKenzie JM, Mulder RT, Frampton CM, Joyce PR. Patient predictors of response to cognitive behaviour therapy and interpersonal psychotherapy in a randomised clinical trial for depression. J Affect Disord. 2011;128(3):252–61. Epub 2010/08/03. doi: S0165-0327(10)00470-2 [pii] pmid:20674982.
- Joyce PR, McKenzie JM, Carter JD, Rae AM, Luty SE, Frampton CM, et al. Temperament, character and personality disorders as predictors of response to interpersonal psychotherapy and cognitive-behavioural therapy for depression. Br J Psychiatry. 2007;190:503–8. Epub 2007/06/02. doi: 190/6/503 [pii] pmid:17541110.
- Hamburg MA, Collins FS. The path to personalized medicine. The New England journal of medicine. 2010;363(4):301–4. Epub 2010/06/17. pmid:20551152.
- Simon GE, Perlis RH. Personalized Medicine for Depression: Can We Match Patients With Treatments? Am J Psychiatry. 2010;(Epub ahead of print). pmid:20843873.
- Winter SE, Barber JP. Should treatment for depression be based more on patient preference? Patient preference and adherence. 2013;7:1047–57. Epub 2013/10/22. pmid:24143080; PubMed Central PMCID: PMCPmc3797653.
- Cuijpers P, Reynolds CF 3rd, Donker T, Li J, Andersson G, Beekman A. Personalized treatment of adult depression: medication, psychotherapy, or both? A systematic review. Depress Anxiety. 2012;29(10):855–64. Epub 2012/07/21. pmid:22815247.
- DeRubeis RJ, Cohen ZD, Forand NR, Fournier JC, Gelfand LA, Lorenzo-Luaces L. The Personalized Advantage Index: translating research on prediction into individualized treatment recommendations. A demonstration. PLoS One. 2014;9(1):e83875. Epub 2014/01/15. pmid:24416178; PubMed Central PMCID: PMCPmc3885521.
- Barber JP, Muenz LR. The role of avoidance and obsessiveness in matching patients to cognitive and interpersonal psychotherapy: empirical findings from the treatment for depression collaborative research program. J Consult Clin Psychol. 1996;64(5):951–8. Epub 1996/10/01. pmid:8916624.
- Fournier JC, DeRubeis RJ, Shelton RC, Gallop R, Amsterdam JD, Hollon SD. Antidepressant medications v. cognitive therapy in people with depression with or without personality disorder. Br J Psychiatry. 2008;192(2):124–9. Epub 2008/02/05. doi: 192/2/124 [pii] pmid:18245030; PubMed Central PMCID: PMC2682552.
- Fournier JC, DeRubeis RJ, Shelton RC, Hollon SD, Amsterdam JD, Gallop R. Prediction of response to medication and cognitive therapy in the treatment of moderate to severe depression. J Consult Clin Psychol. 2009;77(4):775–87. Epub 2009/07/29. doi: 2009-11168-016 [pii] pmid:19634969; PubMed Central PMCID: PMC2810269.
- Leykin Y, Amsterdam JD, DeRubeis RJ, Gallop R, Shelton RC, Hollon SD. Progressive resistance to a selective serotonin reuptake inhibitor but not to cognitive therapy in the treatment of major depression. J Consult Clin Psychol. 2007;75(2):267–76. Epub 2007/05/02. doi: 2007-04141-007 [pii] pmid:17469884.
- DeRubeis RJ, Hollon SD, Amsterdam JD, Shelton RC, Young PR, Salomon RM, et al. Cognitive therapy vs medications in the treatment of moderate to severe depression. Arch Gen Psychiatry. 2005;62(4):409–16. Epub 2005/04/06. doi: 62/4/409 [pii] pmid:15809408.
- Lemmens LH, Arntz A, Peeters F, Hollon SD, Roefs A, Huibers MJ. Clinical effectiveness of cognitive therapy vs. interpersonal psychotherapy for depression: results of a randomized controlled trial. Psychological Medicine. 2015:1–16.
- Huibers MJ, van Breukelen G, Roelofs J, Hollon SD, Markowitz JC, van Os J, et al. Predicting response to cognitive therapy and interpersonal therapy, with or without antidepressant medication, for major depression: a pragmatic trial in routine practice. J Affect Disord. 2014;152–154:146–54. Epub 2013/09/26. pmid:24060588.
- Lemmens LH, Arntz A, Peeters FP, Hollon SD, Roefs A, Huibers MJ. Effectiveness, relapse prevention and mechanisms of change of cognitive therapy vs. interpersonal therapy for depression: Study protocol for a randomised controlled trial. Trials. 2011;12:150. Epub 2011/06/16. pmid:21672217; PubMed Central PMCID: PMCPmc3136406.
- Beck AT, Steer R, Brown GK. Beck Depression Inventory II: Manual. Boston: Hartcourt Brace.; 1996.
- First MB, Spitzer RL, Gibbon M, Williams JBW. Structured Clinical Interview for DSM-IV Axis I Disorders. New York: Biometrics Research Department, New York State Psychiatric Institute; 1996.
- Beck AT, Rush AJ, Shaw BF, Emery G. Cognitive therapy of depression. New York: Guilford; 1979.
- Klerman GL, Weissman MM, Rounsaville BJ, Chevron ES. Interpersonal psychotherapy for depression. New York: Basic Books; 1984.
- Kraemer HC, Blasey CM. Centring in regression analyses: a strategy to prevent errors in statistical inference. International journal of methods in psychiatric research. 2004;13(3):141–51. Epub 2004/08/07. pmid:15297898.
- Efron B, Gong G. A leisurely look at the bootstrap, the jackknife and cross-validation. The American Statistician. 1983;37:36–48.
- Harrell FE, Lee KL, Mark DB. Tutorial in biostatistics mutlivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in Medicine. 1996;15:361–87.
- Hiroe T, Kojima M, Yamamoto I, Nojima S, Kinoshita Y, Hashimoto N, et al. Gradations of clinical severity and sensitivity to change assessed with the Beck Depression Inventory-II in Japanese patients with depression. Psychiatry research. 2005;135(3):229–35. Epub 2005/07/06. pmid:15996749.
- Cuijpers P, Dekker J, Hollon SD, Andersson G. Adding psychotherapy to pharmacotherapy in the treatment of depressive disorders in adults: a meta-analysis. J Clin Psychiatry. 2009;70(9):1219–29. Epub 2009/10/13. pmid:19818243.
- Kazdin AE. Mediators and mechanisms of change in psychotherapy research. Annual Review of Clinical Psychology. 2007;3:1–27. pmid:17716046.
- van Schaik DJ, Klijn AF, van Hout HP, van Marwijk HW, Beekman AT, de Haan M, et al. Patients’ preferences in the treatment of depressive disorder in primary care. Gen Hosp Psychiatry. 2004;26(3):184–9. Epub 2004/05/04. pmid:15121346.
- Kraemer HC. Discovering, comparing, and combining moderators of treatment on outcome after randomized clinical trials: a parametric approach. Stat Med. 2013;32(11):1964–73. Epub 2013/01/11. pmid:23303653.