WEAVE
WEAVE
Centre for Integrative Psychiatry
Weave Psychotherapy — Vol. 12
Landmark Papers & Trials
The Evidence Base — Key RCTs and Meta-Analyses in Psychotherapy
Dodo Bird · NIMH TDCRP · Linehan 1991 · Giesen-Bloo 2006 · Shedler 2010 · Common Factors
Dr. Wilfred D'souza
MD Psychiatry
wilfred.desouza1996@gmail.com

Contents

01Deep Study
02Clinical Quick Reference
Weave Psychotherapy · www.weave.clinic
01
Deep Study
Landmark Papers & Trials — Weave Psychotherapy Vol. 12
www.weave.clinic
WEAVE Weave Psychotherapy Vol. 12 | Landmark Papers & Trials Chapter 01 · Deep Study

D1: Landmark Papers & Trials in Psychotherapy — Deep Study

Table of Contents

  1. The Dodo Bird Verdict and Common Factors
  2. CBT Landmark Trials
  3. DBT Landmark Trials
  4. Psychodynamic Landmark Trials
  5. Schema Therapy Landmark Trials
  6. Exposure Therapy and EMDR
  7. Family and Couples Therapy
  8. ACT and MBCT
  9. Comparative and Integration Studies
  10. Process Research

1. THE DODO BIRD VERDICT AND COMMON FACTORS

1.1 Luborsky, Singer & Luborsky (1975) — "Everybody Has Won and All Must Have Prizes"

This paper is the origin of the "Dodo bird verdict" in psychotherapy research. Luborsky and colleagues reviewed comparative outcome studies and concluded that different forms of psychotherapy produce roughly equivalent outcomes. The title references the Dodo bird from Alice in Wonderland, who declares after a race that "everybody has won and all must have prizes."

Design: Narrative review of comparative outcome studies

Key finding: No consistent differences in efficacy between bona fide psychotherapies

Clinical implication: The factors common to all therapies (alliance, empathy, expectation) may matter more than technique-specific ingredients

Exam Pearl

The Dodo bird verdict does NOT claim that therapy is ineffective or that all therapies are equally good for all conditions. It claims that when two bona fide treatments are compared head-to-head for the same condition, differences in outcome are typically small or absent. This is the most misunderstood finding in psychotherapy research.

1.2 Lambert (1992) — The Pie Chart of Outcome Variance

Michael Lambert proposed a model of what accounts for therapeutic change, based on his review of outcome research. His framework became one of the most cited in psychotherapy training.

The Four Factors:

FactorVariance ExplainedDescription
Extratherapeutic change40%Client factors, spontaneous remission, life events
Therapeutic relationship30%Alliance, empathy, warmth, positive regard
Expectancy (placebo)15%Hope, credibility of treatment rationale
Technique15%Specific model-driven interventions

Clinical implication: The relationship accounts for roughly twice the variance of technique. Training that focuses exclusively on technique while neglecting relational skills is misallocated.

Exam Pearl

Lambert's 40/30/15/15 split is the single most frequently examined breakdown of therapeutic outcome factors. The exact percentages have been debated, but the rank order is consistent across re-analyses: client/extratherapeutic factors > relationship > expectancy >= technique.

Exam Strategy

When asked "what is the most important factor in psychotherapy outcome," the answer is extratherapeutic/client factors (40%). When asked "what is the most important therapist-delivered factor," the answer is the therapeutic relationship (30%).

1.3 Wampold (2001, 2015) — The Great Psychotherapy Debate

Bruce Wampold's book and subsequent meta-analytic work provided the most rigorous quantitative test of the Dodo bird verdict. He introduced the distinction between the medical model (specific ingredients cure specific disorders) and the contextual model (common factors account for outcomes).

Design: Meta-analyses of comparative outcome studies

Key findings:

Clinical Anchor

Wampold's work means that who delivers the therapy matters more than which therapy is delivered. A skilled CBT therapist and a skilled psychodynamic therapist will get comparable outcomes. A poor therapist delivering an "evidence-based" protocol will get poor outcomes. This has direct implications for training, supervision, and quality assurance.

Exam Pearl

Wampold's effect size for the overall benefit of psychotherapy (d = 0.80) is comparable to the effect of many medical interventions. Psychotherapy is not a "soft" intervention — its effect size exceeds that of many medications for common conditions.


2. CBT LANDMARK TRIALS

2.1 Rush, Beck, Kovacs & Hollon (1977) — The First CBT vs Pharmacotherapy Trial

This is the trial that put CBT on the map. Aaron Beck's cognitive therapy was tested against imipramine (then the gold-standard antidepressant) for outpatient depression.

Design: RCT

Sample: N = 41 outpatients with major depression

Intervention: Cognitive therapy (max 20 sessions over 12 weeks) vs imipramine

Key findings:

Clinical implication: A psychological treatment could match or exceed pharmacotherapy for depression. This was revolutionary in an era when depression was considered a purely biological illness requiring medication.

Exam Pearl

The Rush et al. (1977) trial has been criticised for its small sample, non-blind design, and possible inadequate pharmacotherapy dosing. But its historical significance is enormous — it launched the entire CBT for depression research programme.

2.2 Elkin et al. (1989) — NIMH Treatment of Depression Collaborative Research Program (TDCRP)

The TDCRP was the first large-scale, multi-site, federally funded trial comparing psychotherapies head-to-head with pharmacotherapy and placebo. It remains one of the most discussed trials in psychotherapy research.

Design: Multi-site RCT, 4 arms

Sample: N = 250 outpatients with major depression

Arms: CBT vs IPT vs imipramine + clinical management vs placebo + clinical management

Duration: 16 weeks

Key findings:

Exam Pearl

The TDCRP is frequently cited as evidence that CBT is less effective for severe depression. This interpretation is contested: CBT performance varied dramatically across sites, suggesting therapist competence issues rather than a limitation of the model. Site 2 (University of Oklahoma), where CBT therapist competence was rated lowest, dragged down the CBT results.

Clinical Anchor

The TDCRP taught the field three lasting lessons: (1) multi-site trials require rigorous therapist training and monitoring; (2) severity moderates treatment response; and (3) the placebo response rate in depression trials is substantial (~29%).

2.3 DeRubeis et al. (2005) — CBT vs Medications for Moderate-to-Severe Depression

This trial directly addressed the TDCRP criticism that CBT does not work for severe depression.

Design: RCT, 2 sites (University of Pennsylvania, Vanderbilt)

Sample: N = 240 outpatients with moderate-to-severe MDD (HRSD >= 20)

Arms: Cognitive therapy vs paroxetine vs placebo (8 weeks, then active arms continued to 16 weeks)

Key findings:

Clinical implication: CBT is effective for moderate-to-severe depression, directly refuting the TDCRP conclusion.

Exam Pearl

DeRubeis et al. (2005) is the definitive rebuttal to "CBT doesn't work for severe depression." The response rates were identical to medication at 58% each. Know this trial and the TDCRP as a pair — examiners love this comparison.

2.4 Hollon et al. (2005) — The Enduring Effects of Cognitive Therapy

This is the companion paper to DeRubeis et al. (2005), following patients after treatment discontinuation.

Design: Naturalistic follow-up of the DeRubeis (2005) RCT

Sample: Responders from CT (n = 52) vs responders from paroxetine randomised to continuation medication or placebo withdrawal

Duration: 12-month follow-up after 16-week acute treatment

Key findings:

Exam Pearl

Hollon et al. (2005) established that CBT has an enduring effect that persists after treatment ends. Medication works while you take it; CBT continues to protect after you stop. This is the single most important argument for CBT in depression guidelines.

Clinical Anchor

The relapse rate after medication discontinuation (76%) is strikingly high. This is why most guidelines now recommend either long-term maintenance medication OR a course of CBT for recurrent depression — not just a single course of antidepressants.

2.5 Clark et al. (2003, 2006) — CBT for Social Anxiety and PTSD

David M. Clark's Oxford group produced two landmark trials that established disorder-specific cognitive therapy as a front-runner for anxiety disorders.

Clark et al. (2003) — Social Anxiety Disorder

Design: RCT

Sample: N = 60 with social phobia

Arms: Individual CT (Clark & Wells model) vs fluoxetine + self-exposure vs placebo

Key findings: CT was significantly superior to both fluoxetine and placebo at post-treatment and follow-up. Effect sizes were very large (d > 1.0 vs placebo).

Ehlers, Clark et al. (2003, 2005) — PTSD

Design: RCT

Sample: N = 85 with chronic PTSD (Ehlers et al., 2005 trial)

Arms: Cognitive therapy (Ehlers & Clark model) vs self-help booklet vs waitlist

Key findings: CT produced large effects (d = 1.27 vs waitlist). 71% achieved reliable and clinically significant change.

Exam Pearl

Clark's cognitive therapy models for social anxiety and PTSD are now considered among the most effective psychological treatments for any condition. The effect sizes (d > 1.0) exceed typical psychotherapy benchmarks.


3. DBT LANDMARK TRIALS

3.1 Linehan et al. (1991) — The First RCT of DBT for BPD

Marsha Linehan's first RCT was a watershed moment for BPD treatment. Before this trial, the prevailing view was that BPD was untreatable and that patients should be avoided.

Design: RCT

Sample: N = 44 women with BPD and recent parasuicide

Arms: DBT (1 year) vs treatment as usual (TAU)

Key findings:

Exam Pearl

Linehan (1991) showed DBT reduces self-harm and hospitalisation but NOT depression or suicidal ideation. This distinction matters: DBT targets behavioural dysregulation, not mood per se. When asked about DBT efficacy, specify the outcomes — it reduces the behaviour, not necessarily the distress.

3.2 Linehan et al. (2006) — DBT vs Community Expert Treatment

The major criticism of early DBT trials was that the comparison condition (TAU) was a weak control. This trial addressed that directly.

Design: RCT

Sample: N = 101 women with BPD, recent suicidal or self-injurious behaviour

Arms: DBT (1 year) vs community treatment by experts (CTBE) — therapists nominated as expert BPD treaters

Key findings:

Exam Pearl

Linehan (2006) is the strongest evidence for DBT superiority over genuine expert treatment (not just TAU). The comparison was deliberately rigorous — community experts who treated BPD regularly, not randomly assigned novices.

Clinical Anchor

A key finding was that even expert clinicians without a structured BPD protocol achieved significantly worse outcomes than DBT. Structure, protocol adherence, and skills training matter — clinical experience alone is insufficient for this population.

3.3 Rathus & Miller (2002) — Adolescent DBT

Design: Quasi-experimental (non-randomised comparison)

Sample: N = 111 suicidal adolescents with BPD features

Arms: DBT-A (12 weeks, adapted for adolescents) vs TAU (12-week supportive-psychodynamic therapy)

Key findings: DBT-A had significantly fewer psychiatric hospitalisations and higher treatment completion. No suicide attempts in the DBT-A group vs 3 in TAU.

Exam Pearl

Rathus & Miller (2002) is the foundational study for adolescent DBT (DBT-A). Although not a true RCT, it demonstrated feasibility and safety, leading to the later McCauley et al. (2018) definitive trial showing DBT-A reduces self-harm in adolescents.


4. PSYCHODYNAMIC LANDMARK TRIALS

4.1 Shedler (2010) — The Efficacy of Psychodynamic Psychotherapy

Jonathan Shedler's American Psychologist paper was a landmark review that challenged the narrative that psychodynamic therapy lacked an evidence base.

Design: Meta-analytic review

Key findings:

Exam Pearl

Shedler (2010) showed that the effect size for psychodynamic therapy (d = 0.97) matches CBT benchmarks. The "sleeper effect" — continuing improvement after termination — was unique to psychodynamic therapy and was not seen with CBT or medication.

Exam Strategy

If asked "is psychodynamic therapy evidence-based?", cite Shedler (2010) and Leichsenring & Rabung (2008, 2011). The evidence base is smaller than CBT's, but the effect sizes are comparable.

4.2 Leichsenring & Rabung (2008, 2011) — Long-Term Psychodynamic Psychotherapy Meta-Analyses

Design: Meta-analyses of RCTs and controlled trials of long-term psychodynamic psychotherapy (LTPP, defined as > 1 year or > 50 sessions)

Samples: 2008 meta-analysis: 23 studies (N = 1,053). 2011 update: 10 RCTs

Key findings:

Clinical Anchor

"Complex mental disorders" in these meta-analyses means personality disorders, chronic depression, multiple comorbidities, and treatment-resistant presentations. The finding that LTPP is superior for complexity, not for simple single-diagnosis cases, is clinically important — it guides appropriate treatment matching.

4.3 Bateman & Fonagy (1999, 2001, 2008) — Mentalisation-Based Treatment for BPD

Anthony Bateman and Peter Fonagy developed MBT specifically for BPD and tested it in a landmark partial hospitalisation trial.

Bateman & Fonagy (1999) — Original Trial

Design: RCT

Sample: N = 38 with BPD, partial hospitalisation setting

Arms: MBT-based partial hospitalisation (18 months) vs standard psychiatric care

Key findings: MBT significantly reduced self-harm, suicide attempts, inpatient days, depression, anxiety, and interpersonal distress. Effects appeared after 6 months and continued to increase.

Bateman & Fonagy (2001, 2008) — Follow-Up Studies

18-month follow-up (2001): MBT gains were maintained and continued to improve, while the control group showed no change.

8-year follow-up (2008): MBT group maintained significantly better functioning. Suicidality and service use remained lower. 13% of MBT patients still met BPD criteria vs 87% of control.

Exam Pearl

Bateman & Fonagy's 8-year follow-up (2008) is the longest follow-up in any BPD treatment trial. Only 13% of MBT patients still met BPD criteria at 8 years vs 87% of controls. This is the strongest longitudinal evidence for any BPD treatment.

4.4 Clarkin et al. (2007) — TFP vs DBT vs Supportive Therapy for BPD

Design: RCT

Sample: N = 90 women with BPD

Arms: Transference-Focused Psychotherapy (TFP, 1 year) vs DBT (1 year) vs psychodynamic supportive therapy (1 year)

Key findings:

Exam Pearl

Clarkin (2007) is the only trial comparing TFP, DBT, and supportive therapy head-to-head. TFP was the only treatment that improved reflective functioning and attachment. DBT was the only one that improved impulsivity. Each modality has a distinct mechanism signature.


5. SCHEMA THERAPY LANDMARK TRIALS

5.1 Giesen-Bloo et al. (2006) — Schema Therapy vs TFP for BPD

This is the largest and most methodologically rigorous trial of any psychotherapy for BPD. It was conducted across four sites in the Netherlands and directly compared two structured, long-term treatments.

Design: Multi-centre RCT

Sample: N = 86 with BPD (44 ST, 42 TFP), 3-year treatment

Arms: Schema therapy (2 sessions/week, 3 years) vs Transference-Focused Psychotherapy (2 sessions/week, 3 years)

Key findings:

Exam Pearl

Giesen-Bloo et al. (2006) produced the highest recovery rate ever reported for BPD in an RCT (45.5%). Know this number. The dropout rate (26.7%) was also notably low for a BPD trial, reflecting ST's emphasis on limited reparenting and alliance.

Clinical Anchor

The TFP dropout rate (50.6%) was comparable to dropout from unstructured treatment. The ST dropout rate (26.7%) was half that of TFP. For a population notorious for treatment attrition, this is clinically significant. Limited reparenting and the warm therapeutic stance in ST appear to be protective against dropout.

Exam Strategy

If asked to compare ST, TFP, DBT, and MBT for BPD, know the trial evidence for each: Giesen-Bloo (2006) for ST, Clarkin (2007) for TFP, Linehan (2006) for DBT, Bateman & Fonagy (1999/2008) for MBT. All show efficacy; ST has the highest recovery rate, MBT has the longest follow-up, DBT has the strongest evidence for reducing self-harm behaviour.

5.2 Nadort et al. (2009) — Schema Therapy With and Without Phone Crisis Support

Design: Multi-centre RCT

Sample: N = 62 with BPD

Arms: ST with therapist phone availability between sessions vs ST without phone availability

Key findings:

Exam Pearl

Nadort et al. (2009) showed that ST's efficacy does not depend on between-session phone contact, making it more practical and less burdensome than DBT for therapists. ST works through the therapeutic relationship in-session, not through crisis coaching.

5.3 Bamelis et al. (2014) — Schema Therapy for Cluster C Personality Disorders

Design: Multi-centre RCT

Sample: N = 323 with Cluster C PDs (avoidant, dependent, obsessive-compulsive), PD NOS, or paranoid PD

Arms: ST (50 sessions) vs Clarification-Oriented Psychotherapy (COP) vs TAU

Key findings:

Exam Pearl

Bamelis (2014) is the largest trial of any psychotherapy for personality disorders (N = 323). It established ST's efficacy beyond BPD — specifically for Cluster C PDs. The 81% recovery rate for ST is remarkable for personality disorders.


6. EXPOSURE THERAPY AND EMDR

6.1 Foa, Dancu, Hembree et al. (1999, 2005) — Prolonged Exposure for PTSD

Edna Foa's work on Prolonged Exposure (PE) is the foundation of trauma-focused treatment.

Foa et al. (1999)

Design: RCT

Sample: N = 96 female sexual assault survivors with chronic PTSD

Arms: PE vs stress inoculation training (SIT) vs combined PE + SIT vs waitlist

Key findings:

Foa et al. (2005)

Design: RCT

Sample: N = 179 women with PTSD

Arms: PE vs PE + cognitive restructuring vs waitlist

Key findings: PE alone and PE + CR were equally effective. Adding cognitive restructuring did not improve outcomes beyond exposure alone.

Exam Pearl

Foa's work consistently shows that exposure is the critical ingredient in PTSD treatment. Adding cognitive restructuring, stress inoculation, or other components does not significantly improve outcomes beyond PE alone. Exposure is necessary and may be sufficient.

6.2 Resick et al. (2002) — Cognitive Processing Therapy vs Prolonged Exposure

Design: RCT

Sample: N = 171 female sexual assault survivors with PTSD

Arms: CPT (12 sessions) vs PE (9 sessions) vs minimal attention control

Key findings:

Clinical Anchor

CPT and PE are the two front-line trauma treatments recommended by the VA/DoD guidelines. For clinical matching: PE is preferred when avoidance of trauma memories is the primary maintaining factor. CPT is preferred when "stuck points" (maladaptive beliefs about the trauma) are prominent.

6.3 Shapiro (1989) — The Original EMDR Study

Design: RCT

Sample: N = 22 with traumatic memories (rape, molestation, Vietnam veterans)

Arms: EMDR (single session) vs modified flooding with saccades

Key findings: EMDR produced significant reductions in subjective distress (SUD scores) and increases in positive cognition validity (VOC scores) after a single session.

Exam Pearl

Shapiro (1989) has been heavily criticised for: small sample, therapist allegiance (Shapiro administered all treatments), non-blind assessment, and inadequate comparison condition. However, it generated a research programme that has since accumulated strong evidence. Know the criticisms but also know the subsequent positive trials.

6.4 Bisson et al. (2007) — Cochrane Review of Psychological Treatments for PTSD

Design: Cochrane systematic review and meta-analysis

Sample: 38 RCTs

Key findings:

Exam Pearl

The Cochrane review (Bisson, 2007) gave NICE and WHO their evidence base for recommending TF-CBT and EMDR as first-line PTSD treatments. The specific finding that trauma-focused approaches outperform non-trauma-focused ones is critical — supportive counselling is not adequate for PTSD.


7. FAMILY AND COUPLES THERAPY

7.1 Gottman (1999) — The Mathematics of Divorce Prediction

John Gottman's research programme, culminating in his book "The Seven Principles for Making Marriage Work," used observational coding of marital interactions to predict divorce with remarkable accuracy.

Design: Prospective longitudinal studies with observational coding (Specific Affect Coding System, SPAFF)

Samples: Multiple cohorts totalling over 3,000 couples

Key findings:

Exam Pearl

Gottman's Four Horsemen of the Apocalypse: Criticism, Contempt, Defensiveness, Stonewalling. Contempt is the most damaging. The 5:1 positive-to-negative ratio during conflict is the stability threshold. These are among the most asked questions in couples therapy examinations.

7.2 Johnson, Hunsley, Greenberg & Schindler (1999) — EFT for Couples

Design: Meta-analysis

Sample: 7 studies of Emotionally Focused Couple Therapy (EFT-C)

Key findings:

Exam Pearl

EFT-C's effect size of d = 1.30 is the largest reported for any couples therapy approach. Know that EFT-C (Johnson) works through accessing attachment emotions and restructuring interactional cycles — it is NOT the same as Greenberg's EFT for individuals, though they share theoretical roots.

7.3 Leff et al. (2000) — Family Intervention for Schizophrenia

Design: Meta-analysis and critical review of family intervention trials

Key findings:

Clinical Anchor

High expressed emotion (criticism, hostility, emotional over-involvement) in family members is one of the most replicated predictors of relapse in schizophrenia. Family interventions targeting EE are now recommended in every major schizophrenia guideline (NICE, APA, PORT). The evidence is strong enough that NOT offering family intervention represents a gap in care.


8. ACT AND MBCT

8.1 Teasdale et al. (2000) — MBCT Prevents Depression Relapse

Design: Multi-site RCT

Sample: N = 145 with recurrent MDD (>= 3 prior episodes), currently in remission

Arms: MBCT (8-week group programme) + TAU vs TAU alone

Key findings:

Exam Pearl

Teasdale (2000) showed MBCT prevents relapse in patients with >= 3 prior episodes of depression, NOT in those with only 2 episodes. This specificity is crucial: MBCT works by disrupting cognitive reactivity, which is more established in highly recurrent depression. The 3+ episodes threshold is the standard clinical indication.

8.2 Kuyken et al. (2015) — MBCT vs Maintenance Antidepressants

Design: Multi-centre RCT (the PREVENT trial)

Sample: N = 424 with recurrent MDD (>= 3 episodes), on maintenance antidepressants

Arms: MBCT with support to taper antidepressants vs maintenance antidepressants

Duration: 24-month follow-up

Key findings:

Exam Pearl

Kuyken (2015) is the definitive MBCT trial. It established MBCT as a viable alternative to long-term antidepressant maintenance. The finding that childhood abuse survivors benefited more from MBCT than medication suggests MBCT may particularly help patients whose depression is maintained by ruminative processing of early adversity.

Exam Strategy

Know the triad of MBCT evidence: Teasdale (2000) = prevention, Ma & Teasdale (2004) = replication, Kuyken (2015) = non-inferiority to medication.

8.3 A-Tjak et al. (2015) — ACT Meta-Analysis

Design: Meta-analysis of RCTs

Sample: 39 RCTs across multiple conditions

Key findings:

Exam Pearl

A-Tjak (2015) is the benchmark ACT meta-analysis. ACT works — it outperforms control conditions with medium effect sizes. But it does NOT outperform traditional CBT. The claim that ACT is "better" than CBT is not supported; the claim that ACT is an effective, evidence-based alternative is.


9. COMPARATIVE AND INTEGRATION STUDIES

9.1 Cuijpers et al. (2008, 2019) — Meta-Analyses of Psychotherapy for Depression

Pim Cuijpers and his group at VU Amsterdam have produced the most comprehensive meta-analytic programme in psychotherapy research.

Cuijpers et al. (2008)

Design: Meta-analysis

Sample: 53 RCTs of psychotherapy for adult depression

Key findings:

Exam Pearl

Cuijpers (2008) estimated that publication bias inflates psychotherapy effect sizes by approximately 25%. The corrected effect size (d = 0.42) is still clinically meaningful but more modest than the uncorrected estimate. Always consider publication bias when interpreting psychotherapy meta-analyses.

Cuijpers et al. (2019) — Updated Network Meta-Analysis

Design: Network meta-analysis

Sample: 331 RCTs (N > 34,000)

Key findings:

Clinical Anchor

The consistent finding across Cuijpers' meta-analyses is that combination treatment (therapy + medication) outperforms either alone. For moderate-to-severe depression, the evidence strongly supports combination. This is reflected in NICE, APA, and CANMAT guidelines.

9.2 Steinert et al. (2017) — Psychodynamic Therapy vs CBT

Design: Systematic review and meta-analysis

Sample: 23 RCTs directly comparing psychodynamic therapy with CBT

Key findings:

Exam Pearl

Steinert (2017) is the most cited direct comparison meta-analysis of psychodynamic vs CBT. The effect difference was essentially zero (g = -0.07). Use this to counter the narrative that CBT is "more evidence-based" — it has more trials, but when directly compared, outcomes are equivalent.

9.3 Barlow's Unified Protocol (UP)

The Unified Protocol for Transdiagnostic Treatment of Emotional Disorders (Barlow et al., 2011) represents the integration movement in CBT — treating the shared mechanisms underlying all emotional disorders rather than using disorder-specific protocols.

Core modules: Mindful emotion awareness, cognitive flexibility, countering emotional behaviours (exposure), interoceptive awareness

Key trials:

Exam Pearl

Barlow's Unified Protocol is the most researched transdiagnostic CBT. It treats neuroticism (the shared vulnerability across emotional disorders) rather than disorder-specific symptoms. The non-inferiority finding vs disorder-specific protocols is significant — one protocol can treat comorbid anxiety and mood disorders simultaneously.


10. PROCESS RESEARCH

10.1 Bordin (1979) — The Working Alliance

Edward Bordin proposed a pantheoretical model of the therapeutic alliance with three components:

  1. Tasks — agreement on what happens in therapy
  2. Goals — agreement on desired outcomes
  3. Bond — mutual trust, liking, and attachment

Impact: The Working Alliance Inventory (WAI), based on Bordin's model, became the most widely used alliance measure. Subsequent meta-analyses (Horvath & Symonds, 1991; Horvath et al., 2011) confirmed a consistent relationship between alliance and outcome: r = 0.275 (medium effect), stable across therapies, raters, and time of measurement.

Exam Pearl

Bordin's three alliance components: Tasks, Goals, Bond. The alliance-outcome correlation (r = 0.275) is one of the most replicated findings in psychotherapy research. It accounts for roughly 7.5% of outcome variance — modest but consistent and significant.

Clinical Anchor

The alliance is not just "rapport." It includes agreement on what you are doing (tasks) and why (goals). A therapist who has excellent rapport but uses interventions the patient does not understand or agree with has a poor task alliance — and this predicts poorer outcomes.

10.2 Safran & Muran (2000) — Alliance Rupture and Repair

Jeremy Safran and J. Christopher Muran's programme of research on rupture-repair sequences revolutionised how clinicians understand the therapeutic relationship.

Key concepts:

Research findings:

Exam Pearl

Safran & Muran's key finding: repaired ruptures predict BETTER outcomes than no rupture at all. A therapy that goes smoothly may mean the patient is being compliant rather than engaged. The rupture-repair cycle is itself therapeutic — it provides a corrective emotional experience.

Exam Strategy

There are two types of rupture: confrontation (the patient pushes back) and withdrawal (the patient shuts down). Withdrawal ruptures are more dangerous because they are harder to detect. If a patient suddenly becomes agreeable and compliant after previously being engaged, suspect a withdrawal rupture.

10.3 Kazdin (2007) — Mediators and Mechanisms of Change

Alan Kazdin's landmark paper challenged the field to move beyond "does therapy work?" to "how and why does therapy work?"

Key arguments:

Exam Pearl

Kazdin (2007) distinguished mediators from mechanisms. A mediator is a statistical finding (X accounts for the Y-Z relationship). A mechanism requires temporal precedence, specificity, gradient, and plausibility. Very few psychotherapy "mechanisms" have been truly demonstrated — most remain at the mediator level.

Clinical Anchor

This paper has practical implications. If we do not know how therapy works, we cannot efficiently train therapists, improve protocols, or match patients to treatments. Kazdin's call to study mechanisms has shaped the next generation of psychotherapy research, including the dismantling studies and process-outcome studies that now dominate the field.


MASTER TABLE: LANDMARK TRIALS IN PSYCHOTHERAPY

TrialYearDesignNConditionComparisonKey FindingEffect Size
Luborsky et al.1975ReviewMultipleVariousVarious psychotherapies"Dodo bird verdict" — equivalent outcomes
Rush et al.1977RCT41DepressionCT vs imipramineCT superior on all measures
Bordin1979TheoryWorking alliance: tasks, goals, bondr = 0.275
Shapiro1989RCT22TraumaEMDR vs flooding variantEMDR reduced distress rapidly
Elkin et al. (TDCRP)1989Multi-site RCT250DepressionCBT vs IPT vs imipramine vs placeboAll active > placebo; imipramine best for severed = 0.37 (vs placebo)
Linehan et al.1991RCT44BPDDBT vs TAUDBT reduced parasuicide + hospitalisations
Lambert1992ReviewVarious40/30/15/15 outcome variance model
Bateman & Fonagy1999RCT38BPDMBT partial hospital vs standard careMBT reduced self-harm, depression, hospitalisations
Foa et al.1999RCT96PTSDPE vs SIT vs combined vs waitlistPE alone best at follow-up
Gottman1999Longitudinal>3000CouplesObservational codingFour Horsemen predict divorce (93.6%)
Johnson et al.1999Meta-analysis7 studiesCouplesEFT-C vs controlsEFT-C d = 1.30; 70-73% recoveryd = 1.30
Leff et al.2000Meta-analysisMultipleSchizophreniaFamily intervention vs standard care50% relapse reduction
Safran & Muran2000Process researchMultipleVariousRupture-repair predicts better outcomes
Teasdale et al.2000RCT145Recurrent MDDMBCT + TAU vs TAUMBCT halved relapse for 3+ episodes
Wampold2001Meta-analysisMultipleVariousBetween-therapy comparisonsTherapy differences d = 0.00-0.20d = 0.80 (vs no tx)
Rathus & Miller2002Quasi-exp.111Adolescent BPDDBT-A vs TAUDBT-A reduced hospitalisations, 0 suicide attempts
Resick et al.2002RCT171PTSDCPT vs PE vs controlCPT = PE; both superior to control
Clark et al.2003RCT60Social anxietyCT vs fluoxetine vs placeboCT superior to fluoxetine and placebod > 1.0
DeRubeis et al.2005RCT240Moderate-severe MDDCT vs paroxetine vs placeboCT = paroxetine (58% response each)
Foa et al.2005RCT179PTSDPE vs PE + CR vs waitlistPE alone = PE + CR
Hollon et al.2005Follow-up104DepressionPrior CT vs medication continuation vs withdrawalPrior CT relapse 31% vs withdrawal 76%
Giesen-Bloo et al.2006Multi-site RCT86BPDST vs TFP (3 years)ST recovery 45.5% vs TFP 24.4%
Linehan et al.2006RCT101BPDDBT vs community expertsDBT halved suicide attempts (23% vs 46%)
Bisson et al.2007Cochrane review38 RCTsPTSDMultiple comparisonsTF-CBT and EMDR recommended; equivalent to each other
Clarkin et al.2007RCT90BPDTFP vs DBT vs supportiveAll improved; TFP improved mentalisation; DBT improved impulsivity
Kazdin2007ConceptualMediators are not mechanisms; criteria for mechanism evidence
Cuijpers et al.2008Meta-analysis53 RCTsDepressionMultiple psychotherapiesd = 0.67 (0.42 corrected for pub bias); no between-therapy differencesd = 0.67
Leichsenring & Rabung2008Meta-analysis23 studiesComplex disordersLTPP vs shorter therapiesLTPP d = 0.96 for complex presentationsd = 0.96
Bateman & Fonagy20088-year follow-up38BPDMBT vs standard care13% still met BPD criteria vs 87% control
Nadort et al.2009RCT62BPDST + phone vs ST aloneNo added benefit of phone support
Shedler2010Meta-reviewMultipleVariousPsychodynamic vs benchmarksd = 0.97; "sleeper effect" unique to psychodynamicd = 0.97
Bamelis et al.2014Multi-site RCT323Cluster C PDST vs COP vs TAUST 81% recovery; largest PD trial
A-Tjak et al.2015Meta-analysis39 RCTsVariousACT vs controls and CBTACT g = 0.57 vs controls; g = 0.06 vs CBT (ns)g = 0.57
Kuyken et al.2015RCT424Recurrent MDDMBCT + taper vs maintenance medsMBCT non-inferior to medication over 24 months
Steinert et al.2017Meta-analysis23 RCTsVariousPsychodynamic vs CBTNo significant difference: g = -0.07g = -0.07
Cuijpers et al.2019Network meta-analysis331 RCTsDepression7 psychotherapy typesAll effective; no between-therapy differences; combo > monotherapy

KEY NUMBERS TO MEMORISE

NumberWhat It RepresentsSource
40/30/15/15Outcome variance: client/relationship/expectancy/techniqueLambert (1992)
d = 0.80Overall effect of psychotherapy vs no treatmentWampold (2001)
r = 0.275Alliance-outcome correlationHorvath et al. (2011)
5:1Positive-to-negative interaction ratio for stable couplesGottman (1999)
45.5%ST recovery rate for BPDGiesen-Bloo (2006)
31% vs 76%Relapse after prior CT vs medication withdrawalHollon (2005)
d = 1.30EFT-C effect sizeJohnson et al. (1999)
3+ episodesMBCT relapse prevention thresholdTeasdale (2000)
93.6%Gottman's divorce prediction accuracyGottman (1999)
13% vs 87%BPD criteria at 8-year follow-up (MBT vs control)Bateman & Fonagy (2008)

Weave Psychotherapy Vol. 12 — Landmark Papers & Trials

Dr. Wilfred Dsouza

www.weave.clinic wilfred.desouza1996@gmail.com IG: @weave.clinic
02
Clinical Quick Reference
Landmark Papers & Trials — Weave Psychotherapy Vol. 12
www.weave.clinic
WEAVE Weave Psychotherapy Vol. 12 | Landmark Papers & Trials Chapter 02 · Clinical Quick Reference

D6: Clinical Quick Reference — Landmark Papers & Trials in Psychotherapy


1. Master Landmark Trials Table

Common Factors & Outcome Research

TrialYearDesignNKey Finding
Luborsky et al.1975Narrative reviewMultipleDodo bird verdict — no consistent differences between bona fide therapies
Lambert1992ReviewOutcome variance: 40% client, 30% relationship, 15% expectancy, 15% technique
Wampold2001Meta-analysisMultipleTherapy vs no treatment d = 0.80; between-therapy differences d = 0.00-0.20
Wampold2015Meta-analysisMultipleTherapist effects (5-9%) > treatment differences; alliance r = 0.28

CBT Trials

TrialYearDesignNConditionKey Finding
Rush et al.1977RCT41DepressionFirst CT vs medication trial; CT superior to imipramine
Elkin et al. (TDCRP)1989Multi-site RCT250DepressionAll active > placebo; CBT struggled in severe depression
DeRubeis et al.2005RCT240Moderate-severe MDDCT = paroxetine (58% response each); refuted TDCRP
Hollon et al.2005Follow-up104DepressionPrior CT relapse 31% vs medication withdrawal 76%
Clark et al.2003RCT60Social anxietyCT > fluoxetine > placebo; d > 1.0
Ehlers & Clark2005RCT85Chronic PTSDCT d = 1.27 vs waitlist; 71% reliable change

DBT Trials

TrialYearDesignNConditionKey Finding
Linehan et al.1991RCT44BPD (women)DBT reduced parasuicide + hospitalisations; NOT depression
Rathus & Miller2002Quasi-exp111Adolescent BPDDBT-A reduced hospitalisations; 0 suicide attempts
Linehan et al.2006RCT101BPD (women)DBT halved suicide attempts vs expert clinicians (23% vs 46%)

Psychodynamic Trials

TrialYearDesignNConditionKey Finding
Bateman & Fonagy1999RCT38BPDMBT reduced self-harm, depression, hospitalisations
Bateman & Fonagy20088-yr follow-up38BPD13% still met BPD criteria vs 87% control
Clarkin et al.2007RCT90BPDTFP improved mentalisation; DBT improved impulsivity
Leichsenring & Rabung2008Meta-analysis1,053Complex disordersLTPP d = 0.96 for complex presentations
Shedler2010Meta-reviewMultipleVariousPsychodynamic d = 0.97; unique "sleeper effect"

Schema Therapy Trials

TrialYearDesignNConditionKey Finding
Giesen-Bloo et al.2006Multi-site RCT86BPDST recovery 45.5% vs TFP 24.4%; ST dropout 26.7% vs TFP 50.6%
Nadort et al.2009RCT62BPDST effective without phone crisis support
Bamelis et al.2014Multi-site RCT323Cluster C PDST recovery 81% vs COP 60% vs TAU 52%; largest PD trial

Trauma Trials (PE, CPT, EMDR)

TrialYearDesignNConditionKey Finding
Shapiro1989RCT22TraumaOriginal EMDR study; heavily criticised but generative
Foa et al.1999RCT96PTSD (sexual assault)PE alone best at follow-up; combined NOT superior
Resick et al.2002RCT171PTSD (sexual assault)CPT = PE; both superior to control
Foa et al.2005RCT179PTSDPE alone = PE + cognitive restructuring
Bisson et al.2007Cochrane review38 RCTsPTSDTF-CBT and EMDR recommended; equivalent to each other

Family & Couples Therapy

TrialYearDesignNConditionKey Finding
Gottman1999Longitudinal>3,000CouplesFour Horsemen predict divorce (93.6%); 5:1 ratio
Johnson et al.1999Meta-analysis7 studiesCouplesEFT-C d = 1.30; 70-73% recovery
Leff et al.2000Meta-analysisMultipleSchizophreniaFamily intervention halved relapse (~25% vs ~50%)

ACT & MBCT

TrialYearDesignNConditionKey Finding
Teasdale et al.2000RCT145Recurrent MDDMBCT halved relapse for 3+ episodes (37% vs 66%)
Kuyken et al.2015RCT424Recurrent MDDMBCT non-inferior to maintenance antidepressants over 24 months
A-Tjak et al.2015Meta-analysis39 RCTsVariousACT g = 0.57 vs controls; equivalent to CBT (g = 0.06, ns)

Comparative & Process Research

TrialYearDesignNKey Finding
Bordin1979Theory paperWorking alliance = tasks + goals + bond; r = 0.275 with outcome
Safran & Muran2000Process researchMultipleRepaired ruptures predict better outcomes than no rupture
Kazdin2007ConceptualMediators are not mechanisms; five criteria for mechanism evidence
Cuijpers et al.2008Meta-analysis53 RCTsDepression: d = 0.67 (0.42 corrected for pub bias)
Steinert et al.2017Meta-analysis23 RCTsPsychodynamic vs CBT: g = -0.07 (no difference)
Cuijpers et al.2019Network MA331 RCTsAll 7 therapy types effective for depression; combo > mono
Barlow et al.2017RCT223Unified Protocol non-inferior to disorder-specific CBT

2. Key Numbers to Memorise

Effect Sizes

NumberMeaningSource
d = 0.80Overall psychotherapy vs no treatmentWampold (2001)
d = 0.00-0.20Differences between bona fide therapiesWampold (2001)
d = 0.67Psychotherapy for depression (uncorrected)Cuijpers (2008)
d = 0.42Psychotherapy for depression (corrected for pub bias)Cuijpers (2008)
d = 0.97Psychodynamic therapyShedler (2010)
d = 0.96LTPP for complex disordersLeichsenring & Rabung (2008)
d = 1.30EFT for couplesJohnson et al. (1999)
d > 1.0Clark's CT for social anxietyClark et al. (2003)
d = 1.27Clark's CT for PTSDEhlers & Clark (2005)
g = 0.57ACT vs control conditionsA-Tjak et al. (2015)
g = -0.07Psychodynamic vs CBT (no difference)Steinert et al. (2017)
r = 0.275Alliance-outcome correlationHorvath et al. (2011)

Percentages and Ratios

NumberMeaningSource
40/30/15/15Outcome variance: client / relationship / expectancy / techniqueLambert (1992)
5-9%Outcome variance explained by therapist effectsWampold (2001)
7.5%Outcome variance explained by allianceHorvath et al. (2011)
5:1Positive-to-negative ratio in stable couplesGottman (1999)
93.6%Gottman's divorce prediction accuracyGottman (1999)
45.5% vs 24.4%ST vs TFP recovery in BPDGiesen-Bloo (2006)
31% vs 76%Relapse: prior CT vs medication withdrawalHollon et al. (2005)
23% vs 46%Suicide attempts: DBT vs community expertsLinehan (2006)
37% vs 66%Relapse: MBCT vs TAU (3+ episodes)Teasdale (2000)
13% vs 87%BPD criteria at 8-year follow-up: MBT vs controlBateman & Fonagy (2008)
58% vs 58%CT vs paroxetine response rate in moderate-severe MDDDeRubeis (2005)
81%ST recovery rate for Cluster C PDsBamelis (2014)
70-73%EFT-C couple recovery rateJohnson et al. (1999)
50%Relapse reduction from family intervention in schizophreniaLeff et al. (2000)

3. "Who Proved What" Quick Reference

Depression

BPD

Personality Disorders (non-BPD)

PTSD

Anxiety Disorders

Couples & Family

Common Factors

Psychodynamic Therapy


4. Timeline of Psychotherapy Research Milestones

YearMilestoneSignificance
1975Luborsky — Dodo bird verdictLaunched the common factors debate
1977Rush et al. — CT vs imipramineCBT entered the evidence arena
1979Bordin — Working Alliance modelPantheoretical alliance framework
1989Elkin et al. — NIMH TDCRPFirst large-scale psychotherapy horse race
1989Shapiro — EMDR introducedNew trauma treatment paradigm
1991Linehan — First DBT trialBPD became treatable
1992Lambert — 40/30/15/15 modelDefined outcome variance factors
1999Bateman & Fonagy — MBT for BPDMentalisation-based treatment enters evidence
1999Foa — PE for PTSDExposure confirmed as key ingredient
1999Gottman — Divorce predictionCouples research became empirical
1999Johnson — EFT-C meta-analysisAttachment-based couples therapy validated
2000Teasdale — MBCT prevents relapseMindfulness entered evidence-based practice
2000Safran & Muran — Rupture-repairAlliance became a dynamic process, not a static trait
2001Wampold — Great Psychotherapy DebateQuantified common vs specific factors
2002Resick — CPT = PE for PTSDTwo front-line trauma treatments established
2003Clark — CT for social anxietyDisorder-specific CBT benchmarks set
2005DeRubeis — CT = medication for severe MDDCBT's severe depression limitation refuted
2005Hollon — CT's enduring effectCBT protects after discontinuation
2006Giesen-Bloo — ST vs TFP for BPDHighest BPD recovery rate in any trial
2006Linehan — DBT vs expert cliniciansDBT superiority confirmed against rigorous comparator
2007Bisson — Cochrane PTSD reviewTF-CBT and EMDR established as first-line
2007Kazdin — Mechanisms of changeShifted field from "does it work" to "how does it work"
2008Leichsenring & Rabung — LTPP meta-analysisLong-term psychodynamic therapy validated for complex disorders
2010Shedler — Psychodynamic efficacy reviewPsychodynamic therapy shown to match CBT effect sizes
2014Bamelis — ST for Cluster C PDsLargest personality disorder trial; ST efficacy broadened
2015Kuyken — MBCT = maintenance medsMBCT established as medication alternative
2015A-Tjak — ACT meta-analysisACT validated; equivalent to CBT
2017Barlow — Unified ProtocolTransdiagnostic CBT non-inferior to disorder-specific
2017Steinert — Psychodynamic vs CBTDirect comparison: no difference (g = -0.07)
2019Cuijpers — Network meta-analysis331 RCTs: all therapies work; combo best

5. Meta-Analyses Summary Table

AuthorsYearFocusStudiesNMain Result
Wampold2001Therapy vs no txMultipleMultipled = 0.80
Cuijpers et al.2008Psychotherapy for depression53 RCTs>5,000d = 0.67 (0.42 corrected)
Leichsenring & Rabung2008LTPP23 studies1,053d = 0.96
Shedler2010Psychodynamic therapyMultipleMultipled = 0.97
Horvath et al.2011Alliance-outcome>200 studies>14,000r = 0.275
A-Tjak et al.2015ACT39 RCTs>3,000g = 0.57 vs controls
Steinert et al.2017PDT vs CBT23 RCTs>2,500g = -0.07 (no difference)
Cuijpers et al.2019Psychotherapy for depression331 RCTs>34,000All 7 types effective; combo > mono
Bisson et al.2007PTSD treatments38 RCTs>3,000TF-CBT and EMDR first-line
Johnson et al.1999EFT-C7 studies>300d = 1.30

6. Viva Questions

Q1. What is the Dodo bird verdict and who proposed it?

Luborsky et al. (1975). When bona fide psychotherapies are compared head-to-head, outcome differences are small or absent. Named after the Dodo bird in Alice in Wonderland: "everybody has won and all must have prizes." Supported by Wampold (2001): between-therapy differences d = 0.00-0.20.

Q2. What are Lambert's four factors of therapeutic outcome?

Extratherapeutic/client factors (40%), therapeutic relationship (30%), expectancy/placebo (15%), technique (15%). The relationship accounts for twice the variance of technique.

Q3. What is the strongest evidence that CBT works for severe depression?

DeRubeis et al. (2005): N = 240, moderate-to-severe MDD. CT and paroxetine had identical response rates (58%). This directly refuted the NIMH TDCRP (Elkin, 1989) finding that CBT was less effective for severe depression.

Q4. What evidence supports CBT's enduring effect?

Hollon et al. (2005): relapse rate after prior CT was 31% vs 76% after medication withdrawal. Prior CT was equivalent to continuing medication (47%). CBT provides lasting protection after discontinuation; medication does not.

Q5. Compare the evidence for DBT, MBT, ST, and TFP in BPD.

DBT (Linehan, 2006): halved suicide attempts vs expert treatment. MBT (Bateman & Fonagy, 2008): 8-year follow-up, 13% still met BPD criteria. ST (Giesen-Bloo, 2006): highest recovery rate (45.5%), lowest dropout. TFP (Clarkin, 2007): uniquely improved mentalisation. All effective; different mechanisms.

Q6. What is the evidence for MBCT in preventing depression relapse?

Teasdale (2000): MBCT halved relapse for patients with 3+ episodes (37% vs 66%). NOT effective for patients with only 2 episodes. Kuyken (2015): MBCT non-inferior to maintenance antidepressants over 24 months.

Q7. What are the recommended first-line treatments for PTSD?

Bisson et al. (2007, Cochrane): TF-CBT (including PE and CPT) and EMDR. Both are equivalent to each other. Non-trauma-focused therapies (supportive counselling, relaxation) are not recommended. Exposure is the critical ingredient (Foa, 1999, 2005).

Q8. What is the working alliance and how does it relate to outcome?

Bordin (1979): tasks, goals, bond. Alliance-outcome correlation r = 0.275 (~7.5% variance). Most replicated finding in psychotherapy research. Safran & Muran (2000): repaired alliance ruptures predict better outcomes than no rupture.

Q9. What did Kazdin (2007) argue about mechanisms of change?

Most claimed mechanisms are actually mediators. True mechanism evidence requires: temporal precedence, specificity, gradient, strong association, and plausibility. Very few psychotherapy mechanisms meet all criteria.

Q10. What is the evidence that psychodynamic therapy is effective?

Shedler (2010): d = 0.97, matching CBT benchmarks. Unique "sleeper effect" (continuing improvement post-treatment). Leichsenring & Rabung (2008): LTPP d = 0.96 for complex disorders. Steinert (2017): direct comparison with CBT shows g = -0.07 (no difference).

Q11. What is Gottman's contribution to couples therapy evidence?

Prospective longitudinal research with >3,000 couples. Four Horsemen (criticism, contempt, defensiveness, stonewalling) predict divorce at 93.6% accuracy. Stable couples: 5:1 positive-to-negative ratio during conflict. Contempt is the most toxic behaviour.

Q12. What is the largest psychotherapy trial for personality disorders?

Bamelis et al. (2014): N = 323. Schema therapy for Cluster C PDs (avoidant, dependent, OCPD) and PD NOS. ST recovery 81% vs COP 60% vs TAU 52%. ST was also more cost-effective.

Exam Strategy

For viva, know one landmark trial per modality plus one meta-analysis. Minimum set: Rush (1977), Linehan (1991/2006), Bateman & Fonagy (1999), Giesen-Bloo (2006), Foa (1999), Teasdale (2000), Wampold (2001), Shedler (2010), Cuijpers (2019).


Weave Psychotherapy Vol. 12 — Landmark Papers & Trials

Dr. Wilfred Dsouza

www.weave.clinic wilfred.desouza1996@gmail.com IG: @weave.clinic
WEAVE
Centre for Integrative Psychiatry
Psychiatry that listens before it prescribes.
Website
www.weave.clinic
Instagram
@weave.clinic · @doc.wilfred.md · @doc.niharika.md
Email
Two doctors. Two approaches. One practice.
ADHD · Relationships · Mood · Addiction · Children
Online across India