are different methods to reduce placebo response in cns ...€¦ · affect placebo response when...

1
The SAFER Interview The SAFER Interview is an operationalized version of the SAFER (Specificity, Assessability, Face Validity, Ecological Validity, and Rule of the 3 Ps) principles developed by clinicians at Massachusetts General Hospital (MGH) 4 . In a recent analysis of five trials in treatment resistant depression (TRD) all subjects had passed screening procedures at the site and were considered to be eligible for the trial. In addition to the 45-minute SAFER Interview performed remotely by MGH clinicians who called subjects (N=1935) directly, a structured severity interview was performed; 1562 (81 percent) of the subjects were deemed eligible for continued screening. (Figure 5) Of the 373 (19 percent) subjects deemed ineligible: 97 (5 percent) did not meet severity criteria; 39 (2 percent) did not meet only SAFER criteria; 151 (8 percent) did not meet ATRQ criteria for treatment resistance; and, 86 (4 percent) did not meet criteria on more than one component of the SAFER Interview. Across all of these five TRD studies, placebo response rates were within a range of 18 to 28 percent, below the 30 to 37 percent average in studies of treatments approved for TRD (olanzapine-fluoxetine combination, quetiapine and aripiprazole). 19% 81% Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average 12 68 54 52 81 53 100 80 60 40 20 0 % Reduction Randomize Placebo Active Treatment (Dose X) Active Treatment (Dose X) Active Treatment (Dose X) Placebo Responder Responder Non Responder Non Responder Placebo Active Treatment (Dose X) Randomize Randomize Treatment Phase 1 Results Treatment Phase 2 600 400 200 0 Frequency Frequency Frequency Frequency BPRS Total Score (n=382) PANSS Total Score (n=563) HAM-A Total Score (n=462) MADRS Total Score (n=4140) 80 60 40 20 0 100 80 60 40 20 0 50 40 30 20 10 0 0 4 8 12 16 20 24 28 32 36 40 44 48 6 10 14 18 22 26 30 34 38 42 50 60 70 80 90 100 110 120 130 140 24 28 32 36 40 44 48 52 56 60 64 68 Number of Central Raters Estimated number of site raters Number of Raters 180 160 140 120 100 80 60 40 20 0 STUDY A STUDY B STUDY C STUDY D STUDY E STUDY F STUDY G STUDY I STUDY J STUDY K STUDY L STUDY M STUDY H STUDY N STUDY O STUDY P STUDY Q STUDY R STUDY S STUDY T STUDY U STUDY V STUDY W STUDY X Rater Cohort Characteristics 1.00 80% 100 Experienced, calibrated 0.93 3 78% 108 0.80 71% 125 Inexperienced, calibrated 0.77 3 70% 130 0.60 59% 167 Experienced, calibrated 0.55 3 55% 181 Reliability (ICC) Study Power 1 Sample Size per Arm 2 Fava,M 1,2 , Williams JBW 3,4 , Freeman, M 1,2 , Detke MJ 3,5,6 1 Massachusetts General Hospital Dept. of Psychiatry, 2 Clinical Trials Network and Institute 3 MedAvante, Inc., 4 Columbia University Depts. of Psychiatry and Neurology, 5 Detke Biopharma Consulting LLC, 6 Indiana University School of Medicine Are Different Methods to Reduce Placebo Response in CNS Trials Mutually Exclusive or Have Additive Effects? ©2014 MedAvante Inc. RESEARCH INSTITUTE INTRODUCTION Over the past two decades, numerous methodologies have been developed with the specific goal of reducing the placebo effect in CNS trials. The number of failed trials in psychiatry has increased substantially over time. This may be due to baseline score inflation and the inclusion of inappropriate subjects, variability in baseline and post-baseline clinical ratings, and expectation or relationship biases associated with the perceptions of subjects, clinicians or both that can affect placebo response when clinical raters function in multiple roles at study sites. This presentation will discuss how a combination of three strategies, Central Ratings, use of the SAFER Interview, and use of Sequential Parallel Comparison Design (SPCD), may offer a “triple safety” net to ensure maximum reduction of the risk of a failed trial. THE THREE STRATEGIES COMBINING THE THREE STRATEGIES Each of these risk reduction strategies can improve results, but combining all three methods to work in sequence in a trial can increase study power and most effectively minimize the methodology factors that can cause a trial to fail to detect a signal. Analyses have shown that the cost of reducing the risk of trial failure increases the expected net present value of the overall program investment, many times over 5 . Calibrated independent central raters avoid the biases and reduce the rating variability that adversely affects subject selection, accurate baseline and post-baseline ratings. The SAFER Interview, administered by independent central raters, can help ensure selection of appropriate subjects. SPCD methodology, in conjunction with the accuracy and precision achievable with remote central ratings, can decrease placebo response and allow for a smaller sample size. In combination, these three unique but complementary strategies offer a “triple safety net” that may dramatically reduce the risk of a failed trial and offer the best chance for a trial to succeed. References 1 Mueller & Szegedi. (2002). Effects of Interrater Reliability of Psychopathologic Assessment on Power and Sample Size Calculations in Clinical Trials. Journal of Clinical Psychopharmacology. 22(3), 318-325. Calculated where n = 100 subjects per arm and effect size = .40 2 Perkins DO et al. (2000). Penny-wise and pound foolish: The impact of measurement error on sample size requirements in clinical trials. Biological Psychiatry. 47, 762-766. Calculated where n = 100 subjects per arm 3 Kobak KA et al. (2009). Sources of Unreliability in Depression Ratings. .Journal of Clinical Psychopharmacology. 29(1), 82-85. 4 Targum SD, Pollack MH, Fava M. (2008). Redefining affective disorders: Relevance for drug development. CNS Neuroscience & Therapeutics. 14, 2-9; doi: 10.1111/j.1527-3458.2008.00038.x. 5 Greenblatt W et al. (2012). The Value of Risk Reduction in CNS Drug Development: Use of Expected Net Present Value (eNPV) as a Model. Presented at American College of Neuropsychopharmacology 51st Annual Meeting, Hollywood FL. Remote Independent Central Ratings The use of well-trained and blinded, calibrated expert central raters minimizes perception biases and reduces variability, which enhances separation between placebo and active treatment. Remote independent central raters screening subjects via evaluation of symptom severity and the SAFER Interview help ensure the unbiased selection of appropriate subjects while avoiding baseline score inflation. (Figure 1) Figure 1. Sample of Baseline Score Distributions in Completed Studies MDD GAD SCHIZOPHRENIA PSYCHOSIS Central ratings ensure accurate post-baseline ratings and increase study power via high ICCS further enhancing the impact of the SPCD identification of placebo non-responders for randomization in Stage 2. Central raters reduce variability by limiting the number of raters needed for any one trial. (Figure 2) Higher interrater reliability means higher study power. Experienced and calibrated raters ensure the highest level of reliability. (Figure 3) Ongoing calibration of central raters prevents rater drift throughout the course of the trial. (Figure 4) Figure 3. Impact of Experience vs. Calibration on Interrater Reliability Figure 4. Impact of Continuous Calibration on Interrater Reliability Figure 2. Centralized vs. Decentralized Study: Number of Raters Figure 5. SAFER Interview Pass/Fail Rate Sequential Parallel Comparison Design (SPCD) Goals of the Sequential Parallel Comparison Design (SPCD) are to reduce the impact of high placebo response and decrease study sample size. The SPCD model (Figure 6) incorporates two relatively short phases of treatment of equal duration: Phase 1: all subjects randomized to placebo or active treatment; more subjects than usual assigned to placebo arm to enrich phase 2; Phase 2: subjects on active treatment and placebo non-responders in Phase 1 are randomized to placebo or active treatment. Figure 6. Sequential Parallel Comparison Design In Phase 2 the blind is maintained for all subjects as those on active drug in Phase 1 remain on active treatment and placebo non-responders are randomized to placebo or active treatment. Standard analyses of SPCD studies combine data in Phase 1 from all subjects and in Phase 2 only from placebo non-responders during Phase 1. Pooling the data with a pre-specified weighted average significantly reduces the overall placebo response, as it is the average of the placebo response in Phase 1 (standard response) and in Phase 2 (lower response among subjects who failed to respond to placebo prospectively). In five completed SPCD trials, this method has reduced the overall placebo response rate on average by 53 percent compared to the Phase 1 placebo response. (Figure 7) Figure 7. Percent Reduction in Placebo Response Rates in Stage 2 SPCD Trials (Chronological Order) Passes Fails Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes. Q1 Q2 Q3 Q4 HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97 HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96 PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88 PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98 *MedAvante study data on file and made available to sponsor during study.

Upload: others

Post on 31-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Are Different Methods to Reduce Placebo Response in CNS ...€¦ · affect placebo response when clinical raters function in multiple roles at study sites.This presentation will discuss

The SAFER Interview• TheSAFERInterviewisanoperationalizedversionoftheSAFER(Specificity, Assessability, Face Validity, Ecological Validity, and Rule of the 3 Ps)principlesdevelopedbycliniciansatMassachusettsGeneralHospital(MGH)4.

• Inarecentanalysisoffivetrialsintreatmentresistantdepression(TRD)allsubjectshadpassedscreeningproceduresatthesiteandwereconsideredtobeeligibleforthetrial.

• Inadditiontothe45-minuteSAFERInterviewperformedremotelybyMGHclinicianswhocalledsubjects(N=1935)directly,astructuredseverityinterviewwasperformed;1562(81percent)ofthesubjectsweredeemedeligibleforcontinuedscreening.(Figure 5)

• Ofthe373(19percent)subjectsdeemedineligible:97(5percent)didnotmeetseveritycriteria;39(2percent)didnotmeetonlySAFERcriteria;151(8percent)didnotmeetATRQcriteriafortreatmentresistance;and,86(4percent)didnotmeetcriteriaonmorethanonecomponentoftheSAFERInterview.

• AcrossallofthesefiveTRDstudies,placeboresponserateswerewithinarangeof18to28percent,belowthe30to37percentaverageinstudiesoftreatmentsapprovedforTRD(olanzapine-fluoxetinecombination,quetiapineandaripiprazole).

Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes.

19%

81%Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average

12

68

54 52

81

53

100

80

60

40

20

0

% Reduction

Randomize

PlaceboActive

Treatment (Dose X)

Active Treatment

(Dose X)

Active Treatment

(Dose X)Placebo

ResponderResponderNon

ResponderNon

Responder

PlaceboActive

Treatment (Dose X)

Randomize

Randomize

TreatmentPhase 1

Results

TreatmentPhase 2

Rater Cohort Characteristics 1.00 80% 100

Experienced, calibrated 0.933 78% 108

0.80 71% 125

Inexperienced, calibrated 0.773 70% 130 0.60 59% 167

Experienced, calibrated 0.553 55% 181

Reliability (ICC) Study Power1 Sample Size per Arm2

Q1 Q2 Q3 Q4

HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97

HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96

PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88

PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98

*MedAvante study data on file and made available to sponsor during study.

Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes.

19%

81%Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average

12

68

54 52

81

53

100

80

60

40

20

0

% Reduction

Randomize

PlaceboActive

Treatment (Dose X)

Active Treatment

(Dose X)

Active Treatment

(Dose X)Placebo

ResponderResponderNon

ResponderNon

Responder

PlaceboActive

Treatment (Dose X)

Randomize

Randomize

TreatmentPhase 1

Results

TreatmentPhase 2

Rater Cohort Characteristics 1.00 80% 100

Experienced, calibrated 0.933 78% 108

0.80 71% 125

Inexperienced, calibrated 0.773 70% 130 0.60 59% 167

Experienced, calibrated 0.553 55% 181

Reliability (ICC) Study Power1 Sample Size per Arm2

Q1 Q2 Q3 Q4

HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97

HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96

PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88

PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98

*MedAvante study data on file and made available to sponsor during study.

Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes.

19%

81%Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average

12

68

54 52

81

53

100

80

60

40

20

0

% Reduction

Randomize

PlaceboActive

Treatment (Dose X)

Active Treatment

(Dose X)

Active Treatment

(Dose X)Placebo

ResponderResponderNon

ResponderNon

Responder

PlaceboActive

Treatment (Dose X)

Randomize

Randomize

TreatmentPhase 1

Results

TreatmentPhase 2

Rater Cohort Characteristics 1.00 80% 100

Experienced, calibrated 0.933 78% 108

0.80 71% 125

Inexperienced, calibrated 0.773 70% 130 0.60 59% 167

Experienced, calibrated 0.553 55% 181

Reliability (ICC) Study Power1 Sample Size per Arm2

Q1 Q2 Q3 Q4

HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97

HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96

PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88

PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98

*MedAvante study data on file and made available to sponsor during study.

Number of Central Raters Estimated number of site raters

Num

ber o

f Rat

ers

180

160

140

120

100

80

60

40

20

0

STUDY A

STUDY B

STUDY C

STUDY D

STUDY E

STUDY F

STUDY G

STUDY I

STUDY J

STUDY K

STUDY L

STUDY M

STUDY H

STUDY N

STUDY O

STUDY P

STUDY Q

STUDY R

STUDY S

STUDY T

STUDY U

STUDY V

STUDY W

STUDY X

600

400

200

0

Freq

uenc

y

Freq

uenc

y

Freq

uenc

y

Freq

uenc

y

BPRS Total Score (n=382)PANSS Total Score (n=563)HAM-A Total Score (n=462)MADRS Total Score (n=4140)

80

60

40

20

0

100

80

60

40

20

0

50

40

30

20

10

00 4 8 12 16 20 24 28 32 36 40 44 48 6 10 14 18 22 26 30 34 38 42 50 60 70 80 90 100 110 120 130 140 24 28 32 36 40 44 48 52 56 60 64 68

Number of Central Raters Estimated number of site raters

Num

ber o

f Rat

ers

180

160

140

120

100

80

60

40

20

0

STUDY A

STUDY B

STUDY C

STUDY D

STUDY E

STUDY F

STUDY G

STUDY I

STUDY J

STUDY K

STUDY L

STUDY M

STUDY H

STUDY N

STUDY O

STUDY P

STUDY Q

STUDY R

STUDY S

STUDY T

STUDY U

STUDY V

STUDY W

STUDY X

600

400

200

0

Freq

uenc

y

Freq

uenc

y

Freq

uenc

y

Freq

uenc

y

BPRS Total Score (n=382)PANSS Total Score (n=563)HAM-A Total Score (n=462)MADRS Total Score (n=4140)

80

60

40

20

0

100

80

60

40

20

0

50

40

30

20

10

00 4 8 12 16 20 24 28 32 36 40 44 48 6 10 14 18 22 26 30 34 38 42 50 60 70 80 90 100 110 120 130 140 24 28 32 36 40 44 48 52 56 60 64 68

Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes.

19%

81%Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average

12

68

54 52

81

53

100

80

60

40

20

0

% Reduction

Randomize

PlaceboActive

Treatment (Dose X)

Active Treatment

(Dose X)

Active Treatment

(Dose X)Placebo

ResponderResponderNon

ResponderNon

Responder

PlaceboActive

Treatment (Dose X)

Randomize

Randomize

TreatmentPhase 1

Results

TreatmentPhase 2

Rater Cohort Characteristics 1.00 80% 100

Experienced, calibrated 0.933 78% 108

0.80 71% 125

Inexperienced, calibrated 0.773 70% 130 0.60 59% 167

Experienced, calibrated 0.553 55% 181

Reliability (ICC) Study Power1 Sample Size per Arm2

Q1 Q2 Q3 Q4

HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97

HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96

PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88

PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98

*MedAvante study data on file and made available to sponsor during study.

Fava,M1,2, Williams JBW3,4, Freeman, M1,2 , Detke MJ3,5,6

1Massachusetts General Hospital Dept. of Psychiatry, 2Clinical Trials Network and Institute 3MedAvante, Inc., 4Columbia University Depts. of Psychiatry and Neurology, 5Detke Biopharma Consulting LLC, 6Indiana University School of Medicine

Are Different Methods to Reduce Placebo Response in CNS Trials Mutually Exclusive or Have Additive Effects?

©2014MedAvanteInc.

RESEARCH INSTITUTE

INTRODUCTION

Overthepasttwodecades,numerousmethodologieshavebeendevelopedwiththespecificgoalofreducingtheplaceboeffectinCNStrials.Thenumberoffailedtrialsinpsychiatryhasincreasedsubstantiallyovertime.Thismaybeduetobaselinescoreinflationandtheinclusionofinappropriatesubjects,variabilityinbaselineandpost-baselineclinicalratings,andexpectationorrelationshipbiasesassociatedwiththeperceptionsofsubjects,cliniciansorboththatcanaffectplaceboresponsewhenclinicalratersfunctioninmultiplerolesatstudysites.Thispresentationwilldiscusshowacombinationofthreestrategies,CentralRatings,useoftheSAFERInterview,anduseofSequentialParallelComparisonDesign(SPCD),mayoffera“triplesafety”nettoensuremaximumreductionoftheriskofafailedtrial.

THE THREE STRATEGIES

COMBINING THE THREE STRATEGIES

Get the free mobile app athttp:/ /gettag.mobi

Eachoftheseriskreductionstrategiescanimproveresults,butcombiningallthreemethodstoworkinsequenceinatrialcanincreasestudypowerandmosteffectivelyminimizethemethodologyfactorsthatcancauseatrialtofailtodetectasignal.Analyseshaveshownthatthecostofreducingtheriskoftrialfailureincreasestheexpectednetpresentvalueoftheoverallprograminvestment,manytimesover5.• Calibratedindependentcentralratersavoidthebiasesandreducetheratingvariabilitythatadverselyaffectssubjectselection,accuratebaselineandpost-baselineratings.

• TheSAFERInterview,administeredbyindependentcentralraters,canhelpensureselectionofappropriatesubjects.

• SPCDmethodology,inconjunctionwiththeaccuracyandprecisionachievablewithremotecentralratings,candecreaseplaceboresponseandallowforasmallersamplesize.

Incombination,thesethreeuniquebutcomplementarystrategiesoffera“triplesafetynet”thatmaydramaticallyreducetheriskofafailedtrialandofferthebestchanceforatrialtosucceed.

References1Mueller&Szegedi.(2002).EffectsofInterraterReliabilityofPsychopathologicAssessmentonPowerandSampleSizeCalculationsinClinicalTrials.JournalofClinicalPsychopharmacology.22(3),318-325.Calculatedwheren=100subjectsperarmandeffectsize=.402PerkinsDOetal.(2000).Penny-wiseandpoundfoolish:Theimpactofmeasurementerroronsamplesizerequirementsinclinicaltrials.BiologicalPsychiatry.47,762-766.Calculatedwheren=100subjectsperarm3KobakKAetal.(2009).SourcesofUnreliabilityinDepressionRatings..JournalofClinicalPsychopharmacology.29(1),82-85.4TargumSD,PollackMH,FavaM.(2008).Redefiningaffectivedisorders:Relevancefordrugdevelopment.CNSNeuroscience&Therapeutics.14,2-9;doi:10.1111/j.1527-3458.2008.00038.x.5GreenblattWetal.(2012).TheValueofRiskReductioninCNSDrugDevelopment:UseofExpectedNetPresentValue(eNPV)asaModel.PresentedatAmericanCollegeofNeuropsychopharmacology51stAnnualMeeting,HollywoodFL.

Remote Independent Central Ratings• Theuseofwell-trainedandblinded,calibratedexpertcentralratersminimizesperceptionbiasesandreducesvariability,whichenhancesseparationbetweenplaceboandactivetreatment.

•RemoteindependentcentralratersscreeningsubjectsviaevaluationofsymptomseverityandtheSAFERInterviewhelpensuretheunbiasedselectionofappropriatesubjectswhileavoidingbaselinescoreinflation.(Figure 1)

Figure 1.Sample of Baseline Score Distributions in Completed Studies MDD GAD SCHIZOPHRENIA PSYCHOSIS

•Centralratingsensureaccuratepost-baselineratingsandincreasestudypowerviahighICCSfurtherenhancingtheimpactoftheSPCDidentificationofplacebonon-respondersforrandomizationinStage2.

•Centralratersreducevariabilitybylimitingthenumberofratersneededforanyonetrial.(Figure 2)

•Higherinterraterreliabilitymeanshigherstudy

power.Experiencedandcalibratedratersensurethehighestlevelofreliability.(Figure 3)

•Ongoingcalibrationofcentralraterspreventsraterdriftthroughoutthecourseofthetrial.(Figure 4)

Figure 3.Impact of Experience vs. Calibration on Interrater Reliability

Figure 4.Impact of Continuous Calibration on Interrater Reliability

Figure 2.Centralized vs. Decentralized Study: Number of Raters

Figure 5.SAFER Interview Pass/Fail Rate

Sequential Parallel Comparison Design (SPCD)• GoalsoftheSequentialParallelComparisonDesign(SPCD)aretoreducetheimpactofhighplaceboresponseanddecreasestudysamplesize.

• TheSPCDmodel(Figure6)incorporatestworelativelyshortphasesoftreatmentofequalduration:

Phase 1:allsubjectsrandomizedtoplacebooractivetreatment;moresubjectsthanusualassignedtoplaceboarmtoenrichphase2;

Phase 2:subjectsonactivetreatmentandplacebonon-respondersinPhase1arerandomizedtoplacebooractivetreatment.

Figure 6.Sequential Parallel Comparison Design

• InPhase2theblindismaintainedforallsubjectsasthoseonactivedruginPhase1remainonactivetreatmentandplacebonon-respondersarerandomizedtoplacebooractivetreatment.

• StandardanalysesofSPCDstudiescombinedatainPhase1fromallsubjectsandinPhase2onlyfromplacebonon-respondersduringPhase1.

• Poolingthedatawithapre-specifiedweightedaveragesignificantlyreducestheoverallplaceboresponse,asitistheaverageoftheplaceboresponseinPhase1(standardresponse)andinPhase2(lowerresponseamongsubjectswhofailedtorespondtoplaceboprospectively).

• InfivecompletedSPCDtrials,thismethodhasreducedtheoverallplaceboresponserateonaverageby53percentcomparedtothePhase1placeboresponse.(Figure 7)

Figure 7.Percent Reduction in Placebo Response Rates in Stage 2 SPCD Trials (Chronological Order)

Passes

Fails

ZiprasadoneMonotherapyStudy:Papakostasetal,JClinPsychiatry.2012Dec;73(12):1541-7;TRD-1andTRD-2:Papakostasetal,AmJPsychiatry.2012Dec1;169(12):1267-74;ADAPT-A:Favaetal,PsychotherPsychosom.2012;81(2):87-97;AlkermesALK5461:Poster,ISCTM,2013OctNote:ALKresponseratesrelatetoMADRS(secondary)outcomes.

Ziprasadone Monotherapy Study: Papakostas et al, J Clin Psychiatry. 2012 Dec; 73(12): 1541-7; TRD-1 and TRD-2: Papakostas et al, Am J Psychiatry. 2012 Dec 1; 169(12): 1267-74; ADAPT-A: Fava et al, Psychother Psychosom. 2012; 81(2): 87-97; Alkermes ALK 5461: Poster, ISCTM, 2013 Oct Note: ALK response rates relate to MADRS (secondary) outcomes.

19%

81%Zip Mono TRD-1 ADAPT-A TRD-2 ALK Average

12

68

54 52

81

53

100

80

60

40

20

0

% Reduction

Randomize

PlaceboActive

Treatment (Dose X)

Active Treatment

(Dose X)

Active Treatment

(Dose X)Placebo

ResponderResponderNon

ResponderNon

Responder

PlaceboActive

Treatment (Dose X)

Randomize

Randomize

TreatmentPhase 1

Results

TreatmentPhase 2

Rater Cohort Characteristics 1.00 80% 100

Experienced, calibrated 0.933 78% 108

0.80 71% 125

Inexperienced, calibrated 0.773 70% 130 0.60 59% 167

Experienced, calibrated 0.553 55% 181

Reliability (ICC) Study Power1 Sample Size per Arm2

Q1 Q2 Q3 Q4

HAM-A (N=100) (Study 1)* 0.90 0.96 0.95 0.97

HAM-A (N=68) (Study 2)* 0.98 0.97 0.97 0.96

PANSS (N=131) (Study 3)* 0.90 0.90 0.96 0.88

PANSS (N=67) (Study 4)* 0.98 0.97 0.98 0.98

*MedAvante study data on file and made available to sponsor during study.