testing for bioequivalence of highly variable drugs from

22
Testing for bioequivalence of highly variable drugs from TR-RT crossover designs with heterogeneous residual variances Christopher I. Vahl, PhD Department of Statistics Kansas State University Qing Kang, PhD Chief Statistical Scientist The Statistical Intelligence Group, LLC

Upload: others

Post on 10-Feb-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Testing for bioequivalence of highly variable drugs from

Testing for bioequivalence of highly variable drugs from TR-RT crossover designs

with heterogeneous residual variances

Christopher I. Vahl, PhDDepartment of StatisticsKansas State University

Qing Kang, PhDChief Statistical Scientist

The Statistical Intelligence Group, LLC

Page 2: Testing for bioequivalence of highly variable drugs from

Traditional bioavailability (BA) studies assess average bioequivalence (ABE)between the test (T) and reference (R) products under the TR-RT crossover design.

With highly variable (HV) drugs whose within-subject coefficient of variation (CV) inpharmacokinetic measures is 30%, assertion of ABE becomes difficult.

In 2011, the FDA adopted a BE criterion with mixed-scaling. For HV drugs, theequivalence limits for the geometric mean ratio (GMR) are scaled to within-subjectvariance of the reference product. This is known as reference scaled ABE (RSABE).

Introduction

R

GMR

1H

0H

2 2 2

0

2 2 2

1

2 2

2 2

: ( ) max(0.25 , ) vs.

: ( ) max(0.25 , )

ln(T:R GMR);

ln( 1);

[ln(1.25)] 0.25 .

T R R

T R R

T R

R R

H k

H k

CV

k

Page 3: Testing for bioequivalence of highly variable drugs from

Introduction

The FDA extended the statistical methods for assessing individual BE (IBE) to testingRSABE. The recommended procedure operates exclusively under TRR-RTR-RRT andTRTR-RTRT designs.

Testing IBE calls for separate estimation of subjectformulation variance and within-subject variances, which could only be achieved by replicate crossover designs. In2003, the FDA discontinued the IBE criterion due to the lack of evidence confirmingthe existence of subjectformulation.

Designs with more than 2 periods are not always feasible.

• The volume of blood taken from each subject may exceed the acceptable limit.

• They tend to have a large amount of missing data and a high dropout rate.

• To avoid carryover effect, the washout period lasts for 5 half lives. A 2-perioddesign is more practical for drugs with a long half life.

• Subject’s physiological changes affect the variability of systemic drugconcentration. A lengthy study could not be conducted on growing animals orthose susceptible to stress under prolonged confinement and repeated dosing.

Goal: To investigate how to evaluate HV drugs under the TR-RT design.

Page 4: Testing for bioequivalence of highly variable drugs from

Consider a TR-RT design where the washout time between periods is sufficient toeliminate any carryover effect.

Subject j in sequence i provides a vector of ln-transformed responses (Yij1 ,Yij2).

Alternative BE criteria under the TR-RT design were assessed using the model withheterogeneous residual variance.

Model

1

21

2 1

2

11

1

12

1

: su: sequence effect;

: period effect;

& : product effects;

ijTT

i ij

ijRRij

ij ijRR

i ij

ijTT

ij

i

j

T R

iY

Yi

τ

τ

τ2

2 2

0 1 1bject effect, ~ ( , );

0 1 1

& : residual errors, ~ (0, ), ~ (0, );

, & are mutually indept.

ij S

ijT ijR ijT T ijR R

ij ijT ijR

N

N N

τ

τ

Sequence (i) No. of Subjects*

Treatment

Period 1 Period 2

1 n1 T R

2 n2 R T1 2*n n N

Page 5: Testing for bioequivalence of highly variable drugs from

The FDA’s model for replicate crossover designs allows ij to have a covariancematrix of any positive definite structure.

The variance of subjectformulation is

The TR-RT design confounds Int2 with within-subject variances. But when there is

no subjectformulation,

It is then plausible to assess RSABE from the TR-RT design.

Model

2 2

1 22 2

0 0~ N( , ), ~ N( , )

0 0

ST ST SR SR ST SR

j j

ST SR SR ST SR ST

τ τ

2 2 2

2 20 1 1

0 ~ ( , )0 1 11

ST SR S

Int ij SN

τ

2 2( ) 2(1 )Int ST SR ST SR

Page 6: Testing for bioequivalence of highly variable drugs from

Convert (Yij1 , Yij2 ) into within-subject sum and T-R difference.

Define the summary statistics with respect to (Yij , Yij ) as

Note :• and S/[2(N-2)] correspond the LS mean difference and MSE in the classical

ANOVA model;• B and S| represent the slope and the SSE when regressing Yij on Yij with the

same slope but different intercept for i=1,2.

Summary Statistics

1 2

1 2

2 1

2 2 2

2 2

2 2 2 2 2 2 2

1

2

4( )

( ) ( )

ij ij

ij ij ij ij

ij ij

ij S TR TR

ij TR TR

T R R T TR T R

Y Y iY Y Y Y

Y Y i

YVar

Y

1 21 1

2 2 22 2

1 1 1 1 1 1

2

|

( ) 2

( ) ( ) ( )( )

i i

i i i

n n

i ij i i ij ij j

n n n

ij i ij i ij i ij ii j i j i j

Y Y n Y Y n D Y Y

S Y Y S Y Y S Y Y Y Y

B S S S S S S

D

Page 7: Testing for bioequivalence of highly variable drugs from

By Rao(1973), these summary statistics have the distributional properties of

Note:• Z, Z, U, and U| are mutually independent;

• B is not independent of S and its marginal distribution is not normal;

• The estimator for TR , and TR2 are , B and S/(N-2);

• The estimator for is

whose distribution does not follow any classic form.

Summary Statistics

1 2 2

22

| 2

|2

| |

2 2 2 2

|

4[ ( )] ~ (0,1) ~ ( 2)

1( ) ~ (0,1) ~ ( 3)

4 (1 ) : conditional variance of given

T R

TRTR

S TR ij ij

n n SD Z N U N

N

SB S Z N U N

Y Y

2 20.5(1 )R TR

0.5(1 ) ( 2)B S N

D

Page 8: Testing for bioequivalence of highly variable drugs from

The current test of RSABE employs the modified large sample (MLS) method whichis applicable under replicate crossover designs.

The MLS method approximates the confidence limits (CLs) of a linear combinationof parameters by restricting it to be exact when only one parameter is unknown. Itcalls for independent summary statistics with known distributions.

Testing Procedures

Let b, s and be the values of B, S and observed from the TR-RT design.

The estimate of R2 is

and the CLs for TR is

The MLS method is inappropriate under the TR-RT design because R2 is not

estimated from independent summary statistics with classic marginal distributions.

0.95, 2

1 24 ( 2)N

Nd t s

n n N

2 1(1 )

2 4R b s

N

d D

Page 9: Testing for bioequivalence of highly variable drugs from

The generalized pivotal quantity (GPQ) method is used to test RSABE under the TR-RT design. The distribution of the GPQ produces a fiducial-type inference which, inmany situations, meets frequentists’ standards.

The distributional properties of the summary statistics suggest that the GPQ forTR , TR

2, and are

The GPQ for is then assembled as

Testing Procedures

2

1 2 30.5 (1 )T kT T

2 2 2 2( ) ( ) 0.5 (1 )T R R T R TRk k

1 2 2

12

1 2 1 2

2

2

2

| | |

32

| ||

4( )

4 4

1( )

T R TR

TR

TR

n n N s N sT d D d Z

n n S n n UN

s sT

S U

s sT b B S b Z

S s s U

Page 10: Testing for bioequivalence of highly variable drugs from

The distribution of this GPQ is obtained via a resampling algorithm.

Step 1. Independently sample z, u, z and u| and from N(0,1), 2 (N-2),N(0,1) and 2 (N-3).

Step 2. Replace Z, U, Z, and U| in T1, T2 and T3 with z, u, z and u| . Thisyields t1, t2 and t3.

Step 3. Calculate t120.5kt2

2(1-t3) and accumulate its value by repeating steps1and 2 many times.

The 95% upper CL of (TR )2kR2 is given by the 95th percentile of the

resampling distribution. The test product passes RSABE when this percentile is 0.

Testing Procedures

Page 11: Testing for bioequivalence of highly variable drugs from

Analysis of Example Dataset

Source: Dataset #7 in FDA’s databank for BA studies.

Data: Cmax measured in the first two periods of a TRTR-RTRT-RTTR-TRRT design.

1TR ( 12)n 2RT ( 10)n

|

0.1724 19.5285

10.1230 1.4336

0.1416 19.3255

d s

s s

b s

Parameter Point Estimate (PE) Conf. Level Conf. Interval (CI)

R 0.5375 -- --

exp(TR ) 1.1882 90% (0.9148, 1.5434)

(TR )2kR2 -0.2004 95% (-, -0.0113)

Page 12: Testing for bioequivalence of highly variable drugs from

Analysis of Example Dataset

Data from theTR-RT design

0.294R

2 2

95% UCL of ( ) 0

T R Rk

Pass

Yes

Yes

Test RSABE via the GPQ method

Yes

PE constaint: 0.8 ln( ) 1.25d

TOST-GPQ Procedure W/ PE constraint

The FDA’s decision rule for testing RSABE under replicate crossover designs appliesTOST whenTo ensure public confidence and harmonize with various regulatory agencies, the FDArequires PE of the estimated T:R geometric mean ratio to lie within 0.8~1.25.

0.296.R

Data from thereplicate crossover design

0.294R

2 2

95% UCL of ( ) 0

T R Rk

Test ABE via TOST

Pass

NoYes

Yes

No

90% CI of exp( )

[0.8,1.25]T R

Test RSABE via the MLS method

Yes

Fail

No

PE constaint: 0.8 ln( ) 1.25d

NoYes

TOST-MLS Procedure W/ PE constraint

Page 13: Testing for bioequivalence of highly variable drugs from

Simulation Study

Simulation parameters were chosen based on historical publications on BA studies.

Total Sample sizeTR-RT designs: N =24, …, 72 with n1=n2=N/2;TRR-RTR-RRT designs: N=18,…,48 with n1=n2=n3=N/3;

Variation of the reference productHigh: CVR= 40%, …,80%; Borderline high: CVR= 25% (the changeover point) and 30%;Regular: CVR= 20% .

Within-subject varianceHeterogeneous: T/R=0.5 and 2;Homogeneous: T/R=1.

S =0.2, … ,1.

Zero sequence and period effects.

10,000 simulations at each sample size and parameter setting.

5, 000 resamples within each simulated dataset.

Page 14: Testing for bioequivalence of highly variable drugs from

Simulation Study

Type I error rate at (TR )2=kmax(0.252,R2). Each schematic boxplot summarizes

results of 75 combinations of simulation setting.

Page 15: Testing for bioequivalence of highly variable drugs from

Simulation Study

Note:

• The TOST-GPQ procedure was moderately liberal at CVR =25% but held adesirable type I error rate at all other examined values of CVR.

• The TOST-MLS procedure preserved its nominal level when CVR ≠25%. Incomparison to the TOST-GPQ procedure, it had a slightly lower inflation oftype I error at CVR =25% .

• The PE constraint led to ultra conservativeness when CVR40% but had littleimpact when CVR25% .

Page 16: Testing for bioequivalence of highly variable drugs from

Simulation Study

S

Pow

er

24N 48N 72N

RCV RCV RCV

Power for the TOST-GPQ procedure with PE constraint at TR =0 & T/R=1.

Note:

• Power increased as S decreased and as N increased.

• There was no monotonic relationship between power and CVR. For a given S

& N, power was always the highest when CVR is around 20% and dipped to thebottom when CVR is around 50%.

Page 17: Testing for bioequivalence of highly variable drugs from

Simulation Study

Passing rate of the TOST-GPQ procedure under the TR-RT design of size 54 (―) and

the TOST-MLS procedure under the TRR-RTR-RRT design of size 36 (- - -).

Note:• These hybrid procedures performed similarly well at S=0.2 & 0.6.• The TOST-GPQ procedure underperforms at S=1.

Passin

g R

ate

S

exp( )T R

CVR=CVT=30% CVR=CVT=60%

exp( )T R

Page 18: Testing for bioequivalence of highly variable drugs from

Sample Size Estimation

† Provided by Tothfalusi & Endrenyi 2012. ‡ The ratio of sample sizes.

Note:• The two designs have a trivial difference at S=0.4 .• The TR-RT design is more expensive at S=0.6 & 0.8.

80% power

TRR-TRT-RRT DesignTR-RT Design

CVT=CVR S=0.4 S=0.6 S=0.8

30% 21† 28 (1.3‡) 32 (1.5) 38 (1.8)

40% 22 34 (1.5) 44 ( 2 ) 56 (2.5)

50% 22 34 (1.5) 44 ( 2 ) 56 (2.5)

60% 23 34 (1.5) 42 (1.8) 52 (2.3)

70% 25 36 (1.4) 42 (1.7) 50 ( 2 )

Total sample size needed to establish BE with mixed scaling at TR =0 .

Page 19: Testing for bioequivalence of highly variable drugs from

Discussion

So far, BA studies on HV drugs have all been carried out under replicate crossoverdesigns with 3 or 4 periods. In cases that these designs are infeasible, paralleldesigns have been considered as a backup choice. The setbacks include

• prohibitive sample sizes;• difficulty in translating RSABE when the between- and within-subject

variability are confounded.

The proposed approach• utilizes the fact that subjectformulation is negligible;• operates under the traditional TR-RT design;• allows continued exercise of the existing BE criterion.

When S0.4, the resulting inference is at least as good as that derived fromreplicate crossover designs. Larger S weakens the inference.

Recommendation: Replicate crossover designs are preferred for BA studies on HVdrugs. Under practical restrictions, producers are encouraged to implement the TR-RT design and the proposed TOST-GPQ procedure.

Page 20: Testing for bioequivalence of highly variable drugs from

Discussion

The FDA requires producers to determine, a priori, whether the reference drug isHV. Variability classification is a known problem for human and animal drugs.

Dispute arises when producers classify the same reference drug differently.

Data from thereplicate crossover design

0.294R

2 2

95% UCL of ( ) 0

T R Rk

Test ABE via TOST

Pass

NoYes

Yes

No

90% CI of exp( )

[0.8,1.25]T R

Test RSABE via the MLS method

Yes

FailNo PE constaint:

0.8 exp( ) 1.25d

No

Yes

The protocol specifies to use RSABE for a HV drug

Data from the TR-RT design

NoYes

Page 21: Testing for bioequivalence of highly variable drugs from

Discussion

The proposed approach promotes coherence in regulatory evaluation.

Data from acrossover design

0.294R

2 2

95% UCL of ( ) 0

T R Rk

Test ABE via TOST

Pass

NoYes

Yes

No

90% CI of exp( )

[0.8,1.25]T R

Yes

FailNo PE constaint:

0.8 exp( ) 1.25d No

Yes

The protocol specifies to test BE with mixed scaling

Test RSABE via theMLS or GPQ method

Page 22: Testing for bioequivalence of highly variable drugs from

References

Claxton R et al. Estimating product bioequivalence for highly variable veterinarydrugs. J. Vet. Pharmaco. Ther. 2012; 35 (Suppl. 1):11–16.

Davit BM et al. Implementation of a reference-scaled average bioequivalenceapproach for highly variable generic drug products by the US Food and DrugAdministration. AAPS J. 2012; 14:915–924.

Draft guidance on progesterone, 2011. http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM209294.pdf.

Guilbaud O. Exact inference about the within-subject variability in 22 crossovertrials. JASA 1993; 88:939-946.

Hannig J et al. Fiducial generalized confidence intervals. JASA 2006; 101:254-269.

Haidar et al. Evaluation of a scaling approach for the bioequivalence of highlyVariable drugs. AAPS J. 2008; 10:450–454.

Rao CR. Linear statistical inference and its applications (2nd ed.). Wiley: New York;1973.

Tothfalusi L & Endrenyi L. Sample sizes for designing bioequivalence studies forhighly variable drugs. J. Pharm. Pharm. Sci. 2012; 15:73 – 84.

Weerahandi S. Generalized confidence Intervals. JASA 1993; 88: 899-905.