mixture ratio estimators using multi-auxiliary variables and … · 2014-10-28 · the history of...

14
Open Journal of Statistics, 2014, 4, 776-788 Published Online October 2014 in SciRes. http://www.scirp.org/journal/ojs http://dx.doi.org/10.4236/ojs.2014.49073 How to cite this paper: Waweru, P.M., Kung’u, J. and Kahiri, J. (2014) Mixture Ratio Estimators Using Multi-Auxiliary Va- riables and Attributes for Two-Phase Sampling. Open Journal of Statistics, 4, 776-788. http://dx.doi.org/10.4236/ojs.2014.49073 Mixture Ratio Estimators Using Multi-Auxiliary Variables and Attributes for Two-Phase Sampling Paul Mwangi Waweru, John Kung’u, James Kahiri Department of Statistics and Actuarial Science, Kenyatta University, Nairobi, Kenya Email: [email protected] , [email protected] Received 25 July 2014; revised 27 August 2014; accepted 12 September 2014 Copyright © 2014 by authors and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/ Abstract In this paper, we have proposed three classes of mixture ratio estimators for estimating popula- tion mean by using information on auxiliary variables and attributes simultaneously in two-phase sampling under full, partial and no information cases and analyzed the properties of the estima- tors. A simulated study was carried out to compare the performance of the proposed estimators with the existing estimators of finite population mean. It has been found that the mixture ratio es- timator in full information case using multiple auxiliary variables and attributes is more efficient than mean per unit, ratio estimator using one auxiliary variable and one attribute, ratio estimator using multiple auxiliary variable and multiple auxiliary attributes and mixture ratio estimators in both partial and no information case in two-phase sampling. A mixture ratio estimator in partial information case is more efficient than mixture ratio estimators in no information case. Keywords Ratio Estimator, Multiple Auxiliary Variables, Multiple Auxiliary Attributes, Two-Phase Sampling, Bi-Serial Correlation Coefficient 1. Introduction The history of using auxiliary information in survey sampling is as old as history of the survey sampling. The work of Neyman [1] may be referred to as the initial works where auxiliary information has been used. Cochran [2] used auxiliary information in single-phase sampling to develop the ratio estimator for estimation of popula- tion mean. In the ratio estimator, the study variable and the auxiliary variable had a high positive correlation and the regression line was passing through the origin. Hansen and Hurwitz [3] also suggested the use of auxiliary

Upload: others

Post on 15-Mar-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

Open Journal of Statistics, 2014, 4, 776-788 Published Online October 2014 in SciRes. http://www.scirp.org/journal/ojs http://dx.doi.org/10.4236/ojs.2014.49073

How to cite this paper: Waweru, P.M., Kung’u, J. and Kahiri, J. (2014) Mixture Ratio Estimators Using Multi-Auxiliary Va-riables and Attributes for Two-Phase Sampling. Open Journal of Statistics, 4, 776-788. http://dx.doi.org/10.4236/ojs.2014.49073

Mixture Ratio Estimators Using Multi-Auxiliary Variables and Attributes for Two-Phase Sampling Paul Mwangi Waweru, John Kung’u, James Kahiri Department of Statistics and Actuarial Science, Kenyatta University, Nairobi, Kenya Email: [email protected], [email protected] Received 25 July 2014; revised 27 August 2014; accepted 12 September 2014

Copyright © 2014 by authors and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/

Abstract In this paper, we have proposed three classes of mixture ratio estimators for estimating popula-tion mean by using information on auxiliary variables and attributes simultaneously in two-phase sampling under full, partial and no information cases and analyzed the properties of the estima-tors. A simulated study was carried out to compare the performance of the proposed estimators with the existing estimators of finite population mean. It has been found that the mixture ratio es-timator in full information case using multiple auxiliary variables and attributes is more efficient than mean per unit, ratio estimator using one auxiliary variable and one attribute, ratio estimator using multiple auxiliary variable and multiple auxiliary attributes and mixture ratio estimators in both partial and no information case in two-phase sampling. A mixture ratio estimator in partial information case is more efficient than mixture ratio estimators in no information case.

Keywords Ratio Estimator, Multiple Auxiliary Variables, Multiple Auxiliary Attributes, Two-Phase Sampling, Bi-Serial Correlation Coefficient

1. Introduction The history of using auxiliary information in survey sampling is as old as history of the survey sampling. The work of Neyman [1] may be referred to as the initial works where auxiliary information has been used. Cochran [2] used auxiliary information in single-phase sampling to develop the ratio estimator for estimation of popula-tion mean. In the ratio estimator, the study variable and the auxiliary variable had a high positive correlation and the regression line was passing through the origin. Hansen and Hurwitz [3] also suggested the use of auxiliary

Page 2: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

777

information in selecting the sample with varying probabilities. If regression line is still linear but does not pass through the origin the regression estimator is used. Watson [4] used the regression estimator of leaf area on leaf weight to estimate the average area of the leaves on a plant.

Olkin [5] was the first person to use information on more than one auxiliary variable, which was positively correlated with the variable under study, using a linear combination of ratio estimator based on each auxiliary variable. Raj [6] suggested a method of using multi-auxiliary information in sample survey. Singh [7] proposed a ratio-cum-product estimator and its multi-variable expression.

The concept of double sampling was first proposed by Neyman [1] in sampling human populations when the mean of auxiliary variable was unknown. It was later extended to multi-phase by Robson [8]. Ahmad [9] pro-posed a generalized multivariate ratio and regression estimators for multi-phase sampling while Zahoor, Muh-hamad and Munir [10] suggested a generalized regression-cum-ratio estimator for two-phase sampling using multiple auxiliary variables.

Jhajj, Sharma and Grover [11] proposed a family of estimators using information on auxiliary attribute. They used known information of population proportion possessing an attribute (highly correlated with study variable Y). The attribute are normally used when the auxiliary variables are not available e.g. amount of milk produced and a particular breed of cow or amount of yield of wheat and a particular variety of wheat. Jhajj, Sharma and Grover [11] used the information on auxiliary attributes in ratio estimator in estimating population mean of the variable of interest using known attributes such as coefficient of variation, coefficient kurtosis and point bi-serial correlation coefficient. The estimator performed better than the usual sample mean and Naik and Gupta [12] es-timator. Jhajj, Sharma and Grover [11] also used the auxiliary attribute in regression, product and ratio type ex-ponential estimator following the work of Bahl and Tuteja [13].

Hanif, Haq and Shahbaz [14] proposed a general family of estimators using multiple auxiliary attribute in sin-gle- and double-phase sampling. The estimator had a smaller MSE compared to that of Jhajj, Sharma and Grover [11]. They also extended their work to ratio estimator which was generalization of Naik and Gupta [12] estima-tor in single- and double-phase samplings with full information, partial information and no information. Kung’u and Odongo [15] and [16] proposed ratio-cum-product estimators using multiple auxiliary attributes in single- and two-phase sampling. Moeen, Shahbaz and Hanif [17] proposed a class of mixture ratio and regression esti-mators for single-phase sampling for estimating population mean by using information on auxiliary variables and attributes simultaneously.

In this paper, we will extend the mixture ratio estimator proposed by Moeen, Shahbaz and Hanif [17] in sin-gle-phase sampling to two-phase sampling under full, partial and no information case strategies introduced by Samiuddin and Hanif [18] and also incorporate Arora and Bansi [19] approach in writing down the mean squared error.

2. Preliminaries 2.1. Notation and Assumption Consider a population of N units. Let Y be the variable for which we want to estimate the population mean and

1 2, , , kX X X are k auxiliary variables. For two-phase sampling design let 1n and 2n ( )2 1n n< are sample sizes for first and second phase respectively. ( )1jx and ( )2jx denote the thj auxiliary variables form first and second phase samples respectively and 2y denote the variable of interest from second phase. jX

and

jxC denote the population means and coefficient of variation of thj auxiliary variables respectively and

jyxρ de-notes the population correlation coefficient of Y and jX .

Further, let ( )1 2 1 21 2

1 1 1 1 n N n N

θ θ θ θ

= − = − <

( ) ( ) ( ) ( )

( ) ( )( )( )

2 1

2

2 1

2

,

and 1, 2, ,j

j

y j xj

j xj

y Y e x X e

x X e j k

= + = +

= + =

(1.0)

where 2ye ,

( )1jxe and ( )2jxe are sampling error and are very small. We assume that

( ) ( )( ) ( )( )2 1 20.

j jy x xE e E e E e= = = (1.1)

Page 3: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

778

Consider a sample of size n drawn by simple random sampling without replacement from a population of size N . Let jy and jτ denotes the observations on variable y and r respectively for the thj unit where

1, 2, , j k k q= + + . In defining the attributes we assume complete dichotomy so that,

1, if unit of population possess auxiliary attribute,0, otherwise.

th th

iji j

τ

=

(1.2)

Let 1

N

j ijj

A τ=

= ∑ and 1

n

j iji

a τ=

= ∑ be the total number of units in the population and sample respectively pos-

sessing attribute jτ . Let jj

AP

N= and ( )2

jj

ap

n= be the corresponding proportion of units possessing a specific

attributes jτ and y is the mean of the main variable at second phase. Let ( )1jp and ( )2jp denote the thjauxiliary attribute form first and second phase samples respectively and 2y denote the variable of interest from second phase. The mean of main variable of interest at second phase will be denoted by 2y . Also let us define

( ) ( ) ( ) ( )1 21 2, j jj jj je p P e p Pτ τ= − = − (1.3)

where 2ye ,

( )1jeτ and

( )2jeτ are sampling error and are very small. We assume that

( ) ( )( ) ( )( )2 1 20.

j jyE e E e E eτ τ= = =

Let ( )2 jj

j

xv

X= , then ( ) ( ) ( )22 21 1 jxjj j

jj j j

ex x Xv

X X X

−− = − = = . Therefore ( )21 jx

jj

ev

X= + . Similarly, ( )21 j

jj

ew

= + .

Also ( )

( )

2

1

jj

j

xz

x= , then

( ) ( )

( )

( ) ( )

( )

( ) ( )

( )

( ) ( )

( )

( ) ( ) ( )2 1 2 1 2 1 2 1 1

11

1

2 1

1 1

1 1 .

1

j j j j j j j j j

jj

x x x x x x x x xj jj

xx j j jj jj

j

e e e e e e e e ex xv

x x e X X XeX

X

−− − − − − − = = = = = + + +

We shall take 1iv − to term of order 1n

as ( ) ( )2 11 j i jx x

jj

e ev

X

−− = hence,

( ) ( ) ( ) ( )2 1 1 21 1 .j j j jx x x x

jj j

e e e ez

X X

− −= + = −

Similarly,

( ) ( )1 21 .j jj

j

e eu

Pτ τ−

= − (1.4)

The coefficient of variation and correlation coefficient are given by

2 1 11 1

2 2 22 2 2

2 2 21 1

, and

, , , ,

, .i i

y x x yxy x yx

y x

yz y yyz Pb Pb

y z y y

S S S SC C C

S SY X PS S S

S S S S S S

τ

τ τ

τ τ

ρ

ρ ρ ρ

= = = =

= = =

(1.5)

Then for simple random sampling without replacement for both first and second phases we write by using phase wise operation of expectations as:

Page 4: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

779

( ) ( ) ( )( ) ( )( ) ( )( ) ( )

( )( ) ( )( )( ) ( ) ( )( )( ) ( )

( ) ( ) ( )( )( ) ( )

( )

2 2 1 2 1 2

2 2 2 22 2

2 1 2 2 1 2

2

2 22 2 2 2 2 2 22 2 1 2 1

2 2

2 2

2 21 2 1 2

, , ,

, ,

, ,

j jj j j j

j j jj j j

j j jj j j j j

j

y y j x x j x

y x j y x yx y j y Pb

y x x j y x yx x x x j x

E e Y C E e e P C E e e X C

E e e YX C C E e e YP C C

E e e e YX C C E e e e X C

E e e

τ

τ τ τ

τ τ

τ

θ θ θ θ θ

θ ρ θ ρ

θ θ ρ θ θ

= − = − − = −

= =

− = − − = −

( ) ( )( )( ) ( )( ) ( )( ) ( )

( ) ( )( ) ( )( ) ( ) ( )( )( ) ( )

( ) ( )( ) ( ) ( )( )( ) ( ) ( )

( ) ( )( ) ( ) ( )( )( ) ( )

1 2 2 2

2 2 2 1 2

1 2 1 2

1 2 1 2

2 21 2 2

2 1 2

2 1

2 1

,

, ,

,

;

j i j i jj j i j

i j i j j ji j j j

j i j ii i j j

i i j j

j x x i j x x x x

i j Pb y x x j y x yx

i j

x x x x i

e P C E e e X X C C i j

E e e PP C C i j E e e e YX C C

E e e e e PP C C i j

E e e e e X

τ τ τ

τ τ τ τ τ τ

τ τ τ τ τ τ τ τ

θ θ θ ρ

θ ρ θ θ ρ

θ θ ρ

θ θ

− = − = ≠

= ≠ − = −

− − = − ≠

− − = − ( )

( ) ( ) ( )( )( ) ( ) ( )

( ) ( ) ( )( )( ) ( ) ( )

2 1 2

2 1 2

1 2

1 2

;

;

;

j i j i

j i j ij i i

j i j ii i i

j x x x x

x x x i j x x x x

i j

X C C i j

E e e e X X C C i j

E e e e PP C C i jτ τ τ τ τ τ τ

ρ

θ θ ρ

θ θ ρ

− = − ≠

− = − ≠

(1.6)

( ) ( )1 T1ij

Adj AA C

A A− = = (1.7)

( ) [ ]2.1 Arora and Lai 19q

q

q

yxy x

x

R

Rρ= −

(1.8)

The following notations will be used in deriving the mean square errors of proposed estimators.

qyxR

: Determinant of population correlation matrix of variables 1 2 1, , , , and q qy x x x x− .

i qyx yx

R

: Determinant of thi minor of pyxR

corresponding to the thi element of iyxρ .

2ryxρ : Denotes the multiple coefficient of determination of y on 1 2 1, , , and r rx x x x− .

2

qyxρ

: Denotes the multiple coefficient of determination of y on 1 2 1, , , , and q qy x x x x− .

rxR

: Determinant of population correlation matrix of variables 1 2 1, , , and r rx x x x− .

qxR

: Determinant of population correlation matrix of variables 1 2 1 , , , and q qx x x x− .

i ry xR

: Determinant of the correlation matrix of 1 2 1, , , , and i r ry x x x x− .

.i qy xR

: Determinant of the correlation matrix of 1 2 1, , , , and i q qy x x x x− .

. .i j ry y xR

: Determinant of the minor corresponding to i jy yρ of the correlation matrix of

( )1 2 1, , , , , and i j r ry y x x x x i j− ≠ .

. .i j qy y xR

: Determinant of the minor corresponding to i jy yρ of the correlation matrix of

( )1 2 1, , , , , and .i j q qy y x x x x i j− ≠ (1.9)

2.2. Mean per Unit in Two-Phase Sampling The sample mean 2y using simple random sampling without replacement in two-phase sampling is given by is given by,

2

212

1 n

ii

y yn =

= ∑ (2.0)

Page 5: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

780

while its variance is given, ( ) 2 2

2 2Var .yy Y Cθ= (2.1)

2.3. Ratio Estimator Using Auxiliary Variable in Two-Phase Sampling

Let 2

212

1 n

ii

x xn =

= ∑ be the sample mean of the auxiliary variable in two-phase sampling. The ratio estimator when

information on one auxiliary variables is available for population (full information case) is: 1

1RV 2

2

.Xt yx

α

=

(2.2)

The mean square error of RVt can be written as:

( ) ( )2

2 2 2 2RV 2 1 1MSE 2y x x y yxt Y C C C Cθ α α ρ= + − (2.3)

where 1y yx

x

CCρ

α = and yxρ are the optimum value and the correlation coefficient respectively.

2.4. Ratio Estimator Using Multiple Auxiliary Variables in Two-Phase Sampling The ratio estimator by Haq [9] when information on k auxiliary variables is available for population (full in-formation case) is:

( ) ( ) ( )

1 2

1 2RMV 2

2 1 2 2 2

.k

k

k

XX Xt yx x x

β β β =

(2.4)

The optimum values of unknown constants are,

( ) 11 .j

k

j k

yxj yxyj

x x

RCC R

β += −

(2.5)

The mean square error of RMVt can be written as:

( ) ( )2 2 2RMV 2MSE 1 .

ky yxt Y Cθ ρ= −

(2.6)

2.5. Ratio Estimator Using Auxiliary Attribute in Two-Phase Sampling In order to have an estimate of the population mean Y of the study variable y, assuming the knowledge of the population proportion P, Naik and Gupta [12] defined ratio estimator of population mean when the prior infor-mation of population proportion of units possessing the same attribute is variable. Naik and Gupta [12] proposed the following estimator:

1

RA 22

.Pt yp

α

=

(2.7)

The MSE of RAt up to the first order of approximation are given respectively by,

( ) ( )2 2 2 2RA 2 1 1MSE 2

jy P P y Pbt Y C C C C τθ α α ρ= + − (2.8)

where 1y yx

x

CCρ

α = and Pbτρ are the optimum value and the bi-serial correlation coefficient respectively.

2.6. Ratio Estimator Using Multiple Auxiliary Attributes in Two-Phase Sampling The ratio estimators by Hanif, Haq and Shahbaz [14] for two-phase sampling using information on multiple

Page 6: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

781

auxiliary attributes is given by,

( ) ( ) ( )

1 2

1 2MRA 2

2 1 2 2 2

.q

q

q

PP Pt yp p p

γ γ γ =

(2.9)

The optimum values of unknown constants are,

( ) 11 .j

q

j q

y yj yj

RCC R

ττ

τ τ

γ += −

(2.10)

The MSE of the MRAt up to the first order of approximation is given by,

( ) ( )2

2 2 2MRA 2MSE 1 .

qy Pbt Y C τθ ρ= −

(2.11)

In general these estimators have a bias of order 1n

. Since the standard error of the estimates is of order 1n

,

the quantity bias .s e is of order 1n

and becomes negligible as n becomes large.

3. Methodology 3.1. Mixture Ratio Estimators Using Multi-Auxiliary Variable and Attributes for

Two-Phase Sampling (Full Information Case) If we estimate a study variable when information on all auxiliary variables is available from population, it is uti-lized in the form of their means. By taking the advantage of mixture ratio estimators technique for two-phase sampling, a generalized estimator for estimating population mean of study variable Y with the use of multi auxiliary variables and attributes are suggested as:

( )( ) ( ) ( ) ( ) ( ) ( )

1 2 1 2

1 21 22MRVAF 3.0

2 1 2 2 2 2 1 2 2 2

.k k k t

k k k t

k k k t

X P P PX Xt yx x x p p p

α α α β β β+ +

+ +

+ +

=

(3.0)

Using (1.0), (1.3) and (1.4) in (3.0) and ignoring the second and higher terms for each expansion of product and after simplification, we write,

( )( ) ( )2 2

2MRVAF 3.01 1

.j jk h k tx

y i jj j kj j

e et e Y Y Y

X Pτ

α β+ =

= = +

= + − −∑ ∑ (3.1)

The mean squared error of ( )MRVAF 3.0t is given by,

( )( ) ( )( ) ( ) ( )2 2

2

22

MRVAF 3.0 MRVAF 3.01 1

MSE .j jk h k tx

y j jj j kj j

e et E t Y E e Y Y

X Pτ

α β+ =

= = +

= − = − −

∑ ∑ (3.2)

We differentiate the Equation (3.2) partially with respect to jα ( )1,2, ,i k= and jβ ( )1, 2, ,i k k t= + + then equate to zero, using (1.6), (1.7) and (1.9), we get

( ) 11 j

tj

j t

yxj xy

x x

RCC R

α += − (3.3)

( ) 11 .j

t

j t

yj yj

RCC R

ττ

τ τ

β += −

(3.4)

Using normal equation that is used to find the optimum values given (3.2) we can write,

Page 7: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

782

( )( ) ( ) ( )2 2

2 2MRVAF 3.01 1

MSE .j jk h k tx

y y i jjj j k j

e et E e e Y Y

PXτ

α β+ =

= = +

= − −

∑ ∑ (3.5)

Taking expectation and using (1.6) in (3.5), we get,

( )( ) 2 22MRVAF 3.0

1 1MSE .

j j j j

k k h t

y i j y x yx j j y Pbj j kj j

Y Yt Y C X C C P C CX P τθ α ρ β ρ

+ =

= = +

= − −

∑ ∑ (3.6)

Substituting the optimum (3.3) and (3.4) in (3.6), we get,

( )( ) ( ) ( )2 22MRVAF 3.0

1 1MSE 1 1 .

j jj j

j j j jj jj j

yx yk k h txj jy yy y x yx y Pb

j jx x

R RC Ct Y C C C C C

C CR R

ττ

ττ τ

θ ρ ρ+ =

= =

= + − + −

∑ ∑ (3.7)

Or

( )( ) ( ) ( )2 22MRVAF 3.0

1 1MSE 1 1 1 .

ijj j

j j

j j

yyxk k h txj jy yx Pb

j j kx

RRt Y C

R R

τ τ

τ

θ ρ ρ+ =

= = +

= + − + −

∑ ∑ (3.8)

Or

( )( ) ( )

( )

,2 22MRVAF 3.0

,

MSE .t

t

y xy

x

Rt Y C

τ

θ=

(3.9)

Using (1.8), we get,

( )( ) ( )( )2 2 22MRVA 3.0 . ,MSE 1 .

ty y xt Y C τθ ρ= −

(3.10)

3.2. Mixture Ratio Estimator Using Multi-Auxiliary Variable and Attributes in Two-Phase Sampling (Partial Information Case)

In this case suppose we have no information on all s and t auxiliary variables but only for r and g auxiliary va-riables from population. Considering mixture ratio technique of estimating technique, the population mean of study variable Y can be estimated for two-phase sampling using multi-auxiliary variables and attributes as:

( )( )

( ) ( )

( )

( ) ( )

( )

( ) ( )

( )

( )

( )

( )

( )

( )

( )

( ) ( )

1 1 2 2 1 2

1

1 1 1 2 1 1 1 1 2 11 22MRVAP 3.1

2 1 2 1 2 2 2 2 2 2 2 1 2 2 2

1 1 1

12 2 1

s s r r k

k

r s s kr

r r s s k

k k

k k

x x x x x xX X Xt yx x x x x x x x x

p Pp p

α β α β α β α α α

γ

+ +

+

+ +

+ +

+ +

+ +

=

( )

( ) ( )

( )

( ) ( )

( )

( )

( )

( )

( )

( )

1 2 2 1 2

1 2 1 1 1 1 2 12

22 2 2 2 2 2 1 2 2 2

.k k k h h h h q

k h h h tk h

k hk h h h t

p p p p pP Pp p p p p p p

λ γ λ γ λ γ γ γ+ + + + +

+ + ++

+ + + +

(3.11)

Using (1.0), (1.3) and (1.4) in (3.1) and ignoring the second and higher terms for each expansion of product and after simplification, we write,

( )( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( )

1 2 2 1 2 1 2

2 1 2

2MRVAP 3.11 1 1 1

1 1 .

jj j j j j j

j j

j j j

r r s r k k t hx x x x x

j ij j j ij j j i

h q qk t h

j jj j hj j

e e e e e e et y Y Y Y Y

X X X P

e e eY Y

P P

τ τ

τ τ τ

α β α γ

λ γ

+ = + =

= = = =

+ =+ =

= = +

− − −= + − + +

−− +

∑ ∑ ∑ ∑

∑ ∑ (3.12)

Mean squared error of ( )MRVAP 3.1t estimator is given by,

Page 8: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

783

( )

( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( )

1 2 2 1 2 1 2

2

2 1 2

MRVAP 3.1

1 2 11 1 1 1

2

1 1 .

jj j j j j j

j j

j j j

r r s r k k t hx x x x x

y j ij j j ij j j i

h q qk t h

j jj j hj j

t

e e e e e e eE E e Y Y Y Y

X X X P

e e eY Y

P P

τ τ

τ τ τ

α β α γ

λ γ

+ = + =

= = = =

+ =+ =

= = +

− − −= + − + +

− − +

∑ ∑ ∑ ∑

∑ ∑

(3.13)

We differentiate the Equation (3.13) with respect to ( )1,2, ,i i rα = , ( )1,2, ,i i rβ = , ( )1, 2, ,i i r r kα = + + , ( )1, 2, ,i i k k hγ = + + , ( )1, 2, ,i i k k hλ = + + , ( )1, 2, ,i i h h qγ = + + and equate

to zero and use (1.6), (1.7) and (1.9). The optimum values are as follows,

( ) ( )

( )

( ) ( )

( )

1

1

1

1

1 1, 2, , ;

1 1, 2, , ;

1 1, 2, , ;

1 1

j jm w

j m jw

j jp w

j wm

jr

j w

jw

yx yxj x xyj

x x yx x

y yxj yj

yxj xyj

x x

jyj y

R RCj r

C R R

R RCj k k h

C R R

RCj r

C R

CR

CR

τ ττ

τ ττ

ττ

α

γ

β

λ

+

+

+

+

= − − =

= − − = + +

= − =

= −

( )

( )

1

1

1, 2, , ;

1 1, 2, , ;

1 1, 2, , .

jw

jis

j s

jw

jp

yxj xyj

x x

j yj y

x

j k k h

RCj r r k

C R

CR j h h q

R C

ττ

ψτ

τ

α

γ

+

+

= + +

= − = + +

= − = + +

(3.14)

Using normal equation that are used to find the optimum values given (3.13) we can write

( )( )( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( )

1 2 2 1 2

2

1 2 2 1 2

MRVAP 3.1

1 2 11 1 1

1 1 1

MSE

.

jj j j j

j j

jj j j j

r r s r kx x x x x

y y jj j jj j j

h q mk t h k t h

i j ji j j hi j j

t

e e e e eE E e e Y Y Y

X X X

e e e e eY Y Y

P P Pτ τ τ τ τ

α β α

γ λ γ

+ =

= = =

+ =+ = + =

= = = +

− − = + − +

− −

+ − +

∑ ∑ ∑

∑ ∑ ∑

(3.15)

Taking expectation and using (1.6) in (3.15), we get,

( )( )( ) ( )

( ) ( )

3.1

2 2 22 2

2 1 2 2 1 21 1 1

2 2 2

1 2 2 1 21 1 1

+

j j j j j j j j j

j j j j j j j j

MRVAP

r r r s k

j j jy y x yx y x yx y x yxj j jj j j r

t k q qt k h

j y Pb j y Pb j yi k i k i hj j j

MSE t

Y Y YY C X C C X C C X C CX X X

Y Y YP C C P C C P C CP P Pτ τ τ

θ θ θ α ρ θ β ρ θ θ α ρ

θ θ γ ρ θ λ ρ θ θ γ

+ =

= = = +

+ = + =

= + = + = +

= + − − + −

− − + −

∑ ∑ ∑

∑ ∑ ∑ .jPbρ

(3.16)

Using the optimum value (3.14) in (3.16), we get,

Page 9: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

784

( )( )

( ) ( ) ( )

( ) ( ) ( ) ( )

MRVAP 3.1

.1 1. .2 22 1 2 2

1 1

. .1 1 .1 2 1 2

1

MSE

1 1

1 1

j j jq w wj j

q w w

ji jq q wi

q q w

yxr ryx yxy xj jy x y xy yx yx

j jx

yr s k yx yy x y xj j yyx

j r x x

t

R R RY C

R R R

RR R

R R R

τ τ

τ ττ

τ

θ θ θ ρ θ ρ

θ θ ρ θ θ

+ +

= =

+ =+ +

= +

= + − − − − −

+ − − + − − −

∑ ∑

( ) ( ) ( )

1

.1 1.2 1 2

1 1 1 1 .

j

jj qwj j j

w q

t k h

Pbj k

yqt k h y yj jyPb Pb

j k j h

RR

R R

ττ ττ

τ τ

ρ

θ ρ θ θ γ ρ

+ =

= +

+ =+ +

= + = +

− − + − −

∑ ∑

(3.17)

Or

( )( ) ( ) ( ) ( )

( ) ( ) ( ) ( )

.1 1 .2 22 1 2 1MRVAP 3.1

1 1

. .1 11 2 1 2

1 1

MSE 1 1

1 1

j jq wj j

q w

ji q q

i j

q q

yxr r yxy xj j y xy yx yx

j jx x

yr s k t k hyx y x yj jyx Pb

j r j kx

R Rt Y C

R R

RR

R R

ττ

τ

θ θ θ ρ θ ρ

θ θ ρ θ θ ρ

+ +

= =

+ = + =+ +

= + = +

= + − − − −

+ − − + − −

∑ ∑

∑ ∑

( ) ( ) ( ) .1 1.1 1 2

1 1 1 1 .

jj qwj j j

w q

yqt k h y yj jyPb Pb

j k j hx

RR

R R

ττ ττ

τ

θ ρ θ θ γ ρ+ =

+ +

= + = +

− − + − −

∑ ∑

(3.18)

Or

( )( )

( ) ( ) ( ) ( )

( ) ( ) ( )

MRVAP 3.1

. .2 22 1

1 1 1

. .1

1 1

MSE

1 1 1 1

1 1 1 1 j ji q q q

j j j

q q q

j iq wi j i

q w

yx yr r s k t k hyx x y x yj j jy yx yx Pb

j j r j kx x

yxq r yxyj j jy xPb yx

j h j x

t

R RRY C

R R R

R RR

R R

ττ

τ

τ

τ

θ θ ρ ρ ρ

γ ρ θ ρ

+ = + =

= = + = +

= + =

= − + − + − + −

+ − + + − + −

∑ ∑ ∑

∑ ∑

.

1.

jw

j

w

t k h yx y xPb

j k xRρ

+ =

= +

(3.19)

Or

( )( ) ( )( )

( ) ( )( )

( )( )

( )( )

( )( )

( )

,

,, ,

, , ,

. .2 12 2

MRVAP 3.1 ,1

1

1 1

MSE 1

1 1 1

j jq

j jqq q

ji wrj j

w w w

y xq xjy x y x

jx x

r t k h yxyxj jxyx Pb

j j kx x x

Rt Y C R

R R

RR

R R R

ττ

τ ττ τ

τ

τ τ τ

θ θρ

θρ ρ

=

+ =

= = +

− = + −

+ + − + −

∑ ∑

.

(3.20)

Or

Page 10: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

785

( )( ) ( )( )

( )

( )

( )

. , . ,2 22 1 1MRVAP 3.1

, ,

MSE .q w

q w

y x y xy

x x

R Rt Y C

R Rτ τ

τ τ

θ θ θ = − +

(3.21)

Using (1.8) in (3.21), we get,

( )( ) ( ) ( )( ) ( )( )( )2 2 2 22 1 1MRVAP 3.1 , ,MSE 1 1 .

q wy y x y xt Y C τ τθ θ ρ θ ρ= − − + −

(3.22)

Simplifying (3.22) we get,

( )( ) ( )( ) ( ) ( )( )( )2 2 2 2 22 1MRVAP 3.1 , , ,MSE 1 .

q q wy y x y x y xt Y C τ τ τθ ρ θ ρ ρ= − + −

(3.23)

3.3. Mixture Ratio Estimator Using Multi-Auxiliary Variable and Attributes in Two-Phase Sampling (No Information Case)

If we estimate a study variable when information on all auxiliary variables is unavailable from population, it is utilized in the form of their means. By taking the advantage of mixture ratio technique for two-phase sampling, a generalized estimator for estimating population mean of study variable Y with the use of multi auxiliary va-riables and attributes are suggested as:

( )( )

( )

( )

( )

( )

( )

( )

( )

( )

( )

( )

( )

1 2 1 2

1 1 1 2 1 1 1 1 2 12MRVAN 3.2

12 1 2 2 2 2 2 2 2

.s k k t

k k k t

kk k t

x x x p p pt y

x x x p p p

α α α γ γ γ+ +

+ +

+ +

=

(3.24)

Using (1.0), (1.3) and (1.4) in (3.24) and ignoring the second and higher terms for each expansion of product and after simplification, we write,

( )( ) ( ) ( ) ( )1 2 1 2

2MRVAN 3.21 1

.j j j j

i

k k h tx x x

jj jj j

e e e et y Y Y

X Pτ τ

α γ+ =

= =

− −= + +∑ ∑ (3.25)

Mean squared error of ( )MRVAN 3.2t estimator is given by

( )( ) ( ) ( ) ( ) ( )1 2 1 2

2

2

MRVAN 3.21 1

MSE .j j j j

j

k k h tx x

y jj jj j

e e e et E e Y Y

X Pτ τ

α γ+ =

= =

− − = + +

∑ ∑ (3.26)

We differentiate the Equation (3.27) partially with respect to iα ( )1,2, ,i k= and iβ ( )1, 2, ,i k k t= + + then equate to zero, using (1.6), (1.7) and (1.9), we get

( ) 1= 1j

k

j j

yxj xyj

x x

RCC R

α +− (3.27)

( ) 1= 1 .i j

j j

yj yj

RCC R

τ τ

τ τ

β +− (3.28)

Using normal equation that are used to find the optimum values given (3.26) we can write,

( )( ) ( ) ( ) ( ) ( )1 2 1 2

2 2MRVAN 3.21 1

MSE .j j j jk k h tx x x x

y y j jj j ki j

e e e et E e e Y Y

X Pα γ

+ =

= = +

− − = + +

∑ ∑ (3.29)

Taking expectation and using (1.6) in (3.29), we get

( )( ) ( ) ( )2 22 1 2 1 2MRVAN 3.2

1 1MSE .

j j j j

k k h t

y j y x yx j y Pbj j k

t Y C C C C Cτθ θ θ α ρ θ θ β ρ+ =

= = +

= + − + −

∑ ∑ (3.30)

Or

Page 11: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

786

( )( ) ( ) ( )( )

( ) ( )( )

. .2 22 2 2 1 2 1MRVAN 3.2

1 1, ,

MSE 1 1 .j j

q q

j j

q q

yx yk k h ty x y xj jy yx Pb

j jx x

R Rt Y C

R R

τ

τ τ

θ θ θ θ ρ θ θ ρ+ =

= =

= + − − + − −

∑ ∑

(3.31)

Or

( )( ) ( ) ( )( )

( )( )

. . . .2 22 2 1 1 1MRVAN 3.2 . .

1 .

MSE 1 1 .i j

q

i j

q

y xq y xjy y x

j x

Rt Y C

R

ττ

ττ

θ θ θ ρ θ θ=

= − + − + +

(3.32)

Or

( )( ) ( )( )

( ) ( )( ) ( )

2 12 2. . 1MRVAN 3.2 . . .. .1.

MSE 1 .i j j jq q

q

t jy y xx y xy xix

t Y C R RR ττ ττ

τ

θ θρ θ

=

− = + − + ∑

(3.33)

Or

( )( ) ( )( )

( )

. ,2 22 1 1MRVAN 3.2

,

MSE .q

q

y x

yx

Rt Y C

τ

θ θ θ = − +

(3.34)

Using (1.8) in (3.34), we get,

( )( ) ( ) ( )( )( )2 2 22 1 1MRVAN 3.2 ,MSE 1 .y y xt Y C τθ θ ρ θ= − − +

(3.35)

Simplifying (3.35), we get,

( )( ) ( )( ) ( )( )2 2 2 22 1MRVAN 3.2 , ,MSE 1 .y y x y xt Y C τ τθ ρ θ ρ= − +

(3.36)

3.4. Bias and Consistency of Mixture Ratio Estimators These mixture ratio estimators using multiple auxiliary variables and attributes in two-phase sampling are biased. However, these biases are negligible for large samples that is ( )30n ≥ . It’s easily shown that the mixture ratio estimators are consistent estimators using multiple auxiliary variables since they are linear combinations of con-sistent estimators it follows that they are also consistent.

4. Simulation, Result and Discussion We carried out data simulation experiments to compare the performance of mixture ratio estimators using mul-tiple auxiliary variables and attributes in two-phase sampling with ratio estimator using one auxiliary variable and one auxiliary attribute or ratio estimator using multiple auxiliary variable or multiple auxiliary attributes in two-phase sampling estimators for finite population.

All the results were obtained after carrying out two hundred simulations and taking their average. 1) Study variable 2 800, 96, 34, mean 50, standard deviation 6.N n n= = = = = 2) For ratio estimator the auxiliary variable is positively correlated with the study variable and the line passes

through the origin.

2 500, 96, 34, mean 29, standard deviation 6.712.N n n= = = = =

2 500, 96, 34, mean 38, standard deviation 8.612.N n n= = = = =

Correlation coefficients 1 2

0.6820, 0.5788.yx yxρ ρ= = 3) For ratio estimator the auxiliary attributes is positively correlated with the study variable and the line

passes through the origin.

2 500, 96, 34, mean 0.5612.N n n= = = =

Page 12: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

787

500, 96, mean 0.4826.N n= = =

Correlation coefficients 2

0.7031yτρ = , 1

0.6614.yτρ = In order to evaluate the efficiency gain we could achieve by using the proposed estimators, we have calcu-

lated the variance of mean per unit and the mean squared error of all estimators we have considered. We have then calculated percent relative efficiency of each estimator in relation to variance of mean per unit. We have then compared the percent relative efficiency of each estimator, the estimator with the highest percent relative efficiency is considered to be the more efficient than the other estimators. The percent relative efficiency is cal-culated using the following formulae.

( ) ( )( )ˆVarˆeff 100ˆMSE

yY

Y= ∗ (4.0)

Table 1 shows percent relative efficiency of proposed estimator with respect to mean per unit estimator for single-phase sampling. It is very clear from Table 1 that our proposed mixture ratio estimator using multiple auxiliary variables and multiple auxiliary attributes simultaneously is the most efficient compared to ratio esti-mator using one auxiliary variable and one auxiliary attribute or ratio estimator using multiple auxiliary variable or multiple auxiliary attributes in two-phase sampling.

Table 2 compares the efficiency of full information case and partial case to no information case and full to partial information case of proposed mixture ratio estimators. It is observed that the full information case and partial information case are more efficient than no information case because they have higher percent relative efficiency than no information case. In addition, the full information case is more efficient than the partial in-formation case because it has a higher percent relative efficiency than partial information case.

5. Conclusion The proposed mixture ratio estimator under full information case is recommended for estimating the finite pop-ulation mean since it is the most efficient estimator compared to mean per unit, ratio estimator using one aux-iliary variable, ratio estimator using one auxiliary attribute, ratio estimator using multiple auxiliary variable and ratio estimator using multiple auxiliary attributes in two-phase sampling. In case some auxiliary variables or attributes are unknown, we recommend mixture ratio estimator under partial information case since it is more

Table 1. Relative efficiency of existing and proposed estimator with respect to mean per unit estimator for two-phase sampling.

Estimator Relative percent efficiency with respect to

mean per unit in two-phase sampling

( )var y 100

RVt 146

RAt 185

MRVt 212

MRAt 233

( )MRVAF 3.0t (proposed) 285

Table 2. Comparisons of full, partial and no information cases for proposed ratio-cum-product estimator using multiple auxi- liary variables.

Population Percent relative efficiency of full and partial to no information Percent relative efficiency of full to partial in formation case

Estimator ( )MRVAN 3.2t ( )MRVAP 3.1t ( )MRVAF 3.0t ( )MRVAP 3.1t ( )MRVAF 3.0t

Relative percent efficiency 100 139 172 100 127

Page 13: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling

P. M. Waweru et al.

788

efficient than the mixture ratio estimator under no information case and if all are unknown, we recommend the mixture ratio estimator under no information case to estimate finite population mean.

References [1] Neyman, J. (1938) Contribution to the Theory of Sampling Human Populations. Journal of the American Statistical

Association, 33, 101-116. http://dx.doi.org/10.1080/01621459.1938.10503378 [2] Cochran, W.G. (1940) The Estimation of the Yields of the Cereal Experiments by Sampling for the Ratio of Grain to

Total Produce. Journal of Agricultural Science, 30, 262-275. http://dx.doi.org/10.1017/S0021859600048012 [3] Hansen, M.H. and Hurwitz, W.N. (1943) On the Theory of Sampling from Finite Populations. Annals of Mathematical

Statistics, 14, 333-362. http://dx.doi.org/10.1214/aoms/1177731356 [4] Watson, D.J. (1937) The Estimation of Leaf Areas. Journal of Agricultural Science, 27, 474.

http://dx.doi.org/10.1017/S002185960005173X [5] Olikin, I. (1958) Multivariate Ratio Estimation for Finite Population. Biometrika, 45, 154-165.

http://dx.doi.org/10.1093/biomet/45.1-2.154 [6] Raj, D. (1965) On a Method of Using Multi-Auxiliary Information in Sample Surveys. Journals of the American Sta-

tistical Association, 60, 154-165. http://dx.doi.org/10.1080/01621459.1965.10480789 [7] Singh, M.P. (1967) Ratio-Cum-Product Method of Estimation. Metrika, 12, 34-42.

http://dx.doi.org/10.1007/BF02613481 [8] Robson, D.S. (1952) Multiple Sampling of Attributes. Journal of the American Statistical Association, 47, 203-215.

http://dx.doi.org/10.1080/01621459.1952.10501164 [9] Ahmad, Z. (2008) Generalized Multivariate Ratio and Regression Estimators for Multi-Phase Sampling. Unpublished

Ph.D. Thesis, National College of Business Administration and Economics, Lahore. [10] Zahoor, A., Muhhamad, H. and Munir, A. (2009) Generalized Regression-Cum-Ratio Estimators for Two Phase

Sampling Using Multiple Auxiliary Variables. Pakistan Journal of Statistics, 25, 93-106. [11] Jhajj, H.S., Sharma, M.K. and Grover, L.K. (2006) A Family of Estimator of Population Mean Using Information on

Auxiliary Attributes. Pakistan Journal of Statistics, 22, 43-50. [12] Naik, V.D. and Gupta, P.C. (1996) A Note on Estimation of Mean with Known Population of Auxiliary Character.

Journal of the Indian Society of Agricultural Statistics, 48, 151-158. [13] Bahl, S. and Tuteja, R.K. (1991) Ratio and Product Type Estimator. Information and Optimization Sciences, 12, 159-

163. [14] Hanif, M., Haq, I.U. and Shahbaz, M.Q. (2009) On a New Family of Estimator Using Multiple Auxiliary Attributes.

World Applied Science Journal, 11, 1419-1422. [15] Kung’u, J. and Odongo, L. (2014) Ratio-Cum-Product Estimator Using Multiple Auxiliary Attributes in Single Phase

Sampling. Open Journal of Statistics, 4, 239-245. http://dx.doi.org/10.4236/ojs.2014.44023 [16] Kung’u, J. and Odongo, L. (2014) Ratio-Cum-Product Estimator Using Multiple Auxiliary Attributes in Two-Phase

Sampling. Open Journal of Statistics, 4, 246-257. http://dx.doi.org/10.4236/ojs.2014.44024 [17] Moeen, M., Shahbaz, Q. and Hanif, M. (2012) Mixture Ratio and Regression Estimators Using Multi-Auxiliary Varia-

ble and Attributes in Single Phase Sampling. World Applied Sciences Journal, 18, 1518-1526. [18] Simiuddin, M. and Hanif, M. (2007) Estimation of Population Mean in Single and Two Phase Sampling with or with-

out Additional Information. Pakistan Journal of Statistics, 23, 99-118. [19] Arora, S. and Lal, B. (1989) New Mathematical Statistics. Satya Prakashan, New Delhi.

Page 14: Mixture Ratio Estimators Using Multi-Auxiliary Variables and … · 2014-10-28 · The history of using auxiliary information in survey is as old as history of the survey sampling