essays on capability indices for autocorrelated data169913/fulltext01.pdf · for an industrial...
TRANSCRIPT
ACTAUNIVERSITATISUPSALIENSISUPPSALA2007
Digital Comprehensive Summaries of Uppsala Dissertationsfrom the Faculty of Social Sciences 25
Essays on Capability Indices forAutocorrelated Data
ERIK WALLGREN
ISSN 1652-9030ISBN 978-91-554-6835-4urn:nbn:se:uu:diva-7761
Papers summarized in the dissertation
Wallgren, E. (1998). Properties of the Taguchi Capability Index for Markov Dependent Quality Characteristics.
Wallgren, E. (2001). Confidence Limits for the Process Capability Index Cpk. for Autocorrelated Quality Characteristics.
Wallgren, E. (2001). A generalization of the Taguchi capability index for data generated by a first order moving average process.
Wallgren, E. (2000). Confidence limits for a generalized Cpk index for data. from an MA(1) process.
Wallgren, E. (2002). Confidence limits for Process Capability Indices Cpkand Cpm for Stationary Time-series Data.
Contents
Introduction .......................................................................................... 7
Summary of the papers ....................................................................... 18
Reports using an AR(1) model................................................................18 Report 1. Properties of the Taguchi Capability Index for Markov Dependent Quality Characteristics..................................19 Report 2. Confidence Limits for the Process Capability Index Cpk. for Autocorrelated Quality Characteristics ............................21
Reports using an MA(1) model...............................................................23 Report 3. A generalization of the Taguchi capability index for data generated by a first order moving average process ...............24 Report 4. Confidence limits for a generalized Cpk index for data from an MA(1) process..................................................................26
Report 5. Confidence limits for Process Capability Indices Cpk and. Cpm for Stationary Time-series Data...................................................27
Acknowledgements…………………..………………………………….... 31
References…………………..………………………………….…….........32
7
Introduction
The history of process capability indices, PCIs, or process capability ratios,
PCRs, originates in the 1920s when the statistician Walter A. Shewhart first
used lot sampling and significance theory in process control. In his book
from 1931, Economic Control of Quality of Manufactured Products, She-
whart introduced a strategy for monitoring and controlling variation which
‘brought together the disciplines of statistics, engineering and economics’
(Deleryd 1996).
In the works of Shewhart, control charts play a most important role. Using
charts is a graphical method to monitor the variability of a process, and con-
trol charts for sample statistics, such as means, standard deviations or ranges,
are used as well as control charts for individual observations. Originally, the
charts were constructed in such a way that all personnel involved in the
process were able to do the measurements and to make notations in the
charts. Today, however, charts are commonly presented with control limits
formulated from concepts based on statistical theory. Measurements are of-
ten made automatically and the charts are often printed by special quality
software, such as quality tool packs included in e.g. SAS or MINITAB.
For a single quality characteristic, QC, a control chart for the sample mean,
x , is constructed as
nLCL
lineCenternUCL
/3:
:/3:
8
where UCL and LCL are the upper and lower control limits, and are
the mean and standard deviation and n is the sample size used. Commonly,
the parameters and are unknown and many alternative estimates are
used, often based on a series of samples from the process. The constant 3 is a
traditional choice and is probably explained by the fact that, if the sample
mean is an observation of a normally distributed variable and if the actual
process mean and standard deviation equal the anticipated and , almost
all (99.7 %) sample means will fall inside the control limits.
If the sample mean was included in the control limit interval the process was
considered to be under ‘statistical control’, a situation defined by Shewhart
(1931) as:
A phenomenon will be said to be controlled when, through the use
of past experience, we can predict, at least within limits, how the
phenomenon may be expected to vary in the future.
However, a control chart for x only monitors one part of the process vari-
ability. For the same process, the sample means may stay almost constant
over time while the sample standard deviations, monitored in another control
chart, indicate an increased variability. Hence, a simultaneous registration of
the sample mean and standard deviation undoubted
9
ly gives a more efficient monitoring of the process compared to a control
chart for a single statistic. As an example of a combined sx, control chart,
Fig. 1 is based on a data set from Montgomery (2001) where each sample
point is x or s in a sample of size n=5 from diameter measurements in an
automobile manufacturing. In both charts, x and s stay inside their respec-
tive control limits and hence indicate a process under control.
To be able to predict the future the process must behave in such a way that it
is possible to assign a probability model to the QC, and a process with this
property is then said to be stable. Deming (1986) strengthens this further and
also requires that the quality characteristic, at least with respect to its mean
and standard deviation, must stay constant over time. Deming also discusses
the advantages of a stable process and says, ‘it gives an identity to the proc-
ess, it has a measurable, communicable capability.’ In the language of time
series analysis, Deming’s requirements correspond to weak stationarity of a
time series.
10
Figure 1 An sx / , control chart for a data set from an automobile manufac-
turer. (Chart constructed in MINITAB).
In Juran’s Quality Control Handbook, Juran(1988), process capability is
defined as
6capabilityProcess
where is the process standard deviation ‘under a state of statistical con-
trol’. Juran further defines the capability ratio as
capabilityprocesswidthtoleranceratioCapability
where the tolerance width refers to the maximal variability accepted by the
product customer. With the commonly used notation Cp, and upper and
lower specification limits for the tolerance, USL and LSL respectively, a
basic capability ratio is
252015105Subgroup 0
74.015
74.005
73.995
73.985
Sam
ple
Mea
n
Mean=74.00
UCL=74.01
LCL=73.99
0.02
0.01
0.00
Sam
ple
StD
ev
S=0.009443
UCL=0.01973
LCL=0
Xbar/S Chart for x1-x5
11
36dLSLUSLC p
where d is the half-width of the specification interval, (USL-LSL)/2. With the
process capability in the denominator, a large Cp-value is desirable, and val-
ues of Cp 3/4 are sometimes regarded as a requisite for a well-behaved
process. The capability ratio Cp, today commonly referred to as a capability
index, is sometimes defined in reciprocal form, which gives Cp-values near 0
as favourable.
After the introduction of the fundamental capability index Cp, many alterna-
tive indices have been suggested. The two most common indices in today’s
industry are Cpk, introduced by Kane (1986) but used in Japan already in the
70s, Kane (1986), and Cpm, introduced by Hsiang and Taguchi (1985), and
independently by Chan et al. (1988)
Cpk is defined by Kane as
33),min( MdLSLUSLC pk
where is the process mean and M is the midpoint of the specification
interval. While Cp, for a fixed specification interval only varies with , Cpk
is also sensitive to the deviation of from M. This can be seen in Figure 2,
where the two lines (a) combine points ( dM / , )/ d where Cpk= Cp
=1 and the two lines (b) combine points where Cpk= Cp =2. When differs
from M, i.e. 3./ dM , the standard deviation d/ must be reduced
from approximately .33 to .23 to keep an index value Cpk=1 and from ap-
proximately .17 to .12 to keep an index value Cpk=2.
12
Figure 2 Loci of points ddM /,/ such that: (a) 1pkp CC ;
(b) 2pkp CC . For each pair, pC = solid line and pkC = dashed line.
In this form of the index, positive and negative deviations of from M are
treated equivalently but by writing the index
plpupk CCLSLUSLC ,min3
,3
min
i.e by defining two indices, one upper, Cpu, and one lower, Cpl, deviations
from M can be evaluated individually.
The index Cpm is defined as
2222 36 T
d
T
LSLUSLC pm
where T is the process target, most commonly equal to M. Compared to pkC ,
a deviation from T has a larger impact on Cpm. For example, for a given de-
viation, except for small values, an index value Cpm=1 demands a smaller
relative variance than pkC =1, see Figure 3.
0.40.30.20.10.0
0.5
0.4
0.3
0.2
0.1
0.0-M|/d
/d
(a)
(b)
|
13
Figure 3 Loci of points ddM /,/ such that: (a) 1pmpk CC ;
(b) 2pmpk CC . For each pair, pkC = solid line and pmC = dashed line.
In addition to the three indices above, many index modifications have been
proposed but here we only mention the superstructure of the PCIs introduced
by Vännman (1995):
223),(
Tv
MudvuC p
from which the three indices above can be formulated as special cases:
pp CC )0,0( , pkp CC )0,1( and pmp CC )1,0( .
The PCIs mentioned above, and many others, have been studied in many
aspects and by many authors. Kotz & Johnson (1993) and Kotz & Lovelace
(1998) give comprehensive expositions and Kotz & Johnson (1999) study
‘delicate relations’ between the three indices. In a recent report, Kotz &
Johnson (2002) give a review of PCIs for the period 1992-2000.
0.40.30.20.10.0
0.5
0.4
0.3
0.2
0.1
0.0| -M|/d
/d
(a)
(b)
14
For an industrial process, many different QCs can be of interest, and for each
QC, any of the three indices mentioned above may be used. Whatever PCI is
chosen, the index includes at least one of the parameters and , and in
general, the parameters are unknown. Hence the PCIs have to be estimated.
Consider one specific QC. Let n successive observations on this specific QC
be denoted X1, . . ., Xn. In studies of the properties of estimated PCIs, the
following three assumptions, together notated IIN, are commonly used:
1. Identically distributed random variables. To make the PCIs interpretable,
the process has to be stable, with at least a constant mean and a constant
standard deviation, Deming (1931). This can be strengthened and most
commonly, the variables X1, . . ., Xn are assumed to be identically distributed.
2. Independently distributed random variables. The variables X1, . . ., Xn are
considered to be a random sample from the process, i.e. the variables are
assumed to be independently distributed.
3. Normally distributed random variables. To facilitate inference concerning
the true PCIs, a commonly used assumption is that the variables X1, . . ., Xn
are normally distributed, in particular when the inference is based on small
samples. Traditionally, a sample of five observations is used which makes
the normal distribution assumption important. For larger sample sizes, this
assumption is of course less important.
If and are estimated respectively by the sample mean n
i i nXX1
/
and by the sample standard deviation 2/1
12 )1/(n
i i nXXS , or
with an n in the denominator, straightforward estimators for the three indices
are obtained by replacing and by X and S respectively. Using the IIN
assumptions above, exact or approximate sampling distributions for the es-
15
timators can more or less easily be found. In addition, confidence intervals,
or lower confidence limits for the indices can be deduced.
Confidence limits for the pC index are easily obtained by observing that,
222 /ˆ/ SCC pp which is distributed as )1/(21 nn . See e.g. Kane
(1986), Chou et al. (1990).
For pmC , we have, see Boyles (1991),
22222 /ˆ/ TTXSCC pmpm
This ratio follows a scaled non-central 2 distribution and hence confidence
limits can be found, either directly, or via an approximation to a central 2 variable.
For the third PCI, pkC , a compact analytic form for the probability distribu-
tion of pkC can not be derived, and useful confidence limits have to be
found from suitable approximations. Heavlin (1988), Zhang et al. (1990) and
Bissel (1990), among others, have given results in this area and Kushler and
Hurley (1992) give an expose over these results. In a simulation study they
compare the empirical coverage rates for different approximate confidence
limits by means of empirical coverage rates and they conclude that the re-
sults of Bissel are to be recommended.
Even if not all of the three IIN assumptions are fulfilled, it may still be pos-
sible to formulate probability models. Several reports published during the
last decade deal with problems in the area, and several alternative indices are
proposed.
The case of non-normality is dealt with in different ways. Franklin and
Wasserman (1992) use a bootstrap methodology for estimating pC , pkC
16
and pmC . They conclude that the actual distribution of the QC is crucial, but
from large samples, reliable confidence limits can be found by means of the
bootstrap technique.
A common statistical technique for non-normal data is the use of a suitable
transformation, e.g. using logarithms, to achieve approximate normality,
Kane (1986) and Gunther (1989). On the other hand, if the QC can be mod-
eled by some non-normal distribution, e.g. Gamma or Weibull, an estimated
PCI can be found from quantile estimation, where the sample is used to es-
timate the parameters of the actual distribution, Kotz & Lovelace (1998). For
example, the estimate of the index pC can be written
00135.99865.ˆˆ
ˆXXLSLUSLC p
where 99865.X and 00135.X are the estimated 99.865 % and .135 % quantiles
of the QC which leaves 99.7 % of the distribution between the quantiles.
The requirement of identically distributed variables is highly important for a
process to be stable, and violations of this requirement are often difficult to
deal with. In a recent report, Carlsson (2002) studies a case with randomly
occurring shifts in the process level, a state that increases the process vari-
ability and reduces the PCI values.
This dissertation deals with the case when the assumption of independent
variables is not fulfilled. Though this situation is well known in the industry
and recognized by many authors, Kotz and Lovelace (1998), reports in the
area are not frequent. The various effects of ignoring autocorrelation on es-
timates of the process mean and standard deviation, are discussed by Shore
(1997), while Christoffersson (1999) estimates the asymptotic distribution of
),( vuC p for dependent data by means of bootstrap methods. Zhang (1998)
17
derives approximate confidence limits for pkC for samples from a process
with a known autocorrelation function.
In the present dissertation, the indices pmC and pkC are studied in situations
where the variables X1, . . ., Xn are dependent and observations from the pro-
duction process can be regarded as observations on a stationary time series
ntX t ,...,2,1, , with an unknown autocorrelation function.
The indices pkC and pmC are studied in five reports. In reports 1-4, the time
series modeling is restricted to an AR(1) or an MA(1), but in the last report,
weak stationarity is the only restriction. See Table 1
Table 1 Classification of the reports according to PCI
and model assumptions.
Model
restrictionpmC pkC
AR(1) Report 1 Report 2
MA(1) Report 3 Report 4
Weak
stationarity Report 5
For each model assumption, generalizations of pkC and pmC are proposed
and confidence limits are derived.
All reports include extensive Monte Carlo simulations used to evaluate the
confidence limits. Random samples are generated from an AR(1) or an
MA(1) process, and, in the fifth report, from more complex ARMA-models.
The confidence limits are calculated in two ways:
1. The time series structure is recognized and the confidence limits are calcu-
lated according to the derived results.
18
2. The time series structure is not recognized (ignored) and the confidence
limits are calculated according to earlier results derived for independent
samples.
For all cases the empirical percentage coverage rates are calculated and
compared to a pre-determined nominal confidence level.
Note that the five reports included in this thesis are written over a period of
years and the notations have changed. In the last report, Report 5, notations
are adapted to modern time series literature and these notations are also used
in the following summary.
Summary of the papers
Reports using an AR(1) model
For many industrial processes, like oil refinery, paper production etc, it is
well known that the level of individual quality characteristics often varies
with a wave-like pattern. Observations on such a characteristic, made at
equal time intervals, are then supposed to be dependent and the outcome of
such a process can be modeled in many ways.
In this report we choose the well-known AR(1) model,
ttt ZXX 1
where tX is the time series with parameters and , and tZ is white
noise with variance 2 . An AR(1) model, sometimes known as a low pass
filter or Markov dependency, is probably the most commonly used single
ARIMA(p,q)-model in industry. With the commonly accepted ‘keep it sim-
ple’-attitude to time series analysis, the AR(1) is often a reasonable model
and furthermore easy to estimate.
19
Report 1. Properties of the Taguchi Capability Index for Markov Dependent
Quality Characteristics.
For independent samples, confidence limits, and intervals for Cpm were given
by Boyles (1991). Using the technique discussed in the introduction, an
)%1(100 lower confidence limit for Cpm is
ˆ)ˆ(ˆ
2
pmC
where ˆ is the estimated number of degrees of freedom for a central 2 -
distribution given by
2
22
ˆ21ˆ1
ˆn
where 222 /ˆ STX is a non-centrality parameter estimate.
For a sample from an AR(1) process we propose an index denoted CpmAR.
Following the derivations made by Boyles, and using X and 2S as estima-
tors for the process level and the total process variance respectively, we
show that an )%1(100 lower confidence limit for CpmAR can be calcu-
lated as for an independent sample if ˆ is estimated by
ˆ1ˆ1ˆ2ˆ1
ˆ1ˆ1
ˆ2
2
2
22n
where 2ˆ is the same as for an independent sample and where ˆ is the esti-
mated first order sample autocorrelation coefficient.
20
For dependent samples, as well as independent samples, and for large values
of ˆ , the central 2 -distribution can be approximated by a normal distribu-
tion, which gives the )%1(100 lower confidence limit
))ˆ2/(1(ˆ 2/11zC pmAR
where z is the lower % percentile of a standard normally distributed
variable. Of course, ˆ , being the estimated number of degrees of freedom,
strongly depends on the sample size n. But it can also be noted that a strong
positive autocorrelation reduces the value of ˆ which probably makes a
normal approximation impossible.
In extensive Monte Carlo simulations samples are generated from an AR(1)
process and the samples are used to estimate lower confidence limits for
CpmAR . Estimates are calculated in two ways: By taking account of the auto-
correlation, using the confidence limits developed in this report, and
by ignoring the autocorrelation, using the confidence limits developed for
independent samples by Boyles (1991). Since, from an industrial point of
view, the higher capability index the better, only a lower confidence limit for
the index is calculated.
For both estimation methods, empirical coverage rates are calculated and
compared to nominal confidence levels.
The simulations are made for a range of and -values and for autocor-
relations 1 . From the simulations it was found that, except for -
values close to 1, the coverage rates given by the lower confidence limits
given in the report are always closer to a chosen nominal confidence level
than the coverage rates given by confidence limits when autocorrelations are
ignored.
21
Report 2. Confidence Limits for the Process Capability Index Cpk for Auto-
correlated Quality Characteristics.
From a statistical point of view, Cpk, with parameters to be estimated in the
numerator as well as in the denominator, is less tractable than Cpm. Further-
more, Cpm =(tolerance width)/(process capability), has a more natural inter-
pretation than Cpk. Nevertheless Cpk is more commonly used.
For independent samples, a compact analytic expression for the sampling
distribution of pkC has not been found and only approximate results for
)ˆ( pkCE and )ˆ( pkCVar are available. As an approximate lower confidence
limit for Cpk estimated from an independent sample, Bissel (1990) suggests
12
ˆ
91ˆ
2
1 nC
nzC pk
pk
which is also recommended by Franklin and Wasserman (1992).
This confidence limit, however, is not valid for observations on dependent
variables and in this second report, observations from an AR(1) process are
used to derive point and interval estimates of Cpk .The index estimator is the
same for independent and dependent data but the variance of the estimator
depends on the strength of the dependence. For the variance, we make a
Taylor linearization of pkC and, for large samples, a compact formula for
the variance is derived.
A Monte Carlo simulation indicates that pkC for dependent data is approxi-
mately normally distributed, at least for large samples. In this report, the
proposed lower confidence limit for pkC based on data from an AR(1) proc-
ess is
22
2
22
11 ˆ1ˆ1
2
ˆˆ1ˆ1
91ˆ)ˆ(ˆˆ
nC
nzCCarVzC pk
pkpkpk
where ˆ is the estimated first order autocorrelation coefficient, commonly
denoted ˆ . It can be noted that the estimated variance of pkC increases
with increasing ˆ .
The actual coverage rate of the proposed lower confidence limit for pkC is
estimated from Monte Carlo simulations. The empirical coverage rates for
the proposed confidence limits are compared to a nominal confidence level
and also to the empirical coverage rates for the confidence limits proposed
by Bissel, which assume independent observations.
Samples are generated from an AR(1) process with process mean and
noise variance 2 and confidence limits are calculated in two ways:
1. By taking account of the autocorrelation, using the confidence limits
developed in this report and
2. By ignoring the autocorrelation, using the confidence limits developed
for independent samples by Bissel.
3. The simulations are made for a range of and -values and for auto-
correlations 1 .
The results from the simulation study indicate that, if the assumption of an
AR(1) process is adequate,
(i) when the process mean equals the target, Cpk can be used as a
PCI for 5. , approximately, and
(ii) when the process mean does not equal the target, Cpk can be used
as a PCI for for 5. .
23
(iii) furthermore, the study indicates that large samples are necessary
to achieve coverage rates close to a nominal confidence level.
Finally, by comparing the results of the first two reports, it is found that Cpm
can be used in a wider interval of autocorrelations than Cpk. It is also found
that the empirical coverage rates for Cpm are generally closer to a nominal
confidence level than Cpk. Together with earlier discussed differences be-
tween the two indices, it is concluded that the Cpm index is preferable to Cpk.
Reports using an MA(1) model
In reports 3 and 4, an MA(1) model is used, i.e.
1ttt ZZX
which is as simple as the AR(1) and can be of interest for some applications,
although AR(1) is more often used in the industry.
Here and are the time series parameters, while tZ and 1tZ are white
noise with variance 2 .
In time series analysis, parameters of ARMA models are often estimated
through a recursive procedure, e.g. the Innovation Algorithm, see Brockwell
and Davis (1988). But with an additional assumption of invertibility, the
parameter of an MA(1) model can be estimated through the estimation of
the first order autocorrelation coefficient .
24
Report 3. A generalization of the Taguchi capability index for data generated
by a first order moving average process
In this report an index Cpm is proposed to be used for data from an MA(1)
process. Approximate confidence limits for the index are derived in a similar
manner as for an AR(1) model in the first report. A generalization of Boyles’
(1991) results gives as a )%1(100 lower confidence limit for CpmMA
ˆ)ˆ(ˆ
2
pmMAC
where ˆ is the estimated number of degrees of freedom for a central 2 -
distribution and given by
ˆ21ˆ2ˆ21ˆ1
ˆ 22
22n
and where, as before, 222 /ˆ STX is an estimated non-centrality
parameter.
For large values of ˆ , an )%1(100 lower confidence limit for Cpm is
))ˆ2/(1(ˆ 2/11zC pm
Monte Carlo simulations are made to compare the empirical coverage rates
of these approximate lower confidence limits to the coverage rates based on
lower confidence limits when autocorrelations are ignored, or not recog-
nized.
The samples, from which the Cpm are estimated, are generated and treated in
two ways:
1. Samples are generated from an MA(1)-process and the confidence limits
are based on the assumptions of an MA(1)-process, i.e. a correct model
assumption.
25
2. Samples are generated from an AR(1) process but the confidence limits
are based on the assumptions of an MA(1) process, i.e. an incorrect
model assumption.
The simulations are made for a range of and -values and for autocorre-
lations 5. .
From the simulation study the following conclusions are made:
1. Autocorrelations are included in the calculations.
If samples are generated from an MA(1)-process, the empirical coverage
rates of the confidence limits proposed in this report are very close to the
nominal confidence level for all parameter settings. If the model is misspeci-
fied, samples are actually generated from an AR(1)-process, the coverage
rates can differ substantially from the nominal confidence level, especially
when the process level is out of target.
2. Autocorrelations are ignored in the calculations.
Irrespective of generating process, an MA(1) or an AR(1)-process, the em-
pirical coverage rates of the confidence limits calculated in this report differ
considerably from a nominal confidence level. The size of the difference
depends on both the size of the autocorrelation and on whether the process
level is on target or not.
Finally, it is observed that increasing the sample size from 50 to 200 does
not give any substantial increase in the coverage rates.
26
Report 4. Confidence limits for a generalized Cpk index for data from an
MA(1) process.
This fourth report completes the generalization of PCIs for simple ARMA-
models. The index Cpk, is estimated from MA(1) process data in almost the
same way as in report 2. With an assumption of an MA (1)-process, and as-
ymptotic normality of pkC , the following )%1(100 lower confidence
limit for CpA is proposed:
22
1 ˆ212
ˆˆ21
91ˆ
nC
nzC pk
pk
Monte Carlo simulations are made to compare the empirical coverage rates
of the approximate lower confidence limits to the coverage rates based on
lower confidence limits for independent samples given by Bissel.
The samples from which the Cpk are generated are treated in two different
ways:
1. Samples are generated from an MA(1) process and the confidence limits
are based on the assumptions of an MA(1) process, i.e. a correct model
assumption.
2. Samples are generated from an AR(1) process but the confidence limits
are based on the assumptions of an MA(1) process, i.e. an incorrect
model assumption.
The simulations are made for a range of and -values and for autocorre-
lations 5. .
The analyses of the simulation results are based on four-way ANOVAs with
the four factors n, , and . To distinguish different parts of the cover-
age rate errors, three response variables are defined:
27
1. The coverage rate error defined as the difference between the empirical
coverage rate and the nominal confidence level when the model assump-
tion is correct and autocorrelations are taken into account.
2. The corresponding coverage rate error that results from ignoring the
autocorrelations, i.e. samples are generated from an MA(1) process but
treated as simple random samples.
3. The corresponding coverage rate error that results from an erroneous
time series assumption, i.e. samples are generated from an AR(1) process
but treated as samples from an MA(1) process.
From the simulations it can be concluded that the estimation approach pro-
posed in the report gives coverage rates close to nominal confidence levels
when the time series is correctly specified. If the time series structure is not
recognized and the samples are regarded as simple random samples, the cov-
erage rate errors increase with increasing autocorrelation. But for the models
under study, the error sizes are not notably large.
On the other hand, if the time series is misspecified, the coverage rate errors
may be very large and it is reasonable to believe that, for time series with a
strong serial dependency, the characteristics of the time series must be ac-
knowledged, either by finding a suitable time series model or by studying the
complete autocorrelation function.
Report 5. Confidence limits for Process Capability Indices Cpk and Cpm for Stationary Time-series Data
In the last report, the earlier assumption of a simple AR- or MA-model is
relaxed and weak stationarity of the time series is now the only restriction.
To be used for data from a stationary time series, the use of the indices Cpk
and Cpm, are generalized and two indices, CpkS and CpmS, are proposed. As in
28
the earlier reports, the proposed indices are identical to Cpk and Cpm for inde-
pendent data.
The variance of pkSC is approximated through a Taylor expansion, which
leads to the proposed )%1(100 lower confidence limit for CpkS
hh
pkS
hhpkS n
Cn
zC ˆ2
ˆˆ
91ˆ
22
1
where hˆ is the sample autocorrelation at lag h. This confidence limit cov-
ers the corresponding confidence limits in reports 2 and 4 as special cases.
For pmSC , the procedure adopted in reports 1 and 3, used by Boyles (1991),
is generalized and an )%1(100 lower confidence limit for pmSC is
ˆ)ˆ(ˆ
2
pmC
where ˆ is the estimated number of degrees of freedom for a central 2 -
distribution and given by
h hh h
n
ˆˆ2ˆ
ˆ1ˆ
22
22
and where 222 /ˆ STX is a non-centrality parameter estimator.
For large values of ˆ , an )%1(100 lower confidence limit for Cpm is
))ˆ2/(1(ˆ 2/11zC pmS
This confidence limit covers the corresponding confidence limits in reports 1
and 3 as special cases.
For both indices, the approximate confidence limits include 1 ˆh h and
12ˆh h , and each h is estimated by the traditional sample autocorrela-
tion coefficient. For each sum, the number of autocorrelation coefficients
29
included is determined by sequential tests and a general test procedure for
stationary data is proposed.
In a limited Monte Carlo simulation the reliability of the proposed limits is
exemplified. As examples of stationary time series, two contrasting
ARMA(p,q) are used: an ARMA(2,1) and an ARMA(1,2) with parameters
chosen to achieve both causality and invertibility, i.e. for both models, both
parameter polynomials have roots well outside the unit circle. The only pa-
rameters allowed to vary in the simulations are the process mean and the
noise variance 2 .
Samples are generated from each process and lower confidence limits for
CpkS and CpmS are estimated in two ways:
1. By taking account of the autocorrelations, using formulas proposed in
the report.
2. By ignoring the autocorrelations, using formulas proposed by Bissel
(1990) and Boyles (1991) respectively.
For both CpkS and CpmS the results of the simulations indicate that, when the
autocorrelations are taken into account, the coverage rates are generally
closer to the nominal confidence level than when the autocorrelations are
ignored. Comparing the indices, the simulations also indicate that CpmS is less
influenced by changes in 2 and than CpkS .
One noteworthy result is found in the estimation of Cpk: if the autocorrela-
tions are ignored, the coverage rates strongly depend on , but if autocorre-
lations are included, the coverage rates do not vary with .
Finally in this report, a data set from an oil refinery is studied. Observations
on a gas concentration are made each minute during a 10-hour period and the
30
series shows an apparent time series behavior. Lower confidence limits for
CpkS and CpmS are estimated: both for the frequency one observation each
minute and one observation each three or five minutes. The confidence lim-
its for CpkS and CpmS diverge and clearly depend on the observation fre-
quency. The results make it obvious that if a passage regarding a capability
index is included in a contract, the estimation procedure must be clearly
specified.
31
Acknowledgements.
This thesis has been written rather late in my career as a university teacher;
for many years the plans of a dissertation was not on my agenda. But when
professor Olle Carlsson was engaged at the staff of statisticians of Örebro
University, he introduced to me the problem of capability index calculations
for dependent observations and slowly the possibility of producing a thesis
became realistic. I owe him a dept of gratitude for all the time he spent with
me, discussing my problems and reading my papers.
At Uppsala University, professor Anders Christoffersson and the board of
supervisors at the department of statistics have read and commented on my
thesis and I am very grateful for their help. Valuable suggestions on the Eng-
lish text were made by Mr Gray Gatehouse. The financial support provided
by Örebro University is gratefully acknowledged.
I am also indebted to my colleagues at the university for their friendly atti-
tude to my work and last but not least, I am grateful to my wife Lotta for
here support and patience.
Örebro, February 2007
Erik Wallgren
32
REFERENCES:
Bissel, A.F. (1990). How Reliable is Your Capability Index? Applied Statistics, 39(3), pp 331-340.
Boyles, R.A. (1991). The Taguchi Capability Index, Journal of Quality Technology, 23, pp 17-26.
Box, G.E.P., Jenkins, G.M. and Reinsel, G.C. (1994). Time Series Analysis. Prentice Hall, New Jersey.
Brockwell, P.J. and Davis, R.A. (1988). Time Series. Theory and Methods. 2nd ed. Springer, New York.
Carlsson, O. (2002). A Generalization of the Cpk Index for Production Processes with Varying Process Levels. Private Communications.
Chan, L.K., Cheng, S.W. and Spiring, F.A. (1988). A New Measure of Process Capability: Cpm . Journal of Quality Technology, 20(3), pp 163-175.
Chou, Y-M. and Owen, D.B. (1989). On the Distribution of the Estimated Process Capability Indices. Communications in Statistics-Theory and Methods, 18(12), pp 4549-4560.
Chou, Y-M., Owen, D.B. and Borrego, S.A. (1990). Lower Confidence Limits on Process Capability Indices. Journal of Quality Technology,
22(3), pp 223-229.
Christoffersson, J. (1999). Process Capability Indices and Block Bootstrap for Serially Correlated Data. Research Report 1999:4. Luleå University of Technology
Deleryd, M. (1996). Process Capability Studies in Theory and Practice. Licentiate Thesis. Luleå University of Technology.
Deming, W.E. (1986). Out of the Crisis. Cambridge University Press, Cambridge.
Franklin, L.A. and Wasserman, G.S. (1992). Bootstrap Lower Confidence Limits for Capability Indices. Journal of Quality Technology, 24, pp 196-210.
33
Franklin, L.A. and Wasserman, G.S. (1992). A Note on the Conservative Nature of the Tables of Lower Confidence Limits for Cpk, with a Suggested Correction. Communications in Statistics-Simulation and Computation, 23, pp 1165-1169.
Gunther, B. (1989). The Use and Abuse of Cpk, Parts 1-4. Quality Progress, 22(1), pp 72-73.
Heavlin, W.D. (1988). Statistical properties of capability indices. Technical Report No. 320, Tech. Library, Advanced Micro Devices, Inc., Sunnyvale, California.
Hsiang, T.C. and Taguchi, G. (1985). A Tutorial on Quality Control and Assurance- The Taguchi Methods. ASA, Annual Meeting, Las Vegas, USA.
Juran’s Quality Control Handbook, (1988). Juran, J.M. (Ed). McGraw- Hill, New York.
Kane, V.E. (1986). Process Capability Indices. Journal of Quality Technology, 18, pp 41-52.
Kotz, S. and Johnson, N.L. (1993). Process Capability Indices. Chapman & Hall, London.
Kotz, S. and Johnson, N.L. (1999). Delicate Relations Among the Basic Process Capability Indices Cp, Cpk and Cpm, and their Modifications.
Communications in Statistics-Simulation and Computation., 28(3), pp 849-866.
Kotz, S. and Johnson, N.L. (2002). Process Capability Indices- A Review, 1992-2000. Journal of Quality Technology, 34, pp 2-19.
Kotz, S. and Lovelace,C.R. (1998). Process Capability Indices in Theory and Practice. Arnold, London.
Kushler, R.H and Hurley, P. (1992). Confidence Bounds for Capability Indices. Journal of Quality Technology, 24, pp 188-195.
Montgomery, C.D. (2001). Introduction to Statistical Quality Control. Wiley, New York.
Priestley, M.B. (1981). Spectral Analysis and Time Series, Volume 1. Academic Press, London
34
Shewhart, W.A. (1931). Economic Control of Quality of Manufactured Product. Van Nostrand, New York.
Shore, H. (1997). Process Capability Indices when Data are Autocorrelated. Quality Engineering 9, pp 615-626.
Wallgren, E. (1996). Properties of the Taguchi Capability Index for Markov Dependent Quality Characteristics. Lic.Thesis, Uppsala University.
Wallgren, E. (2000). Confidence Limits for a Generalized Cpk Index from an MA(1) Process. Working Paper No 4, ESA, Örebro University.
Wallgren, E. (2001). Confidence limits for the process capability index Cpk for autocorrelated quality Characteristics. Frontiers in Statistical Quality Control. Ed Lenz, H.J. and Wilrich, P.Th. Physica-Verlag, Heidelberg.
Wallgren, E. (2001). A Generalization of the Taguchi Capability Index for Data Generated by a First Order Moving Average Process. Statistical Methods, 3(1) pp 1-15.
Vännman, K. (1995). A Unified Approach to Capability Indices. Statistica Sinica 5(2), pp 805-822.
Zhang, N.F., Stenback, G.A. and Wardrop, D.M. (1990). Interval Estimation of Process Capability Index Cp.. Communications in Statistics-Theory and Methods, 19, pp 4455-4470.
Zhang, N.(1998). Estimating Capability Indices for Autocorrelated Data.Journal of Applied Statistics, 25(4), pp 559-574.
Acta Universitatis UpsaliensisDigital Comprehensive Summaries of Uppsala Dissertationsfrom the Faculty of Social Sciences 25
Editor: The Dean of the Faculty of Social Sciences
A doctoral dissertation from the Faculty of Social Sciences,Uppsala University, is usually a summary of a number ofpapers. A few copies of the complete dissertation are kept atmajor Swedish research libraries, while the summary alone isdistributed internationally through the series DigitalComprehensive Summaries of Uppsala Dissertations from theFaculty of Social Sciences. (Prior to January, 2005, the serieswas published under the title “Comprehensive Summaries ofUppsala Dissertations from the Faculty of Social Sciences”.)
Distribution: publications.uu.seurn:nbn:se:uu:diva-7761
ACTAUNIVERSITATISUPSALIENSISUPPSALA2007