mean and standard deviation for the facilities design problem

5
Computersind. Engng Vol. 19, Nos 1-4, pp. 313-317, 1990 0360-8352/90 $3.00 +0.00 Printed in Great Britain. All fights reserved Copyright © 1990 Pergamon Press plc MEAN AND STANDARD DEVIATION FOR THE FACILITIES DESIGN PROBLEM S. Allen Broughton and Veeravudhi Charumongkol Department of Mathematics and Department of Industrial Engineering Cleveland State University Cleveland, Ohio 44115 ABSTRACT An important problem in facilities design to find an assignment of n facilities to n locations so that total materials handling cost is minimized. For problems of moderate size, suboptimal solutions must be accepted since optimal algorithms are computationally infeasible. If the mean and standard deviation of the layout cost distribution is known, then statistical methods may be used to measure and compare the efficiencies of various suboptimal solutions as well as to monitor the efficiency of the same assignment under changing production environments. In this paper a new, simple algorithm to calculate the exact value of the standard deviation of the layout cost distribution is presented (the mean is easy). This algorithm has a computational efficiency of O(n 2) arithmetic operations for a problem of size n × n, an improvement over previous methods which are either inexact or have a computational efficiency of O(n4). Results of tests verifying the accuracy and claimed efficiency of this algorithm, as implemented on a microcomputer, are also presented (about 0.85 second for a 30 x 30 problem). INTRODUCTION A problem confronting the industrial plant designer is to assign n facilities to n locations in a plant so that the total material handling cost is minimized. Since there are n! possible assignments to consider, algorithms yielding optimal solutions, are computationally infeasible even for moderately large values of n. A number of computationally feasible algorithms, yielding only suboptimal solutions, have been devised, e.g., [AB], [BAV], [HC], [G]. The software package CRAFT ([AB], [BAV]), which is commonly uscd in facilities design, is based on some of these algorithms. Since we have to consider suboptimal solutions it is important to have a measure of the quality of a proposed solution. A commonly used statistical measurc is the efficiency of a proposed assignment which was introduced by Wallace et. al. in [WHT]. We now describe this measure as well as the cost distribution associated to the facilities design problem. Let ~r denote a typical assignment of n facilities to n locations in a plant, i.e., for each i = 1,..., n facility i is assigned to location It(i). There is an associated materials handling cost which is denoted by C~, we defer the explicit definition of C, to formula (5). As we vary through all possible assignments we get a data set consisting of n! C~'s. Let p and a be the mean and standard deviation of this data set, they are defined by: 1 (1) P ----~.W E g'~' 1 f where the summations are over all n! possible assignments 7r. If the layout cost distribution is, for example, normal then about 95.4% of the layout costs C~ are with two standard deviations of the mean aand about 99.7% of the layout costs are within three standard deviations of the mean. Thus, it is statistically unlikely that many layout costs smaller than p - 2a or/~ - 3a will be found. This justifies the following measure of the quality of a proposed facility assignment. The efficiency e,~, of the assignment v is given by: ° 313

Upload: sallen-broughton

Post on 21-Jun-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Mean and standard deviation for the facilities design problem

Computers ind. Engng Vol. 19, Nos 1-4, pp. 313-317, 1990 0360-8352/90 $3.00 +0.00 Printed in Great Britain. All fights reserved Copyright © 1990 Pergamon Press plc

M E A N A N D S T A N D A R D D E V I A T I O N

F O R T H E F A C I L I T I E S D E S I G N P R O B L E M

S. Allen Broughton and Veeravudhi Charumongkol

Department of Mathematics and

Department of Industrial Engineering Cleveland State University

Cleveland, Ohio 44115

A B S T R A C T

An important problem in facilities design to find an assignment of n facilities to n locations so that total materials handling cost is minimized. For problems of moderate size, suboptimal solutions must be accepted since optimal algorithms are computationally infeasible. If the mean and standard deviation of the layout cost distribution is known, then statistical methods may be used to measure and compare the efficiencies of various suboptimal solutions as well as to monitor the efficiency of the same assignment under changing production environments. In this paper a new, simple algorithm to calculate the exact value of the standard deviation of the layout cost distribution is presented (the mean is easy). This algorithm has a computational efficiency of O(n 2) arithmetic operations for a problem of size n × n, an improvement over previous methods which are either inexact or have a computational efficiency of O(n4). Results of tests verifying the accuracy and claimed efficiency of this algorithm, as implemented on a microcomputer, are also presented (about 0.85 second for a 30 x 30 problem).

I N T R O D U C T I O N

A problem confronting the industrial plant designer is to assign n facilities to n locations in a plant so that the total material handling cost is minimized. Since there are n! possible assignments to consider, algorithms yielding optimal solutions, are computationally infeasible even for moderately large values of n. A number of computationally feasible algorithms, yielding only suboptimal solutions, have been devised, e.g., [AB], [BAV], [HC], [G]. The software package CRAFT ([AB], [BAV]), which is commonly uscd in facilities design, is based on some of these algorithms. Since we have to consider suboptimal solutions it is important to have a measure of the quality of a proposed solution. A commonly used statistical measurc is the efficiency of a proposed assignment which was introduced by Wallace et. al. in [WHT]. We now describe this measure as well as the cost distribution associated to the facilities design problem.

Let ~r denote a typical assignment of n facilities to n locations in a plant, i.e., for each i = 1 , . . . , n facility i is assigned to location It(i). There is an associated materials handling cost which is denoted by C~, we defer the explicit definition of C , to formula (5). As we vary through all possible assignments we get a data set consisting of n! C~'s. Let p and a be the mean and standard deviation of this data set, they are defined by:

1 (1) P ---- ~.W E g'~'

1

f

where the summations are over all n! possible assignments 7r. If the layout cost distribution is, for example, normal then about 95.4% of the layout costs C~ are with two standard deviations of the mean aand about 99.7% of the layout costs are within three standard deviations of the mean. Thus, it is statistically unlikely that many layout costs smaller than p - 2a or/~ - 3a will be found. This justifies the following measure of the quality of a proposed facility assignment. The efficiency e,~, of the assignment v is given by: °

313

Page 2: Mean and standard deviation for the facilities design problem

314 Proceedings of the 12th Annual Conference on Computers & Industrial Engineering

(3) e~ = p + ra - C,~ 2ra × 100%,

where r , p, a are as above and r = 1 ,2 ,3 . . . is a selected level of standard, usually r = 2 or 3. The quantity en is just a linear measure of efficiency in which the "statistically worst case" # + ra is rated at 0% and the "statistically best case" p - ra is rated at 100%.

This measure of efficiency is only useful if we have good algorithms to compute the distribution parameters p and a. A formula for p, formula (6), was first given by R.E. Gomory and reported in fG]. The first algorithm to compute a was developed by Wallace et. al. [WHT], subsequent refinements have been given in [DS], [SS], [KKN1], [KKN2] and [MB]. In [WHT], exact formulae for n < 5 were obtained by algebraic manipulation of the formula for the variance and an approximate formula was proposed for larger values of n. The general equation of the variance, which has a form similar to formula (7) below, was also given in [WHT] but no exact algorithm was developed. From the form of the equations in [WHT] it is clear that about n ~ multiplications and additions were required for exact evaluation of the standard deviation. Therefore, Wallace et. al. had theoretically succeeded in reducing the complexity of calculation from worse than O(n!) arithmetic operations to O(n 4) arithmetic operations. The subsequent papers, cited above, offer refinements of the formulae given in [WHT], but the formulae are either inexact ([SS], [MB]) or exact algorithms are given, but without a substantial reduction of the computational complexity ([DS], [KKN1], [KKN2]). Moreover, in those papers which do present exact algorithms, no formal derivation of the general case is given.

In this paper we give simple formulae for the exact calculation of a, which require only O(n 2 ) arithmetic operations (Theorem 1). The formulae may be derived by using the mathematical theory of group actions on networks. Due limitations of space we only give the formulae for computing the mean and standard deviation in this paper; the complete development of the formula and results, their statistical verification and a pseudocode algorithm for the computation of p and a are given in [B-C].

F O R M U L A E F O R p A N D a

In the usual formulation of the facilities assignment problem we are given two n × n matrices F = (fij) and D = (dij) where flj is the rate of fiow of materials between locations i and j and di j is either the rectilinear distance or straightline distance between locations i and j. The fij often have a built-in cost factor so they actually represent per unit length materials handling cost rather than actual flow. The matrix D is generally assumed to be symmetric since the path travelled between location i and j is generally the same backwards and forwards. We do not assume that F is symmetric since backward and forward flow between facilities need not be equal. If we start from an initial layout in which facility i is put in location i, then the per unit time materials handling cost of the layout is clearly given by:

C = y ~ f~,~d~,~. i-~-I j ~ - I

Since there is no transport cost for material handling at location j we have assumed fi,i = di,i -= 0 in this equation. The cost equation may be rewritten:

C = ~ ~ (fi,j + fj,i)di,j. i-~l j-~i+l

The quantity ¢i,j = fi,j + f£i is the total forward and backward flow between facilities i and j and is the ( i , j ) -entry of the symmetric matrix F + F t (t = transpose). Let us call • = (¢i,j) = F + F ~ the total flow matrix. In this notation, the layout cost is given by:

(4) C=~ ~ ¢~,jd~, i. i l l j~i-bl

Now suppose that we make a reassignment of the facilities to the locations, say that facility i is moved to location lr(i). Consider the total cost C~ for the new assignment. Since facilities i and j will be moved to location ~r(i) and ~r(j) respectively there will now be a total flow of ¢i,j between locations 7r(i) and 7r(j). Thus, the contribution fbi,jdl,j in equation (4) should be replaced by ¢i,jd~(o,~(j ) since the distance

Page 3: Mean and standard deviation for the facilities design problem

Broughton and Charumongkol: Facilities Design Problem 315

between locations ~r(i) and is ~r(j) is d~(0,r(j ). The formula for C,r is:

(5) C~- = '~"~ ~ ¢,,.id..(O,,(./). i ff i l ,./----i+ l

Let E . denote the collection of all n! possible assignments. The collection {C~ : 7r E En} is the data set of layout costs for which we seek the mean and standard deviation. We state our results as a series of formulae in Theorem 1 below. A computer program or spread sheet program may be easily written from the formulae (cf. [B-C] for proofs and computer algorithm).

T h e o r e m 1. Let the n x n flow and distance matrices, F = (fi t) and D = (dii) be given. Define the total flow matrix ¢ by: q~ = (¢i i ) = (fli + fji). Let {C,t : ~" E E ,} be the cost distribution defined by (5) and let IJ and a be the mean and standard deviation of this distribution, defined by (1) and (2). Define the following quantities:

No - n(n - 1) n(n - 1)(n - 2)(n - 3) 2 ' g , , , = n ( n - 1)(n - 2), Na = 4 '

i----1 j - - - - i+ l

i ff i l j f f i i + l

i f1

rt u2¢ = ~ ( v , ~ ) 2 ,

i----1

T,, ,¢ = U2 ¢ - 2 T , ¢ ,

Ta¢ = ( s ¢ ) 2 - Toe - T,.¢,

S d = ~ d i i , i= l j f f i i+l

i----1 j -~ i+l

Uid = E d~j, .iffil

n

V'd =

i----1

Trod = U2 d - 2Tod,

Tdd = (Sd) 2 - Tod - Trod.

Then, the distribution parameters p and a are ~ven by:

(6) /z = ~.-~S~Sd,

and

(7) ~2 = T.~,T,d T . . ¢ T m d T~¢Tdd ~2. N- Y + +

V E R I F I C A T I O N O F F O R M U L A E A N D C O M P U T A T I O N A L C O M P L E X I T Y

A computer program based on the formula above has been developed to compute the standard deviation of layout cost distributions. The program was coded in TURBO BASIC using the pseudocode algorithm given in [B-C] as a model. As a verification that the computer program gives the correct standard de- viation, all the sample problems appearing in the literature for which the standard deviation had been calculated exactly were sought out and the standard deviations recalculated. The trials were run on an 8 Mhz. IBM compatible microcomputer with an Intel 80286 processor and 80287 numeric coprocessor. The data obtained in these trials are given in Table 1. Most of the discrepancies between the results in the literature and the recalculated standard deviations, in relative terms, are much less than 0.1%. The worst case is the 20 x 20 sample problem where the relative difference is about 0.15%.

By inspection of the formulae in Theorem 1 it appears that the number of calculations for a problem of size n x n has been reduced from O(n 4) to O(n2). To check the validity of this and also to extrapolate running times to larger problem sizes, the running times for the trials were recorded and a regression model of Y (running time) on X (number of facilities) was developed. A quadratic model Y = aX 2 + bX + c

Page 4: Mean and standard deviation for the facilities design problem

316 Proceedings of the 12th Annual Conference on Computers & Industrial Engineering

was assumed because of the theoretical result on computational efficiency. The statistical analysis in [B-CJ shows that the data almost perfectly fits the quadratic model:

Y = 0.000884X 2 + 0.00145X + 0.001278.

By using the regression model above, running time for a 40-facility problem (the maximum size considered by CRAFT) is predicted to be 1.4736 seconds, as shown in the table. An O(n 4) algorithm would presumably take about 1600 times as long.

S U M M A R Y AND DISCUSSION

A new algorithm for calculating the standard deviation of the of the cost distribution associated to the facilities design problem has been developed. The algorithm calculates the exact value of the standard deviation and is more efficient than previous algorithms, O(n 2) vs. O(n4). Algorithms, of a lower order of computational efficiency are not possible, since the data in this problem has O(n 2) degrees of freedom. A computer program, written to test the algorithm, and running on a commonly available microcomputer is calculates the distribution parameters for a 30 × 30 problem in about 0.85 second and is predicted to calculate the parameters for 40 × 40 problem (the largest size considered by CRAFT) in less than 1.5 seconds. The algorithms are easily implemented on a microcomputer as a compiled program or a as spreadsheet program.

The distribution parameters can be used in calculating the efficiency of proposed solutions in facilities design (formula (3)). The use of this measure generally assumes a normal or nearly normal distribution. Some experiments on the appraximate form of the cost distribution are reported in [H], [WHT] and [KKN2]. These experiments yielded distributions that were mainly normal, lognormal, two parameter gamma distri- butions and, alternatively, three parameter Weibull distributions. For these distributions a choice of level of standard r = 2 or 3 is appropriate. However, as noted in [WHT] there are reported cases of solutions lying as much as 9 (d. [NVR D and 6.13 (cf. [H]) standard deviations below average cost. Even with these examples, the efficiency using the level of standard r = 2 or 3 is still particularly useful in comparison of the efficiency of an assignment before and after plant design parameters have been altered or as a control variable in situations where there is a frequent change in production flows but it is difficult to seek or implement a new high--e~ciency layout after every production change. For example the second author [C] has used our method of calculating efficiency to compare the quality of a solution provided by CRAFT and the quality of the same solution after the plant layout provided by CRAFT had been "smoothed out", using software developed by him.

Table 1: Standard Deviations and Running Times

Number of Computed Standard Deviation Running Time Reference Facilities from literature current algorithm (seconds)

3 2.494446 2.494438 0.0109 [KKN2] 4 5.497447 5.497474 0.0219 [KKN2] 5 6.980947 6.980926 0.0329 [KKN2] 6 11.809590 11.809601 0.0439 [KKN2] 7 10.793770 10.793884 0.0549 [KKN2] 8 11.877200 11.877149 0.0659 [KKN2]

12 22.885840 22.886221 0.1482 [KKN2] 15 108.717400 108.782082 0.2197 [K K N2] 20 146.182000 146.390600 0.3844 [KKN2] 30 N/A 212.564703 0.8403 [NVR l 40 N/A N/A 1.4736 (predicted)

REFERENCES

lAB] G. C. A r m o u r &~ E. S. Buffa A Heuristic Algorithm and Simulation Approach to the Relative Location of Facilities, Mgmt. Sci. Vol. 9 (1963), pp. 294-309.

[B-C] A. Broughton £r V. Charumongkol, Mean and Standard Deviation of the Layout Cost Dis- tribution in F~cilities Design Problems, preprint, Cleveland State Univ.

[C] V. Charumongkol, Interactive Microcomputer Graphic Methods for Smoothing CRAFT Lay- outs, Ph.D. thesis, Cleveland State Univ. (1990).

Page 5: Mean and standard deviation for the facilities design problem

Broughton and Charumongkol: Facilities Design Problem 317

[BAV]

[DS]

[C]

[HI

[HC]

[KKNI]

[KKN2]

[MB]

[NVR]

[ss]

[WHT]

E. S. Buffa, G. C. Armour ~r T. E. Vollmann, Allocating F~ilities with CRAFT, Harvard Business Rev., Vol. 42 (1964), pp. 136-158.

K.N. Du t t a & K. Sahu, Some Studies on Distribution Parameters for Facilities Design Prob- lems, Int. J. Prod. Res., Vol. 19, No. 2 (1981), pp. 725-736.

P.C. Gilmore, A Solution to the Module Placement Problem IBM Research Report RC--430, April 26, 1961.

G.G. Hitehings, Analysis and Development of Techniques for Improving the Layout of Plant and Equipment, Ph.D. thesis, Univ. of Wales, Cardiff (1973).

F.S. Hillier & M.M. Connors, Quadratic Assignment Problem Algorithms and the Location of Indivisible Facilities, Mgmt. Sci., Vol. 13 (1966), pp. 42-57.

V.K. Khare , M.K. Khare , & M.L. Neema, Computer Aided Estimation of Distribution Parameters for Facilities Design Problems, Proc. XXVIII National Convention, IIIE.

V.K. Khare , M.K. Khare , & M.L. Neema, Estimation of Distribution Parameters Associated with Facilities Design Problems Involving Forward and Backtracking of Materials, Computers Ind. Engng., Vol. 14, No. 1 (1988), pp. 63-75.

P.B. Mahapatra & D. S. Bedi, FALSA--Facilities Allocation by Statistical Analysis Int. J. Prod. Res. Vol. 22 (1984), pp. 173-181.

C.E. Nugent , T.E. Vollmann, & J. Rural, An Experimental Comparison of Techniques for the Assignment of Facilities to Locations, Ops. Res., Vol. 16 (1968), pp. 150-173.

S. Sahu ~r K. Sahu, On the Estimation of Parameters for Distributions Associated with the Facilities Design Problem, Int. J. Prod. Res., Vol. 17, No. 2 (1979), pp. 137-142.

H. Wallace, G.G. Hitchlngs & D.R. Towill, Parameter Estimation for Distributions Asso- ciated with the Facilities Design Problem, Int. J. Prod. Res., Vol. 14, No. 2 (1976), pp. 263-274.