ecological investigation of the ganges river using principal component analysis
TRANSCRIPT
Environmental Quality Management / DOI 10.1002/tqem / Autumn 2009 / 61
© 2009 Wiley Periodicals, Inc.Published online in Wiley InterScience (www.interscience.wiley.com).DOI: 10.1002/tqem.20236
The study discussed
here used multi-
variate statistical
approaches to in-
terpret a large and
complex data ma-
trix obtained dur-
ing monitoring of
the Ganges River in
Varanasi, India. The study analyzed 18 physi-
cochemical and bacteriological variables in water
samples collected every three months for two
years from six sampling sites at locations where
the river has been affected by human-induced
and seasonal influences.
Principal component analysis (PCA) was ap-
plied to the data set to extract the parameters
that are most important for assessing variation in
water quality. Four principal factors were identi-
fied as being responsible for the data structure,
collectively explaining over 80 percent of the
total variance in the water-quality data. These
factors were nutrients (39.3 percent), sewage
and fecal contamination (29.4 percent), physico-
chemical sources of variability (6.3 percent), and
wastewater pollution from industrial and organic
loads (5.8 percent).
The study discussed here suggests that PCA
techniques offer a useful tool for identifying
important surface-
water-quality pa-
rameters.
Background: Understanding Surface-Water Quality
Impairment of Surface-Water QualityThe quality of surface water is an issue of sig-
nificant environmental concern worldwide. Nat-
ural processes (such as changes in precipitation
inputs, soil erosion, and weathering of crustal
materials) as well as anthropogenic influences
(including urban, industrial, and agricultural
activities and increasing consumption of water
resources) degrade surface waters and impair their
suitability for drinking.
Industrial wastewater discharges and runoff
from agricultural land contribute significantly to
water pollution (Singh, Malik, & Sinha, 2005).
Urban and industrial effluents are major sources
of the chemicals and nutrients found in aquatic
Archana Mishra and
Brahma Dutt Tripathi
Ecological Investigation of the Ganges River Using Principal Component Analysis
Identifying the principal factors
responsible for water-quality
variance
Archana Mishra and Brahma Dutt Tripathi62 / Autumn 2009 / Environmental Quality Management / DOI 10.1002/tqem
Kaufman, 1988; Wenning & Erickson, 1994), such
as PCA. Papers published in Analytical Chemistry
indicate the importance of multivariate statistical
tools in the treatment of analytical and environ-
mental data (Brown, Blank, Sum, & Weyer, 1994;
Brown, Sum, Despagne, & Lavine, 1996).
Principal component analysis can be used for
dimensionality reduction in a data set by retain-
ing those characteristics of the data that con-
tribute most to its variance, keeping lower-order
principal components and ignoring higher-order
ones. It is very useful in the analysis of data cor-
responding to a large number of variables.
PCA has been widely used because it offers
an unbiased method that can indicate associa-
tions between samples and variables (Wenning
& Erickson, 1994). It reduces the dimensionality
of data by explaining correlations among a large
group of parameters in terms of a smaller number
of underlying factors or “principal components”
(Jackson, 1991; Meglen, 1992).
Using PCA to Interpret Water-Quality Parameters
In recent years, many studies have been done
using principal component analysis to interpret
water-quality parameters (Lohani, 1984).
A number of researchers have applied PCA
techniques to the study of groundwater quality
(Gangopadhyay, Das Gupta, & Nachabe, 2001;
Winter, Mallory, Allen, & Rosenberry, 2000). PCA
has been successfully applied to sort out hydro-
geological and hydrogeochemical processes from
commonly collected groundwater-quality data
(Jayakumar & Siraz, 1997; Praus, 2005; Salman &
Abu Ruka’h, 1999).
Iyer, Sindhu, Kulkarni, Tambe, and Kulkarni
(2003) have constructed a PCA-based statistical
model for coastal water-quality data using infor-
mation from the Cochin coast in southwest India.
The model explains the relationship among the
various physicochemical variables that have been
ecosystems. High concentrations of toxic chemi-
cals and excessive levels of biologically available
nutrients can create a range of problems, includ-
ing toxic algal blooms and fish kills from lack
of oxygen, loss of aquatic plant beds and coral
reefs, and an overall diminishment of biodiver-
sity (Voutsa, Manoli, Samara, Sofoniou, & Stratis,
2001).
Effective Monitoring of Surface-Water Quality Prevention of river pollution requires effective
monitoring of physicochemical and microbiolog-
ical parameters (Bonde, 1977; Ramteke, Pathak,
Bhattacherjee, Gopal, & Mathur, 1994). Levels of
dissolved oxygen (DO)
and biological oxygen
demand (BOD) typi-
cally are used to indi-
cate the pollution sta-
tus of aquatic systems.
These parameters are
not always sufficient
(Voznaya, 1981). The
overall quality of water
can only be indicated by its physical, chemical,
and biological properties, along with analysis of a
large number of measured variables (Boyacioglu,
2006).
Long-term surveys and monitoring programs
for water quality provide better knowledge re-
garding river hydrochemistry and pollution, but
they produce large sets of data that are often dif-
ficult to interpret (Dixon & Chiswell, 1996).
Principal Component Analysis
Making Data More Manageable The challenges of data reduction and inter-
preting multiconstituent chemical and physical
measurements can be addressed through the ap-
plication of multivariate statistical analysis meth-
ods (Massart, Vandeginste, Deming, Michotte, &
The challenges of data reduction and interpreting multiconstituent chemical and physical measurements can be addressed through the application of multivariate statistical analysis methods, such as PCA.
Environmental Quality Management / DOI 10.1002/tqem / Autumn 2009 / 63Ecological Investigation of the Ganges River
liters per day (MLD) of domestic sewage and 80
MLD of untreated sewage and industrial effluents
(along with excreta from humans and various
warm-blooded animals) are directly or indirectly
discharged into the Ganges River, adversely af-
fecting the physicochemical and biological qual-
ity of the river’s water.
Monitoring SitesThe following six sites were selected for river-
water-quality monitoring:
Samne Ghat (Site 1), •
Assi Ghat (Site 2), •
Shiwala Ghat (Site 3), •
Harischandra Ghat (Site 4), •
Dashwamedh Ghat (Site 5), and •
Raj Ghat (Site 6). •
Each site was con-
sidered to be reason-
ably representative
of the river system’s
water quality.
The first site, which
receives much of the
sewage from the town,
is the most polluted.
Sites 2, 3, 4, and 5 fall within the midstream re-
gion. Site 6 is located upstream of Varanasi city,
in an area of relatively low river-water pollution.
Sampling and AnalysisSamples were taken at the selected sites every
three months for a period of two years (2005–
2007). Samples were collected across the width of
the river at all six sites with a view toward moni-
toring changes caused by anthropogenic sources.
Water sampling, sample preservation, and trans-
portation of samples to the laboratory were all
carried out in accordance with standard methods
(Eaton, Clesceri, & Greenberg, 1998).
monitored and the environmental conditions
that affect coastal water quality.
PCA techniques have been used to estimate
spatial and temporal patterns of heavy-metal
contamination (Shine et al., 1995) and to inves-
tigate nutrient gradients within a eutrophic res-
ervoir (Perkins & Underwood, 2000). Tauler, Bar-
celo, and Thurman (2000) used PCA to identify
the composition of the major herbicides causing
observed data variations in a water body.
Study Materials and MethodsThe research discussed here studied water
quality in the Ganges River using multivariate
principal component analysis to interpret and
extract the parameters that are most important
to assessing variations in the quality of the river’s
water.
Study AreaThe particular area studied in this research
covered the urban fringe of Varanasi, a city
situated in the eastern Gangetic plain of northern
India (from 82°15’E to 84°30’E and from 24°35’N
to 25°30’N).
The Ganges and its tributaries drain a large
(about one-million square kilometers) and fertile
basin that supports one of the world’s highest-
density human populations. Almost half the
population of India proper lives on one-third of
the land within 500 kilometers of the Himalayan
range along the Gangetic plains.
The Ganges, one of the most sacred rivers in
India, is being polluted by many sources. There
are 29 cities, 70 towns, and thousands of villages
along the banks of the river. All the sewage from
these municipalities (over 1.3 billion liters per
day) goes directly into the Ganges River.
The main sources of river pollution at Vara-
nasi are industrial effluents, domestic sewage,
and cremation of dead bodies (Tripathi, Sikan-
dar, & Shukla, 1991). At Varanasi, 190 million
The Ganges and its tributaries drain a large (about one-million square kilometers) and fertile basin that
supports one of the world’s highest-density human populations.
Archana Mishra and Brahma Dutt Tripathi64 / Autumn 2009 / Environmental Quality Management / DOI 10.1002/tqem
Coliform were detected by presumptive inocula-
tion into tubes of MacConkey broth, and their
incubation at 37±2°C for 48-hour gram charac-
teristics was also observed by gram staining. The
most probable number (MPN) of coliform per 100
milliliters (mL) of water was determined using
standards tubes.
For confirmation of indicator bacterial species,
other tests—such as IMViC, fermentation, Voges-
Proskauer, nitrate reductase, oxidase, catalase,
citrate, and hydrogen sulfide (H2S) testing—were
performed using specific media and indicators
(Eaton et al., 1998; Sirockin & Cullimore, 1969).
Data TreatmentExhibit 1 shows the parameters used in the
present study, along with their abbreviations and
measurement units.
Because water-quality parameters involve dif-
ferent magnitudes and scales of measurement,
the data must be standardized to produce a nor-
mal distribution of all variables (Davis & Samp-
son, 1973). In the present study, we reduced the
All samples were transported to the labora-
tory in cold packs and were analyzed within
seven hours of collection. The pH of the water
samples was determined by a portable pH meter
at the collection site immediately after sampling
since biological and chemical reactions between
the atmosphere and samples can readily alter pH
(Hutton, 1983).
Eighteen physicochemical and bacteriologi-
cal parameters were determined using prescribed
standard methods. A total of 480 analyses were
carried out (18 variables in 16 samples).
Enumerating the Bacterial PopulationFor bacterial analysis, samples were collected
in sterile bottles at each site and were kept packed
in cold ice in cooler boxes in the field. The sam-
ples were returned to the laboratory for analysis
as soon as possible. We used Himedia Laborato-
ries for bacterial analysis.
For members of the coliform group, qualita-
tive analysis was carried out with multiple-tube
fermentation techniques (Eaton et al., 1998).
Exhibit 1. Water-Quality Parameters Used in This Study, with Associated Abbreviations and Units
Parameter Abbreviation UnitpH pHTemperature Temp °CElectricity Conductivity EC µmho/cmDissolved Oxygen DO mg/LTransparency Trans CmChloride Cl mg/LAcidity Aci mg/LAlkalinity Alk mg/LNitrate NO3 mg/LPhosphate PO4 mg/LBiological Oxygen Demand BOD mg/LChemical Oxygen Demand COD mg/LTotal Bacterial Density TBD ×103 LTotal Coliform TC ×103/100 mLFecal Coliform FC ×103/100 mLFecal streptococci FS 100 mLEscherichia coli EC ×103/100 mLClostridium perfringens CP 100 mL
Environmental Quality Management / DOI 10.1002/tqem / Autumn 2009 / 65Ecological Investigation of the Ganges River
components and their initial eigenvalues, along
with the percentage of variance contributed by
each component.
Exhibit 5 shows the Scree plot of the eigen-
values for each component. Four principal com-
ponents cumulatively account for over 80 percent
of the total variance in the water-quality data set.
The Scree plot shows a pronounced change of
slope after the third eigenvalue (Cattell & Jaspers,
1967).
The first four components clearly are the most
significant, representing most of the water-qual-
ity variance in the Ganges River at Varanasi. The
variance percentages contributed by these four
principal components
are 39.3 percent (PC1),
29.4 percent (PC2), 6.3
percent (PC3), and 5.8
percent (PC4).
Component load-
ings (correlation coef-
ficients) measure the
degree of closeness
between the variables
and the PC. The largest loading, either positive
or negative, “suggests the meaning of the dimen-
sion: positive loading indicates that the contribu-
tion of the variables increases with the increasing
loading in a dimension; and negative loading
indicates a decrease” (Jayakumar & Siraz, 1997).
In general, component loadings larger than
0.45 may be taken into consideration. In this
case, the most significant variables (represented
by high loadings) have been considered in evalu-
ating the components (Mazlum, Ozer, & Mazlum,
1999).
Component loading and communalities for
each variable included in the four selected com-
ponents are shown in Exhibit 6 (before varimax
rotation) and Exhibit 7 (after varimax rotation).
Communalities provide an index of the efficiency
of the reduced set of components and show each
dimensionality of the data set while minimizing
the loss of information.
We converted the raw data collected into a
unitless form with a zero mean and a variance
of one. This was accomplished by subtracting
the mean of the data set from each variable and
dividing by the standard deviation. We extracted
the initial factor solution from the standardized
covariance or correlation matrix of the data with
multivariate principal components extraction.
Diagonalization of the correlation matrix
transforms the original p-correlated variables
into p-uncorrelated (orthogonal) variables called
principal components (PCs), which are weighted
linear combinations of the original variables
(Meglen, 1992; Mellinger, 1987; Wenning & Er-
ickson, 1994). The characteristic roots (eigenval-
ues) of the PCs are a measure of their associated
variances, and the sum of eigenvalues coincides
with the total number of variables (Vega, Pardo,
Barrado, & Deban, 1998).
Study Results and Discussion Exhibit 2 shows the correlation matrix for
the water-quality parameters obtained with PCA.
Only a few parameters had significant correlation
relationships. High and positive correlations (r =
0.55 to 0.942) can be observed between pH, BOD,
chemical oxygen demand, temperature, and vari-
ous bacterial populations that are responsible for
fecal contamination in the river. BOD is strongly
correlated with nitrate (0.751) and phosphate
(0.552), which indicates contamination with or-
ganic matter.
DO shows a negative correlation with tem-
perature and pH because the solubility of oxygen
decreases as the water temperature rises and
organic matter is partially oxidized. A seasonal
fluctuation seems to be responsible for this type
of correlation.
Exhibit 3 summarizes the descriptive statis-
tics of the study’s data set. Exhibit 4 shows the
Component loadings (correlation coefficients) measure the degree
of closeness between the variables and the PC.
Archana Mishra and Brahma Dutt Tripathi66 / Autumn 2009 / Environmental Quality Management / DOI 10.1002/tqem
Exhi
bit 2
. Cor
rela
tion
Mat
rix
pHTe
mp
ECDO
Tran
sCl
Aci
Alk
NO3
PO4
BOD
COD
TBD
TCFC
FSEC
CP
pH1.
000
Tem
p.4
271.
000
EC
.154
–.20
91.
000
DO
–.54
7–.
261
–.30
01.
000
Tra
ns–.
441
–.67
4–.
170
.508
1.00
0
Cl
.308
–.07
4.6
68–.
605
–.16
71.
000
Aci
–.44
8.2
61.6
35–.
603
–.60
5.6
541.
000
Alk
.544
.192
.647
–.49
0–.
472
.730
–.83
81.
000
NO
3.4
54.2
62.6
25–.
717
–.70
3.7
11.8
02.7
871.
000
PO
4.3
85.1
86.6
03–.
580
–.40
9.8
46.7
39.8
07.7
521.
000
BO
D–.
581
–.24
7–.
425
–.79
4–.
480
–.57
5–.
724
–.64
1–.
751
–.55
21.
000
CO
D–.
014
–.03
4–.
060
–.09
7–.
133
–.12
5–.
061
–.15
1–.
025
–.06
1–.
101
1.00
0
TB
D.3
27.7
57–.
239
–.38
6–.
831
–.09
6.2
73.1
34.4
18.1
21.3
50.1
431.
000
TC
.053
.194
.028
–.15
4–.
306
–.12
2.0
19–.
017
.123
–.04
4.1
07.0
15.2
701.
000
FC
.037
.596
–.47
9–.
261
–.43
3–.
094
–.02
9–.
168
.108
–.03
5.1
00.1
93.7
18.1
661.
000
FS
.221
.707
–.36
9–.
333
–.60
9–.
096
.107
–.01
7.2
45.0
58.2
44.2
50.8
35.2
20.8
891.
000
EC
.186
.701
–.41
2–.
354
–.52
7–.
060
.071
–.04
9.1
76.0
67.2
08.2
23.7
78.2
12.9
22.9
421.
000
CP
.070
.631
–.45
0–.
257
–.57
0–.
142
.042
–.09
5.1
93.0
10.1
17.2
54.7
95.1
69.9
41.9
22.8
841.
000
Environmental Quality Management / DOI 10.1002/tqem / Autumn 2009 / 67Ecological Investigation of the Ganges River
counted for 39.3 percent of the total variance.
It was positively correlated with electrical con-
ductivity (EC), chloride (Cl), acidity, alkalinity,
nitrate, phosphate, and BOD.
By contrast, DO showed a negative contribu-
tion to this variance. This can be explained by the
fact that a large amount of dissolved organic mat-
ter consumes significant amounts of oxygen. Or-
ganic matter in urban wastewater consists mainly
of carbohydrates, proteins, and lipids. As the
amount of available dissolved oxygen decreases,
these constituents undergo anaerobic fermenta-
tion processes, leading to creation of ammonia
and organic acids.
Principal Component 2The second PC was highly loaded with total
bacterial density (TBD), fecal coliform (FC),
fecal streptococci (FS), Escherichia coli (EC), and
Clostridium perfringens (CP), which show the sew-
age discharge and fecal contamination.
Principal Component 3The third PC was weighted with pH and DO,
representing the physicochemical source of the
variability in water quality.
variable’s level of contribution to the four se-
lected components.
Principal Component 1The first PC (nutrients) represents influences
from nonpoint sources, such as agricultural run-
off and atmospheric deposition. This factor ac-
Exhibit 3. Descriptive Statistics
Mean Std. Deviation Analysis NpH 7.6366 .2109 216Temperature 27.8556 3.3657 216Electric Conductivity 449.1574 155.2798 216Dissolved Oxygen 3.9083 2.0174 216Transparency 23.8963 7.6336 216Chloride 33.8541 10.1813 216Acidity 16.3036 6.4029 216Alkalinity 117.9032 59.9250 216Nitrate .273505 .170281 216Phosphate .5850 .3666 216Biological Oxygen Demand 19.6032 11.1427 216Chemical Oxygen Demand 297.9977 1712.0381 216Total Bacterial Density 11492.3148 6724.4868 216Total Coliform 120890.8796 156003.7307 216Fecal Coliform 36368.7963 50166.8307 216Fecal Streptococci 1085.0620 1132.6995 216Escherichia coli 9116.2269 9144.6253 216Clostridium perfringens 8012.1343 10096.9555 216
Exhibit 4. Total Variance Explained
Component Initial Eigenvalues Total % of Variance Cumulative %1 7.074 39.298 39.2982 5.285 29.362 68.6593 1.127 6.260 74.9204 1.047 5.817 80.7365 .905 5.028 85.7656 .796 4.423 90.1877 .460 2.556 92.7438 .275 1.528 94.2719 .243 1.349 95.62010 .223 1.241 96.86111 .151 .837 97.69812 .110 .609 98.30713 8.431E-02 .468 98.77514 8.105E-02 .450 99.22515 5.080E-02 .282 99.50716 4.036E-02 .224 99.73217 3.487E-02 .194 99.92518 1.343E-02 7.462E-02 100.000Extraction Method: Principal Component Analysis. Note: When components are correlated, sums of squared loadings cannot be added to obtain a total variance.
Archana Mishra and Brahma Dutt Tripathi68 / Autumn 2009 / Environmental Quality Management / DOI 10.1002/tqem
Correlation Components Matrix Exhibit 8 shows the correlation components
matrix (component score covariance matrix) of the
four varimax-rotated PCs. It indicates that there are
no correlations among the four principal compo-
Principal Component 4The fourth PC was loaded with BOD, which
represents the organic load in wastewater from
domestic and industrial sources disposed in the
river at Varanasi.
Exhibit 5. Scree Plot
Exhibit 6. Component Matrix
Component 1 2 3 4
pH .592 –.188 .657 –.156Temperature .647 .473 –.277 –.216Electric Conductivity .270 –.818 –.025 .212Dissolved Oxygen –.774 .218 .873 –.097Transparency –.848 –.160 .256 –.123Chloride .515 –.679 .301 –.161Acidity .742 –.511 .016 .065Alkalinity .652 –.633 –.099 –.152Nitrate .833 –.421 .015 .087Phosphate .658 –.559 .159 –.160Biological Oxygen Demand .754 –.351 .053 .681Chemical Oxygen Demand .127 .224 .517 .133Total Bacterial Density .735 .557 –.169 .005Total Coliform .212 .188 –.591 .564Fecal Coliform .506 .763 .228 –.105Fecal Streptococci .650 .706 .101 –.030Escherichia coli .615 .711 .147 –.081Clostridium perfringens .565 .752 .180 –.045
Extraction Method: Principal Component Analysis. Note: Four components extracted.
Environmental Quality Management / DOI 10.1002/tqem / Autumn 2009 / 69Ecological Investigation of the Ganges River
extracted in this case represent four different
processes or sources of contamination affecting
the study area:
nutrients,•
sewage and fecal contamination,•
physicochemical variability, and •
wastewater pollution from domestic and in-•
dustrial sources, and its organic load.
Sources Bonde, G. J. (1977). Bacterial indication of water pollution. In M. R. Droop & H. W. Jannasch (Eds.), Advances in aquatic microbi-ology (pp. 273–364). London & New York: Academic Press.
Boyacioglu, H. (2006). Surface water quality assessment using factor analysis. Water SA, 32, 389–394.
Brown, S. D., Blank, T. B., Sum, S. T., & Weyer, L. G. (1994). Chemometrics. Analytical Chemistry, 66(12), 315R–359R.
Brown, S. D., Sum, S. T., Despagne, F., & Lavine, B. K. (1996). Chemometrics. Analytical Chemistry, 68(12), 21–62.
Cattell, R. B., & Jaspers, J. (1967). A general plasmode (No. 30-10-5-2) for factor analytic exercises and research. Multivariate Behavioral Research Monographs, No. 67.
Davis, J. C., & Sampson, R. J. (1973). Statistics and data analy-sis in geology. New York: Wiley.
Dixon, W., & Chiswell, B. (1996). Review of aquatic monitor-ing program design. Water Research, 30, 1935–1948.
Eaton, A. D., Clesceri, L. S., & Greenberg, A. E. (1998). Standard
nents, meaning that each component represents a
discrete unit that is separate from the others.
ConclusionPrincipal component analysis is a powerful
“pattern recognition” approach that seeks to
explain the variance in a large set of intercorre-
lated variables by using a smaller set of indepen-
dent variables, or principal components (Hopke,
1985).
The study described here used PCA to inves-
tigate water quality in the Ganges River. Our re-
search identified the principal physical, chemical,
and bacteriological parameters that are important
in predicting surface-water quality. The four PCs
Exhibit 7. Rotated Component Matrix
Component 1 2 3 4pH .481 .189 .630 –.358Temperature .100 .731 .354 –.310Electric Conductivity .706 –.515 .155 –.032Dissolved Oxygen –.737 –.289 .765 –.073Transparency –.446 –.531 –.578 .099Chloride .886 –.112 –.195 –.082Acidity .867 .033 .240 –.074Alkalinity .852 –.077 .151 –.324Nitrate .876 .148 .297 –.049Phosphate .877 .033 –.029 –.160Biological Oxygen Demand .783 .150 .275 .025Chemical Oxygen Demand .077 .170 .098 .868Total Bacterial Density .137 .810 .447 –.066Total Coliform –.128 .007 .852 .083Fecal Coliform –.055 .936 .036 .142Fecal Streptococci .052 .934 .217 .102Escherichia coli .035 .939 .142 .094Clostridium perfringens –.018 .938 .126 .152Extraction Method: Principal Component Analysis. Rotation Method: Equamax with Kaiser Normalization. Note: Rotation converged in seven iterations.
Exhibit 8. Component Score Covariance Matrix
Component 1 2 3 41 1.000 .000 .000 .0002 .000 1.000 .000 .0003 .000 .000 1.000 .0004 .000 .000 .000 1.000
Extraction Method: Principal Component Analysis. Rotation Method: Equamax with Kaiser Normalization.
Archana Mishra and Brahma Dutt Tripathi70 / Autumn 2009 / Environmental Quality Management / DOI 10.1002/tqem
principal component analysis of hydrological data. Water SA, 31, 417–422.
Ramteke, P. W., Pathak, S. P., Bhattacherjee, J. W., Gopal, K., & Mathur, N. (1994). Evaluation of the presence-absence (P-A) test: A simplified bacteriological test for detecting coliforms in rural drinking water of India. Environmental Monitoring and Assessment, 33(1), 53–59.
Salman, S. R., & Abu Ruka’h, Y. H. (1999). Multivariate and principal component statistical analysis of contamination in urban and agricultural soils from north Jordan. Environmen-tal Geology, 38, 265–270.
Shine, J. P., Ika, R. V., & Ford, T. E. (1995). Multivariate statis-tical examination of spatial and temporal patterns of heavy metal contamination in New Bedford Harbor marine sedi-ments. Environmental Science & Technology, 29, 1781–1788.
Singh, K. P., Malik, A., & Sinha, S. (2005). Water quality as-sessment and apportionment of pollution sources of Gomti river (India) using multivariate statistical techniques—A case study. Analytica Chimica Acta, 538(1–2), 355–374.
Sirockin, G., & Cullimore, S. (1969). Practical microbiology. London: McGraw-Hill.
Tauler, R., Barcelo, D., & Thurman, E. M. (2000). Multivariate correlation between concentrations of selected herbicides and derivatives in outflows from selected US midwestern reser-voirs. Environmental Science & Technology, 34, 3307–3314.
Tripathi, B. D., Sikandar, M., & Shukla, S. C. (1991). Physico-chemical characterization of city sewage discharged into river Ganga at Varanasi, India. Environment International, 17, 469–478.
Vega, M., Pardo, R., Barrado, E., & Deban, L. (1998). Assessment of seasonal and polluting effects on the quality of river water by exploratory data analysis. Water Research, 32, 3581–3592.
Voutsa, D., Manoli, E., Samara, C., Sofoniou, M., & Stratis, I. (2001). A study of surface water quality in Macedonia, Greece: Speciation of nitrogen and phosphorus. Water, Air, & Soil Pol-lution, 129(1–4), 13–32.
Voznaya, N. F. (1981). Chemistry of water and microbiology. Moscow: Mir Publishers.
Water Research Commission. (1998). Quality of domestic water supplies, volume 1: Assessment guide (2nd ed.). Preto-ria, South Africa: Department of Water Affairs & Forestry.
Wenning, R. J., & Erickson, G. A. (1994). Interpretation and analysis of complex environmental data using chemometric methods. Trends in Analytical Chemistry, 13, 446–457.
Winter, T. C., Mallory, S. E., Allen, T. R., & Rosenberry, D. O. (2000). The use of principal component analysis for interpret-ing ground water hydrographs. Ground Water, 38, 234–246.
methods for the examination of water and wastewater (19th ed.). Washington, DC: American Public Health Association.
Gangopadhyay, S., Das Gupta, A., & Nachabe, M. H. (2001). Evaluation of ground water monitoring network by principal component analysis. Ground Water, 39(2), 181–191.
Hopke, P. K. (1985). Receptor modeling in environmental chemistry. New York: Wiley.
Hutton, L. G. (1983). Field testing of water in developing countries. Buckinghamshire, UK: Water Research Centre.
Iyer, C. S., Sindhu, M., Kulkarni, S. G., Tambe, S. S., & Kulkarni, B. D. (2003). Statistical analysis of the physico-chemical data on the coastal waters of Cochin. Journal of Environmental Monitoring, 5, 324–327.
Jackson, J. E. (1991). A user’s guide to principal components. New York: Wiley.
Jayakumar, R., & Siraz, L. (1997). Factor analysis in hydrogeo-chemistry of coastal aquifers—A preliminary study. Environ-mental Geology, 31(3-4), 174–177.
Khattab, M. F. (2003). Factor analysis of water quality I Hit-Cubaisa area, west of Iraq. Iraqi Journal for Earth Science, 4(1), 110–198.
Lohani, B. N. (1984). Environmental quality management (pp. 376–387). New Delhi: South Asian Publishers.
Massart, D. L., & Kaufman, L. (1983). The interpretation of analytical chemical data by the use of cluster analysis. New York: Wiley.
Massart, D. L., Vandeginste, B. G. M., Deming, S. M., Mi-chotte, Y., & Kaufman, L. (1988). Chemometrics: A textbook. Amsterdam: Elsevier.
Mazlum, N., Ozer, A., & Mazlum, S. (1999). Interpretation of water quality data by principal component analysis. Turkish Jour-nal of Engineering and Environmental Sciences, No. 23, 19–26.
Meglen, R. R. (1992). Examining large databases: A chemo-metric approach using principal component analysis. Marine Chemistry, 39(1–3), 217–237.
Mellinger, M. (1987). Multivariate data analysis: Its methods. Chemometrics and Intelligent Laboratory Systems, 2(1), 29–36.
Ouyang, Y., Higman, J., Thompson, J., O’Toole, T., & Camp-bell, D. (2002). Characterization and spatial distribution of heavy metals in sediment from Cedar and Ortega rivers sub-basin. Journal of Contaminant Hydrology, 54(1-2), 19–35.
Perkins, R. G., & Underwood, G. J. C. (2000). Gradients of chlorophyll a and water chemistry along an eutrophic res-ervoir with determination of the limiting nutrient by in situ nutrient addition. Water Research, 34, 713–724.
Praus, P. (2005). Water quality assessment using SVD-based
Archana Mishra pursues environmental research at the Department of Botany, Center of Advanced Study, Banaras Hindu University, in Varanasi, India. She can be reached by e-mail at [email protected].
Brahma Dutt Tripathi, PhD, is a professor in the Department of Botany and coordinator of the Center of Environmental Science Technology at the Center of Advanced Study, Banaras Hindu University, Varanasi, India. He can be reached by e-mail at [email protected].