Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Research
Author(s): Donna L. Hoffman and George R. Franke
Source: Journal of Marketing Research , Aug., 1986, Vol. 23, No. 3 (Aug., 1986), pp. 213-
227
Published by: Sage Publications, Inc. on behalf of American Marketing Association
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
American Marketing Association and Sage Publications, Inc. are collaborating with JSTOR to
digitize, preserve and extend access to Journal of Marketing Research
Marketing researchers often need to detect and inter- corresponding units, allowing all the variables to be
pret relationships among the variables in a rectangularplotted in the same space for ease of interpretation. This
data matrix. To facilitate this task, multidimensional representation then can be used to reveal the structure
scaling and unfolding, discriminant analysis, canonical and patterns inherent in the data. In this sense, corre-
correlation analysis, factor analysis, and principal com-
spondence analysis is in that class of methods known as
ponents analysis all have been used to represent graph-"exploratory data analysis" (cf. de Leeuw 1973; Heiser
ically the rows and/or columns of a data matrix. How-1981; Tukey 1977).
ever, these methods have little applicability to the Correspondence analysis has several features that con-
categorical data that arise in many marketing researchtribute to its usefulness to marketing researchers. Much
applications. The purpose of our article is to direct theof its value relates to its multivariate treatment of the
attention of the marketing community to correspondence data through the simultaneous consideration of multiple
analysis, a multivariate descriptive statistical method thatcategorical variables. The multivariate nature of corre-
represents graphically the rows and columns of a cate-spondence analysis can reveal relationships that would
gorical data matrix in the same low-dimensional space.not be detected in a series of pairwise comparisons of
In correspondence analysis, numerical scores are as- variables. Correspondence analysis also helps to show
signed to the rows and columns of a data matrix so ashow variables are related, not just that a relationship ex-
to maximize their interrelationship. The scores are inists. The joint graphical display obtained from a corre-
spondence analysis can help in detecting structural re-
lationships among the variable categories. Finally,
correspondence analysis has highly flexible data require-
*Donna L. Hoffman is Assistant Professor, Graduate School of ments. The only strict data requirement for a correspon-
Business, Columbia University. George R. Franke is Assistant Pro-
dence analysis is a rectangular data matrix with non-neg-
fessor, Department of Advertising, The University of Texas, Austin.
The authors thank William Moore, Donald Morrison, Thomas No-ative entries. Thus, the researcher can gather suitable data
vak, William Perreault, and two anonymous JMR reviewers for theirquickly and easily.
helpful comments on a draft of this article. A longer version of the A distinct advantage of correspondence analysis over
article with a technical appendix is available from the first author.other methods yielding joint graphical displays is that it
Professor Hoffman gratefully acknowledges support from the Faculty
Research Fund of the Graduate School of Business, Columbia Uni- produces two dual displays whose row and column ge-
versity. ometries have similar interpretations, facilitating analy-
sis and detection of relationships. In other multivariate
213
approaches to graphical data representation, this duality fering. Correspondence analysis of this consumers by
is not present. product features matrix affords guidelines for appropri-
Correspondence analysis as a geometric approach to ate segmentation bases and potential marketing mix
multivariate descriptive data analysis originated in France; strategies. The method can be applied also in the con-
Benzecri (1969, 1973a, b) and his colleagues have done cept-testing phase when several concepts are competing
much to popularize the technique. The term "correspon- for developmental funds. Analysis of the concepts by
dence analysis" is a translation of the French "analyse attributes matrix can indicate those concepts that have
factorielle des correspondances." The technique has re- the most favorable profiles and, consequently, should be
ceived considerable attention in the statistical and psy- developed further.
chometric literature under a variety of names, including In the next two sections we use an artificial example
dual scaling, method of reciprocal averages, optimal to describe the theory behind the method of correspon-
scaling, canonical analysis of contingency tables, cate- dence analysis. Appropriate types of data for its use and
gorical discriminant analysis, homogeneity analysis, guidelines for interpretations also are discussed. We then
quantification of qualitative data, and simultaneous lin- illustrate correspondence analysis with an example that
ear regression. Complete histories of correspondence empirically demonstrates practical data considerations and
analysis are given by de Leeuw (1973), Greenacre (1984), issues of interpretation. The relationship of correspon-
and Nishisato (1980). dence analysis to other multivariate methods is exam-
Though very few applications of correspondence anal- ined. In the concluding section we discuss the issues of
ysis have been reported in the marketing literature, in- supplementary variables and outliers, provide some cau-
terest is increasing. Levine's (1979) procedure for the tions to the researcher, and comment on implementation.
analysis of "pick-any" data, which is related closely to
THE METHOD OF CORRESPONDENCE ANALYSIS
correspondence analysis, has been discussed by Hol-
brook, Moore, and Winer (1982). Green et al. (1983) An Artificial Example
use correspondence analysis in a cross-national exami-
In many marketing research applications, the data col-
nation of family purchasing roles. Franke (1983) illus-
lected are categorical, mainly because of the limitatio
trates the use of "dual scaling" with a reanalysis of data
and constraints imposed on the data collection proces
from a study by Belk, Painter, and Semenik (1981) on
For example, a researcher may be interested in the r
perceived causes of and preferred solutions to the energy
lationship between several brands in a product class a
crisis. Franke (1985) also discusses the use of dual scal-
a variety of attributes believed to describe the brand
ing in examining measurement-level assumptions and in-
Frequently the researcher gives consumers a list of brand
terpreting responses to a measure. Additionally, Benzecri and asks them to check off the attributes that describe
(1973b) describes two marketing-oriented applications
the brands, rather than asking them to rate each brand
of correspondence analysis, one evaluating competing
on a scale. The advantages of this common data collec-
cigarette brands and the other selecting a name for a new
tion process are that it is quicker, easier, and less ex-
brand of cigarettes.
pensive than obtaining rating scale (i.e., interval-level)
There is virtually no limit to the number of marketing data.
applications for correspondence analysis. In the devel-
As an example, suppose data were collected from 100
opment of market segments, for example, correspon-
consumers on three brands, and six attributes were hy-
dence analysis could be used to detect relatively ho-
pothesized to describe those brands. For each brand, re-
mogeneous groupings of individuals. Correspondence
spondents indicate whether the attribute describes the
analysis also can aid in product positioning studies. For
brand. The data generated from such a procedure might
example, suppose interest centers on consumer percep-
be arrayed as in Table 1. We have calculated, by sub-
tions of brands as a basis for positioning a particular brand.
traction, the "no" category for each attribute (we explain
Correspondence analysis of the categorical brands by at-
why subsequently). Twenty-nine percent of the respon-
tributes matrix gives information on the positioning of
dents indicated that attribute 1 described brand A, 20%
each brand vis-a-vis the attributes selected to describe
said that attribute 2 described brand A, and so on. These
them.
data, originally zeros and ones, have been aggregated
Correspondence analysis has been used to monitor the
over individuals and proportions calculated.
efficiency of advertising campaigns in France (Marc
Suppose we are interested in the following questions.
1973). Before the ad campaign, a study is carried out to
monitor advertising efficiency. After the campaign, an- 1. What are the similarities and differences among the three
other study is conducted. Together, the results of these brands with respect to the six attributes?
2. What are the similarities and differences among the six
studies reveal movement in product positioning attrib-
attributes with respect to the three brands?
utable to the advertising campaign.
3. What is the relationship among the brands and attributes?
The method also may prove useful in the design phase
4. Can these relationships be represented graphically in a
of the new-product development process. Suppose a new- joint low-dimensional space?
product manager gathers (binary) endorsements of con-
sumers on a variety of proposed features of a new of- To answer these questions, we present the method of
Table 1
ARTIFICIAL DATA ON THREE BRANDS AND SIX ATTRIBUTES FROM 100 INDIVIDUALS
Attribute
1 2 3 4 5 6 Row
Brand Yes No Yes No Yes No Yes No Yes No Yes No mass
A 29a 71 20 80 18 82 24 76 20 80 13 87
(016b 039 011 044 010 045 013 042 011 044 007 048) 333
B 26 74 15 85 25 75 30 70 10 90 34 66
(014 041 008 047 014 041 016 039 005 050 019 036) 333
C 25 75 26 74 31 69 21 79 15 85 24 76
(014 041 014 041 017 038 011 044 008 047 013 042) 333
Column
mass 044 122 034 132 042 124 041 125 024 142 040 126
gravity of each set of points are both at the origin of thethe columns of G are the eigenvectors of CR.
principal axes.
(14) (D,-PD-'P')F = FDx
The principal coordinates (cf. Gower 1966) of the brand
and attribute category profiles, with respect to their prin-and
cipal axes, are written in the rows of F and G, respec-
(15) (D 'P'D1'P)G = GDx
tively.
(7) F = (D'P - lc')DN'lN The eigenvalues, X,, are the weighted variances of each
principal axis (the weighted sums of squares of the points'
and coordinates along the tt principal axis in each set) and
are equal to the corresponding squared basic values from
(8) G = (D'P' - lr')Dr'M
the SVD in equation 11.
The set of points defined in equation 7 are the n row
(16) F'D,F = D2 = DA
profiles in weighted Euclidean k-dimensional space, with
masses defined by the n elements of r and principal axis and
weights defined by the inverses of the elements of c, that
G'DCG = D2= = Dk
is, Dc'. A similar definition holds for the column set of (17)
points defined in equation 8. These are the p column The axes are orthogonal, though the metric is 'chi square"
profiles in weighted Euclidean k-dimensional space, with and not ordinary Euclidean as in principal components
masses defined by the p elements of c and principal axis analysis.
weights defined by the inverses of the elements of r, that The transition formulas relate the brand and attribute
is, D;1. Thus, the principal axes are weighted inversely category coordinates to each other.
by the elements of the average profile.
Each set of points can be related to the principal axes (18) F = DT'PGD,1
r IL= ~
RGD-'
of the other set of profile points through rescalings by and
the basic values.
(19) G = D-'P'FD-'
c rJ
= CFD-'
L
(9) F = Dr-'MD,
Hill (1974) co
and
corresponden
(10) G = D'lNDD, The transitio
portant becau
In practice, the correspondence analysis problem is ing one set of
restated in an equivalent form in terms of the SVD for geometric im
computational convenience. row of F, f'.
(11) Dr'/2(P - rc')D'1/2 = MD,N' (20) f;= (frG)D,'l
where M'M = N'N = I, with M n x k, Np x k, D
where r' is the i throw profile from R in equation 4.
k x k, and Il > ... -> L, t ... J'k > 0. Then,
Equation 20 defines a "barycenter," actually a center of
(12) F = Dr1/2MD,, where M = Dr'M, mass, of the p column profile points in G, because the
and
sum of the elements of ri equals unity. Postmultiplica-
tion by D', divides the coordinates of the centroid by
(13) G = D-1/2NDs, where N = D-'N, the singular values. Geometrically, a particular row pro-
file will be "attracted" to a position in its subspace that
and plotting the rows of F and G in the same space re- corresponds to the column variable categories prominent
sults in a k-dimensional correspondence analysis. in that row profile. A corresponding definition and inter-
Correspondence analysis can be considered a dual pretation holds for the rows of G.
generalized principal components analysis (Greenacre Distances (squared) between points in the same set are
1984). The columns of F are the eigenvectors of RC and given by
Euclidean distances except that each squared term is Correspondence analysis is appropriate for such data,
weighted by the inverse of the relative frequency (mass) whereas standard multidimensional scaling methods are
corresponding to the term. not (Holbrook, Moore, and Winer 1982).
These distances, approximated in the k-dimensional Though correspondence analysis is ideally suited to
subspaces by those research situations in which categorical measure-
ments are the most reasonably obtained, it also can be
(23) d2, - (fi - fi,)'(fi - fi)
applied to ordered categories and "discretized" quanti-
and tative variables (see Jambu and Lebeaux 1983), but the
original ordering may not be maintained after scaling un-
(24) d>, - (gj - gj,)'(gj - gj,) less the solution is constrained (Nishisato and Sheu 1984).
for row and column points, respectively, are defined as This type of application allows investigation of possible
chi square distances. This distance measure is chosen nonlinearities among the categories with respect to the
because it guarantees invariance according to the prop- principal axes. It can lead to the discovery of relation-
erty of distributional equivalence: ships between scale value categories that are obscured if
the data are dichotomized, or if methods are used that
-If two rows having identical column profiles are aggre-
recognize only the metric properties of the data. Thus,
gated, the distances between columns remain unchanged.
a "loss of information" in ignoring the ordered or inter-
-If two columns having identical row profiles are aggre-
gated, the distances between rows remain unchanged. val nature of the data yields a meaningful gain in un-
derstanding (Lebart, Morineau, Warwick 1984).
Clearly, identical profiles imply equal or proportional raw Correspondence analysis of other forms of data, such
data. as rank-order data, sorting data, paired comparison data,
and successive categories data, is discussed by Jambu
The Correspondence Analysis Model and Lebeaux (1983), Nishisato (1980), Nishisato and
The correspondence analysis "model" on P in k di- Nishisato (1983), and Nishisato and Sheu (1984). Ap-
mensions reveals how an element of P is approximated plications of correspondence analysis are virtually un-
in the k-dimensional weighted Euclidean subspace. limited, but Lebart, Morineau, and Warwick (1984)
P - re' + DrFD,'G'Dc suggest three conditions that should be satisfied if corre-
(25)
spondence analysis is to be most effective.
From equation 25 it is clear that the model treats rows 1. The data matrix must be large enough that visual in-
and columns symmetrically, as nothing changes if we spection or simple statistical analysis cannot reveal its
begin with X' instead of X. structure.
Because of the inherent symmetry of correspondence columns and so that distances can be interpreted mean-
ingfully.
analysis, the implied data matrix for analysis is a con-
3. The data matrix must be "amorphous, a priori." In other
tingency table. However, the method can be applied to words, the method is most fruitfully applied to data whose
almost any matrix of categorical data, as long as the en- structure is either unknown or only poorly understood.
tries are non-negative. Excellent data classifications for
correspondence analysis are given by Benzecri (1973b), INTERPRETING A CORRESPONDENCE ANALYSIS
Nishisato (1980), and Greenacre (1984).
The principal coordinates of the brand and attribute
Many situations in marketing research lead to datacategory
at profile points from the correspondence analysis
the nominal or ordinal level of measurement (Perreault in two dimensions of the artificial data of Table 1 are
and Young 1980). Such data are often intractable with plotted in Figure 1. The plots are merged into one joint
traditional analytical methods. A common source of this display for ease of interpretation.
type of data is the evaluation of objects (e.g., retail out- The overall spatial variation in each set of points can
lets, competing products, individuals) on attributes (e.g.,be quantified and assists in interpretation. This variation,
product features, attitude statements) with binary judg- the total inertia, is defined as the weighted sum of squared
ments rather than 5- or 7-point rating scales. Binary
distances from the points to their respective centroids and
judgments are useful when the researcher has many ob-is equivalent for both sets of points.
jects or attributes to measure, when respondent coop-
eration is difficult to obtain, when it is difficult to make
(26) Inertia (Total)= E (P- ri )2
fine distinctions between objects on the attributes, and i rici
whenever rating scales are difficult to use.
Another source of data common in marketing research (27) Inertia (rows) = 3 ri 3 l
is the open-ended elicitation of attributes, brands, stores, i -
(A2'.00426)
(28.4%)
The first dimension separates attri
from attribute 5+ on the left, and
4+
B on the right from brand A on
dimension separates attributes 2+ a
Brand A o
3-
2- Brand B
from attribute 4+ on the top. This
O 0
tion, differentiates brand C on the b
5+ 6- 1+
I 0 5-
II a&~l A and B on the top.
4- D
o 1- The transition formulas in equati
clear that, geometrically, a particular
Brand C a position in its space corresponding
egories prominent in that brand pro
3+
2+
(28) Inertia (columns) =
I the display of brand profiles, a parti
egory will tend along the principal a
of the brands that are relatively sub
gory. For example, the attribute cate
the negative side of the second princ
C, which is relatively high on att
(28) Inertia (columns) = E cj l/ri(Pi/cj - ri)2 1), is on the negative side of its se
_ - i
Points near the center of the displ
tiated profile distributions as a conse
= cj(c - r)'Dr-'(j - r).
placed at the center of gravity. Notic
careful not to interpret between-set
It is because of the geometric Thecorrespondence
interpretation of of thethe
corresp
two sets of points, in position and inertia, that we can
not yet complete. The two-dimension
merge the two displays into 1 one
shows joint display. The
the projections ad- poi
of the
vantage of this merger is that a concise
plane, graphical
but does display
not indicate which
representing varied features of the
most data
impact in is obtained the
determining in aorie
single picture. The geometric For display of each
a complete and set of points
correct interpreta
reveals the nature of similarities andwe
display, variation
must use within the inf
additional
set, and the joint display shows the the
Because correspondence
total inertia ofbe- each s
tween sets. However, distances between
composed points
along from dif-
the principal axes and
ferent sets cannot be interpreted because
in similar these distances
and symmetric fashion, th
do not approximate any defined quantity. Distances be-
of points can be decomposed in a m
tween points in the same set the are decomposition
equal to the relevant chi
of variance. Th
square distances in equations 23 and are
positions 24, used
whereas thein
to assist be-
the i
tween-set correspondence is graphical influenced by the barycen-
display.
tric nature of the transition formulas Table 2 is inthe equations
numerical 18 and
represent
19.
spondence analysis depicted in Fig
The total inertia also can be decomposed along the represents a particular decomposition
principal axes. Each eigenvalue, X,, indicates the weighted each set of points and is discussed in
variance (inertia) explained by the tth principal axis of umns headed "Coordinate" contain th
the display. Summed over all k principal axes, these ei- points on the first and second pri
genvalues represent the total inertia of the spatial rep- tively. The weights for each poi
resentation.
"Mass") are repeated from Table 1
The first principal axis in the artificial example ac-
counts for 71.6% of the spatial variation in the data (,X Inertia of the Points
= .01074). The second principal axis accounts for the The inertia of the ih brand point
remaining 28.4% (h2 = .00426). In this artificial ex-
ample, two dimensions recover exactly the original data
matrix because with three brands there are at most two (29) ri ( l/ci(pi/ri - c)2 = ri fi.
.- j - t
mutually exclusive dimensions. Real applications in-
volving reduced dimensionalities of larger data matrices Equation
will necessarily be approximations. to the tot
Table 2
NUMERICAL RESULTS OF CORRESPONDENCE ANALYSIS OF TABLE 1 DATAa
Axis 1 Axis 2
Squared Squared
Name Quality Mass Inertiab Coordinate correlation Contributionb Coordinate correlation Contributionb
Brand A 1000 333 405 -130 883 500 50 117 167
Brand B 1000 333 405 130 883 500 50 117 167
Brand C 1000 333 190 0 0 0 -90 1000 666
quantity in (31)
(31) rif i/XA,,
brackets the square
the brand profile to the center
the absolute contribution of the i" brand to the th prin-
space (i.e., 2,f,). A similar de
cipal axis is obtained. The absolute contributions quan-
attribute category point. The
tify the importance of each point in determining the di-
over all brands (or attribute c
inertia.
rection of the principal axes and serve as guides to
interpretation of each axis. They are interpreted as the
The inertias for each point a
percentage of (weighted) variance explained by each point
"Inertia" in Table 2. Brand A's inertia in the set of brand
in relation to each axis.
points is 40.5% of the total inertia, as is brand B's. Brand
It is clear from the decomposition that a point can con-
C accounts for 19% of the total inertia in this set. At-
tribute to a principal axis (i.e., make a high contribution
tribute category 6+ has an inertia that is 34% of the total
to the inertia of that axis) in two ways: when it has a
inertia in the attribute set of points and accounts for by
large mass and/or when it is a large distance from the
far the largest proportion.
centroid, even if it has relatively low mass.
Absolute Contributions to Inertia Because all the brands have equal mass, it is their dis-
tance from the centroid that determines their contribu-
The inertia along the th axis, A,, consists of the weightedtions to the inertia of each axis. The absolute contribu-
sum of squared distances to the origin of the displayedtions, in the columns headed "Contribution" in Table 2,
row (or column) profiles, where the weights are the massesindicate that brands A and B contribute equally and solely
for each row (or column) point. For the brand profiles,to the direction of axis 1 and brand C contributes pri-
this inertia can be expressed as marily to axis 2.
Similarly for the attributes, categories 6+, 5+, and
(30)
6- define the first principal axis whereas categories 3 +,
, = > rif i.
/ 2+, 4+, and 3- define the second principal axis. At-
tribute categories 1-, 5-, and to a lesser extent 1 + con-
A similar definition holds for the attribute category pro- tribute essentially nothing to the inertia of each axis and
files. Thus, each eigenvalue also represents the inertia consequently are near the origin (note that their profiles
of the projections of the brand set (or attribute category are virtually identical to the average column profile r).
set) of points on each axis.
Relative Contributions to Inertia
If each term in the summation is expressed as a per-
centage relative to the inertia "explained" by each axis, After the dimensional interpretation, the next step in
that is, a correspondence analysis is to determine the "quality"
row masses equal 1/n. The row profiles define n points Pepsi, separates the colas on the top from the non
in 2q-dimensional space, but q linear dependencies are on the bottom. We label this a "cola/noncola" dim
present among the columns so the dimensionality of the sion. The soft drink locations are based on the pro
rows is really equal to q. of purchase frequency by individuals and provide inf
Figure 2 is the joint display of individuals and soft mation on market segments and market structure.
drinks in the plane defined by the first two principal axes. The relative contributions indicate that the first p
These two axes account for 63.3% of the total inertia in cipal axis explains nearly all the variance in Coke (squ
the data. We chose two axes for display on the basis of correlation = .785) and Tab (squared correlation = .
the interpretability of the dimensions, the desire for par- whereas the second principal axis explains the most v
simony, and a scree plot of the eigenvalues which in- ance in Sprite (squared correlation = .533). Diet Pe
dicated a clear elbow at t = 2. Though the third principal quality in two dimensions (quality = .405) indicat
axis contributes enough to the total inertia to suggest its has the worst fit in the plane defined by the first
interpretation might be worthwhile, we omit plotting it principal axes. However, Coke, Pepsi, and Tab are
to save space. The numerical results are reported in Ta- well by two dimensions (quality = .803, .777, .712,
ble 4 for the soft drinks and Table 5 for the individuals. spectively).
We discuss first the soft drink points. The display in Figure 2 reveals, by their proximitie
The display is interpreted with respect to the "posi- those soft drinks having similar profiles of purchase
tive" (purchase and consumption at least every other week) consumption by the 34 individuals. Similarly, ind
soft drink points. Because each pair of soft drink points uals close together in the display share similar pat
(+ and -) has the same mass (.125), each pair has its of purchase and consumption and thus constitute lik
centroid at the origin of the display and consequently market segments based on purchase behavior. Ind
each is balanced there with the lighter mass point being uals who share the same point in the display have ide
proportionately farther from the origin. For example, tical profiles of soft drink usage.
7Up+ with mass equal to .03 is farther than 7Up- with Two segments of consumers are seen on the "d
mass of .09. side of the first principal axis, those who drink diet
The absolute contributions in Table 4 indicate that the (individuals 12, 27, 15, 30, and 32) and those who d
first principal axis is defined by Tab, Diet 7Up, and Coke diet cola and diet noncola soft drinks (individuals 7, 3
and separates the diet soft drinks (both colas and non- 4, 23, 26, and 25). The absence of any individual p
colas) on the right from the nondiet soft drinks on the in the lower right region of the space indicates that
left. We label this a "diet/nondiet" dimension. The sec- one drinks only diet noncolas.
ond principal axis, defined primarily by Sprite, 7Up, and On the left side of the space, we can identify segm
of individuals who drink both diet and nondiet colas
24, and 20), who drink only nondiet colas (2, 3, 5,
and 22), and who drink both colas and noncolas, sk
Figure 2
either toward colas (e.g., 6, 11, 10, 31, and 33) or
TWO-DIMENSIONAL CORRESPONDENCE ANALYSIS OF
ward noncolas (e.g., 1, 8, 14, 29, and 19).
THE DOUBLED DATA MATRIX CONSTRUCTED FROM The absolute contributions indicate that individuals 13
TABLE 3
and 28 are the primary contributors to the second prin-
cipal axis (contribution = .219 and .116, respectively).
Axis It These contributions are "too large" in the sense that these
(X21-51)
(15.1%) points define, almost solely, the direction of the second
principal axis. Such a situation occurs often in corre-
spondence analysis. A remedy is presented in the Dis-
Pepsi+
18,24 20 Sprite- cussion section.
2,3,5,21,22 7Up-
l
Relationship to Other Multivariate Methods
Coke+ Diet 7Up-
I O
4' 15,30,32
4 7 It is relatively easy to show that correspondence anal-
10,31,33.
13 a
Diet Coke+ Tab+
Tab-
4,2326 734 ysis is related directly
DietPepsi-
Axis I to several familiar multivariate
_-i.-- .-_I l (X, f.482)
6.11
*16
Zs5 -
Diet Pepsi +
(48.2%)
(48.2%) methods, including principal components analysis,4 the
Diet Coke-
Coke- biplot (Gabriel 1971, 1981), canonical correlation anal-
e19
17
.14,29 *
Pepsi- Diet 7Up+
O *
9< 4In fact, the row geometry of the correspondence analysis of a dou-
028
bled binary data matrix is identical to the row geometry of the prin-
cipal components analysis of the column-standardized (to unit vari-
7Up+
13 ance) undoubled data matrix. The row masses and relative distances
Sprite+ between row profiles are equivalent in both (Greenacre 1984; Lebart,
Morineau, and Warwick 1984).
Table 4
DECOMPOSITION OF INERTIA AMONG THE SOFT DRINKS FOR THE FIRST TWO PRINCIPAL AXESa
Axis I Axis 2
Squared Squared
Soft drink Qualityb Mass Inertiac correlation Contributionc correlation ContributionC
Coke+ 809 70 55 785 84 18 6
Coke- 809 55 70 785 120 18 9
Table 5
DECOMPOSITION OF INERTIA AMONG THE INDIVIDUALS FOR THE FIRST TWO PRINCIPAL AXESa
Axis 1 Axis 2
Squared Squared
Individual Qualityb Mass Inertia' correlation Contribution' correlation Contribution'
1 911 30 27 701 47 198 42
2,3,5,21,22 909 30 18 520 19 384 44
4,23,26 702 30 40 690 55 0 0
6,11 716 30 18 623 30 1 1
7,34 955 30 49 888 90 1 1
19 428 30 27 245 13 85 14
20 444 30 27 0 0 301 55
25 715 30 40 543 49 1 1
28 908 30 27 347 23 560 116
ysis, and discriminant analysis, through the generalized It is also useful to contrast correspondence analysis
singular value decomposition (see Greenacre 1984, Ap- with multidimensional unfolding, another approach for
pendix A.2). The differences among methods are deter- the joint display of a data matrix. Correspondence anal-
mined by the type of transformation applied to the orig- ysis displays the positions of the rows (or columns) of
inal data matrix, the metrics in which the principal axes the data matrix relative to the set of rows (or columns)
are defined, and how the basic values are assigned to the included in the analysis. This is a consequence of using
left and right basic vectors. In correspondence analysis, profiles, rather than absolute frequencies. Multidimen-
the transformation is defined by equation 11, the metrics sional unfolding methods, however, directly approxi-
are the chi square metrics defined by the inverses of mate the entries in the data matrix, which are assumed
equations 2 and 3, and the basic values are assigned ac- to be row-to-column distances (dissimilarities). In this
cording to equations 12 and 13. case, there is a direct interpretation of the graphical rep-
A distinct advantage of correspondence analysis over resentation in terms of interpoint distances. If the data
other methods, in terms of obtaining a joint graphical can be considered as row-to-column distances, multidi-
display, is that correspondence analysis produces two dual mensional unfolding is an appropriate technique. If the
displays whose row and column geometries have similar data cannot be considered as such, correspondence anal-
interpretations. In other multivariate approaches as they ysis may be the more appropriate method for construct-
are commonly employed, this duality does not exist. ing joint representations.
Tenenhaus and Young (1985) have shown that four
broad data analytic approaches lead to the equations of DISCUSSION
display and imply validation of the variables being in- of dimensionality." There is no method for conclusively
vestigated. determining the appropriate number of and what com-
binations of dimensions to plot and inspect. As with other
Handling Outliers multivariate methods, the researcher must balance par-
Outlier points plague correspondence analysis solu- simony against interpretability in determining the num-
tions. Occasionally, a row (or column) profile point is ber of dimensions to use.
so "rare" (in profile) in its set of points that it has a Finally, it must be recognized that in many ways cor-
major role in determining the higher order principal axes. respondence analysis is a subjective technique. Many
This situation is easily discerned by examining the points' different portrayals of a data set often are possible, lead-
contributions to the axes. When a point has a very large ing to different analysis categories and solutions. By its
absolute contribution and a large principal coordinate on flexibility, correspondence analysis can lead to greater
a major principal axis, it can be considered an outlier. insight into the phenomena being studied because it af-
Two such points are individuals 13 and 28 in the em- fords several different views of the same data set. Sub-
pirical example. These points consume nearly 34% of jectivity of analysis is part of the price of this flexibility.
the inertia on the second principal axis, determining its
orientation to a large degree. The solution lies in rede- Implementation
fining these points as supplementary and performing the A variety of computer programs are available for car-
analysis again without them, permitting them no influ- rying out a correspondence analysis. The SPAD system
ence on the direction of the principal axes. Then the points of FORTRAN programs written by Lebart and Morineau
can be fit a posteriori on the axes calculated for the re- (1982) for mainframe computer systems is particularly
maining points with transition formula 18. applicable to large data sets. A specialized version of
Inspection of the data matrix provides information about this program is described by Lebart, Morineau, and
the nature of the "rarity" of an outlying point. Treating Warwick (1984). Nishisato and Nishisato (1983) have
the point as supplementary allows more detailed study prepared a program that performs correspondence anal-
of the structure of the remaining points whose multi- ysis ("dual scaling") on the IBM PC. The program ac-
variate association is not as readily determined by in- cepts as input up to six different types of data. Greenacre
spection. (1984) presents a simple program to do correspondence
A Caveat
analysis using the high-level programming language
GENSTAT. An extensive collection of computer pro-
Correspondence analysis does have limitations. It is a
grams for correspondence analysis and related tech-
niques is provided by Jambu and Lebeaux (1983). If the
multivariate descriptive statistical method and is not ap-
propriate for hypothesis testing. Other approaches researcher
are has access to a matrix subroutine that per-
better suited to searching for parsimonious models that forms a singular value decomposition, he or she has the
can account for most of the variance in the data, such tools necessary to implement the method. For example,
as weighted least squares (Grizzle, Starmer, and Koch correspondence analysis is programmed easily with the
1969) and loglinear modeling (Bishop, Fienberg, and MATRIX procedure in SAS (SAS Institute 1982).
Holland 1975). Recently, van der Heijden and de Leeuw
(1985) showed that, under certain conditions, corre- Concluding Remarks
spondence analysis can be interpreted in terms of spe- As we present it, correspondence analysis is a method
cific loglinear models. However, statistical tests for cor-
of exploratory data analysis that (1) quantifies multi-
respondence analysis are still being developed; earlier variate categorical data, (2) affords a graphical repre-
tests were shown either theoretically or through simu- sentation of the structure in the data, and (3) does not
lations to be unjustified (Lebart 1976). Nonetheless, cor-
pose stringent measurement requirements. For many ap-
respondence analysis may be helpful in detecting models plications, its use is straightforward and unambiguous.
that merit further consideration by other methods. When complex multivariate relationships are examined,
As discussed before, an important caveat for inter- correspondence analysis is limited only by the research-
preting correspondence analysis results is that the er's be- ingenuity in interpreting the derived spatial map. As
tween-set distances cannot be interpreted. The joint dis-a graphical method of data analysis, correspondence
play of coordinates shows the relationship between a point
analysis is applied best as a multivariate descriptive sta-
from one set and all the points of the other set, not be-tistical technique supplemental to other forms of analy-
sis.
tween individual points from each set. (See Carroll, Green,
and Schaffer 1986 for an alternative scaling of the co-
Correspondence analysis is very flexible. Not only is
ordinates that provides for comparability of all within-
it flexible in terms of data requirements, but it also al-
set and between-set distances.) When the correspon- lows for the incorporation of marketing knowledge. In
dence analysis solution has more than two dimensions,studying a product class, say, the researcher can set masses
of brands equal to the market share or dollar sales of
proximity with one pair of axes may disappear when other
pairs are plotted. each, or perhaps to the percentage of consumers who use
Correspondence analysis also suffers from the "curse
the product in the population. The technique of fitting
supplementary points in the display is an interesting and Gabriel, K. R. (1971), "The Biplot Graphic Display of Ma-
virtually limitless way to incorporate external informa- trices with Application to Principal Component Analysis,"
tion into the analysis. It is also useful as a check on data Biometrika, 58 (December), 453-67.
validity and as a tool for handling troublesome outliers. (1981), "Biplot Display of Multivariate Matrices for
Inspection of Data and Diagnosis," in Interpreting Multi-
Though correspondence analysis has limitations, the most
variate Data, V. Barnett, ed. Chichester: John Wiley & Sons,
important being that between-set distances in the graph- Inc., 147-73.
ical display are not interpretable, its flexibility may ren- Gower, J. C. (1966), "Some Distance Properties of Latent Root
der it more suitable than other methods for marketing and Vector Methods Used in Multivariate Analysis," Bio-
research applications in many situations. metrika, 53 (December), 325-38.
Categorical data are common products of marketing Green, Paul E., with J. Douglas Carroll (1978), Mathematical
research. However, the analysis of such data often is Tools for Applied Multivariate Analysis. New York: Aca-
hindered by the requirements and limitations of many demic Press, Inc.
familiar research tools. Correspondence analysis is a , Vithala R. Rao, and Wayne S. DeSarbo (1978), "In-
versatile and easily implemented analytical method that corporating Group-Level Similarity Judgments in Conjoint
Analysis," Journal of Consumer Research, 5 (December),
can do much to assist researchers in detecting and ex- 187-93.
plaining relationships among complex marketing phe- Green, Robert T., Jean-Paul Leonardi, Jean-Louis Chandon,
nomena.
Isabella C. M. Cunningham, Bronis Verhage, and Alain
REFERENCES Strazzieri (1983), "Societal Development and Family Pur-
chasing Roles: A Cross-National Study," Journal of Con-
Belk, Russell, John Painter, and Richard Semenik (1981), sumer Research, 9 (March), 436-42.
"Preferred Solutions to the Energy Crisis as a Function of
Greenacre, Michael J. (1978), "Some Objective Methods of
Causal Attributions," Journal of Consumer Research, 8
Graphical Display of a Data Matrix" (English translation of
(December), 306-12. 1978 doctoral thesis), Department of Statistics and Opera-
Benzdcri, J. P. (1969), "Statistical Analysis as a Tool to tions MakeResearch, University of South Africa.
Patterns Emerge from Data," in Methodologies of Pattern(1984), Theory and Application of Correspondence
Recognition, S. Watanabe, ed. New York: Academic Press, Analysis. London: Academic Press, Inc.
Inc., 35-74. Grizzle, J. E., C. F. Starmer, and G. G. Koch (1969), "Anal-
et al. (1973a), L'Analyse des Donnees. Vol. I, La Tax- ysis of Categorical Data by Linear Models," Biometrics, 25
inomie. Paris: Dunod. (September), 489-504.
et al. (1973b), L'Analyse des Donnees. Vol. 1I, L'Ana- Guttman, Louis (1941), "The Quantification of a Class of At-
lyse des Correspondances. Paris: Dunod. tributes: A Theory and Method of Scale Construction," in
Bishop, Yvonne M. M., S. E. Fienberg, and P. W. Holland Prediction of Personal Adjustment, The Committee on So-
(1975), Discrete Multivariate Analysis: Theory and Prac- cial Adjustment, ed. New York: Social Science Research
tice. Cambridge, MA: MIT Press. Council, 319-48.
Bock, R. Darrell (1960), "Methods and Applications of Op-Hayashi, C. (1950), "On the Quantification of Qualitative Data
timal Scaling," Laboratory Report No. 25, L. L. Thurstone from the Mathematico-Statistical Point of View," Annals of
Psychometric Laboratory, University of North Carolina, the Institute of Statistical Mathematics, 2 (1), 35-47.
Chapel Hill. (1952), "On the Prediction of Phenomena from Qual-
Burt, C. (1950), "The Factorial Analysis of Qualitative Data," itative Data and the Quantification of Qualitative Data from
British Journal of Psychology (Statistical Section), 3 (No- the Mathematico-Statistical Point of View," Annals of the
vember), 166-85. Institute of Statistical Mathematics, 3 (2), 69-98.
Carroll, J. Douglas, Paul E. Green, and Catherine M. Schaffer (1954), "Multidimensional Quantification-with the
(1986), "Interpoint Distance Comparisons in Correspon- Applications to Analysis of Social Phenomena," Annals of
dence Analysis," Journal of Marketing Research, 23 (Au- the Institute of Statistical Mathematics, 5 (2), 121-43.
gust), 271-80. Heiser, Willem J. (1981), Unfolding Analysis of Proximity Data.
de Leeuw, Jan (1973), Canonical Analysis of Categorical Data, Leiden, The Netherlands: Department of Data Theory, Uni-
unpublished doctoral dissertation, Psychological Institute, versity of Leiden.
University of Leiden, The Netherlands. Hill, M, 0. (1974), "Correspondence Analysis: A Neglected
Eckart, C. and Gale Young (1936), "The Approximation of Multivariate Method," Applied Statistics, 23 (3), 340-54.
One Matrix by Another of Lower Rank," Psychometrika, 1 Hirschfeld, H. 0. (1935), "A Connection Between Correlation
(September), 211-18. and Contingency," Proceedings of the Cambridge Philo-
Fisher, Ronald A. (1940), "The Precision of Discriminant sophical Society, 31 (October), 520-4.
Functions," Annals of Eugenics, 10 (December), 422-9. Holbrook, Morris B., William L. Moore, and Russell S. Wi-
Franke, George R. (1983), "Dual Scaling: A Model for In- ner (1982), "Constructing Joint Spaces from Pick-Any Data:
terpreting and Quantifying Categorical Data," in Research A New Tool for Consumer Analysis," Journal of Consumer
Methods and Causal Modeling in Marketing, W. R. Dar- Research, 9 (June), 99-105.
den, K. B. Monroe, and W. R. Dillon, eds. Chicago: Amer- Horst, Paul (1935), "Measuring Complex Attitudes," Journal
ican Marketing Association, 111-4. of Social Psychology, 6 (3), 369-74.
(1985), "Evaluating Measures Through Data Quanti- Jambu, M. and M-O. Lebeaux (1983), Cluster Analysis and
fication: Applying Dual Scaling to an Advertising Copy- Data Analysis. Amsterdam: North Holland Publishing Com-
test," Journal of Business Research, 13 (February), 61-9. pany.
Kruskal, Joseph B. and Myron Wish (1978), Multidimensional and Wen-Jenn Sheu (1984), "A Note on Dual Scaling
Scaling, Sage University Paper Series on Quantitative Ap- of Successive Categories Data," Psychometrika, 49 (De-
plications in the Social Sciences, 07-011. Beverly Hills, CA: cember), 493-500.
Sage Publications, Inc. Perreault, William D., Jr., and Forrest W. Young (1980), "Al-
Lebart, Ludovic (1976), "The Significancy of Eigenvalues Is- ternating Least Squares Optimal Scaling: Analysis of Non-
sued from Correspondence Analysis," Proceedings in Com- metric Data in Marketing Research," Journal of Marketing
putational Statistics (COMPSTAT). Vienna: Physica Verlag, Research, 17 (February), 1-13.
38-45. Richardson, M. and G. F. Kuder (1933), "Making a Rating
and Alain Morineau (1982), "SPAD: A System of Scale That Measures," Personnel Journal, 12 (June), 36-
FORTRAN Programs for Correspondence Analysis," Jour- 40.
nal of Marketing Research, 19 (November), 608-9. SAS Institute (1982), SAS User's Guide: Statistics. Cary, NC:
, , and Kenneth M. Warwick (1984), Multi- SAS Institute Inc.
variate Descriptive Statistical Analysis: CorrespondenceSchiffman, Susan S., M. Lance Reynolds, and Forrest W.
Analysis and Related Techniques for Large Matrices. New Young (1981), Introduction to Multidimensional Scaling. New
York: John Wiley & Sons, Inc. York: Academic Press, Inc.
Levine, Joel H. (1979), "Joint-Space Analysis of 'Pick-Any' Seber, G. A. F. (1984), Multivariate Observations. New York:
Data: Analysis of Choices from an Unconstrained Set of Al- John Wiley & Sons, Inc.
ternatives," Psychometrika, 44 (March), 85-92. Tenenhaus, Michel and Forrest W. Young (1985), "An Anal-
Marc, Marcel (1973), "Some Practical Uses of 'The Factorialysis and Synthesis of Multiple Correspondence Analysis,
Analysis of Correspondence,'" European Research, 1 (July),Optimal Scaling, Dual Scaling, Homogeneity Analysis and
2-8. Other Methods for Quantifying Categorical Multivariate
McKeon, J. J. (1966), "Canonical Analysis: Some Relations Data," Psychometrika, 50 (March), 91-119.
Between Canonical Correlation, Factor Analysis, Discrim- Tukey, John W. (1977), Exploratory Data Analysis. Reading,
inant Function Analysis and Scaling Theory," Psychome- MA: Addison-Wesley Publishing Company, Inc.
trika, Monograph No. 13. van der Heijden, Peter G. M. and Jan de Leeuw (1985), "Cor-
Nishisato, Shizuhiko (1980), Analysis of Categorical Data: Dual respondence Analysis Used Complementary to Loglinear
Scaling and Its Applications. Toronto: University of Toronto Analysis," Psychometrika, 50 (December), 429-47.
Press. van Rijckevorsel, Jan and Jan de Leeuw (1978), "An Outline
and Ira Nishisato (1983), An Introduction to Dual to HOMALS-1," Department of Data Theory, Faculty of
Scaling, 1st ed. Islington, Ontario: MicroStats. Social Sciences, University of Leiden, The Netherlands.
All reprints are sold according to the following price schedule, in minim
50 copies.