This Content Downloaded From 168.176.5.118 On Wed, 13 Jan 2021 19:21:48 UTC

Correspondence Analysis: Graphical Representation of Categorical Data in Marketing
Research
Author(s): Donna L. Hoffman and George R. Franke
Source: Journal of Marketing Research , Aug., 1986, Vol. 23, No. 3 (Aug., 1986), pp. 213-
227
Published by: Sage Publications, Inc. on behalf of American Marketing Association
Stable URL: https://www.jstor.org/stable/3151480
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
American Marketing Association and Sage Publications, Inc. are collaborating with JSTOR to
digitize, preserve and extend access to Journal of Marketing Research
This content downloaded from

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
All use subject to https://about.jstor.org/terms
DONNA L. HOFFMAN and GEORGE R. FRANKE*
Correspondence analysis is an exploratory data analysis technique for the graph-

ical display of contingency tables and multivariate categorical data. Its history can
be traced back at least 50 years under a variety of names, but it has received little
attention in the marketing literature. Correspondence analysis scales the rows and
columns of a rectangular data matrix in corresponding units so that each can be
displayed graphically in the same low-dimensional space. The authors present the
theory behind the method, illustrate its use and interpretation with an example rep-
resenting soft drink consumption, and discuss its relationship to other approaches
that jointly represent the rows and columns of a rectangular data matrix.
Correspondence Analysis: Graphical

Representation of Categorical Data in
Marketing Research
I
Marketing researchers often need to detect and inter- corresponding units, allowing all the variables to be
pret relationships among the variables in a rectangularplotted in the same space for ease of interpretation. This
data matrix. To facilitate this task, multidimensional representation then can be used to reveal the structure
scaling and unfolding, discriminant analysis, canonical and patterns inherent in the data. In this sense, corre-
correlation analysis, factor analysis, and principal com-
spondence analysis is in that class of methods known as
ponents analysis all have been used to represent graph-"exploratory data analysis" (cf. de Leeuw 1973; Heiser
ically the rows and/or columns of a data matrix. How-1981; Tukey 1977).
ever, these methods have little applicability to the Correspondence analysis has several features that con-
categorical data that arise in many marketing researchtribute to its usefulness to marketing researchers. Much
applications. The purpose of our article is to direct theof its value relates to its multivariate treatment of the
attention of the marketing community to correspondence data through the simultaneous consideration of multiple
analysis, a multivariate descriptive statistical method thatcategorical variables. The multivariate nature of corre-
represents graphically the rows and columns of a cate-spondence analysis can reveal relationships that would
gorical data matrix in the same low-dimensional space.not be detected in a series of pairwise comparisons of
In correspondence analysis, numerical scores are as- variables. Correspondence analysis also helps to show
signed to the rows and columns of a data matrix so ashow variables are related, not just that a relationship ex-
to maximize their interrelationship. The scores are inists. The joint graphical display obtained from a corre-
spondence analysis can help in detecting structural re-
lationships among the variable categories. Finally,
correspondence analysis has highly flexible data require-
*Donna L. Hoffman is Assistant Professor, Graduate School of ments. The only strict data requirement for a correspon-
Business, Columbia University. George R. Franke is Assistant Pro-
dence analysis is a rectangular data matrix with non-neg-
fessor, Department of Advertising, The University of Texas, Austin.
The authors thank William Moore, Donald Morrison, Thomas No-ative entries. Thus, the researcher can gather suitable data
vak, William Perreault, and two anonymous JMR reviewers for theirquickly and easily.
helpful comments on a draft of this article. A longer version of the A distinct advantage of correspondence analysis over
article with a technical appendix is available from the first author.other methods yielding joint graphical displays is that it
Professor Hoffman gratefully acknowledges support from the Faculty
Research Fund of the Graduate School of Business, Columbia Uni- produces two dual displays whose row and column ge-
versity. ometries have similar interpretations, facilitating analy-
sis and detection of relationships. In other multivariate
213
Journal of Marketing Research

Vol. XXIII (August 1986), 213-27

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
214 JOURNAL OF MARKETING RESEARCH, AUGUST 1986
approaches to graphical data representation, this duality fering. Correspondence analysis of this consumers by
is not present. product features matrix affords guidelines for appropri-
Correspondence analysis as a geometric approach to ate segmentation bases and potential marketing mix
multivariate descriptive data analysis originated in France; strategies. The method can be applied also in the con-
Benzecri (1969, 1973a, b) and his colleagues have done cept-testing phase when several concepts are competing
much to popularize the technique. The term "correspon- for developmental funds. Analysis of the concepts by
dence analysis" is a translation of the French "analyse attributes matrix can indicate those concepts that have
factorielle des correspondances." The technique has re- the most favorable profiles and, consequently, should be
ceived considerable attention in the statistical and psy- developed further.
chometric literature under a variety of names, including In the next two sections we use an artificial example
dual scaling, method of reciprocal averages, optimal to describe the theory behind the method of correspon-
scaling, canonical analysis of contingency tables, cate- dence analysis. Appropriate types of data for its use and
gorical discriminant analysis, homogeneity analysis, guidelines for interpretations also are discussed. We then
quantification of qualitative data, and simultaneous lin- illustrate correspondence analysis with an example that
ear regression. Complete histories of correspondence empirically demonstrates practical data considerations and
analysis are given by de Leeuw (1973), Greenacre (1984), issues of interpretation. The relationship of correspon-
and Nishisato (1980). dence analysis to other multivariate methods is exam-
Though very few applications of correspondence anal- ined. In the concluding section we discuss the issues of
ysis have been reported in the marketing literature, in- supplementary variables and outliers, provide some cau-
terest is increasing. Levine's (1979) procedure for the tions to the researcher, and comment on implementation.
analysis of "pick-any" data, which is related closely to
THE METHOD OF CORRESPONDENCE ANALYSIS
correspondence analysis, has been discussed by Hol-
brook, Moore, and Winer (1982). Green et al. (1983) An Artificial Example
use correspondence analysis in a cross-national exami-
In many marketing research applications, the data col-
nation of family purchasing roles. Franke (1983) illus-
lected are categorical, mainly because of the limitatio
trates the use of "dual scaling" with a reanalysis of data
and constraints imposed on the data collection proces
from a study by Belk, Painter, and Semenik (1981) on
For example, a researcher may be interested in the r
perceived causes of and preferred solutions to the energy
lationship between several brands in a product class a
crisis. Franke (1985) also discusses the use of dual scal-
a variety of attributes believed to describe the brand
ing in examining measurement-level assumptions and in-
Frequently the researcher gives consumers a list of brand
terpreting responses to a measure. Additionally, Benzecri and asks them to check off the attributes that describe
(1973b) describes two marketing-oriented applications
the brands, rather than asking them to rate each brand
of correspondence analysis, one evaluating competing
on a scale. The advantages of this common data collec-
cigarette brands and the other selecting a name for a new
tion process are that it is quicker, easier, and less ex-
brand of cigarettes.
pensive than obtaining rating scale (i.e., interval-level)
There is virtually no limit to the number of marketing data.
applications for correspondence analysis. In the devel-
As an example, suppose data were collected from 100
opment of market segments, for example, correspon-
consumers on three brands, and six attributes were hy-
dence analysis could be used to detect relatively ho-
pothesized to describe those brands. For each brand, re-
mogeneous groupings of individuals. Correspondence
spondents indicate whether the attribute describes the
analysis also can aid in product positioning studies. For
brand. The data generated from such a procedure might
example, suppose interest centers on consumer percep-
be arrayed as in Table 1. We have calculated, by sub-
tions of brands as a basis for positioning a particular brand.
traction, the "no" category for each attribute (we explain
Correspondence analysis of the categorical brands by at-
why subsequently). Twenty-nine percent of the respon-
tributes matrix gives information on the positioning of
dents indicated that attribute 1 described brand A, 20%
each brand vis-a-vis the attributes selected to describe
said that attribute 2 described brand A, and so on. These
them.
data, originally zeros and ones, have been aggregated
Correspondence analysis has been used to monitor the
over individuals and proportions calculated.
efficiency of advertising campaigns in France (Marc
Suppose we are interested in the following questions.
1973). Before the ad campaign, a study is carried out to
monitor advertising efficiency. After the campaign, an- 1. What are the similarities and differences among the three
other study is conducted. Together, the results of these brands with respect to the six attributes?
2. What are the similarities and differences among the six
studies reveal movement in product positioning attrib-
attributes with respect to the three brands?
utable to the advertising campaign.
3. What is the relationship among the brands and attributes?
The method also may prove useful in the design phase
4. Can these relationships be represented graphically in a
of the new-product development process. Suppose a new- joint low-dimensional space?
product manager gathers (binary) endorsements of con-
sumers on a variety of proposed features of a new of- To answer these questions, we present the method of

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
CORRESPONDENCE ANALYSIS 215
Table 1
ARTIFICIAL DATA ON THREE BRANDS AND SIX ATTRIBUTES FROM 100 INDIVIDUALS
Attribute
1 2 3 4 5 6 Row
Brand Yes No Yes No Yes No Yes No Yes No Yes No mass
A 29a 71 20 80 18 82 24 76 20 80 13 87
(016b 039 011 044 010 045 013 042 011 044 007 048) 333
B 26 74 15 85 25 75 30 70 10 90 34 66
(014 041 008 047 014 041 016 039 005 050 019 036) 333
C 25 75 26 74 31 69 21 79 15 85 24 76
(014 041 014 041 017 038 011 044 008 047 013 042) 333
Column
mass 044 122 034 132 042 124 041 125 024 142 040 126
'All entries are proportions. Decimal points are omitted.

bFigures in parentheses are rescaled so that their sum equals
correspondence analysisWhen usingthe q variables

ashave anonly two possible re-
example the
cial data of Table 1. Notation and
sponses (e.g., yes/no, endorse/dogeneral
not endorse, pur- data c
are introduced first. Correspondence analysis
chase/do not purchase, etc.), only two categories are in
terminology that maypossible
beforunfamiliar
each variable and Pr = 2 for allto market
r. In prac-
searchers. We maintaintice,this terminology
the researcher in our
typically obtains only the positive
tion for consistency with the
endorsements psychometric
and infers the negative by subtraction. In and st
literature. following In the
discussion,
applications of correspondence analysis to data otherboldface
than
letters represent matrices,
contingencyboldface lowercase
tables the data matrix can be "doubled" to lette
resent vectors, and lowercase italic
obtain this full set of responses. letters represen
Doubling creates a sym-
lars. metry between the two "poles" of each binary variable
and renders the correspondence analysis invariant with
Notation and Data Doubling
respect to the direction in which we choose to scale the
Let X represent the 3 x 12 brands by attribute cate- data (Greenacre 1984). The artificial example in this and
gories categorical data matrix displayed in Table 1. In the following section is based on such a doubled data
general, the matrix is "objects by variable categories." matrix.
The term "objects" is used to represent the extensive va-
riety of products, commodities, goods, and consumers Algebraic Considerations in Correspondence Analysis
investigated in marketing research studies. Hence, ob- A variety of approaches lead to the equations of cor-
jects may be brands, individuals, product classes, seg- respondence analysis (Tenenhaus and Young 1985). As
ments of consumers, etc. The term "variables" is used a theoretical basis for developing the logic of corre-
in the broadest sense possible and refers, in general, to spondence analysis, we use the notion of the singular
characteristics of the objects being studied. These char- value decomposition (SVD) of a matrix (Eckart and Young
acteristics may be attributes, store locations, marketing 1936; Green with Carroll 1978). This "principal com-
mix variables, attitude statements, etc. ponents analysis" approach, due largely to Greenacre
The general q-variate categorical data matrix X is (1978, 1984), is useful because it emphasizes the geo-
n x p, where the q variables (e.g., attributes) are rep- metric properties of correspondence analysis and illu-
resented by sets of columns and categorical measure- minates the practical implications of the data analysis.
ments of objects (e.g., brands) on these variables are The singular value decomposition embodies the idea of
represented by rows. Each variable has Pr categories the basic structure of a matrix, consisting of basic values
(columns), with r = 1, ..., q and PI + ... + Pr + ... and basic vectors. The eigenstructure (eigenvalues and
+ Pq = p. The general entry xij is some categorical mea- eigenvectors) of a symmetric matrix is a special case of
sure of the jth variable category,' j = 1, ..., p, on the the SVD.
ith object, i = 1, ..., n. The philosophy behind correspondence analysis is to
obtain a graphical representation of both the rows and
columns of the original data matrix in terms of as few
'Actually, j indexes the Ith category of the r* variable, I = 1, .... dimensions as possible. In correspondence analysis, each
P,, but this level of precision in notation is not required for the ex- row of X represents a point profile in p-dimensional space
position we present. and each column represents a point profile in n-dimen-

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
umn profiles in n-dimensional space are written in the

sional space. Attention is directed to the profiles of the
frequency distributions rather than their raw occurrence,rows of C.
because the raw frequencies in Table 1 do not yield a R = D7'P
(4)
meaningful interpretation of distances between row points
and between column points. and
In terms of the n brands, say, in p-dimensional attri-
(5) C = D-'P'
bute space, it is clear that some brands will "occur" fre-
quently and consequently some attribute categories will Note that a profile (row or column) sums to unity. The
be endorsed frequently for those brands. Other brands correspondence analysis problem is to find a low-rank
will have small frequencies of occurrence and hence theapproximation to the original data matrix that optimally
attribute categories attributed to them will appear less represents both these row and column profiles in k-di-
frequently. The brand profiles are conditional frequen- mensional subspaces, where k is generally much smaller
cies of attribute category j given brand i. Similarly forthan either n or p.2 These two k-dimensional subspaces
the p attributes in n-dimensional brand space, the con- (one for the row profiles and one for the column profiles)
ditional frequencies of brand i given attribute categoryhavej a geometric correspondence, which we examine
are the quantities of interest. hereafter, that enables us to represent both in one joint
To perform a correspondence analysis, one rescales display.
the original data matrix X so that the sum of the elementsBecause we wish to represent graphically the distances
equals 1. between row (or column) profiles, we orient the config-
(1) P = X/1'Xl, with 1'Pl = 1, uration of points at the "center of gravity" of both sets.
The centroid of the set of row points in its space is c,
where 1' = (1 ... 1)', either 1 x n or 1 x p, depending the vector of column masses. This defines the "average"
on the context. P is the correspondence matrix whose row profile. The centroid of the set of column points in
elements are the relative frequencies and if X is a con- its space is r, the vector of row masses. This is the av-
tingency table, P is the probability density on the cells erage column profile. To perform the analysis relative
of X. The row sums of P are written into Dr, an n x n to the center of gravity, P is centered "symmetrically"
diagonal matrix, by rows and columns, that is, P - rc', so that the origin
corresponds to the average profile of both sets of points.
(2) Dr = diag(r) The solution to finding a representation of both row
and column profiles in a low-dimensional space involves
where r = P1, and the column sums of P are written the generalized singular value decomposition (GSVD)
into Dc, a p x p diagonal matrix, and low-rank matrix approximation theory (Seber 1984).
(3) Dc = diag(c) The GSVD of the symmetrically centered correspon-
dence matrix P defines the theoretical correspondence
where c = P'1. These row and column sums are referred
analysis problem.
to as masses in correspondence analysis. The masses en-
able us to weight each profile point in proportion to its (6) P - rc' = MD,N',
frequency. Again, if we are working with a contingency
table, r and c are the marginal densities. These densities where M'Dr'M = N'Dc'N = I, with M n x k, N p x
are only analogies when X is not a contingency table. k, and D , k x k, and , - ... - Lt ... ~k > 0.
The entries of P, with row and column masses r and c, The columns of M and N hold the first k left and right
respectively, are in parentheses in Table 1 below the cor- generalized basic vectors of P - rc', in the metrics
responding entries of X. Dr1 and Dc1, corresponding to the k largest basic values
Note that the brand masses are all equal (.33) and that and define the optimal weighted Euclidean k-dimen-
each attribute also has equal mass (.16), though this sional subspaces in terms of weighted sum of squared
quantity is distributed differently for each attribute be- distances. D, is a diagonal matrix holding the general-
tween the yes and no categories, depending on the fre-ized basic values il, ..., lk, in descending order, cor-
quency of responses in each category. In this example, responding to the generalized basic vectors. In other
the masses are equal because each row sums to the samewords, the principal axes of the attribute category (col-
constant value and each pair of columns sums to the same umn) set of points are defined by the columns of M and
constant value, by design. In other situations, such as in the principal axes of the brand (row) set of points are
contingency tables, the masses will not necessarily be defined by the columns of N. The weighted centers of
equal.
The row and column profiles of P are defined as the
vectors of row and column elements of P divided by
their respective masses. The n row profiles in p-dimen- 2Correspondence analysis optimizes several criteria simultaneously.
sional space are written in the rows of R and the p col- See Tenenhaus and Young (1985) for a detailed discussion.

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
gravity of each set of points are both at the origin of thethe columns of G are the eigenvectors of CR.
principal axes.
(14) (D,-PD-'P')F = FDx
The principal coordinates (cf. Gower 1966) of the brand
and attribute category profiles, with respect to their prin-and
cipal axes, are written in the rows of F and G, respec-
(15) (D 'P'D1'P)G = GDx
tively.
(7) F = (D'P - lc')DN'lN The eigenvalues, X,, are the weighted variances of each
principal axis (the weighted sums of squares of the points'
and coordinates along the tt principal axis in each set) and
are equal to the corresponding squared basic values from
(8) G = (D'P' - lr')Dr'M
the SVD in equation 11.
The set of points defined in equation 7 are the n row
(16) F'D,F = D2 = DA
profiles in weighted Euclidean k-dimensional space, with
masses defined by the n elements of r and principal axis and
weights defined by the inverses of the elements of c, that
G'DCG = D2= = Dk
is, Dc'. A similar definition holds for the column set of (17)
points defined in equation 8. These are the p column The axes are orthogonal, though the metric is 'chi square"
profiles in weighted Euclidean k-dimensional space, with and not ordinary Euclidean as in principal components
masses defined by the p elements of c and principal axis analysis.
weights defined by the inverses of the elements of r, that The transition formulas relate the brand and attribute
is, D;1. Thus, the principal axes are weighted inversely category coordinates to each other.
by the elements of the average profile.
Each set of points can be related to the principal axes (18) F = DT'PGD,1
r IL= ~
RGD-'
of the other set of profile points through rescalings by and
the basic values.
(19) G = D-'P'FD-'
c rJ
= CFD-'
L
(9) F = Dr-'MD,
Hill (1974) co
and
corresponden
(10) G = D'lNDD, The transitio
portant becau
In practice, the correspondence analysis problem is ing one set of
restated in an equivalent form in terms of the SVD for geometric im
computational convenience. row of F, f'.
(11) Dr'/2(P - rc')D'1/2 = MD,N' (20) f;= (frG)D,'l
where M'M = N'N = I, with M n x k, Np x k, D
where r' is the i throw profile from R in equation 4.
k x k, and Il > ... -> L, t ... J'k > 0. Then,
Equation 20 defines a "barycenter," actually a center of
(12) F = Dr1/2MD,, where M = Dr'M, mass, of the p column profile points in G, because the
and
sum of the elements of ri equals unity. Postmultiplica-
tion by D', divides the coordinates of the centroid by
(13) G = D-1/2NDs, where N = D-'N, the singular values. Geometrically, a particular row pro-
file will be "attracted" to a position in its subspace that
and plotting the rows of F and G in the same space re- corresponds to the column variable categories prominent
sults in a k-dimensional correspondence analysis. in that row profile. A corresponding definition and inter-
Correspondence analysis can be considered a dual pretation holds for the rows of G.
generalized principal components analysis (Greenacre Distances (squared) between points in the same set are
1984). The columns of F are the eigenvectors of RC and given by
(21) dI, = E 1/cj(p/r' - pij/ri,)2

3The centering operation has the effect of removing the trivial axes
j
with corresponding basic values of unity. That is, without centering,

for row points i and i' and
the first axes (columns) extracted from the left and right generalized
basic vectors would correspond to r and c, respectively, and the first
diagonal element of D, would equal 1. In this case, the analysis is (22) 2' = E l/ri(pijcj -Pij/cj)2
performed relative to the origin, rather than from the "center of grav- i
ity." Because the GSVD of P - rc' is "contained" in the GSVD of

P (Greenacre 1978, 1984), attention is restricted to P - rc'. for column points j and j'. These are similar to ordinary

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
Euclidean distances except that each squared term is Correspondence analysis is appropriate for such data,
weighted by the inverse of the relative frequency (mass) whereas standard multidimensional scaling methods are
corresponding to the term. not (Holbrook, Moore, and Winer 1982).
These distances, approximated in the k-dimensional Though correspondence analysis is ideally suited to
subspaces by those research situations in which categorical measure-
ments are the most reasonably obtained, it also can be
(23) d2, - (fi - fi,)'(fi - fi)
applied to ordered categories and "discretized" quanti-
and tative variables (see Jambu and Lebeaux 1983), but the
original ordering may not be maintained after scaling un-
(24) d>, - (gj - gj,)'(gj - gj,) less the solution is constrained (Nishisato and Sheu 1984).
for row and column points, respectively, are defined as This type of application allows investigation of possible
chi square distances. This distance measure is chosen nonlinearities among the categories with respect to the
because it guarantees invariance according to the prop- principal axes. It can lead to the discovery of relation-
erty of distributional equivalence: ships between scale value categories that are obscured if
the data are dichotomized, or if methods are used that
-If two rows having identical column profiles are aggre-
recognize only the metric properties of the data. Thus,
gated, the distances between columns remain unchanged.
a "loss of information" in ignoring the ordered or inter-
-If two columns having identical row profiles are aggre-
gated, the distances between rows remain unchanged. val nature of the data yields a meaningful gain in un-
derstanding (Lebart, Morineau, Warwick 1984).
Clearly, identical profiles imply equal or proportional raw Correspondence analysis of other forms of data, such
data. as rank-order data, sorting data, paired comparison data,
and successive categories data, is discussed by Jambu
The Correspondence Analysis Model and Lebeaux (1983), Nishisato (1980), Nishisato and
The correspondence analysis "model" on P in k di- Nishisato (1983), and Nishisato and Sheu (1984). Ap-
mensions reveals how an element of P is approximated plications of correspondence analysis are virtually un-
in the k-dimensional weighted Euclidean subspace. limited, but Lebart, Morineau, and Warwick (1984)
P - re' + DrFD,'G'Dc suggest three conditions that should be satisfied if corre-
(25)
spondence analysis is to be most effective.
From equation 25 it is clear that the model treats rows 1. The data matrix must be large enough that visual in-
and columns symmetrically, as nothing changes if we spection or simple statistical analysis cannot reveal its
begin with X' instead of X. structure.
2. The variables must be "homogeneous," so that it makes

Data Considerations sense to calculate a statistical distance between rows and
Because of the inherent symmetry of correspondence columns and so that distances can be interpreted mean-
ingfully.
analysis, the implied data matrix for analysis is a con-
3. The data matrix must be "amorphous, a priori." In other
tingency table. However, the method can be applied to words, the method is most fruitfully applied to data whose
almost any matrix of categorical data, as long as the en- structure is either unknown or only poorly understood.
tries are non-negative. Excellent data classifications for
correspondence analysis are given by Benzecri (1973b), INTERPRETING A CORRESPONDENCE ANALYSIS
Nishisato (1980), and Greenacre (1984).
The principal coordinates of the brand and attribute
Many situations in marketing research lead to datacategory
at profile points from the correspondence analysis
the nominal or ordinal level of measurement (Perreault in two dimensions of the artificial data of Table 1 are
and Young 1980). Such data are often intractable with plotted in Figure 1. The plots are merged into one joint
traditional analytical methods. A common source of this display for ease of interpretation.
type of data is the evaluation of objects (e.g., retail out- The overall spatial variation in each set of points can
lets, competing products, individuals) on attributes (e.g.,be quantified and assists in interpretation. This variation,
product features, attitude statements) with binary judg- the total inertia, is defined as the weighted sum of squared
ments rather than 5- or 7-point rating scales. Binary
distances from the points to their respective centroids and
judgments are useful when the researcher has many ob-is equivalent for both sets of points.
jects or attributes to measure, when respondent coop-
eration is difficult to obtain, when it is difficult to make
(26) Inertia (Total)= E (P- ri )2
fine distinctions between objects on the attributes, and i rici
whenever rating scales are difficult to use.
Another source of data common in marketing research (27) Inertia (rows) = 3 ri 3 l
is the open-ended elicitation of attributes, brands, stores, i -
and so on, from respondents (i.e., "pick-any" data). With

an unconstrained set of alternatives, failure to mention = E ri(i -
an alternative does not necessarily imply rejection of it. i

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
Figure 1 From Figure 1 we see that brand

TWO-DIMENSIONAL CORRESPONDENCE ANALYSIS OF relatively far from each other in ter
THE DOUBLED DATA MATRIX IN TABLE 1 that describe them. Their relative p
the two-dimensional space indicate
differences among them with respec
Axis II
(A2'.00426)
(28.4%)
The first dimension separates attri
from attribute 5+ on the left, and
4+
B on the right from brand A on
dimension separates attributes 2+ a
Brand A o
3-
2- Brand B
from attribute 4+ on the top. This
O 0
tion, differentiates brand C on the b
5+ 6- 1+
I 0 5-
II a&~l A and B on the top.
4- D
o 1- The transition formulas in equati
clear that, geometrically, a particular
Brand C a position in its space corresponding
egories prominent in that brand pro
3+
2+
(28) Inertia (columns) =
I the display of brand profiles, a parti
egory will tend along the principal a
of the brands that are relatively sub
gory. For example, the attribute cate
the negative side of the second princ
C, which is relatively high on att
(28) Inertia (columns) = E cj l/ri(Pi/cj - ri)2 1), is on the negative side of its se
_ - i
Points near the center of the displ
tiated profile distributions as a conse
= cj(c - r)'Dr-'(j - r).
placed at the center of gravity. Notic
careful not to interpret between-set
It is because of the geometric Thecorrespondence
interpretation of of thethe
corresp
two sets of points, in position and inertia, that we can
not yet complete. The two-dimension
merge the two displays into 1 one
shows joint display. The
the projections ad- poi
of the
vantage of this merger is that a concise
plane, graphical
but does display
not indicate which
representing varied features of the
most data
impact in is obtained the
determining in aorie
single picture. The geometric For display of each
a complete and set of points
correct interpreta
reveals the nature of similarities andwe
display, variation
must use within the inf
additional
set, and the joint display shows the the
Because correspondence
total inertia ofbe- each s
tween sets. However, distances between
composed points
along from dif-
the principal axes and
ferent sets cannot be interpreted because
in similar these distances
and symmetric fashion, th
do not approximate any defined quantity. Distances be-
of points can be decomposed in a m
tween points in the same set the are decomposition
equal to the relevant chi
of variance. Th
square distances in equations 23 and are
positions 24, used
whereas thein
to assist be-
the i
tween-set correspondence is graphical influenced by the barycen-
display.
tric nature of the transition formulas Table 2 is inthe equations
numerical 18 and
represent
19.
spondence analysis depicted in Fig
The total inertia also can be decomposed along the represents a particular decomposition
principal axes. Each eigenvalue, X,, indicates the weighted each set of points and is discussed in
variance (inertia) explained by the tth principal axis of umns headed "Coordinate" contain th
the display. Summed over all k principal axes, these ei- points on the first and second pri
genvalues represent the total inertia of the spatial rep- tively. The weights for each poi
resentation.
"Mass") are repeated from Table 1
The first principal axis in the artificial example ac-
counts for 71.6% of the spatial variation in the data (,X Inertia of the Points
= .01074). The second principal axis accounts for the The inertia of the ih brand point
remaining 28.4% (h2 = .00426). In this artificial ex-
ample, two dimensions recover exactly the original data
matrix because with three brands there are at most two (29) ri ( l/ci(pi/ri - c)2 = ri fi.
.- j - t
mutually exclusive dimensions. Real applications in-
volving reduced dimensionalities of larger data matrices Equation
will necessarily be approximations. to the tot

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
Table 2
NUMERICAL RESULTS OF CORRESPONDENCE ANALYSIS OF TABLE 1 DATAa
Axis 1 Axis 2
Squared Squared
Name Quality Mass Inertiab Coordinate correlation Contributionb Coordinate correlation Contributionb
Brand A 1000 333 405 -130 883 500 50 117 167
Brand B 1000 333 405 130 883 500 50 117 167
Brand C 1000 333 190 0 0 0 -90 1000 666
1+ (yes) 1000 44 10 -50 519 9 40 481 21

1- (no) 1000 122 1 20 519 3 -20 481 7
2+ 1000 34 110 -100 206 32 -20 794 307

2- 1000 132 31 30 206 8 50 794 78
3+ 1000 42 130 120 289 51 -180 711 316

3- 1000 124 44 -40 289 17 60 711 104
4+ 1000 41 60 100 429 37 110 571 124

4- 1000 125 20 -30 429 12 -40 571 41
5+ 1000 24 123 -270 1000 172 0 0 0

5- 1000 142 21 50 1000 30 0 0 0
6+ 1000 40 340 360 999 480 -10 1 1

6- 1000 126 110 -110 999 149 0 1 1
'All values are multiplied by 1000 an

bScaled (before multiplication by 100
quantity in (31)
(31) rif i/XA,,
brackets the square
the brand profile to the center
the absolute contribution of the i" brand to the th prin-
space (i.e., 2,f,). A similar de
cipal axis is obtained. The absolute contributions quan-
attribute category point. The
tify the importance of each point in determining the di-
over all brands (or attribute c
inertia.
rection of the principal axes and serve as guides to
interpretation of each axis. They are interpreted as the
The inertias for each point a
percentage of (weighted) variance explained by each point
"Inertia" in Table 2. Brand A's inertia in the set of brand
in relation to each axis.
points is 40.5% of the total inertia, as is brand B's. Brand
It is clear from the decomposition that a point can con-
C accounts for 19% of the total inertia in this set. At-
tribute to a principal axis (i.e., make a high contribution
tribute category 6+ has an inertia that is 34% of the total
to the inertia of that axis) in two ways: when it has a
inertia in the attribute set of points and accounts for by
large mass and/or when it is a large distance from the
far the largest proportion.
centroid, even if it has relatively low mass.
Absolute Contributions to Inertia Because all the brands have equal mass, it is their dis-
tance from the centroid that determines their contribu-
The inertia along the th axis, A,, consists of the weightedtions to the inertia of each axis. The absolute contribu-
sum of squared distances to the origin of the displayedtions, in the columns headed "Contribution" in Table 2,
row (or column) profiles, where the weights are the massesindicate that brands A and B contribute equally and solely
for each row (or column) point. For the brand profiles,to the direction of axis 1 and brand C contributes pri-
this inertia can be expressed as marily to axis 2.
Similarly for the attributes, categories 6+, 5+, and
(30)
6- define the first principal axis whereas categories 3 +,
, = > rif i.
/ 2+, 4+, and 3- define the second principal axis. At-
tribute categories 1-, 5-, and to a lesser extent 1 + con-
A similar definition holds for the attribute category pro- tribute essentially nothing to the inertia of each axis and
files. Thus, each eigenvalue also represents the inertia consequently are near the origin (note that their profiles
of the projections of the brand set (or attribute category are virtually identical to the average column profile r).
set) of points on each axis.
Relative Contributions to Inertia
If each term in the summation is expressed as a per-
centage relative to the inertia "explained" by each axis, After the dimensional interpretation, the next step in
that is, a correspondence analysis is to determine the "quality"

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
CORRESPONDENCE ANALYSIS
0 221
of the representation of each point in the display. The

period. For illustrative purposes, the scale used to collec
quantity the information was coded 1 to indicate purchase a
consumption at least every other week and 0 to indicat
(32) f2 f2t purchase and consumption less than every other wee
The data about eight soft drinks from 34 of the studen
were used
gives the relative contribution of the t' principal forto
axis our example. The soft drinks are Cok
Diet Coke,
the inertia of the ith brand. A similar definition Diet
holds Pepsi, Diet 7Up, Pepsi, Sprite, Tab, a
for
the relative contributions of the attribute 7Up. The 34 These
categories. x 8 binary indicator matrix is displayed
Table 3.
values are independent of the point's mass and indicate
A correspondence analysis was performed on the 34
how well each point is "fit" by the representation.
x 16 matrix
A relative contribution is actually a squared obtained by doubling Table 3 about the col-
correla-
tion, because it is equal to the cos' of the angle 0 be- eigenvalues equal their correspond-
umns. The resulting
tween the point and the th principal axis.ing proportions
High values of inertia because the total inertia equals
of cos20 indicate that the axis explains theunity (in this
point's example). The eigenvalues for the first three
inertia
principal
very well; 0 is low and the profile point lies in axes
theare di-
.482, .151, and .099, with cumulative
proportions
rection of the axis and correlates highly with of inertia equaling .482, .633, and .732. Eight
it. Summed
dimensions
over all the axes of interest (in this case two), recover perfectly the 34 x 16 doubled data
the relative
matrix (i.e., ES,This
contributions give the quality of the representation. = 1), hence the other five axes account
is just the cos2 of the angle the point makesfor the remaining
with the 26.8% of the inertia.
In a that
subspace. Thus, the relative contribution gives doubled binary data matrix with q variables, the
part
of the variance of a point explained by anrow axis,
sums equal
and athe
constant value (8 in this case) and the
quality gives the goodness of fit of each point's repre-
sentation in the subspace. The sum of the relative con-
tributions over all axes (not just those used for the dis-Table 3
play) equals unity (as in Table 2). THE 34 X 8 BINARY INDICATOR MATRIX OF BEVERAGE
The relative contributions are in the columns headed PURCHASE AND CONSUMPTION
"Correlation" in Table 2. The first axis explains 88.3%
of the inertia of brands A and B and nothing of brand Soft drink
C, whereas the second axis explains 11.7% of the inertia Diet Diet Diet
of brands A and B and 100% of that of brand C. Sim- Individual Coke Coke Pepsi 7Up Pep. pi Sprite Tab 7Up
ilarly, the first axis explains 51.9% of attribute 1 and the 1 0 0 0 1 0
second axis explains the remaining 48.1%. The relative 2 0 0 0 1 00 0 0
contributions are equal for each attribute category pair 3 0 0 0 1 0 0 00

4 0 1 0 1 0 0 1 0
because the doubling procedure gives each pair of attri-
5 1 0 0 0 1 0 0 0
bute categories equal mass. The qualities of each point 6 1 0 0 0 1 1 0 0
in the two-dimensional space (all equal to unity) are in 7 00 1 1 0 0 1 0
the column headed "Quality." 8 1
0
1 0 0 1 0 1
The various decompositions of the total inertia, in 9 1 0 0 0 1

1
1
10 0 0 0 1 0 00 1
conjunction with the principal coordinate values for the 11 0 0 0 1 0 0
brands and attribute categories, make possible a com- 12 0 1 0 0 0 01 1 0
1
plete interpretation of the correspondence analysis of the 13 0 0 1 1 0 0 1
data in Table 1. As discussed hereafter, external infor- 14
1
0 0 0 0 0 1
1
0 0
15 01 1 0 0 0 0
mation can be fit into the display through the transition
16 00 0 0 1 1 0
formulas in equations 18 and 19 and also can be helpful 17 1
1
1 0 0 0 1 0 0
0
in interpreting correspondence analysis results. Another 18 0 0 1 0 0 0
aid is cluster analysis, which with large data matrices 19 1 0 0 0 0 0 0 1
may be useful in detecting homogeneous groups and in 20 1 1 1 0 1 0 0 0

21 0 0 0 1 0 0 0
presenting results (Jambu and Lebeaux 1983). Whatever 22 0 0 1
0 0 0 0
aids are used, we emphasize that visual inspection of the 1 1
23 01 1 0 1 0 0 1 0
graphical display is a key step in interpreting the results. 24 1 0 0 1 0 0 0
1
25 10 1 1 0 0 0 0
ILLUSTRATING CORRESPONDENCE ANALYSIS 1 1
26 0 1 0 0 1 0
27 0 0 0 0 0
Empirical Example: Beverage Purchase and 28 0 0 O O 0 1
1
Consumption 29 0 0 0 0 0 0
1
30 1 1 0 0 0 1 0
A group of male and female MBA students from31 Co- 1 0 0 0 1 0 0 1
lumbia University were asked to indicate, for a variety
32 1 1 0 0 0 1 0
0 1
of popular soft drinks, the frequency with which 33they 0 0 0 0 1
34 1 1 1 0 0 0
purchased and consumed the soft drinks in a 1-month

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
row masses equal 1/n. The row profiles define n points Pepsi, separates the colas on the top from the non
in 2q-dimensional space, but q linear dependencies are on the bottom. We label this a "cola/noncola" dim
present among the columns so the dimensionality of the sion. The soft drink locations are based on the pro
rows is really equal to q. of purchase frequency by individuals and provide inf
Figure 2 is the joint display of individuals and soft mation on market segments and market structure.
drinks in the plane defined by the first two principal axes. The relative contributions indicate that the first p
These two axes account for 63.3% of the total inertia in cipal axis explains nearly all the variance in Coke (squ
the data. We chose two axes for display on the basis of correlation = .785) and Tab (squared correlation = .
the interpretability of the dimensions, the desire for par- whereas the second principal axis explains the most v
simony, and a scree plot of the eigenvalues which in- ance in Sprite (squared correlation = .533). Diet Pe
dicated a clear elbow at t = 2. Though the third principal quality in two dimensions (quality = .405) indicat
axis contributes enough to the total inertia to suggest its has the worst fit in the plane defined by the first
interpretation might be worthwhile, we omit plotting it principal axes. However, Coke, Pepsi, and Tab are
to save space. The numerical results are reported in Ta- well by two dimensions (quality = .803, .777, .712,
ble 4 for the soft drinks and Table 5 for the individuals. spectively).
We discuss first the soft drink points. The display in Figure 2 reveals, by their proximitie
The display is interpreted with respect to the "posi- those soft drinks having similar profiles of purchase
tive" (purchase and consumption at least every other week) consumption by the 34 individuals. Similarly, ind
soft drink points. Because each pair of soft drink points uals close together in the display share similar pat
(+ and -) has the same mass (.125), each pair has its of purchase and consumption and thus constitute lik
centroid at the origin of the display and consequently market segments based on purchase behavior. Ind
each is balanced there with the lighter mass point being uals who share the same point in the display have ide
proportionately farther from the origin. For example, tical profiles of soft drink usage.
7Up+ with mass equal to .03 is farther than 7Up- with Two segments of consumers are seen on the "d
mass of .09. side of the first principal axis, those who drink diet
The absolute contributions in Table 4 indicate that the (individuals 12, 27, 15, 30, and 32) and those who d
first principal axis is defined by Tab, Diet 7Up, and Coke diet cola and diet noncola soft drinks (individuals 7, 3
and separates the diet soft drinks (both colas and non- 4, 23, 26, and 25). The absence of any individual p
colas) on the right from the nondiet soft drinks on the in the lower right region of the space indicates that
left. We label this a "diet/nondiet" dimension. The sec- one drinks only diet noncolas.
ond principal axis, defined primarily by Sprite, 7Up, and On the left side of the space, we can identify segm
of individuals who drink both diet and nondiet colas
24, and 20), who drink only nondiet colas (2, 3, 5,
and 22), and who drink both colas and noncolas, sk
Figure 2
either toward colas (e.g., 6, 11, 10, 31, and 33) or
TWO-DIMENSIONAL CORRESPONDENCE ANALYSIS OF
ward noncolas (e.g., 1, 8, 14, 29, and 19).
THE DOUBLED DATA MATRIX CONSTRUCTED FROM The absolute contributions indicate that individuals 13
TABLE 3
and 28 are the primary contributors to the second prin-
cipal axis (contribution = .219 and .116, respectively).
Axis It These contributions are "too large" in the sense that these
(X21-51)
(15.1%) points define, almost solely, the direction of the second
principal axis. Such a situation occurs often in corre-
spondence analysis. A remedy is presented in the Dis-
Pepsi+
18,24 20 Sprite- cussion section.
2,3,5,21,22 7Up-
l
Relationship to Other Multivariate Methods
Coke+ Diet 7Up-
I O
4' 15,30,32
4 7 It is relatively easy to show that correspondence anal-
10,31,33.
13 a
Diet Coke+ Tab+
Tab-
4,2326 734 ysis is related directly
DietPepsi-
Axis I to several familiar multivariate
_-i.-- .-_I l (X, f.482)
6.11
*16
Zs5 -
Diet Pepsi +
(48.2%)
(48.2%) methods, including principal components analysis,4 the
Diet Coke-
Coke- biplot (Gabriel 1971, 1981), canonical correlation anal-
e19
17
.14,29 *
Pepsi- Diet 7Up+
O *
9< 4In fact, the row geometry of the correspondence analysis of a dou-
028
bled binary data matrix is identical to the row geometry of the prin-
cipal components analysis of the column-standardized (to unit vari-
7Up+
13 ance) undoubled data matrix. The row masses and relative distances
Sprite+ between row profiles are equivalent in both (Greenacre 1984; Lebart,
Morineau, and Warwick 1984).

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
Table 4
DECOMPOSITION OF INERTIA AMONG THE SOFT DRINKS FOR THE FIRST TWO PRINCIPAL AXESa
Axis I Axis 2
Squared Squared
Soft drink Qualityb Mass Inertiac correlation Contributionc correlation ContributionC
Coke+ 809 70 55 785 84 18 6
Coke- 809 55 70 785 120 18 9
Diet Coke+ 664 62 62 614 80 19 8

Diet Coke- 664 62 62 614 80 19 8
Diet Pepsi+ 643 25 100 404 80 1 1

Diet Pepsi- 643 100 25 404 25 1 1
Diet 7Up+ 626 25 100 458 94 49 32

Diet 7Up- 626 100 25 458 24 49 8
Pepsi+ 811 62 62 550 76 227 98

Pepsi- 811 62 62 550 67 227 88
Sprite+ 826 40 85 149 26 533 298

Sprite- 826 85 40 149 13 533 142
Tab+ 751 40 85 712 125 0 0
Tab- 751 85 40 712 60 0 0
7Up+ 727 35 90 181 34 364 221

7Up- 727 90 35 181 12 364 80
'All values are multiplied by 1000 and deci
bMeasured over the first three principal ax
CScaled (before multiplication by 1000) to s
Table 5
DECOMPOSITION OF INERTIA AMONG THE INDIVIDUALS FOR THE FIRST TWO PRINCIPAL AXESa
Axis 1 Axis 2
Squared Squared
Individual Qualityb Mass Inertia' correlation Contribution' correlation Contribution'
1 911 30 27 701 47 198 42
2,3,5,21,22 909 30 18 520 19 384 44
4,23,26 702 30 40 690 55 0 0
6,11 716 30 18 623 30 1 1
7,34 955 30 49 888 90 1 1
8 453 30 27 322 21 130 27

9 463 30 40 0 0 371 91
10,31,33 808 30 27 593 32 1 1
12,27 776 30 27 490 25 34 6
13 864 30 60 42 5 572 219
14,29 621 30 18 248 12 145 21

15,30,32 704 30 40 688 51 16 3
16 300 30 27 162 9 14 2
17 611 30 27 35 2 129 21
18,24 678 30 18 124 4 541 61
19 428 30 27 245 13 85 14
20 444 30 27 0 0 301 55
25 715 30 40 543 49 1 1
28 908 30 27 347 23 560 116
'All values are multiplied b

bMeasured over the first th
'Scaled (before multiplicatio

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
ysis, and discriminant analysis, through the generalized It is also useful to contrast correspondence analysis
singular value decomposition (see Greenacre 1984, Ap- with multidimensional unfolding, another approach for
pendix A.2). The differences among methods are deter- the joint display of a data matrix. Correspondence anal-
mined by the type of transformation applied to the orig- ysis displays the positions of the rows (or columns) of
inal data matrix, the metrics in which the principal axes the data matrix relative to the set of rows (or columns)
are defined, and how the basic values are assigned to the included in the analysis. This is a consequence of using
left and right basic vectors. In correspondence analysis, profiles, rather than absolute frequencies. Multidimen-
the transformation is defined by equation 11, the metrics sional unfolding methods, however, directly approxi-
are the chi square metrics defined by the inverses of mate the entries in the data matrix, which are assumed
equations 2 and 3, and the basic values are assigned ac- to be row-to-column distances (dissimilarities). In this
cording to equations 12 and 13. case, there is a direct interpretation of the graphical rep-
A distinct advantage of correspondence analysis over resentation in terms of interpoint distances. If the data
other methods, in terms of obtaining a joint graphical can be considered as row-to-column distances, multidi-
display, is that correspondence analysis produces two dual mensional unfolding is an appropriate technique. If the
displays whose row and column geometries have similar data cannot be considered as such, correspondence anal-
interpretations. In other multivariate approaches as they ysis may be the more appropriate method for construct-
are commonly employed, this duality does not exist. ing joint representations.
Tenenhaus and Young (1985) have shown that four
broad data analytic approaches lead to the equations of DISCUSSION
correspondence analysis, the "method of reciprocal av-

Supplementary Points: Fitting External Information
erages" (Fisher 1940; Hirschfeld 1935; Horst 1935;
Richardson and Kuder 1933), the analysis of variance Into the Display
approach (Bock 1960; de Leeuw 1973; Guttman 1941; The transition formulas in equations 18 and 19 pro
Hayashi 1950, 1952, 1954; Nishisato 1980; van Rijck- the means for fitting external information into the g
evorsel and de Leeuw 1978), the principal components ical display from a correspondence analysis. These
analysis (PCA) approach (Benzecri 1969, 1973a, b; Burt plementary points" enrich interpretation of the disp
1950; Greenacre 1978, 1984), and the generalized can- in much the same way that regression procedures ass
onical analysis approach (McKeon 1966). We use the in the interpretation of multidimensional scaling
PCA approach to demonstrate correspondence analysis tions (Kruskal and Wish 1978; Schiffman, Reynolds,
because it illustrates clearly the geometric aspects of the Young 1981).
method. However, the equivalences yield additional Suppose we have information about physical cha
interpretations of the results of a correspondence anal- teristics of the eight soft drinks in the preceding exa
ysis (involving the meaning of the eigenvalues), which and array these data in a characteristics by soft d
illuminate other aspects of the method. matrix. It is possible to consider each row of the ma
The method of reciprocal averages is defined by the as defining a point in the space of the row (indiv
transition formulas, where an individual's (row's) scale profiles of the individuals by soft drinks matrix. Thr
value (principal coordinate) is the mean of the scale val- the use of transition formula 18 we can make a transition
ues of the categories (columns) chosen by that individ- from columns (the soft drinks) to rows (the physical
ual, and the scale value of a category is the mean of the characteristics) to obtain point locations for each char-
scale values of the individuals in that category. This ren- acteristic. Each physical characteristic profile then can
ders more intuitive the "barycentric" nature of the tran- be projected onto the plane defined by the first two prin-
sition formulas. The internal consistency of the scale cipal axes to see which characteristics are associated with
values is maximized and each eigenvalue is a measure which soft drinks. If we had information on the individ-
of the internal consistency of each scaling (i.e., dimen- uals, such as demographic data, we could use transition
sion) induced on the rows and columns. formula 19 and go from rows to columns. In this case,
In the analysis of variance approach, the ratio of the each column of the individuals by demographics matrix
sum of squares between rows (columns) is maximized defines a column profile in the same space as the profiles
and the ratio of the sum of squares within rows (col- of the soft drinks across the individuals. The transition
umns) is minimized relative to the total sum of squares. from rows to columns yields a set of points that can be
The successive squared correlation ratios (the between displayed in the original space, thereby providing infor-
sum of squares relative to the total sum of squares) are mation on the demographic characteristics of the indi-
equivalent to the eigenvalues. viduals.
In the generalized canonical analysis approach, the sum The fitting of supplementary points also can serve as
of the squared correlations between the scaled individ- a validity check (Lebart, Morineau, and Warwick 1984,
uals and scaled variable categories is maximized. This p. 163). Because a supplementary variable makes no
maximized value equals the sum of the eigenvalues and contribution to the axis, its squared correlation (relative
each eigenvalue is the canonical correlation between each contribution) with each principal axis can be examined.
successive joint scaling of the rows and columns. High values indicate good fit into the previously defined

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
display and imply validation of the variables being in- of dimensionality." There is no method for conclusively
vestigated. determining the appropriate number of and what com-
binations of dimensions to plot and inspect. As with other
Handling Outliers multivariate methods, the researcher must balance par-
Outlier points plague correspondence analysis solu- simony against interpretability in determining the num-
tions. Occasionally, a row (or column) profile point is ber of dimensions to use.
so "rare" (in profile) in its set of points that it has a Finally, it must be recognized that in many ways cor-
major role in determining the higher order principal axes. respondence analysis is a subjective technique. Many
This situation is easily discerned by examining the points' different portrayals of a data set often are possible, lead-
contributions to the axes. When a point has a very large ing to different analysis categories and solutions. By its
absolute contribution and a large principal coordinate on flexibility, correspondence analysis can lead to greater
a major principal axis, it can be considered an outlier. insight into the phenomena being studied because it af-
Two such points are individuals 13 and 28 in the em- fords several different views of the same data set. Sub-
pirical example. These points consume nearly 34% of jectivity of analysis is part of the price of this flexibility.
the inertia on the second principal axis, determining its
orientation to a large degree. The solution lies in rede- Implementation
fining these points as supplementary and performing the A variety of computer programs are available for car-
analysis again without them, permitting them no influ- rying out a correspondence analysis. The SPAD system
ence on the direction of the principal axes. Then the points of FORTRAN programs written by Lebart and Morineau
can be fit a posteriori on the axes calculated for the re- (1982) for mainframe computer systems is particularly
maining points with transition formula 18. applicable to large data sets. A specialized version of
Inspection of the data matrix provides information about this program is described by Lebart, Morineau, and
the nature of the "rarity" of an outlying point. Treating Warwick (1984). Nishisato and Nishisato (1983) have
the point as supplementary allows more detailed study prepared a program that performs correspondence anal-
of the structure of the remaining points whose multi- ysis ("dual scaling") on the IBM PC. The program ac-
variate association is not as readily determined by in- cepts as input up to six different types of data. Greenacre
spection. (1984) presents a simple program to do correspondence
A Caveat
analysis using the high-level programming language
GENSTAT. An extensive collection of computer pro-
Correspondence analysis does have limitations. It is a
grams for correspondence analysis and related tech-
niques is provided by Jambu and Lebeaux (1983). If the
multivariate descriptive statistical method and is not ap-
propriate for hypothesis testing. Other approaches researcher
are has access to a matrix subroutine that per-
better suited to searching for parsimonious models that forms a singular value decomposition, he or she has the
can account for most of the variance in the data, such tools necessary to implement the method. For example,
as weighted least squares (Grizzle, Starmer, and Koch correspondence analysis is programmed easily with the
1969) and loglinear modeling (Bishop, Fienberg, and MATRIX procedure in SAS (SAS Institute 1982).
Holland 1975). Recently, van der Heijden and de Leeuw
(1985) showed that, under certain conditions, corre- Concluding Remarks
spondence analysis can be interpreted in terms of spe- As we present it, correspondence analysis is a method
cific loglinear models. However, statistical tests for cor-
of exploratory data analysis that (1) quantifies multi-
respondence analysis are still being developed; earlier variate categorical data, (2) affords a graphical repre-
tests were shown either theoretically or through simu- sentation of the structure in the data, and (3) does not
lations to be unjustified (Lebart 1976). Nonetheless, cor-
pose stringent measurement requirements. For many ap-
respondence analysis may be helpful in detecting models plications, its use is straightforward and unambiguous.
that merit further consideration by other methods. When complex multivariate relationships are examined,
As discussed before, an important caveat for inter- correspondence analysis is limited only by the research-
preting correspondence analysis results is that the er's be- ingenuity in interpreting the derived spatial map. As
tween-set distances cannot be interpreted. The joint dis-a graphical method of data analysis, correspondence
play of coordinates shows the relationship between a point
analysis is applied best as a multivariate descriptive sta-
from one set and all the points of the other set, not be-tistical technique supplemental to other forms of analy-
sis.
tween individual points from each set. (See Carroll, Green,
and Schaffer 1986 for an alternative scaling of the co-
Correspondence analysis is very flexible. Not only is
ordinates that provides for comparability of all within-
it flexible in terms of data requirements, but it also al-
set and between-set distances.) When the correspon- lows for the incorporation of marketing knowledge. In
dence analysis solution has more than two dimensions,studying a product class, say, the researcher can set masses
of brands equal to the market share or dollar sales of
proximity with one pair of axes may disappear when other
pairs are plotted. each, or perhaps to the percentage of consumers who use
Correspondence analysis also suffers from the "curse
the product in the population. The technique of fitting

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
supplementary points in the display is an interesting and Gabriel, K. R. (1971), "The Biplot Graphic Display of Ma-
virtually limitless way to incorporate external informa- trices with Application to Principal Component Analysis,"
tion into the analysis. It is also useful as a check on data Biometrika, 58 (December), 453-67.
validity and as a tool for handling troublesome outliers. (1981), "Biplot Display of Multivariate Matrices for
Inspection of Data and Diagnosis," in Interpreting Multi-
Though correspondence analysis has limitations, the most
variate Data, V. Barnett, ed. Chichester: John Wiley & Sons,
important being that between-set distances in the graph- Inc., 147-73.
ical display are not interpretable, its flexibility may ren- Gower, J. C. (1966), "Some Distance Properties of Latent Root
der it more suitable than other methods for marketing and Vector Methods Used in Multivariate Analysis," Bio-
research applications in many situations. metrika, 53 (December), 325-38.
Categorical data are common products of marketing Green, Paul E., with J. Douglas Carroll (1978), Mathematical
research. However, the analysis of such data often is Tools for Applied Multivariate Analysis. New York: Aca-
hindered by the requirements and limitations of many demic Press, Inc.
familiar research tools. Correspondence analysis is a , Vithala R. Rao, and Wayne S. DeSarbo (1978), "In-
versatile and easily implemented analytical method that corporating Group-Level Similarity Judgments in Conjoint
Analysis," Journal of Consumer Research, 5 (December),
can do much to assist researchers in detecting and ex- 187-93.
plaining relationships among complex marketing phe- Green, Robert T., Jean-Paul Leonardi, Jean-Louis Chandon,
nomena.
Isabella C. M. Cunningham, Bronis Verhage, and Alain
REFERENCES Strazzieri (1983), "Societal Development and Family Pur-
chasing Roles: A Cross-National Study," Journal of Con-
Belk, Russell, John Painter, and Richard Semenik (1981), sumer Research, 9 (March), 436-42.
"Preferred Solutions to the Energy Crisis as a Function of
Greenacre, Michael J. (1978), "Some Objective Methods of
Causal Attributions," Journal of Consumer Research, 8
Graphical Display of a Data Matrix" (English translation of
(December), 306-12. 1978 doctoral thesis), Department of Statistics and Opera-
Benzdcri, J. P. (1969), "Statistical Analysis as a Tool to tions MakeResearch, University of South Africa.
Patterns Emerge from Data," in Methodologies of Pattern(1984), Theory and Application of Correspondence
Recognition, S. Watanabe, ed. New York: Academic Press, Analysis. London: Academic Press, Inc.
Inc., 35-74. Grizzle, J. E., C. F. Starmer, and G. G. Koch (1969), "Anal-
et al. (1973a), L'Analyse des Donnees. Vol. I, La Tax- ysis of Categorical Data by Linear Models," Biometrics, 25
inomie. Paris: Dunod. (September), 489-504.
et al. (1973b), L'Analyse des Donnees. Vol. 1I, L'Ana- Guttman, Louis (1941), "The Quantification of a Class of At-
lyse des Correspondances. Paris: Dunod. tributes: A Theory and Method of Scale Construction," in
Bishop, Yvonne M. M., S. E. Fienberg, and P. W. Holland Prediction of Personal Adjustment, The Committee on So-
(1975), Discrete Multivariate Analysis: Theory and Prac- cial Adjustment, ed. New York: Social Science Research
tice. Cambridge, MA: MIT Press. Council, 319-48.
Bock, R. Darrell (1960), "Methods and Applications of Op-Hayashi, C. (1950), "On the Quantification of Qualitative Data
timal Scaling," Laboratory Report No. 25, L. L. Thurstone from the Mathematico-Statistical Point of View," Annals of
Psychometric Laboratory, University of North Carolina, the Institute of Statistical Mathematics, 2 (1), 35-47.
Chapel Hill. (1952), "On the Prediction of Phenomena from Qual-
Burt, C. (1950), "The Factorial Analysis of Qualitative Data," itative Data and the Quantification of Qualitative Data from
British Journal of Psychology (Statistical Section), 3 (No- the Mathematico-Statistical Point of View," Annals of the
vember), 166-85. Institute of Statistical Mathematics, 3 (2), 69-98.
Carroll, J. Douglas, Paul E. Green, and Catherine M. Schaffer (1954), "Multidimensional Quantification-with the
(1986), "Interpoint Distance Comparisons in Correspon- Applications to Analysis of Social Phenomena," Annals of
dence Analysis," Journal of Marketing Research, 23 (Au- the Institute of Statistical Mathematics, 5 (2), 121-43.
gust), 271-80. Heiser, Willem J. (1981), Unfolding Analysis of Proximity Data.
de Leeuw, Jan (1973), Canonical Analysis of Categorical Data, Leiden, The Netherlands: Department of Data Theory, Uni-
unpublished doctoral dissertation, Psychological Institute, versity of Leiden.
University of Leiden, The Netherlands. Hill, M, 0. (1974), "Correspondence Analysis: A Neglected
Eckart, C. and Gale Young (1936), "The Approximation of Multivariate Method," Applied Statistics, 23 (3), 340-54.
One Matrix by Another of Lower Rank," Psychometrika, 1 Hirschfeld, H. 0. (1935), "A Connection Between Correlation
(September), 211-18. and Contingency," Proceedings of the Cambridge Philo-
Fisher, Ronald A. (1940), "The Precision of Discriminant sophical Society, 31 (October), 520-4.
Functions," Annals of Eugenics, 10 (December), 422-9. Holbrook, Morris B., William L. Moore, and Russell S. Wi-
Franke, George R. (1983), "Dual Scaling: A Model for In- ner (1982), "Constructing Joint Spaces from Pick-Any Data:
terpreting and Quantifying Categorical Data," in Research A New Tool for Consumer Analysis," Journal of Consumer
Methods and Causal Modeling in Marketing, W. R. Dar- Research, 9 (June), 99-105.
den, K. B. Monroe, and W. R. Dillon, eds. Chicago: Amer- Horst, Paul (1935), "Measuring Complex Attitudes," Journal
ican Marketing Association, 111-4. of Social Psychology, 6 (3), 369-74.
(1985), "Evaluating Measures Through Data Quanti- Jambu, M. and M-O. Lebeaux (1983), Cluster Analysis and
fication: Applying Dual Scaling to an Advertising Copy- Data Analysis. Amsterdam: North Holland Publishing Com-
test," Journal of Business Research, 13 (February), 61-9. pany.

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC
Kruskal, Joseph B. and Myron Wish (1978), Multidimensional and Wen-Jenn Sheu (1984), "A Note on Dual Scaling
Scaling, Sage University Paper Series on Quantitative Ap- of Successive Categories Data," Psychometrika, 49 (De-
plications in the Social Sciences, 07-011. Beverly Hills, CA: cember), 493-500.
Sage Publications, Inc. Perreault, William D., Jr., and Forrest W. Young (1980), "Al-
Lebart, Ludovic (1976), "The Significancy of Eigenvalues Is- ternating Least Squares Optimal Scaling: Analysis of Non-
sued from Correspondence Analysis," Proceedings in Com- metric Data in Marketing Research," Journal of Marketing
putational Statistics (COMPSTAT). Vienna: Physica Verlag, Research, 17 (February), 1-13.
38-45. Richardson, M. and G. F. Kuder (1933), "Making a Rating
and Alain Morineau (1982), "SPAD: A System of Scale That Measures," Personnel Journal, 12 (June), 36-
FORTRAN Programs for Correspondence Analysis," Jour- 40.
nal of Marketing Research, 19 (November), 608-9. SAS Institute (1982), SAS User's Guide: Statistics. Cary, NC:
, , and Kenneth M. Warwick (1984), Multi- SAS Institute Inc.
variate Descriptive Statistical Analysis: CorrespondenceSchiffman, Susan S., M. Lance Reynolds, and Forrest W.
Analysis and Related Techniques for Large Matrices. New Young (1981), Introduction to Multidimensional Scaling. New
York: John Wiley & Sons, Inc. York: Academic Press, Inc.
Levine, Joel H. (1979), "Joint-Space Analysis of 'Pick-Any' Seber, G. A. F. (1984), Multivariate Observations. New York:
Data: Analysis of Choices from an Unconstrained Set of Al- John Wiley & Sons, Inc.
ternatives," Psychometrika, 44 (March), 85-92. Tenenhaus, Michel and Forrest W. Young (1985), "An Anal-
Marc, Marcel (1973), "Some Practical Uses of 'The Factorialysis and Synthesis of Multiple Correspondence Analysis,
Analysis of Correspondence,'" European Research, 1 (July),Optimal Scaling, Dual Scaling, Homogeneity Analysis and
2-8. Other Methods for Quantifying Categorical Multivariate
McKeon, J. J. (1966), "Canonical Analysis: Some Relations Data," Psychometrika, 50 (March), 91-119.
Between Canonical Correlation, Factor Analysis, Discrim- Tukey, John W. (1977), Exploratory Data Analysis. Reading,
inant Function Analysis and Scaling Theory," Psychome- MA: Addison-Wesley Publishing Company, Inc.
trika, Monograph No. 13. van der Heijden, Peter G. M. and Jan de Leeuw (1985), "Cor-
Nishisato, Shizuhiko (1980), Analysis of Categorical Data: Dual respondence Analysis Used Complementary to Loglinear
Scaling and Its Applications. Toronto: University of Toronto Analysis," Psychometrika, 50 (December), 429-47.
Press. van Rijckevorsel, Jan and Jan de Leeuw (1978), "An Outline
and Ira Nishisato (1983), An Introduction to Dual to HOMALS-1," Department of Data Theory, Faculty of
Scaling, 1st ed. Islington, Ontario: MicroStats. Social Sciences, University of Leiden, The Netherlands.
JMR REPRINT POLICY AND PROCEDURE
All reprints are sold according to the following price schedule, in minim
50 copies.
50 copies with covers $ 75.00

100 copies with covers 125.00
(multiples of 100 may be ordered)
There is NO RETURN and NO EXCHANGE on reprints.

Under the "fair use" provision of the new copyright law taking effect January 19
anyone may make a photocopy of a copyrighted article for his or her own use with
seeking permission. Also, a single copy reprint or an order of less than 50 copies m
be obtained from University Microfilms International, 300 N. Zeeb Road, Ann Arb
MI 48106. Articles are priced prepaid at $6.00 plus $1.00 for each additional copy of
the same article. Complete issues are obtainable at 10? per page, minimum order $10.00.
To obtain permission to reproduce one's own reprints in quantity, please contact th

Permissions Department, American Marketing Association, 250 S. Wacker Drive, Chica
IL 60606.

168.176.5.118 on Wed, 13 Jan 2021 19:21:48 UTC

This Content Downloaded From 168.176.5.118 On Wed, 13 Jan 2021 19:21:48 UTC

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

This Content Downloaded From 168.176.5.118 On Wed, 13 Jan 2021 19:21:48 UTC

Caricato da

Copyright:

Formati disponibili

Correspondence Analysis: Graphical Representation of Categorical Data in Marketing

Stable URL: https://www.jstor.org/stable/3151480

This content downloaded from

Correspondence analysis is an exploratory data analysis technique for the graph-

Correspondence Analysis: Graphical

Journal of Marketing Research

This content downloaded from

This content downloaded from

'All entries are proportions. Decimal points are omitted.

correspondence analysisWhen usingthe q variables

This content downloaded from

umn profiles in n-dimensional space are written in the

This content downloaded from

(21) dI, = E 1/cj(p/r' - pij/ri,)2

with corresponding basic values of unity. That is, without centering,

ity." Because the GSVD of P - rc' is "contained" in the GSVD of

This content downloaded from

2. The variables must be "homogeneous," so that it makes

and so on, from respondents (i.e., "pick-any" data). With

This content downloaded from

Figure 1 From Figure 1 we see that brand

This content downloaded from

1+ (yes) 1000 44 10 -50 519 9 40 481 21

2+ 1000 34 110 -100 206 32 -20 794 307

3+ 1000 42 130 120 289 51 -180 711 316

4+ 1000 41 60 100 429 37 110 571 124

5+ 1000 24 123 -270 1000 172 0 0 0

6+ 1000 40 340 360 999 480 -10 1 1

'All values are multiplied by 1000 an

This content downloaded from

of the representation of each point in the display. The

contributions are equal for each attribute category pair 3 0 0 0 1 0 0 00

The various decompositions of the total inertia, in 9 1 0 0 0 1

may be useful in detecting homogeneous groups and in 20 1 1 1 0 1 0 0 0

This content downloaded from

This content downloaded from

Diet Coke+ 664 62 62 614 80 19 8

Diet Pepsi+ 643 25 100 404 80 1 1

Diet 7Up+ 626 25 100 458 94 49 32

Pepsi+ 811 62 62 550 76 227 98

Sprite+ 826 40 85 149 26 533 298

7Up+ 727 35 90 181 34 364 221

8 453 30 27 322 21 130 27

14,29 621 30 18 248 12 145 21

'All values are multiplied b

This content downloaded from

correspondence analysis, the "method of reciprocal av-

This content downloaded from

This content downloaded from

This content downloaded from

JMR REPRINT POLICY AND PROCEDURE

50 copies with covers $ 75.00

There is NO RETURN and NO EXCHANGE on reprints.

To obtain permission to reproduce one's own reprints in quantity, please contact th

This content downloaded from

Potrebbero piacerti anche