Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Joel Michell*
Department of Psychology, University of Sydney, Sydney NSW 2006, Australia
(i) a = h,
(ii) there exists c m Q such that a = h-\-c,
(iii) there exists c \i\ Q such that h = a-\-c;
2. A magnitude entirely composed of two discrete parts is the same regardless ofthe
order of composition, i.e. for any a and b'\nQ, a-\-h = h-Va;
3. A magnitude which is a part of a part of another magnitude is also a part of that
same magnitude, the latter relation being unaffected in any way by the former, i.e.
for any a, b and c \n Q, a-\- {b -\- c) = {a-^ b)-\- c\
4. For each pair of different magnitudes of the same quantity there exists another
between them, i.e. for any a and b in Q such that a> b, there exists c in Q, such
that a> c > b; and
5. Given any two sets of magnitudes, an 'upper' set and a 'lower' set, such that each
magnitude belongs to either set but none to both and each magnitude in the upper
set is greater than any in the lower, there must exist a magnitude no greater than
any in the upper set and no less than any in the lower, i.e. every non-empty subset
oiQ that has an upper bound has a least upper bound.
Note that one magnitude is greater than another if and only if the latter is a part
of the former, i.e. for any a and b m Q, a > b \£ and only if (ii) above is true.
Conditions 4 and 5 ensure the density and continuity, respectively, of the quantity,
which intuitively may, thus, be thought of as containing no gaps in the sequence of
its magnitudes.
Some words of caution should be added about the use ofthe mathematical symbol,
' -f-', in the above conditions. Readers will be most familiar with the use of this
symbol in arithmetic contexts, where the terms added are numbers. My first warning,
then, is that in the above conditions the addition is not of numbers but of magnitudes
of a quantity (e.g. specific lengths, say). My second warning is this: ' + ' is often
understood as a mathematical operation and this interpretation, when applied to
magnitudes, has sometimes (e.g. by Campbell, 1920, 1928) been understood as
requiring an empirical operation of concatenation (i.e. an operation of putting
magnitudes together in some way). Such an interpretation is not intended here and
to forestall it I recommend the alternative of interpreting a-\-b = c as a relation
between the magnitudes a, b and c. The relation I have in mind is this: magnitude
c is entirely composed of discrete parts, magnitudes a and b. This interpretation is
suggested by Bostock (1979). The point of making this distinction is that just because
magnitudes stand in this relation, it does not follow that suitable operations of either
concatenation or division will obtain for objects possessing the magnitudes so
related. This may be so, as with length and other convenient quantities, or it may not,
as with density or temperature. That is, the additive relation between magnitudes is
a theoretical one and how we gain access to it may often be indirect.
358 Joel Michell
1.4 The two tasks of quantification: The scientific and the instrumental
Establishing a quantitative science involves two tasks. First, there is the logically
prior scientific one of experimentally investigating the hypothesis that the relevant
attribute is quantitative. Second, there is the instrumental task of devising procedures
to measure magnitudes of the attribute shown to be quantitative. Failure to
investigate the scientific task prior to working upon the instrumental one and failure
to confirm the hypothesis that the relevant attribute is quantitative means that
treating the proposed measurement procedures as if they really are measurement
procedures is at best speculation and, at worst, a pretence at science.
2. Measurerhent in psychology
2.1 The measurement of psychological attributes and the commitment to quantitative
structure
Even a superficial perusal of relevant psychological publications reveals that
psychologists believe that they are able to measure many distinctly psychological
attributes, such as cognitive abilities, personality traits, social attitudes and sensory
intensities. These attributes are distinctly psychological in the sense that they form
part of psychology's subject matter and, also, in the sense that they do not belong to
the network of quantitative attributes measurable using the methods of the physical
sciences (see, for example, Jerrard & McNeill, 1992; Sena, 1972). While these
psychological attributes do not form part of this network, it is clear that quantitative
psychology was first modelled upon quantitative physics (Fechner, 1860). That is, in
both disciplines alike, certain attributes are supposed to have quantitative structure.
intellectual task was due to a combination of the level of general ability and the level
ofthe ability specific to that task, conceived of general ability and the various specific
abilities as quantitative attributes and proposed a functional quantitative relation
between these quantities and test scores. Fechner's and Spearman's quantitative
speculations provided the model for many later developments, for example, those by
Thurstone (1938), Hull (1943), Stevens (1956), Cattell (1943) and others. In every
case, the above concept of continuous quantity was necessarily presumed, although
it must be said that this was not often explicitly acknowledged. It was presumed,
however, as a necessary concomitant of quantitative theorizing. Hence, presumed
along with it, as part of the same conceptual package, was the traditional concept of
scientific measurement.
3.3 Its relation to the concept of quantity and to the consequent scientific task of
quantification
ah ag f k j
IF ag - Eh — Hi -
AND af -
THEN af - ah = ai - ak
Figure 1. If the component distances are additive then, given the two antecedent conditions, the
consequent condition follows.
The fact that it is in principle possible for a participant's judgments to violate this
prediction and, hence, falsify a consequence of additivity, means that this is an
empirical issue. It must be investigated experimentally before any claim to scientific
measurement is justified. The logic of such tests was not known to Fechner and,
indeed, was not generally accessible in the psychological literature until Suppes &
Zinnes (1963). However, as already argued, Fechner's mind was effectively closed to
the possibility of such empirical tests of additivity and the extent of this closure is
revealed in his comment in relation to the Plateau-Delboeuf method that' We simply
call a total difference twice as large as each of two equal partial differences of which
it is composed' (Fechner, 1887, p. 214).
equivalence. For Stevens, 'measurement is possible in the first place only because
there is a kind of isomorphism between (1) empirical relations among objects and
events and (2) the properties of...' numerical systems (Stevens, 1951, p. 1). From this
starting point he developed his theory of the four possible types of measurement
scales (nominal, ordinal, interval and ratio) (he later, 1959, added the log-interval
scale) and the associated doctrine of permissible statistics (see Michell, 1986).
This layer of his reconstruction was a masterstroke because it at once disarmed
Campbell and his associates on the Ferguson Committee of their most powerful
weapon. Their criticism of psychophysical measurement, that it was not based upon
the demonstration of any relevant additive relation between sensory intensities, was
made to look as if it depended upon an unnecessarily restrictive version of the
representational theory of measurement. However, this variety of liberalized
representationalism also posed a threat to psychological measurement and, especially,
to Stevens' psychophysical methods. If measurement involves the numerical
representation of empirical relational structures and such structures are understood
realistically (i.e. as structures existing independently of the scientific observer), then
measurement still requires a logically prior scientific stage in which the hypothesis is
tested that relations of the required kind hold within a particular empirical domain.
Even the humble nominal scale would require the demonstration of a reflexive,
transitive, and symmetric empirical relation of equivalence (or sameness with respect
to some attribute) and for most putative instances of psychological measurement not
even this much scientific work had been done. Hence, the second layer of Stevens'
reconstruction required a repudiation of such a realist interpretation of the empirical
structures numerically represented in measurement.
If Stevens' first layer was a masterstroke, his second was audaciously bold, for he
replaced the natural scientific attitude of realism with a form of relativistic
subjectivism. The 1930s had been a time of ferment within the philosophy of science,
with the newer views of logical positivism and operationism challenging older
positions (see Passmore, 1957). Stevens adopted both operationism (Hardcastle,
1995) and logical positivism (Stevens, 1939). His representational theory of
measurement, with its emphasis upon the numerical representation of directly
observable empirical relations and its formalist conception of numerical systems was
essentially positivistic. His bolder vision, however, was to construct an operational
interpretation of the representational theory. He took Bridgman's (1927) opera-
tionism, according to which the meaning of a concept is synonymous with the
operations used to identify it and applied it to psychology (Stevens, 1935a, b, \9?)(>a, b,
1939), agreeing with Bridgman that 'the meanings ofour words can never transcend
the operations which went into their determination' (1936a, p. 93). Operationism has
more in common with Berkeley's ideahsm than with empirical realism (Michell,
1990) and this is not a judgment with which Bridgman would have disagreed (see
Bridgman, 1950) and Stevens (1936a) also endorsed its implied subjectivism. What
is of interest here, however, is the use to which Stevens put Bridgman's philosophy.
In the first place, operationism was woven by Stevens into an elaborate, self-
serving, philosophy. Across a number of papers in the 1930s, he argued (1) that
because operationism implies a relativity to the observer in all science, psychology is
the 'propaedeutic science', the study of the observer (1936a); (2) all scientific
Quantitative science and psychology 371
operations are reducible to the operation of sensory discrimination (1935a); (3) the
operational methods of psychophysics are central to the study of sensory
discrimination (1935^); and (4) the scaling methods advanced by Stevens allow the
central question of psychophysics to be decisively answered (1936^). Operationism
enabled Stevens to believe that his research was the rock upon which all science
stood.
Secondly, operationism enabled him to believe that his psychophysical methods
yielded ratio scale measurement of the intensity of sensations without research into
the scientific task of quantification being necessary. If the meaning of a scientific
concept is given by the operations (i.e. procedures) used to identify it, then it follows
that the empirical relations numerically represented in measurement must hkewise be
defined by the operations used to identify them (Bergmann & Spence, 1944). Thus,
for Stevens, it could just as truthfully be said that measurement is possible because
of an isomorphism between empirical relations and numerical ones, as because of an
'isomorphism between the formal system and empirical operations' (Stevens, 1951,
p. 23, my italics). The point here is that ordinarily a distinction is made between a
relation into which things enter (e.g. the relation of object x being heavier than
object j ) and an operation or procedure used to identify such a relation (e.g. by
placing X andj in the pans of a balance and observing that the arm of the balance
supporting X tilts down). The fact that x is heavier thanj would normally be thought
of as a necessary condition for this outcome of the operation. However, because of his
operationism, Stevens would confuse two such facts, asserting that the operation
with its outcome is all that is meant by asserting that the relation holds. If ordinal
numerical assignments were made to x andj they could, on Stevens' view, be taken
to represent the above operation as easily as the above relation. When the operation
itself involves making numerical assignments (as, say, with Stevens' psychophysical
methods or with mental testing), these assignments may be taken, according to this
operationist logic, to both define the relation represented and to represent it. It is
only if this point is grasped that one can appreciate the conceptual unity between
Stevens' definition of measurement as the assignment of numerals to objects or events
according to rule and his representationahsm.
Given operationism, any rule for assigning numerals to objects or events could be
taken as providing a numerical representation of at least the equivalence relation
operationally defined by the rule itself. Hence, any rule for making numerical
assignments always defines at least a nominal scale, according to Stevens' view. This
is why Stevens was able to rephrase his definition as ' the assignment of numerals to
objects or events according to rule—any rule' (1959, p. 19) without feeling that he
had shifted his ground one inch. Of course, to the realist, this way of thinking is
viciously circular and there appears to be an hiatus between Stevens' definition of and
theory of measurement (Michell, 1986). This is because the realist views the relations
represented in measurement as having an existence independent of human
observations or operations.
When this operational way of thinking was applied to his psychophysical methods
it gave Stevens what he wanted. In the first place, it enabled him to reject the concept
of' private or inner experience for the simple reason that an operation for penetrating
privacy is self-contradictory' (1936a, p. 95) and to conclude that what psycho-
372 foel Michell
physicists had hitherto thought of as 'a subjective scale is a scale of response' (1936^,
p. 407). Then, further applying operationism, the relation of one tone's sounding half
as loud, say, as another will be defined by the operation used to determine it, i.e. by
a subject judging it to be so. Thus, Stevens was able to believe that'... the response
ofthe observer who says "this is half as loud as that" is one which, for the purpose
of erecting a subjective scale, can be accepted at its face value' (1936^, p. 407). T"hen
it follows, he thought, that such a scale is additive because 'With such a scale the
operation of addition consists of changing the stimulus until the observer gives a
particular response which indicates that a given relation of magnitudes has been
achieved' (1936^, p. 407). According to Stevens, an additive (or ratio) scale is
obtained because the person is both instructed to judge and taken to be judging
additive or numerical relations. In this case, the operation by which the numerical
assignments are made was taken by Stevens to define the ratio scale that he believed
was produced.
engaging the scientific task, especially in psychophysics (e.g. Beck & Shaw, 1967;
Gage, 1934^,^; Garner, 1954; Gigerenzer & Strube, 1983; Levelt, Riemersma &
Bunt, 1972; Reese, 1943; Zwislocki, 1983). Here is not the place to review this work
but, in general, its thrust does not warrant its consistent neglect by those endorsing
Stevens' definition of measurement and advocating psychological methods as
measurement procedures.
(Russell, 1920; as quoted by Stevens, 1951, p. 36). Thus, the interested empirical
psychologist would have come to see that Stevens' definition of measurement is
nonsense and the neglect of quantitative structure a serious omission by quantitative
psychologists. Of course, some psychologists did follow something like this route
and periodically critiques of Stevens' definition were published (e.g. Ross, 1964;
Rozeboom, 1966). If some psychologists were able to travel this route, why not all?
The widespread acceptance of Stevens' definition within quantitative psychology
created an intellectual environment in which measurement seemed easily attainable
without travelling this route. This prevented the recognition of these otherwise
evident facts. That is, we are dealing with a case of thought disorder, rather than one
of simple ignorance or error and, in this instance, these states are sustained
systemically by the almost universal adherence to Stevens' definition and the almost
total neglect of any other in the relevant methodology textbooks and courses offered
to students.
The conclusion that follows from this history, especially that of the last five
decades, is that systemic structures within psychology prevent the vast majority of
quantitative psychologists from seeing the true nature of scientific measurement, in
particular the empirical conditions necessary for measurement. As a consequence,
number-generating procedures are consistently thought of as measurement pro-
cedures in the absence of any evidence that the relevant psychological attributes are
quantitative. Hence, within modern psychology a situation exists which is accurately
described as systemically sustained methodological thought disorder.
References
Adams, E. W. & Fagot, R. F. (1959). A model of riskless choice. Behavioral Science, 4, 1-10.
Adams, H. F. (1931). Measurement in psychology. Journal of Applied Psychology, 15, 545-554.
Adler, H. E. (1980). Vicissitudes of Fechnerian psychophysics in America. In R. W. Rieber & K.
Salzinger (Eds), Psychology: Theoretical-historical Perspectives, pp. 11-23. New York: Academic Press.
378 Joel Michell
Aiken, L. S., West, S. G., Sechrest, L. & Reno, R. R. (1990). Graduate training in statistics,
methodology, and measurement in psychology. American Psychologist, 45, 721-734.
Anderson, J. (1962). Studies in Empirical Philosophy. Sydney: Angus & Robertson.
Bartlett, R. J. (1940). Measurement in psychology. Advancement of Science, 1, 422-441.
Beck, J. & Shaw, W. A. (1967). Ratio-estimations of loudness-intervals. American Journal of Psychology,
80, 59-65.
Beckwith, T. G. & Buck, N. L. (1961). Mechanical Measurements. Reading, MA: Addison-Wesley.
Behrend, F. A. (1953). A system of independent axioms for magnitudes. Journal and Proceedings of the
Royal Society of NSW, 87, 27-30.
Behrend, F. A. (1956). A contribution to the theory of magnitudes and the foundations of analysis.
Mathematische Zeitschrift, 63, 345-362.
Bergmann, G. & Spence, K. W. (1944). The logic of psychophysical measurement. Psychological Review,
51, 1-24.
Birkhoff, G. & MacLane, S. (1965). A Survey of Modern Algebra. New York: MacmiUan.
Boring, E. G. (1920). The logic ofthe normal law of error in mental measurement. American Journal of
Psychology, 31, 1-33.
Boring, E. G. (1921). The stimulus-error. American Journal of Psychology, 32, 449-471.
Bostock, D. (1979). Logic and Arithmetic, vol. 2, Rational and Irrational Numbers. Oxford: Clarendon
Press.
Bridgman, P. W. (1927). The Logic of Modern Physics. New York: MacmiUan.
Bridgman, P. W. (1950). Refiections of a Physicist. New York: Philosophical Library.
Brower, D. (1949). The problem of quantification in psychological science. Psychological Review, 56
325-333.
Brown, J. (1992). The Definition of a Profession: The Authority of Metaphor in the History of Intelligence
Testing, 1890-1930. Princeton, NJ: Princeton University Press.
Brown, J, F. (1934). A methodological consideration ofthe problem of psychometrics. Erkenntnis, 4,
46-61.
Brown, W. (1913). Are the intensity differences of sensation quantitative? IV. British Journal of Psychology,
6, 184-189.
Burnet, J. (1955). Greek Philosophy: Thales to Plato. London: MacmiUan.
Campbell, N. R. (1920). Physics, The Elements. Cambridge: Cambridge University Press.
Campbell, N. R. (1928). An Account ofthe Principles of Measurement and Calculation. London: Longmans,
Green.
Cattell, J. McK. (1890). Mental tests and measurements. Mind, 15, 373-380.
Cattell, J. McK. (1893). Mental measurement. Philosophical Review, 2, 316-332.
Cattell, J. McK. (1904). The conceptions and methods of psychology. Popular Science Monthly, pp.
176-186.
Cattell, R. B. (1943). The measurement of adult intelligence. Psychological Bulletin, 40, 153-193.
Cliff, N. (1992). Abstract measurement theory and the revolution that never happened. Psychological
Science, 3, 186-190.
Clifford, G. J. (1968). Edward L. Thorndike: The Sane Positivist. Middletown, CT: Wesleyan University
Press.
Clifford, W. K. (1882). Mathematical Papers. London: MacmiUan.
Comrey, A. L. (1950). An operational approach to some problems in psychological measurement.
Psychological Review, 57, 217-228.
Cook, A. (1994). The Observational Foundations of Physics. Cambridge: Cambridge University Press.
Coombs, C. H. (1950). Psychological scaling without a unit of measurement. Psychological Review, 57,
145-158.
Coombs, C. H., Raiffa, H. & Thrall, R. M. (1954). Some views on mathematical models and
measurement theory. Psychological Review, 61, 132-144.
Crombie, A. C. (1994). Styles of Scientific Thinking in the European Tradition. London: Duckworth.
Cureton, E. E. (1946). Quantitative psychology as a rational science. Psychometrika, 11, 191-196.
Dawes Hicks, G. (1913). Are the intensity differences of sensation quantitative? IL British Journal of
Psychology, 6, 155-174.
Quantitative science and psychology 379
Dedekind, R. (1872). Stetigkeit und Irrational^ahlen. Braunschweig: Vieweg.
Debreu, G. (1960). Topological methods in cardinal utility theory. In K. J. Arrow, S. Karlin & P.
Suppes (Eds), Mathematical Methods in the Social Sciences, 1959, pp. 16-26. Stanford, CA: Stanford
University Press.
Ellis, B. (1966). Basic Concepts of Measurement. Cambridge: Cambridge University Press.
Fechner, G. T. (1860). Elemente derpsychophysik. Leipzig: Breitkopf & Hartel. (English translation by
H. E. Adler, Elements of Psychophysics, vol. 1, D. H. Howes & E. G. Boring (Eds). New York: Holt,
Rinehart & Winston.)
Fechner, G. T. (1887). Uber die psychischen Massprincipien und das Weber'sche Gesetz. Philosophische
Studien, 4, 161-230. (English translation of pp. 178-198 by S. Scheerer (1987). My own viewpoint on
mental measurement. Psychological Research, 49, 213-219.)
Ferguson, A., Myers, C. S., Bartlett, R. J., Banister, H., Bartlett, F. C, Brown, W., Campbell, N. R.,
Craik, K. J. W., Drever, J., Guild, J., Houstoun, R. A., Irwin, J. O., Kaye, G. W. C, Philpott, S.
J. F., Richardson, L. F., Shaxby, J. H., Smith, T., Thouless, R. H. & Tucker, W. S. (1940). Final
report of the committee appointed to consider and report upon the possibility of quantitative
estimates of sensory events. Report of the British Association for the Advancement of Science, 2, 331-349.
Feyerabend, P. (1975). Against Method: Outline of an Anarchistic Theory of Knowledge. London: New Left
Books.
Fraser, C. O. (1980). Measurement in psychology. British Journal of Psychology, 71, 23-34.
Freyd, M. (1926). What is applied psychology? Psychological Review, 33, 308-314.
Gage, F. H. (1934^). An experimental investigation of the measurability of auditory sensation.
Proceedings of the Royal Society, Series B, Biological Sciences, 116, 103-122.
Gage, F. H. (1934^). An experimental investigation ofthe measurability of visual sensation. Proceedings
ofthe Royal Society, Series B, Biological Sciences, 116, 123-138.
Garner, W. R. (1954). Context effects and the validity of loudness scales. Journal of Experimental
Psychology, 48, 218-224.
Gigerenzer, G. & Strube, G. (1983). Are there limits to binaural additivity of loudness? Journal of
Experimental Psychology, Human Perception and Performance, 9, 126—130.
Green, B. F. (1954). Attitude measurement. In G. Lindzey (Ed.), Handbook of Social Psychology, vol. 1,
pp. 335-369. Reading, MA: Addison-Wesley.
Guilford, J. P. (1954). Psychometric Methods. New York: McGraw-Hill.
Gulliksen, H. (1946). Paired comparisons and the logic of measurement. Psychological Review, 53,
199-213.
Hacking, I. (1983). Representing and Intervening. Cambridge: Cambridge University Press.
Hardcastle, G. L. (1995). S. S. Stevens and the origins of operationism. Philosophy of Science, 62, 404-424.
Herbart, J. F. (1816). Lehrbuch z'"' Psychologie. (English translation by M. K. Smith, 1897, A Textbook
in Psychology. New York: Appleton.)
Holder, O. (1901). Die Axiome der Quantitat und die Lehre vom Mass. Berichte iiber die Verhandlungen
der Koniglich Sachsischen Gesellschaft der Wissenschaften•s^uLeipzig, Mathematisch-Physische Klasse, 53, 1-46.
Hornstein, G. A. (1988). Quantifying psychological phenomena: Debates, dilemmas, and implications.
In J. G. Morawski (Ed.), The Rise of Experimentation in American Psychology. New Haven, CT: Yale
University Press.
Hull, C. L. (1943). Principles of Behavior. New York: Appleton-Century-Crofts.
James, W. (1890). The Principles of Psychology, vol. 1. London: MacmiUan.
Jerrard, H. G. & McNeill, D. B. (1992). Dictionary of Scientific Units. London: Chapman & Hall.
Johnson, H. M. (1936). Pseudo-mathematics in the mental and social sciences. American Journal of
Psychology, 48, 342-351.
Kant, I. (1786). Metaphysical Foundations of Natural Science. (J. EUington transl., 1970.) Indianapolis, IN:
Bobbs-Merrill.
Krantz, D. H. (1972). A theory of magnitude estimation and cross-modaUty matching. Journal of
Mathematical Psychology, 9, 168-199.
Krantz, D. H., Luce, R. D., Suppes, P. & Tversky, A. (1971). Foundations of Measurement, vol. 1. New
York: Academic Press.
Kuhn, T. (1970). The Structure of Scientific Revolutions. Chicago, IL: University of Chicago Press.
380 Joel Michell
Levelt, W. J. M.,. Riemersma, J. B. & Bunt, A. A. (1972). Binaural additivity in loudness. British Journal
of Mathematical and Statistical Psychology, 25, 1-68.
Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison-
Wesley.
Lorge, I. (1951). The fundamental nature of measurement. In F. Lindquist (Ed.), Educational
Measurement, pp. 533-559. Washington, DC: American Council of Education.
Luce, R. D. (1987). Measurement structures with Archimedean ordered translation groups. Order 4
165-189.
Luce, R. D. (1990). 'On the possible psychophysical laws' revisited: Remarks on cross-modality
matching. Psychological Review, 97, 6()-ll.
Luce, R. D., Krantz, D. H., Suppes, P. & Tversky, A. (1990). Foundations of Measurement, vol. 3. San
Diego, CA: Academic Press.
Luce, R. D. & Tukey, J. W. (1964). Simultaneous conjoint measurement: A new type of fundamental
mc^suKment. Journal of Mathematical Psychology, 1, 1-27.
McCormack, T. J. (1922). A critique of mental measurements. School and Society, 15, 686-692.
Massey, B. S. (1986). Measures in Science and Engineering: Their Expression, Relation and Interpretation
Chichester: Ellis Horwood.
Maxwell, J. C. (1891). A Treatise on Electricity and Magnetism. London: Constable.
Meier, S. T. (1994). The Chronic Crisis in Psychological Measurement and Assessment: A Historical Survey. S
Diego, CA: Academic Press.
Michell, J. (1986). Measurement scales and statistics: A clash of paradigms. Psychological Bulletin 100
398-407.
Michell, J. (1990). An Introduction to the Logic of Psychological Measurement. HiUsdale, NJ: Erlbaum.
Michell, J. (1993). The origins ofthe representational theory of measurement: Helmholtz, Holder, and
Russell. Studies in History and Philosophy of Science, 24, 185-206.
Michell, J. (1994). Numbers as quantitative relations and the traditional theory of measurement. British
Journal for the Philosophy of Science, 45, 389-406.
Michell, J. (in press). Bertrand Russell's 1897 critique ofthe traditional theory of measurement. Synthese.
Mundy, B. (1987). The metaphysics of quantity. Philosophical Studies, 51, 29-54.
Myers, C. S. (1913). Are the intensity differences of sensation quantitative? I. British Journal of Psychology
6, 137-154.
Nafe, J. P. (1942). Toward the quantification of psychology. Psychological Review, 49, 1-18.
Nagel, E. (1932). Measurement. Erkenntnis, 2, 313-333.
Narens, L. (1985). Abstract Measurement Theory. Cambridge, MA: MIT Press.
Narens, L. (1996). A theory of ratio magnitude estimation. Journal of Mathematical Psychology 40, 109-129.
Narens, L. & Luce, R. D. (1993). Further comments on the "nonrevolution" arising from axiomatic
measurement theory. Psychological Science, 4, 127-130.
Newman, E. B. (1974). On the origin of 'scales of measurement'. In H. R. Moskowitz et at. (Eds)
Sensation and Measurement, pp. 137-145. Dordrecht, Holland: Reidel.
Passmore, J. (1957). A Hundred Years of Philosophy. London: Duckworth.
Perloff, R. (1950). A note on Brower's 'the problem of quantification in psychological science'.
Psychological Review, 57, 188-192.
Reese, T. W. (1943). The application of the theory of physical measurement to the measurement of
psychological magnitudes, with three experimental examples. Psychological Monographs, 55, 1-89.
Rosnow, R. L. & Rosenthal, R. (1989). Statistical procedures and the justification of knowledge in
psychological science. American Psychologist, 44, 1276-1284.
Ross, S. (1964). Logical Foundations of Psychological Measurement. Copenhagen: Munksgaard.
Rozeboom, W. W. (1966). Scaling theory and the nature of measurement. Synthese, 16, 170-233.
Russell, B. (1897). On the relations of number and quantity. Mind, 6, 326-341.
Russell, B. (1903). Principles of Mathematics. Cambridge: Cambridge University Press.
Russell, B. (1920). Introduction to Mathematical Philosophy. New York: MacmiUan.
Sena, L. A. (1972). Units of Physical Quantities and their Dimensions. Moscow: Mir.
South, E. B. (1937). An Index of Periodical Literature on Testing, 1921-1936. New York: The
Psychological Corporation.
Spearman, C. (1904). General intelligence, objectively determined and measured. American Journal of
Psychology, 15, 201-293.
Quantitative science and psychology 381
Appendix I
The survey of texts was done in the Psychology Library at the Katholieke Universiteit Leuven, Belgium
in June 1995. The books surveyed were:
Allen, M. J. & Yen, W. M. (1979). Introduction to Measurement Theory. Monterey, CA: Brooks/Cole.
Andreas, B. G. (1960). Experimental Psychology. New York: Wiley.
Black, J. A. & Champion, D. J. (1976). Methods and Issues in Social Research. New York: Wiley.
Borgatta, E. F. & Bohrnstedt, G. W. (1981). Levels of measurement once over again. In G. W.
Bohrnstedt & E. F. Borgatta (Eds), Social Measurement. Beverly Hills, CA: Sage.
Calfee, R. C. (1975). Human Experimental Psychology. New York: Holt, Rinehart & Winston.
Cliff, N. (1973). Psychometrics. In B. J. Wolman (Ed.), Handbook of General Psychology. Englewood Cliffs,
NJ: Prentice-Hall.
Coombs, C. H. (1953). Theory and methods of social measurement. In L. Festinger & D. Katz (Eds),
Research Methods in the Behavioral Sciences. New York: Holt, Rinehart & Winston.
Crano, W. D. & Brewer, M. B. (1973). Principles of Research in Social Psychology. New York: McGraw-
Hill.
Dominowski, R. L. (1980). Research Methods. Englewood Cliffs, NJ: Prentice-Hall.
Eagly, A. H. & Chaiken, S. (1993). The Psychology of Attitudes. Fort Worth, TX: Harcourt Brace
Jovanovich.
Eysenck, H. J., Wurzburg, W. A. & Berne, R. M. (1972). Encyclopedia of Psychology. London: Search
Press.
Francis, R. G. (1967). Scaling techniques. In J. T. Doby (Ed.), An Introduction to Social Research. New
York: Appleton-Century-Crofts.
Galtung, J. (l967). Theory and Methods of Social Research. London: George Allen & Unwin.
Green, B. F. (1954). Attitude measurement. In G. Lindzey (Ed.), Handbook of Social Psychology. Reading
MA: Addison-Wesley.
Guilford, J. P. (1956). Fundamental Statistics in Psychology and Education. New York: McGraw-Hill.
Haimson, B. R. & Elfenbein, M. H. (1985). Experimental Methods in Psychology. New York: McGraw-
Hill.
Harris, C. W. (1960). Encyclopedia of Educational Research. New York: MacmiUan.
Hays, W. L. (1967). Quantification in Psychology. Belmont: Brooks & Cole.
Hilgard, E. R. (1953). Introduction to Psychology. New York: Harcourt, Brace & World.
Kantowitz, B. H. & Roediger, H. L. (1978). Experimental Psychology. Chicago, IL: Rand McNally.
Kaplan, A. (1964). The Conduct of Inquiry. San Francisco, CA: Chandler.
Kerlinger, F. N. (1966). Foundations of Behavioral Research. New York: Holt, Rinehart & Winston.
Kurtz, K. H. (1965). Foundations of Psychological Research. Boston, MA: AUyn & Bacon.
Lemke, E. & Wiersma, W. (1976). Principles of Psychological Measurement. Chicago, IL: Rand McNally.
Lemon, N. (1973). Attitudes and their Measurement. London: Batsford.
Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison-
Wesley.
Manning, S. A. & Rosenstock, E. H. (1968). Classical Psychophysics and Scaling. New York: McGraw-
HiU.
Marlowe, L. (1971). Social Psychology. Boston, MA: Holbrook Press.
Meyers, L. S. & Grossen, N. E. (1974). Behavioral Research. San Francisco, CA: Freeman.
Miller, G. A. (1962). Psychology: The Science of Mental Life. New York: Harper & Row.
Morgan, C. T. & King, R. A. (1966). Introduction to Psychology. New York: McGraw-Hill.
Newcomb, T. M., Turner, R. F. & Converse, P. E. (1965). Social Psychology. New York: Holt, Rinehart
& Winston.
Nunnally, J. C. (1967). Psychometric Theory. New York: McGraw-Hill.
Quantitative science and psychology 383
Peck, D. F. & Shapiro, C. M. (1990). Measuring Human Problems. New York: Wiley.
Phillips, B. S. (1971). Social Research. New York: MacmiUan.
Plutchik, R. (1968). Foundations of Experimental Research. New York: Harper & Row.
Sartain, A. Q., North, A. J., Strange, J. R. & Chapman, H. M. (1973). Psychology: Understanding Human
Behavior. New York: McGraw-Hill.
Selltiz, C , Jahoda, M., Deutsch, M. & Cook, S. W. (1959). Research Methods in Social Relations. New
York: Holt, Rinehart & Winston.
Shaw, M. E. & Wright, J. M. (1967). Scales for the Measurement of Attitudes. New York: McGraw-Hill.
Siegel, S. (1956). Nonparametric Statistics for the Behavioral Sciences. New York: McGraw-Hill.
Simon, J. L. (1969). Basic Research Methods in Social Science. New York: Random House.
Sjoberg, G. & Nett, R. (1968). A Methodology for Social Research. New York: Harper & Row.
Sommer, B. & Sommer, R. (1991). A Practical Guide to Behavioral Research. New York: Oxford
University Press.
Wiersma, W. & Jurs, S. G. (1985). Educational Measurement and Testing. Boston, M A : AUyn & Bacon.
Appendix II
Put as succinctly as possible, measurement is the numerical estimation of the ratio of a magnitude
of a quantitative attribute to a unit of the same attribute.
The concept of measurement is embedded within a matrix of closely related concepts. There is no
adequate definition of it that avoids implicating them, as well. The associated concepts used here are
briefly explained as follows.
Attribute. An attribute is a range of properties or relations that may vary from instance to instance. E.g.
length is an attribute of objects, different objects often having different lengths; so is sex, some creatures
being female, others male; nationality is another example, some people being British, some French, etc.
Quantitative attribute. A quantitative attribute (or quantity) is an attribute the instances of which are
related to one another both ordinally and additively. One version of (continuous) quantitative structure
is given by Holder's (1901) axioms, another is given in section 1.1 of this paper. Not aU attributes are
quantitative. E.g. length is quantitative, but neither sex nor nationality is.
Magnitude. A magnitude is a specific instance of a quantity. Thus, each instance of length (say, the
length of this page) is a magnitude of the quantity, length.
Ratio. As intended in this context, a ratio is a special kind of relation holding between magnitudes of
the same quantity. The ratio of one magnitude of a quantity to another is the size of the first relative
to the second. Thus, ratios are relative magnitudes. The most useful way to express a ratio is as a
number. E.g. the ratio of the length of a cricket pitch to one yard is 22.
Units. A unit is a more or less arbitrarily selected magnitude of a quantity, singled out to be the instance
against which any other is to be compared. If the unit is known, then expressing the magnitude of any
other instance of the quantity relative to the unit means that the magnitude of that instance is defined
and, so, known via the unit. E.g. one widely used unit of length is the metre.
Numerical estimation. In the first instance, a numerical estimation or number specifies how many? or how
much?. E.g. knowing that there is some whole number, say, three, books on the desk is knowing that
there is a book, another and one more at that location. The range of possible answers to the question,
how many?, is the sequence of whole (or natural) numbers, the meaning of each being definable via the
concepts of one and one more than. Using the natural numbers, the real numbers may be specified. They
answer the question, how much?. It foUows from the axioms for eontinuous quantity that some multiple
(say, «) of an instance of some quantity (call it a) is always less than, equal to, or greater than any other
multiple {m) of another instance of that quantity, b. If less than, then the ratio oi a to b is less than the
ratio oi m to n (i.e. a/b < m/n); if equal to, then a/b = m/n; and if greater than, then a/b > m/n. Thus,
each ratio of magnitudes has a location relative to each ratio of natural numbers and, so, is given by
a real number (i.e. the real number specified by the class of all ratios of natural numbers equal to or less
than it). Thus, the ratio of each instance of a quantity to the unit selected is expressed by a unique
numerical value. In general, measurement procedures only permit the (approximate) estimation of such
unique numerical values.
British journal of Psychology (1997), 88, 385-387 Printed in Great Britain 385
© 1997 The British Psychological Society
Paul Kline*
Department of Psychology, University of Exeter, Washington Singer Laboratories,
Exeter EX4 4QG, UK
Michell has developed some powerful arguments which correctly cast doubt on the scientific validity
of much ofthe measurement procedures in psychology .Jn this reply I shall restrict myself to answering
t;he objections relevant to psychometrics. The attack on psychophyslcal measurement I shall leave others
to rebut.
There are two key points in the paper which are entirely accepted: that scientific measurement may
be defined as the estimation or discovery of the ratio of some magnitude of quantitative attribute to a^
unit of the same attribute; and that quantitative science involves two tasks—the investigation of the
hypothesis that the relevant attribute is quantitative and the instrumental task of devising procedures
to measure magnitudes of the attribute shown to be quantitative. The burden of the paper is that in
psychometrics neither of these two tasks has been done. It is simply assumed that the variables are
quantitative and the measurement techniques fail by the criteria of scientific measurement. An example
will clarify his arguments. It is assumed, in the case of extraversion, despite the work of Jung, that this
is a quantitative variable, without prior proof, and the tests contain no clear unit of measurement, in
comparison with, to cite Michell, a cricket pitch where the ratio of the pitch to the unit is 22 (if yards
are still permissible). Finally, Michell claims, where measures depart from the scientific model, then
there is no justification for using them in the kinds of mathematical arguments which have played so
large a part in developing the natural sciences. All this leads Michell to conclude that psychometrics is
not scientific either in its theory or application.
While accepting these premises of scientific measurement, I shall argue that psychometrics is not as
entirely flawed as Michell claims although some aspects of the field should be abandoned or modified
considerably.
i^irstl shall discuss the psychometrics of intelligence, as typified by the factor analytic work of
Spearman, Burt and Cattell (Cattell, 1971). There are three points which need to be made. First the
original question, which the factor analysis of abilities was designed to answer, was essentially this: what
accounts for the positive correlations between human abilities? It turns out that two g factors,
crystallized and fluid intelligence, account for much of the covariance. The intelligence test score,
measuring these factors, may be regarded as a good index of the level of difficulty which an individual
has reached in cognitive problem solving, ^econdly, measures of these factors have construct validity,
that is they behave as one might expect of intelligence tests. They predict academic success, occupational
success and differentiate between groups, as is well documented. All this means that instead ofthe vague
term intelligence psychometrists can use these tests o( g, despite their admitted defects as scientific
measures, as operational measurements of intelligence which lead to useful predictions and theorizing.
Sternberg (1982) summarizes much ofthis work.
The third point and by far the most germane to this argument is not mentioned by Michell. This
concerns the fact that the leading psychometrists, such as Cattell and Eysenck, figures whom Cattell is
References
Barrett, B. (1996). Process models in individual differences research. In C. Cooper & V. Varma (Eds),
Processes in Individual Differences. London: Routledge.
Cattell, R. B. (1971). Abilities, their Structure, Growth and Action. New York: Houghton Mifflin.
Cattell, R. B. (1981). Personality and Learning Theory. New York: Springer.
Deary, I. & Caryl, P. (1993). Intelligence, EEG and evoked potentials. In P. A. Vernon (Ed.), 'Biological
Approaches to Human Intelligence. Norwood, NJ : Ablex.
Draycott, S. & Kline, P. (1994). Further investigations into the nature of BIP: A factor analysis ofthe
BIP with primary abilities. Personality and Individual Differences, 17, 201—209.
Eysenck, H. J. (1967). The Biological hasis of Personality. Springfield, IL: Thomas.
Eysenck, H. J. (1994). PersonaUty and intelligence: Psychometric and experimental approaches. In
R. J. Sternberg & P. Ruzgis (Eds), Personality and Intelligence. Cambridge: Cambridge University Press.
Gray, J. A. (1981). A critique of Eysenck's theory of personality. In H. J. Eysenck (Ed.), A Model for
Personality. Berlin: Springer-Verlag.
Kline, P. (1993). Handbook of Psychological Testing. London: Routledge.
Lehrl, S. & Fischer, P. (1990). A basic information psychological parameter (BIP) for the reconstruction
of the concepts of intelligence. European Journal of Personality, 4, 259-286.
Sternberg, R. J. (Ed.) (1982). Handbook of Human Intelligence. Cambridge: Cambridge University Press.
Weiss, V. (1995). Memory span as the quantum of action of thought. Cahiers de Psychologie Cognitive, 14,
387-408.
Zuckerman, M. (1991). The Psychobiology of Personality. Cambridge: Cambridge University Press.
British Journal of Psychology (1997), 88, 389-391 Printed in Great Britain 389
© 1997 The British Psychological Society
A critique of a measurement-theoretic
critique: Commentary on Michell,
Quantitative science and the definition of
measurement in psychology
Donald Laming*
Department of Experimental Psychology, University of Cambridge, Downing Street,
Cambridge CB2 3EB, UK
Over the past 40 years theire has been much theoretical progress in the understanding of what it means
to make measurements. If numbers are assigned to objects or events, the kinds of arithmetical operations
(such as averaging or calculating ratios) which it is thereafter meaningful to carry out on the numbers
depend on the rule of assignment. Measurement theory, roughly speaking, is concerned to identify what
conditions need to be satisfied to make this or that arithmetical operation meaningful. Measurement
theorists, generally, feel that psychologists have disregarded their work, to the detriment of the
development of psychology as a natural science. Michell's article is a polemic—a very scholarly and
well-argued polemic—addressing this issue. It would have helped his argument, however, to have
explained why measurement theory should matter to psychologists, and I endeavour, first of all, to
remedy that deficit.
Category judgment
MicheU, however, does his cause a disservice by placing as much as half of his emphasis on the 'proper'
use ofthe word 'measurement'. For Michell, 'measurement' only means ratio-scale measurement ofthe
kind met with in the physical sciences. This will seem quite beside the point to most psychologists
because so many psychological experiments yield merely rank order or categorical data. Would Michell
have us throw all those data away?
Twelve years ago I investigated the thesis that human judgment is little better than ordinal (Laming,
1984). That idea made sense of a variety of results from magnitude estimation and category judgment
experiments—the Umited transmission of information in category judgments (Garner, 1962, chapter 3);
the interaction of the order of stimulus presentation with resolving power (Luce, Nosofsky, Green &
Smith, 1982); the correlation between successive magnitude estimates (Baird, Green & Luce, 1980), and
other results besides—results which otherwise appeared quite baffling to theorists in that field. Much
psychological data comes from respondents who make judgments about stimuli, and the possibility must
seriously be considered that the analysis of categorical and rank order data is essential to psychology.
That does not mean that psychology cannot be scientific; nor that the treatment of such data is arbitrary.
It means that measurement theory must address the question how to treat that kind of data.
Michell does indeed take S. S. Stevens to task; and while he pays lip-service to the overriding role
of scientific theory (pp. 358-359)—real theory, not just measurement theory—seems not then to know
what to do with it. So far as sensory scales are concerned: we can know about internal sensations only
through the medium of participants' judgments of stimuU. If human judgment is no better than ordinal,
there can never be any empirical basis for establishing a subjective scale distinct from the natural
physical measure of the stimulus (Laming, 1977). Any theory of magnitude estimation or category
judgment will have to be formulated in those terms.
Fechner's law
One of the favourite targets of measurement theorists is Fechner's scheme for measuring sensation by
cumulating just noticeable differences, and Michell is no exception. The criticism originated with Luce
& Edwards (1958). The gist of their argument is that if the empirical data consist of no more than the
marking off of successive just noticeable differences along a continuum, many more schemes of
numerical assignment are possible besides Fechner's law. Fechner's procedure of replacing a finite
difference (the jnd) with a differential is unjustified and, except in the special case of Weber's law, actually
gives the wrong answer. But Michell's comments about Fechner's scheme will again seem beside the
point.
That point is that any scheme of measurement necessarily presupposes a theory relating the actual
observations to that which is to be measured (the scientific theory which Michell acknowledges on
Measurement-theoretic critique 391
p. 359). Since Fechner's day our understanding of sensory discrimination has developed beyond all
recognition, the crucial step in that development being the introduction of signal-detection theory by
Tanner & Swets (1954). Treating the signal-detection model purely as a vehicle for the analysis of data,
categorical responses in a two-alternative discrimination task can be used to estimate a continuous
index, d', which has all the properties of an interval measure. Given a discrimination between two
separate stimuli, d' increases in proportion to the stimulus difference (Laming, 1986, chapter 4; also
Mountcastle, Talbot, Sakata & Hyvarinen, 1969), so that the condition of additivity (Michell, p. 363) is
empirically satisfied. Luce & Edwards' objections to Fechner's passage to a differential do not apply to
d'. Experimental psychologists have substantially solved the problem which Michell addresses and he
writes in apparent ignorance of that solution. He is, as it were, directing the critical power of modern
measurement theory against the psychology of 1860. It will not do.
That is a pity because otherwise Michell has a powerfully written argument that has much to
contribute to present-day psychology. Measurement-theoretic critique—specifically the weeding out of
those artificial questions to which the data do not speak—is much needed in psychology today.
References
Baird, J. C, Green, D. M. & Luce, R. D. (1980). VariabiUty and sequential effects in cross-modaUty
matching of area and loudness. Journal of Experimental Psychology: Human Perception and Performance,
6, 277-289.
Burt, C. L. (1940). The Factors of the Mind. London: University of London Press.
Garner, W. R. (1962). Uncertainty and Structure as Psychological Concepts. New York: Wiley.
Guilford, J. P. (1967). The Nature of Human Intelligence. New York: McGraw-HiU.
Laming, D. (1984). The relativity of'absolute' judgements. British Journal of Mathematical and Statistical
Psychology, 37, 152-183.
Laming, D. (1986). Sensory Analysis. London: Academic Press.
Laming, D. (1997). The Measurement of Sensation. Oxford: Oxford University Press.
Luce, R. D. & Edwards, W. (1958). The derivation of subjective scales from just noticeable differences.
Psychological Review, 65, 222-237.
Luce, R. D., Nosofsky, R. M., Green, D. M. & Smith, A. F. (1982). The bow and sequential effects in
absolute identification. Perception <& Psychophysics, 32, 397-408.
Mountcastle, V. B., Talbot, W. H., Sakata, H. & Hyvarinen, J. (1969). Cortical neuronal mechanisms
in flutter-vibration studied in unanesthetized monkeys. Neuronal periodicity and frequency
discrimination. Journal of Neurophysiotogy, 32, 452-484.
Spearman, C. (1904). "General intelligence", objectively determined and measured. American Journal of
Psychology, 15, 201-293.
Spearman, C. (1927). The Abilities of Man. London: MacmiUan.
Sternberg, R. J. (1994). Intelligence and cognitive styles. In A. M. Colman (Ed.), Companion Encyclopedia
of Psychology, vol. 1, pp. 583-601. London: Routledge.
Tanner, W. P. Jr & Swets, J. A. (1954). A decision-making theory of visual detection. Psychological
Review, 61, 401-409.
Thomson, G. H. (1939). The Factorial Analysis of Human Ability. London: University of London Press.
Thurstone, L. L. (1938). Primary Mental Abilities. Chicago: University of Chicago Press.
Vernon, P. E. (1950). The Structure of Human Abilities. London: Methuen.
British Journal of Psychology (1997), 88, 393-394 Printed in Great Britain 393
© 1997 The British Psychological Society
A. D. Lovie*
Department of Psychology, University of Liverpool, Eleanor Rathbone Building,
PO Box 147, Liverpool L69 3BX, UK
\ completely reject the hard-nosed (and very outdated) positivist and empiricist/realist line adopted in
c the paper. It is as if the upheavals of the last 30 or more years in the philosophy, history and sociology
of science had not happened, and that the powerful new insights provided by this work amounted to
no more than a kind of crass anti-scientism (see the usual reaUst knee-jerk reactions to Kuhn and
Feyerabend at the end of the paper). By adopting such a reactionary approach to the history and
philosophy of science, the, author has reduced what could have been an insightful account of a crucial
and pivotal episode in i:heTiistory of psychological statistics into a stale and superficial morality tale, with
S. S. Stevens and others cast as the villains, and Holder and his modern followers as the heroes.
Although such a knockabout tale might please the groundlings, the serious, historical and conceptually
embedded business as to just what Stevens actually did in the late 1930s and early 1940s (and why he
did it) are all left essentially untouched because the author cannot bring himself to look at this episode
in its historical and intellectual context. Instead,,^what we get is a non-reflexive tour deforce on the sins
of the Popes written by one who is in possession of the final truth (note the strategic use of the words
'reality' 'truth' and 'true measurement' throughout the paper). But there are no absolute, ahistorical
mathematical truths or methods, only locally developed and locally maintained collective commitments
and practices: what the ethnomethodologist, Eric Livingston, has termed the 'Uved-work' of the
practising mathematician (Livingston, 1986). Starting from this general position (as most historians of
science now do) would have generated a very different kind of story from the one before us.
I am also both intrigued and unhappy about the rhetorical deployment ofthe so-called 'ideological
support system' which is used to account for Stevens's (and others') inability to see the obvious.
Intrigued because suddenly the author seems happy to embrace an explanatory device which smacks a
Uttle of the social constructivist approach that I have advocated earUer, only to notice unhappily that
this notion is never appUed to Holder et al. whose ideas are emphatically not generated by such a social
constructivist process: blatant ontological gerrymandering of this kind cannot go unchallenged. Would
it not be ultimately more insightful to argue that everybody works within such a support system (the
author and myself included!)?
Let me add something more about this matter of conclusions or points which are described as
'obvious' but which, in some mysterious way, people like Stevens were too blinkered or perhaps too
wilful to see. Nothing is obvious ^er se: it is rather a matter of which interests you are committed to
and which assumptions and presuppositions you buy. If you do not buy into the current author's (or
Holder's or whoever's) position and all that that entails, then such conclusions are not necessarily
obvious. So the term ' obvious' is used here as a contingently phrased rhetorical device to undermine the
opposition, as is the linked term 'methodological thought disorder'. Hence the crudely contingent
argument used to account for psychology's perverse and continuing attachment to measurement a la
Stevens: viz. if my conclusion (that is, that Stevens's approach to measurement is unscientific) is not
Reference
Livingston, E. (1986). The Ethnomethodological Foundations of Mathematics. London: Routledge & Kegan
Paul.
British Joumat of Psychology (1997), 88, 395-398 Printed in Great Britain 395
© 1997 The British Psychological Society
R. Duncan Luce*
Institute for Mathematical Behavioral Sciences, Social Science Plai^a, University of California at Irvine,
Irvine, CA 92697-5100, USA
Several of Michell's points are amplified and emphasized and the following
additional point is made. Most quantitative attributes can be measured in more than
one way, and there are interesting questions about how they relate. Among other
things, units of measurement and symmetries of the underlying structure may or
may not agree.
Because I agree with almost everything Michell says, my commentary is restricted to some amplification
and to an added observation.
2. Quantification of symmetry
One of the most significant discoveries of contemporary representational measurement theory is that a
very broad class of structures can be quantified indirectly (Alper, 1987; Cohen & Narens, 1979; Luce
& Narens, 1985; Narens, 1981a, b). An isomorphism of an ordered empirical structure with itself is
called a symmetry (or automorphism) of the structure. A symmetry with no fixed point is called a
translation. All familiar examples of measurement structures are rich in translations—they are homogeneous
in the sense that any point can be mapped into any other point by a translation. So the elements of the
structure are structurally identical. What is remarkable is that if the structure forms a continuum, then
the set of translations plus the identity forms a group under function composition, has an order induced
from the structural order, and together these form a quantitative system in the classical sense of Holder.
Moreover, the group of translations can be mapped onto the multiplicative positive real numbers. And
in the homogeneous case the structure itself can be mapped isomorphically into the translations and so
into a numerical representation in which the translations form a multiplicative ratio scale. This enlarges
enormously the potential domain of measurable structures.
For example, consider the class of structures involving an ordering and a binary operation (e.g.
receiving two things at once, placing two objects on a pan of a pan balance, etc.). For such structures
that are homogeneous and continuous, the possible numerical representing operations ® are of the
form
x®y=yf{xly), (1)
where/is strictly increasing and/(:^)/;j is strictly decreasing. This forms a ratio representation because
for /fe > 0,
kx@ky = kyf{ky/kx) = k{x ®y). ' (2)
The classical case of measurement in physics is the special case where y(;j) = 1-1-^.
This development opens many possibilities which simply have not been explored by empirical
scientists. It enlarges much beyond additive systems ones that, in a principled way, can be viewed as
quantifiable. Of course, it does not extend to the non-empirical generalizations of Stevens.
1 -I- uv/c'
where c denotes the velocity of light in the same units. Note that both the rapidity and v are invariant
under multiplication by positive constants. The former corresponds to the translations ofthe extensive
structure whereas the latter does not; it is just a change of units. Such changes of units, however, do
eorrespond to translations of the two components of the conjoint structure. It is simply the case that
the two groups of translations are not the same.
A somewhat similar example has arisen recently in utility theory. For half a century utility has been
studied using uncertain alternatives (gambles) for which the representation is a weighted average of
numerical utilities over pure consequences—the famous subjective expected utility model and its
modern generalizations. Luce & Fishburn (1991, 1995) introduced a second way to measure utility
based on a binary operation ® of joint receipt, i.e. receiving two valued objects at the same time. They
assumed for gains relative to a status quo that it forms an extensive structure. Assuming an interlock
called segregation, which is highly rational and has been sustained empirically (Chd & Luce, 1995; Cho,
Luce & von Winterfeldt, 1994), they show that the usual utility function U derived from gambling
behaviour has the following non-additive representation of ® :
7/.A
(4)
where C is the asymptotic maximum of the utility function. Note that this representation is invariant
under change of unit, but that Uke the velocity case these transformations do not correspond to
translations of the extensive structure although they do of the gambling structure.
I believe that these two cases suggest that if we examine sensory intensity measurement from both
the perspective of conjoint structures and extensive ones based on the physical operation of
concatenation, we may find an interlock leading to somewhat similar non-additive, bounded
representations.
References
Alper, T. M. (1987). A classification of all order-preserving homeomorphism groups ofthe reals that
satisfy finite uniqueness. Journal of Mathematical Psychology, 31, 135-154.
Cho, Y. & Luce, R. D. (1995). Tests of hypotheses about certainty equivalents and joint receipt of
gambles. Organisational Behavior and Human Decision Processes, 64, 229-248.
Cho, Y., Luce, R. D. & von Winterfeldt, D. (1994). Tests of assumptions about the joint receipt of
gambles in rank- and sign-dependent utility theory. Journal of Experimental Psychology: Human
Perception and Performance, 20, 931-943.
Cohen, M. & Narens, L. (1979). Fundamental unit structures: A theory of ratio scalability. Journal of
Mathematical Psychology, 20, 193-232.
Holder, O. (1901). Die Axiome der Quantitat und die Lehre vom Mass. Berichte iiber die Verhandlungen
der Koniglich Sachsischen Gesellschaft der Wissenschaften ^u Leipzig, Mathematisch-Physische Klasse, 53, 1-
Krantz, D. H., Luce, R. D., Suppes, P. & Tversky, A. (1971). Foundations of Measurement, vol. 1. San
Diego: Academic Press.
Luce, R. D. & Fishburn, P. C. (1991). Rank- and sign-dependent linear utility models for finite first-
order gambles. Journal of Risk and Uncertainty, 4, 29-59.
Luce, R. D. & Fishburn, P. C. (1995). A note on deriving rank-dependent utility using additive joint
receipts. Journal of Risk and Uncertainty, 11, 5-16.
Luce, R. D., Krantz, D. H., Suppes, P. & Tversky, A. (1990). Foundations of Measurement, vol. 2. San
Diego: Academic Press.
Luce, R. D. & Narens, L. (1985). Classification of concatenation measurement structures according to
scale type. Journal of Mathematical Psychology, 29, 1-72.
398 R. Duncan Luce
Narens, L. (1981a). A general theory of ratio scalability with remarks about the measurement-theoretic
concept of meaningfulness. Theory and Decision, 13, 1-70.
Narens, L. (198U). On the scales of measurement. Journal of Mathematical Psychology, 24, 249-275.
Narens, L. (1996). A theory of ratio magnitude estimation, lournal of Mathematical Psycholo£v 40
109-129. J &y' '
Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103, 677-680.
Stevens, S. S. (1951). Mathematics, measurement and psychophysics. In S. S. Stevens (Ed.), Handbook
of Experimental Psychology, pp. 1-49. New York: Wiley.
Stevens, S. S. (1975). Psychophysics. New York: Wiley.
British Journal of Psychology (1997), 88, 399-400 Printed in Great Britain 399
© 1997 The British Psychological Society
Michael Morgan*
Institute of Ophthalmology and Department of Anatomy/Developmental Biology, University College London,
Gower Street, London WC1E 6BT, UK
Michell's argument is that the branches of psychology that use numbers are vitiated because they do so
without properly validated scales of measurement. In particular, they do not obey the axioms of Holder
(1901) which appear to describe the behaviour of continuous quantities like mass.
The two fields of quantitative psychology which come in for the greatest drubbing are psychometrics
(mental testing) and psychophysics. In the case of psychometrics I am not competent to judge whether
Michell's criticisms are damaging or not. In the case of psychophysics the criticisms seem to me to be
largely irrelevant to actual practice. If a mainstream journal of visual psychophysics is opened, for
example, it will be found that the measurement scales in use are visual angles (arcmin), luminances
(cd/m^), contrast (a ratio scale) and time. In other words, these are physical scales.
The fundamental dependent variable is the probability of detection/discrimination, which is a
continuous, well-behaved scale. Thus results are expressed as the physical magnitude at which some
criterion level of probability is met, often 75 or 84 per cent correct. The reason for this is that the
majority of psychophysicists have followed the tradition of E. H. Weber in measuring the limits of
sensory discrimination power in physical units. The main task of psychophysics is to determine the
relevant physical dimension that limits performance, which is not always obvious.
For example, the best vernier acuity is in the region of 5 arcsec. This is interesting because the
diameter of the foveal photoreceptor is ~ 20 arcsec. This may appear mysterious but when optical
blurring is taken into account, it is found that a 5 arcsec shift in the position of a line implies a 2 per
cent contrast shift in the photoreceptor best placed to detect the shift. This is similar to the eontrast shift
required to detect the presence of a very thin line. Vernier acuity for Unes is reduced when they move.
Is velocity the relevant measurement scale? No, because if sinewave gratings are used instead of lines
the loss of acuity is predictable not from their velocity but from their temporal frequency (Hz).
The Weberian tradition preceded that of Fechner, which Michell aptly criticizes. Fechner wanted to
measure the magnitude of sensation, and thought he could do so rationally by totting up jnds. There may
be people out there who still believe this, but I doubt it. The enterprise of measuring the magnitude
of a sensation appears to have been doomed because there is no such thing. Most psychophysicists
believe, on the other hand, that it is reasonable to rank order sensations by magnitude. Astronomers
had been doing this with the brightness of stars long before psychology had been invented. Michell has
little time for ordinal scales, and presents them as desperate inventions of S. S. Stevens to deflect
attention from the failure of ratio scales. I do not follow the reasons for this severity. Other sciences
have worked quite happily with scales that are not continuous, and which therefore fail to satisfy the
Holder axioms. For example, classical genetics used a unit called (as it happens) the morgan or
centimorgan. This was the distance apart between two genes measured by the probability of
recombination. The fact that centimorgans turned out in many cases to be additive was taken as
evidence for the Unear arrangement of genes on chromosomes. But it is clear that this scale never had
References
Lea, S. E. G. & Morgan, M. J. (1972). The measurement of rate-dependent changes in responding. In
J. MiUenson & R. Gilbert (Eds), Reinforcement: Behavioral Analyses. New York: Academic Press.
British journal of Psychology (1997), 88, 401-406 Printed in Great Britain 401
© 1997 The British Psychological Society
Joel Michell*
Department of Psychology, University of Sydney, Sydney, NSW 2006, Australia
My paper proposed first, that psychology is committed to the scientific task of testing
the hypothesis that its supposedly measurable attributes really are quantitative;
second, that from its inception, modern quantitative psychology has, with few
exceptions, ignored this task, concentrating instead upon the instrumental task of
quantification; and third, that modern psychology has adopted an intellectually
pathological defence mechanism against recognizing the existence of this scientific
task. The commentaries by Kline, Laming, Lovie, Luce and Morgan either amplify
or criticize my arguments for these theses. I will consider each thesis in turn,
presenting a sketch of my argument and then assessing the force of the criticisms.
where ^ is the inverse of the usual normal distribution function. If experimental psychologists have
discovered that for some set of quantitative stimulus attributes, d' increases in proportion to the physical
difference between stimuli, that is a significant discovery, but it is one which no more establishes
psychophysical measurement (or additivity) than does the fact that transformations or combinations of
test scores correlate systematically with some criterion. In the absence of a theory, it is just an
observation about the relationship between physical stimulus differences and a numerical function of hit
and false alarm rates, these rates being directly observable frequencies (not measures of anything). As
with test scores, results such as these raise scientific questions, they do not solve them. The scientific task
is to explain why the observed regularity occurs. A proposed explanation may involve the postulation
of distinctly psychological quantities, or it may not. If it does, then it may be possible to interpret d'
as a measure of something psychological, but such an interpretation will only be correct if the proposed
theory is true and that, in turn, will only be the case if the postulated psychological attributes really are
quantitative. The theory must be tested and the test must be sensitive to the presence of additivity. Only
when supported by such tests does the claim of measurement become more than speculation.
Laming knows this. For example, he (1986) refers to empirical conditions specified by Levine (1970)
and Falmagne (1971) for pair comparisons to be scalable in Fechner's sense. (Falmagne, 1985, provides
further empirical conditions relevant to psychophysical measurement.) In my paper I drew attention to
the psychophysical research tradition that has attempted to investigate the quantitative hypothesis.
However, Laming's judgment that this tradition has 'substantially solved the problem which Michell
addresses' (i.e. the question of quantity) is more sanguine than mine and not supported by other
commentaries mentioning psychophysics. Furthermore, Laming knows only too well that many
researchers in psychophysics neglect these questions, a point he has eloquently made for the general
reader elsewhere (Laming, 1987). Despite his throw-away hne, that I am simply 'directing the critical
power of modern measurement theory against the psychology of I860,' his true attitude to psychological
measurement is, I believe, more faithfully revealed in his other, deftly delivered, salvoes at modern
targets indistinguishable from mine.
The logic that applies to Laming's comments applies equally to Kline's. Kline accepts that ' it is true
that psychometric measurement is not scientific in the sense defined by Michell' and accepts the force
of my logical thesis in relation to social measurement, but with respect to personality measurement and,
more so, abihty measurement, cannot bring himself to swallow its full implications. As scientific
measures, he thinks, 'personality scales are far from worthless' and, better still, 'psychometric testing
is leading to a real scientific understanding ofthe phenomenon' of intelligence. Indeed, Kline appears
convinced that intelligence tests will be to some future scientific apparatus for measuring ability as
Galileo's telescope is to the modern apparatus at Jodrell Bank. On present evidence, I cannot say
whether this view is false or otherwise, of course, but I do know that the best defence against our
seemingly limitless capacity for self-delusion is to try to put our ideas to empirical test. If Kline's faith
in psychometrics as scientific measurement proves true, it will only ever be shown thus by researchers
who, taking a more critical view, find ways to test the hypothesis that abilities are quantitative, ways
specifically sensitive to additive structure. None of the evidence that Kline mentions bears upon the
scientific task of quantification. For example, factor analytic studies are insensitive to the falsity of this
hypothesis because they already presume that the relevant psychological attributes (i.e. abilities) are
quantitative.
Because all research makes assumptions, I have no quarrel with any particular researcher who
candidly assumes that the abilities postulated are quantitative, one at the same time aware of the extent
to which this assumption qualifies any conclusions drawn. My quarrel is with a discipline that while
invariably begging the same central assumption, rarely acknowledges this fact, never explores the
implications of this fact for conclusions drawn and evinces little interest in empirically investigating the
truth of this assumption. In believing that the evidence he adduces moderates the force of my critique.
404 /. Michell
when in fact that evidence is logically irrelevant to it, Kline betrays a resistance, typical of psychologists
in general, to facing the implication that psychological attributes may not be quantitative at all and that
psychological measurement, in the scientific sense, may thus never be able to be realized. Perhaps this
is because such psychologists cannot quite believe that if such measurement proves impossible,
psychology will be no less a science than if quantification is realized. In Galileo's words, 'We must not
ask nature to accommodate herself to what might seem to us the best disposition and order, but must
adapt our intellect to what she has made, certain that such is the best and not something else' (quoted
in Crombie, 1994, p. 45).
Luce's comments on units and symmetries reflect a slant towards measurement theory differing
slightly from mine, but the difference is not substantial. As I have discussed this complex matter
elsewhere (Michell, 1993, 1994), I will confine my reply. Luce is correct to note that any quantitative
attribute ' might be quantifiable in several ways' and that these might result in quite distinct relations
of additivity being identified within the same quantity, as his velocity/rapidity example shows. What
this implies is that for any two magnitudes there is no unique ratio (as measurement theorists from
Euclid to Holder seem to have thought). In order to define a ratio between two magnitudes and, thus,
in order to measure anything, a relation of addition between magnitudes of the relevant kind must also
be specified. At least in the first instance, such a relation can only be discovered empirically (via
extensive or conjoint measurement procedures, for example), although subsequently further additive
relations can always be defined via transformations of the resulting ratios (e.g. logarithmic
transformations). This observation reinforces the need to discover additive structures within an
attribute by empirical methods.
In summary, the force of my logical thesis remains: psychology is committed to the task of testing
the hypothesis that its supposedly measurable attributes are quantitative. Its force is not moderated by
the fact that, to some extent, this task has been pursued already in psychophysics and, of course, in some
other areas (e.g. Luce's research on utility).
2. Historical thesis
My historical thesis consists of the following components.
Part 1. Because oi practicalism and scientism, the founders of modern quantitative psychology simply
assumed that psychological attributes are quantitative.
Part 2. Consequently, they concentrated upon the instrumental task of quantification and ignored the
scientific task.
Part 3. The report of the Ferguson Committee displayed the fact that, as a result, psychological
measurement was anomalous relative to the logic of quantity and measurement.
Part 4. Institutional endorsement of Stevens' definition of measurement deflected this criticism and
satisfied most psychologists that measurement was attainable via the instrumental task alone.
Lovie expresses a global disagreement with this thesis. According to him, I have neglected even to
attempt 'the serious historical and conceptually embedded work of finding out what Stevens actually
did in the late 1930s and early 1940s (and why he did it)'. Lovie believes that there is only one level at
which the history of science can be undertaken (viz., that ofthe 'lived-work' ofthe scientist). This is
a reductionist fallacy. There is no ultimate level of inquiry in any scientific field, and the serious scholar
adapts his or her focus to suit the questions at hand. This inevitably means ignoring what one adjudges
to be irrelevant detail. The historical questions that interested me concerned certain broad conceptual
trajectories relating to quantification and measurement, passing through late 19th and early 20th century
thought, converging upon Stevens and thereafter affecting the way measurement was defined in
psychology. I was not interested in more microscopic details of Stevens' life. If I am mistaken in
identifying the lines of development that I think I have detected, it will be shown by historians studying
this history at the same level as I have done and not by historians pursuing Stevens' 'lived-work'.
3. Sociological thesis
My argument here is as follows.
Premise 1. Resistance to readily available facts is symptomatic of thought disorder.
Reply to Kline, Laming, Lovie, Luce and Morgan 405
Premise 2. The logic of quantity and measurement constitutes a set of readily available methodological
facts.
Premise 3. Modern psychology resists the incorporation of these methodological facts into its ideological
support systems (i.e. its methodology courses and texts, and its research programs).
Conclusion 3. Methodological thought disorder is systemic in modern psychology.
Here, premise 2 is the proposition that the body of knowledge constituting the logic of quantity and
measurement is readily available (or 'obvious') to psychologists. Lovie labours the obvious point that
the concept of the obvious is a relative one. I did not suggest otherwise. If this body of knowledge is
readily available in the psychological literature (and it is) and if quantitative psychologists are largely
unaware of this body of knowledge (and they are), then it is because it has been excluded from this
science's ideological support structures. Prima facie this is odd because this body of knowledge is highly
relevant to attempts at quantification. Drawing the reader's attention to this anomaly is hardly a
rhetorical device: something is radically amiss in a scientific discipline when crucial methodological
matters are available, but ignored.
Lovie was intrigued by my use of the sociological concept of an ideological support structure, but
unhappy because, in his eyes, it amounted to gerrymandering ontologicalty (or ' fatally undermining the
other's position by treating their givens as problematic, while not applying the same (honest or
consistent) analysis to my own' (Lovie, 1992, p. 32)). This accusation is false. I have attempted a
socio/historical explanation for the current situation in quantitative psychology, but not in order to
undermine anyone's position. The acceptance of any set of beliefs will always have its socio/historical
causes, but these causes can never undermine those beliefs because these causes are always logically
independent of the issue of the truth or falsity of those beliefs. I have argued that the current
understanding of measurement in psychology contradicts the logic of quantity and measurement and
is mistaken just for that reason. I have presented the socio/historical facts in an attempt to understand
the causal circumstances of his mistake.
Only when the role of social interests is made explicit, can the error be understood in context. It is
important to do this. I know that the message that most psychologists have erred about the concept of
measurement will be resisted. This is evident in Laming's claim that I am 'directing the critical power
of modern measurement theory against the psychology of I860' and Morgan's, that I am 'shutting the
stable doors after the flight of their occupants', as if the logical problems identified have been
miraculously removed by time alone. One way to deal with such resistance is to uncover the deeper
socio/historical dynamics. By exposing the ideological forces and social interests at work, the way is
opened for a more objective view of one's science.
Pertinent to this point is Lovie's rejection of my diagnosis of modern psychology's methodological
thought disorder. Lovie may have another name for it, but anyone who takes science seriously will
appreciate that when a particular discipline adopts its own, peculiar definition of a fundamental concept
and is thereby able to continue avoiding for another 50 years the same central empirical issue it had
already avoided for almost a century, that discipline is malfunctioning as a scientific enterprise. At first
sight, it might appear that relativism's intellectual scorched-earth policy offers exemption from this
diagnosis. However, a thoroughgoing relativism about truth is unsustainable and Lovie, inevitably,
makes claims (about my paper, for example) which I am encouraged to think he believes state the way
things"are objectively.
One such is his description of my paper as 'a stale and superficial morality tale, with S. S. Stevens
and his acolytes cast as the villains, and Holder and his modern followers as heroes'. However, he is
mistaken. My paper makes no moral judgments and it identifies no villains or heroes. It does identify
certain errors in thinking on the part of individuals and it identifies an instance of thought disorder in
modern psychology, but these are not moral judgments, nor are the defects identified the defects of
villains. Lovie's colourful categories trivialize the issue and only serve anti-intellectual interests. The
debate will be advanced only by taking the issue seriously and either showing where my argument is
factually or logically mistaken or, failing this, taking my conclusions as a basis for action. A necessary
condition for reversing the diagnosed situation is a strengthening of education in the conceptual
foundations of methods in psychology (i.e. in the mathematics, logic, and the history and philosophy
of science necessary for critical thinking about the methods used by psychologists). Because, at present,
I only see a weakening of the curriculum in our universities, I am not optimistic.
406 /. Michell
Conclusion
Have the criticisms of my paper, raised here, deflected its aim or diminished its force? No critic has
explained why the hypothesis that an attribute is quantitative, alone amongst scientific hypotheses, must
be exempt from empirical test. No critic has explained why psychology, alone amongst the sciences, is
entitled to its own definition of measurement, a definition that leaches the concept of all content. And
none has submitted a rational defence of the present, anomalous, situation in quantitative psychology.
Readers of this journal have been given no adequate reason, yet, to avoid the conclusion that
methodological thought disorder is systemic in modern psychology.
Acknowledgements
I am grateful to Benjamin Barnes, James Dalziel, Scott Gazzard, Fiona Hibberd, George Oliphant and
Andrew Rantzen of the Department of Psychology, University of Sydney for their comments upon
earlier versions of this reply.
References
Boring, E. G. (1921). The stimulus-error. American Journal of Psychology, 32, 449-471.
Carnap, R. (1950). Empiricism, semantics, and ontology. Kevue Internationale de Philosophie, 4, 20-40.
Cattell, J. McK. (1893). On errors of observation. American Journal of Psychology, 5, 285-293.
Crombie, A. C. (1994). Styles of Scientific Thinking in the European Tradition, vol. 1. London: Duckworth.
Earman, J. (1993). Carnap, Kuhn, and the philosophy of scientific methodology. In P. Horwich (Ed.),
World Changes: Thomas Kuhn and the Nature of Science, pp. 9-36. Cambridge, MA: MIT Press.
Falmagne, J.-C. (1971). The generalized Fechner problem and discrimination. Journal of Mathematical
Psychology, 8, 22-43.
Falmagne, J.-C. (1985). Elements of Psychophysical Theory. Oxford: Oxford University Press.
Holder, O. (1901). Die Axiome der Quantitat und die Lehre vom Mass. Berichte iiber die Verhandlungen
der Koniglich Sdchsischen Gesellschaft der Wissenschaften ^u Leip^^ig, Mathematisch-Physische Klasse, 5
(English translation of Part I by J. Michell & C. Ernst (1996), 'The axioms of quantity and the theory
of measurement'. Journal of Mathematical Psychology, 40, 235-252.)
Irzik, G. & Grunberg, T. (1995). Carnap and Kuhn: Arch enemies or close allies? British Journalfor the
Philosophy of Science, 46, 285-307.
Krueger, L. E. (1989). Reconciling Fechner and Stevens: Toward a unified psychophysical law.
Behavioral and Brain Sciences, 12, 251-320.
Kuhn, T. S. (1970). The Structure of Scientific devolutions. Chicago, IL: University of Chicago Press.
Laming, D. (1986). Sensory Analysis. London: Academic Press.
Laming, D. (1987). Psychophysics. In R. L. Gregory (Ed.), The Oxford Companion to the Mind, pp.
655-657. Oxford: Oxford University Press.
Levine, M. V. (1970). Transformations that render curves parallel. Journal of Mathematical Psychology 7
410-443. '
Lovie, A. D. (1992). Context and Commitment: A Psychology of Science. New York: Harvester-Wheatsheaf.
Luce, R. D. (1972). What sort of measurement is psychophysical measurement? American Psychologist
27, 96-106.
Michell, J. (1993). Numbers, ratios, and structural relations. Australasian lournal of Philosophy 71
325-332. J^J' '
Michell, J. (1994). Numbers as quantitative relations and the traditional theory of measurement. British
Journal for the Philosophy of Science, 45, 389-406.
Murray, D. J. (1993). A perspective for viewing the history of psychophysics. Behavioral and Brain
Sciences, 16, 115-186.
Narens, L. (1996). A theory of ratio estimation. Journal of Mathematical Psychology, 40, 109-129.
Stevens, S. S. (1936). Psychology: The propaedeutic science. Philosophy of Science, 3, 90-103.