>0 o <- " V -£-. <£ $\. £<% «•. ■ /■ ■ .tf 1 ■0 ^ *>■■ '/ ■3 ^ «. .A ,0 0. '"V ■"0 o X ^ -^ ' ^ .0 O ,v %. ; ,00. \ N V\
%
s - A o
'/
* <-
O rtO
O o ^
V '*
m:
v0<
,0
O'
'/
c^' :
-;-!•
&
Digitized by the Internet Archive
in 2011 with funding from
The Library of Congress
http://www.archive.org/details/mentaltestsOOball
(0 ,
\
MENTAL TESTS
MENTAL TESTS
BY
PHILIP BOSWOOD BALLARD
M.A., D.Lit.
AUTHOR OF " OBLIVISCENCE AND REMINISCENCE," AND "HANDWORK
AS AN EDUCATIONAL MEDIUM "
HODDER AND STOUGHTON, LTD.
LONDON NEW YORK TORONTO
1920
Ms%
'n
}
^
PREFACE
The aim of this little book (its achievement is
another matter) is to make the teacher his own
critic, and the teacher's critic a just and discriminat-
ing judge. By the teacher's critic I mean the
head master, the inspector or the examiner — whoever,
in fact, passes an authoritative judgment on his
work. The teacher and his judge do not always
see eye to eye ; and the judge, holding as he does
the position of authority, is prone to press his point
of view. If he cannot convince he commands.
And, indeed, if he is to justify his post what else
can he do ? A difference of opinion about the
temperature of the classroom may be composed
by an appeal to the thermometer ; a difference of
opinion about the average height of the pupils may
be settled by resort to the foot-rule. But in the
realm of mind there are, it is thought, no thermo-
meters and foot-rules : and we sorely need them.
We need objective measurements recognised by
all as final and unassailable. Indeed, we shall have
benefited but little from the new psychology if we
are not, critic and teacher alike, made aware of our
own complexes — our whims and grooves and fads —
v
vi PREFACE
our prejudices masquerading as principles, and our
personal standards laying claim to universality.
Even if we cannot identify these complexes, to
know that they are there is something : it makes us
rigorous in judging ourselves, cautious in judging
others, grateful for a means of escape from the
precariousness of individual judgment. To provide
a few such avenues of escape is one of the purposes
of the book.
A common malady among teachers, especially
women teachers, is over-anxiety about their work.
They worry about the progress of their pupils, and
are peculiarly sensitive to the opinion of those in
authority. And those who worry the most are often
those who have the least cause for worry. The best
cure for this infirmity is knowledge ; knowledge of
the pupils, their aptitudes and attainments ; know-
ledge of the best means of measuring progress and
assessing results ; knowledge of what can be done
by other children, and other teachers. And it is
hoped that a little wholesome medicine of this kind
will be found in these pages.
The solution offered depends ultimately on
measurement. There are some who instinctively
dislike the idea of bringing measurement into educa-
tion. They urge that the highest products of
education, being spiritual, are outside the realm of
time and space to which measurements properly
belong. A partial reply to this argument will be
found in Chapter I. Here I will merely observe
PREFACE vii
that it is possible to extend the notion of measure-
ment beyond the physical realm ; and that the
fact that some of the products of education cannot
be measured is no reason why we should measure
none. We do, in fact, measure, either well or ill,
whenever we examine. If we examine at all we
should examine well ; and to examine well is to
measure accurately.
No claim of novelty is made for the way in which
the subject is treated. Much space is devoted to
putting before the reader the attempts made in
England and abroad to arrive at a scientific system
of testing. The author's own modest contributions
to the science are to be found mainly in the chapters
on Reading and Arithmetic.
In a slightly different form some of the chapters
have already appeared in the Times Educational
Supplement and in the Journal of Experimental
Pedagogy ; and for permission to reprint them I
am indebted to the courtesy and kindness of the two
editors concerned. The chapter on the Develop-
ment of Mental Tests was read before the Educa-
tional Section of the British Psychological Society,
and the chapter on Practical Ability before the
Educational Handwork Association.
My personal obligations are many. To the
teachers who have so ungrudgingly helped me to
standardise the tests ; to Professor T. Percy Nunn for
his valuable criticism of the chapter on Distribution
and Dispersion ; to Dr. Emrys Jones for correcting
viti PREFACE
the proofs — to all these I am deeply grateful. But
of all my debts of gratitude by far the biggest is due
to my friend Mr. Cyril Burt, who has not only let
me use his revision of Binet's Tests, and include his
tests of Reasoning and of Spelling, but has carefully
read through the whole of the manuscript and given
me the benefit of abundant criticisms and sugges-
tions. And all this did he do spontaneously : he
volunteered his help when he heard that I was
writing on Mental Tests — an act of generosity made
all the more striking by the fact that he himself
is about to bring out an important book on the same
subject.
P. B. Ballard.
Chiswick, 1920.
CONTENTS
CHAP.
PREFACE .
I. THE DEVELOPMENT OF MENTAL TESTS
II. INTELLIGENCE AND KNOWLEDGE
III. THE MEASUREMENT OF INTELLIGENCE
IV. BINET'S TESTS OF INTELLIGENCE
V. BURT'S REASONING TESTS
VI. THE MEASUREMENT OF KNOWLEDGE .
VII. DISTRIBUTION AND DISPERSION
VIII. READING —
(a) AS A MECHANICAL ART .
(/;) AS A MEANS OF ACQUIRING IDEAS
IX. SPELLING .....
X. ARITHMETIC —
(a) THE FUNDAMENTAL PROCESSES
{b) SIMPLE ORAL ARITHMETIC
(f) ARITHMETICAL DEVICES
(d) APPLIED ARITHMETIC .
XI. PRACTICAL ABILITY
XII. COMPOSITION
APPENDIX. SOCRATES ON INTELLIGENCE
INDEX ......
PAGE
V
I
22
29
48
90
I06
115
134
145
155
l6o
186
I90
191
196
2IO
221
233
IX
MENTAL TESTS
CHAPTER I
THE DEVELOPMENT OF MENTAL TESTS
It is a truism to say that advance in any branch
of physical science is largely dependent on improve-
ment in its mathematics. As the science develops
it gets more and more precise. It begins by being
qualitative only : it ends by being quantitative as
well. Indeed, no generalising idea, however illumin-
ating it may seem, can establish its validity and
become recognised as a law of nature, unless it can
be proved by the exact measurement of the facts
it is designed to unify and explain. Jevons, in his
Principles of Science, points out that Newton held
back his theory of Gravitation for fifteen years
because he found that it did not fit the accepted
facts about the orbit of the moon. When it was
possible to measure the orbit more exactly it was
found that the revised figures, instead of refuting
his theory, confirmed it to the full. Wisely or
foolishly he rejected the undulatory theory of light
in favour of the corpuscular because he had no
means of measuring with sufficient precision the
way in which, and the extent to which, a ray of
light is bent out of its course by the edge of an
B
MENTAL TESTS
opaque body. A still more striking instance of the
importance of precise measurement is afforded by
the recent achievements of Einstein. Everybody,
indeed, admits the importance of measurement in
physical things, but when it comes to mental things
people become sceptical. They admit that measur-
ing would be admirable if it could be done ; but it
cannot be done. We can measure sticks and stones,
but we cannot measure ideas. We can fathom the
depth of a well, but we cannot fathom the depth of
an emotion. That is the opinion of the frank
opponents of mental measurements. The other
side is concisely put by Thorndike : " Everything that
exists exists in some amount, and if it exists in some
amount it can be measured." Here we have the
creed of the mental tester : the belief that in some
way or other, at some time or other, the most
subtle mental processes and the" most elusive mental
products will be made amenable to measurement.
It is not the test itself that is so difficult : it is the
evaluating of the product, giving to the result a
definite position in a graduated scale, assigning to it,
in fact, a number, a mark or a score ; and so
assigning it that it has objective value. It must
not be a guess, but a genuine measurement arrived
at by means of a technique which would in every-
body's hands yield, within assignable limits of error,
the same score.
Mental tests are as old as the human race. A
man never consciously and deliberately learns any-
thing without testing himself ; he never effectively
teaches anything without testing his pupils. All
competitive examinations and all school examina-
tions are series of mental tests. And we know that
DEVELOPMENT OF MENTAL TESTS 3
competitive examinations began in China nearly
4000 years ago, and that school examinations are
as old as schools. Then there are the mental tests
that figure so largely in legend and story, the
riddles that baffle the learning of the learned and
are solved by the wisdom of the simple. (Edipus and
the Sphinx, Samson and the lion, are types of stories
where the test is of mental prowess as distinct from
bodily prowess. But these were mere pass-or-fail
tests : they were not designed to graduate, but
merely to select or sift.
The earliest attempts at real mental measurements
were of the nature of indirect measurements. Just
as we cannot measure electricity directly, but by
measuring something else whose size varies con-
comitantly with the strength of the electric current
can get a result which serves our purpose just as
well, so it was thought that some kind of vicarious
measurement might be found for the mind. And
the most plausible concomitant was the head.
This was the idea of Gall, the father of Phrenology.
At the close of the eighteenth century he and his
disciple, Spurzheim, taught that the head was an
index of the brain, and the brain an index of the
mind. They did not go so far as to say that a big
head meant a big mind, but they did assert that the
relative proportions of the skull and the configura-
tion of its surface, would, when exactly measured,
yield an exact indication of the mental powers.
The palmy days of phrenology were the early
decades of the nineteenth century. Nearly every-
body then believed in bumps. My readers will
recall an account of a certain dinner-party where
an unaccustomed guest solemnly asked the company
MENTAL TESTS
whether they did not think Milton a great poet ;
and Charles Lamb, who was present, said he wanted
to feel the gentleman's bumps. Indeed, most of
us can remember the day when the cultured house-
hold was prone to lay great store by a china head
mapped out in plots. But we no longer display
these things in the home, nor use them in the study.
Phrenology has, in fact, failed to establish its claims.
If it is science at all it differs from all other sciences
by standing still. The charts that we see to-day
in O'Dell's window in Ludgate Circus are to all
intents and purposes the charts used by Gall and
Spurzheim a hundred years ago.
Older even than phrenology is physiognomy.
Nearly thirty years before Gall expounded his
theory of the skull Lavater put forward his theory
of the face. In the year 1772, when Gall was a
boy of fourteen, Lavater published his celebrated
essay on Physiognomy. His doctrine was that the
face, not the cranium, was the index of the mind.
A man's fighting qualities, for instance, resided in
his nose. A nose that protruded near the root like
a Roman nose meant an aggressive disposition. It
was the nose of attack. If it protruded in the
middle it meant a propensity to fight for others.
If it protruded at the point it meant an aptitude
for self-defence. The retrousse nose was neither
inquisitive nor self-assertive, but merely self-
defensive.
Shrewd as some of Lavater's observations were, he
made the fundamental mistake of confusing the
bony structure of the face with its fleshy covering.
He regarded the contours of the cheek-bone as
much an expression of character as the lines round
DEVELOPMENT OF MENTAL TESTS 5
the mouth. Many years later, after the science
had emerged somewhat from the eclipse into which
it had been thrown by phrenology, the study of the
face was taken up afresh by scientists like Bell and
Darwin. They, however, definitely discarded the
osteology of the face in favour of its sarcology, and
devoted their attention to the mobile and plastic
covering, the changes in which seemed to betoken
the prevailing modes of thought and feeling.
Physiognomy became, in fact, the science of expres-
sion, and only indirectly and suggestively the science
of character and intellect. If a man, for instance,
has crow's-feet round his eyes before he is forty,
it seems no very wild conjecture to assume that he
smiles much and is of an amiable disposition. But
as weak eyes, as well as a kind heart, may cause
crow's-feet, the inference becomes somewhat pre-
carious. Indeed, Darwin never pretended that
physiognomy was an exact science. We cannot
measure a smile or a sneer or a look of surprise.
Heine states in his Florentine Nights that he
does not know whether the mouths of Parisian
women are large or small, because one can never
tell where the mouth leaves off and the smile
begins.
The next attempt to physiologise the mind was
Lombroso's. He was a criminologist who tried to
find the bodily peculiarities that went with con-
firmed evil-doing. He searched for certain marks
or malformations which he called stigmata, and
which he thought would distinguish the criminal
type. A lack of symmetry, for instance, in form
or feature was supposed to be one of the common
signs of degeneration. We admit its commonness,
MENTAL TESTS
for we are all in some degree branded with it.
This line of research, however, has fallen into
discredit. It is now-a-days believed that if there
is a criminal type we must seek its characteristics
in the mind and its behaviour, rather than in the
body and its anatomy.
The man who gave the greatest impetus to mental
measurement in England was Sir Francis Galton.
But he was primarily an anthropologist, with a
bias towards physical measurements and com-
parative anatomy. He was imbued with a belief
that some sort of correspondence could be found
between intelligence and certain bodily traits, such
as the length of the middle finger, the character of
the finger-prints, and the span of the open arms as
compared with stature. And researches in this
direction have not proved altogether fruitless. If
they did nothing else they gave us the Bertillon
system of identification by finger-prints. But they
did not bring us appreciably nearer the object of
our quest. The physical correlate of intelligence
was still undiscovered. It was left to Professor
Karl Pearson to deliver the most crushing blow to
the belief that we can find a physical scale for
mental facts. In 1906 he published the result of
an elaborate investigation into the relationship
between intelligence and the size and shape of the
skull. And his verdict was that the connection
between them, if it existed at all, was so slight as
to be of no use for purposes of inference. Here
ends the first phase of the search for a scale to
measure mind. The conclusion reached was definite
enough, but entirely negative. We cannot tell
a criminal by looking at him ; we cannot tell a genius
DEVELOPMENT OF MENTAL TESTS 7
by the shape of his skull ; and we cannot tell a fool
by the length of his ears.
But this does not dispose of physical measurements.
For it was next argued that although we cannot
measure the mind by measuring the body, we may
be able/ to do so by measuring the powers of the
body. /A static measurement failed, but a dynamic
measurement might succeed. For as soon as the
body does something, unless that something is
purely automatic, the mind co-operates in the work.
Volition, at least, is brought into play. This view
led to experiments with the various instruments
for measuring muscular strength and skill. The
dynamometer, for instance, measures the strength
of one's grip, and the ergograph the strength and
endurance of the middle finger. But valuable as
these instruments were for testing fatigue, they
revealed no sort of relationship between mental and
muscular traits. The tapping test was more promis-
ing. In its simplest form it consists in seeing how
many taps per second the subject can make with a
pencil on paper. It is found that one can tap
faster with the right hand than with the left, and
it is a conceivable hypothesis that a very stupid
person is, if I may use an Irishism, left-handed all
over his body. But although tapping afforded some
indication of motor ability it did not signify the
presence of any form of intellectual ability.
Reaction-time experiments ran on the same line.
It was found that people differed considerably in
the rapidity with which they responded to a
stimulus ; and a delicate apparatus was devised to
measure the time that elapsed between the hearing
of a sound and the pressing of a button ; or between
8 MENTAL TESTS
some other signal and some other motor response.
This test is an excellent vocational test. Rapid
reaction is important to the boxer, whose aim is to
get his fist in first, and to the airman who needs,
at the sight of danger, to press the right lever at the
right time in response to the right signal.
It will be observed that the quest begins to lose
its singleness of purpose. The measurements began
by being physical, then they became psycho physical,
ultimately they became almost purely psychical.
As soon as the psychical element came in at all,
any measurement that was achieved was to a certain
extent a psychical measurement. But to measure
a particular mental function for its own sake is one
thing : to measure it in the hope of finding it an
index of general mental ability is quite another thing.
The latter has always been the bigger problem ;
but the former is by no means devoid of purpose and
value. To be able to measure any mental function
whatever, however limited its operation, is no mean
achievement, and in the types of experiments to
be now recounted definite measurements have been
attained which have proved valuable in develop-
ing psychological theory, and serviceable in their
application to various pursuits in life. It is only
in the larger quest that they have comparatively
failed.
In the nineteenth century attempts were made to
measure sensations. Weber put forward his cele-
brated law, which was afterwards interpreted and
elaborated by Fechner. It was supposed to establish
a relationship between the physical stimulus and the
sensation it produced, or, rather, between the way
in which an increase in the intensity of the stimulus
DEVELOPMENT OF MENTAL TESTS 9
was accompanied by an increase in the intensity
of the sensation. In the sixties and seventies of
last century the Weber-Fechner law gave rise to
numerous discussions. While some thought that it
revealed the connecting-link between mind and
body and was of immense metaphysical import,
others held that it was merely an instance of the
general law of relativity : it merely illustrated the
fact that we judge things not absolutely but
relatively. But although the Weber-Fechner law
is regarded as of little importance now-a-days — of
little importance to the educationist at least — the
researches to which it led have considerably advanced
the science of mental measurements. Means were
devised, for instance, for measuring the acuity of
the various forms of sense-perception, particularly
of seeing and hearing. More important still, it led
to the measurement of sensory discrimination — of
the fineness with which we can detect differences
in things. For it was suspected that here lurked
the clue for which the intelligence-hunters were
searching. Indeed, the comparison of lines and the
comparison of weights are widely used to-day as
tests of intelligence. It was sensory discrimination
in the skin, however, that gave rise to the most
sanguine hopes. Early in this century an eminent
psychologist expressed the belief that he had found
a trustworthy instrument for measuring general
intelligence. That instrument was the Eesthesio-
meter. The sesthesiometer consists of a pair of
sharp points, like the points of a pair of compasses,
one of which can be shifted back and fore on a
graduated scale. When the points are applied to the
skin of a blindfolded subject they are felt as one
io MENTAL TESTS
point unless they stand a certain distance apart.
The least distance apart at which they are felt as
two points is the discrimination threshold. It was
believed that the sensitivity of the skin, as indicated
by this threshold, was a key to the acuteness of the
mind ; that if a man was thick-skinned he was
thick-headed as well. But when the supporters of
this theory tried to prove it they found they
could not do so. It was found that if sensory dis-
crimination of the skin was an index of anything at
all, it was just as likely to be of stupidity as of
intelligence ; for McDougall and Rivers were able
to show that the savages on the shores of the Torres
Straits had more discriminative skins than Europeans.
So the sesthesiometer was, after all, merely an
sesthesiometer and not a phrenometer : it measured
sensitivity, but not sensibleness.
The scientific study of memory began with
Ebbinghaus, who was the first to measure memory
in its simplest form — in the form of retaining and
reproducing nonsense syllables — sounds devoid of
all sense and all associations. He laid the foundation
of modern laboratory methods of testing memory,
and of the distinction made in all pedagogical tests
between rote memory and substance memory.
Galton devised rough-and-ready means of estimating
the vividness of visual imagery. Later psychologists
have used ink-blots for measuring the rapidity with
which images emerge in the mind. A series of blots
are shown the subject, and a record made of the
time he takes to name the object suggested by
each.
It were both tedious and superfluous to take the
reader through the various means that have been
DEVELOPMENT OF MENTAL TESTS n
invented for measuring the different mental faculties
— processes which, in fact, are neither simple nor
separate — processes such as attention, perception,
apperception, memory, imagination and reasoning.
And whenever an instrument or piece of apparatus
is used for any of these purposes it is always impossible
to claim that it singles out and tests one simple
and unitary mental function. One of the most
useful pieces of apparatus, for instance, in the
psychological laboratory is McDougall's dotting
machine. A strip of paper marked with an irregular
zig-zag row of small circles passes at a rate regulated
by clock-work behind a slot which enables only a
small number of circles to be exposed at a time.
The subject has to mark with a pencil the centre
of each circle as it passes before him. This machine
is used to test fatigue, to test attention and to test
accuracy of aim ; and there may be other kinds of
ability that it tests as well.
In the meantime, while specific tests were being
rapidly devised and improved, what progress took
place in the measurement of general intelligence ?
This can be best illustrated by reference to the
experiments of Mr. Cyril Burt, who has done more
to solve this particular problem than any other
British psychologist. Mr. Burt began his investiga-
tions among school-children in Oxford about the
time that Binet published his first series of tests in
1905. His method was to select the group of
children in a school who fell within certain age
limits (say from twelve and a half to thirteen and a
half years old), and to get the head teacher and class
teachers to arrange these children in order of intel-
ligence, relying partly on their empirical judgment,
12 MENTAL TESTS
and partly on the results of the school examina-
tions. Then the children were given individually
twelve psychological tests, ranging from simple
sensory and motor tests to tests of voluntary atten-
tion, and it was calculated to what extent the
results tallied with the teachers' empirical estimate.
The higher the correlation between the two orders,
the more satisfactory was the test regarded as an
index of intelligence. Some five years later Mr. Burt
made a further investigation at Liverpool, using
tests of a higher and more complex kind — extending
them, in fact, so as to include various types of
reasoning. He was thus able to range in a sort of
hierarchy, on the basis of their value as criteria of
intelligence, a large series of tests involving mental
processes of widely varying levels. And the con-
clusion he arrived at was that when we have arranged
them in the order of their complexity we have already
roughly arranged them in their order as intelligence-
measurers. And of all the tests of intelligence those
that measure the power of thinking, that is, the
power to understand and to reason, are the best.
Thus is common sense vindicated by the psychologist-
Thus we have arrived at the conviction that if we
wish to test intelligence we must test it directly
and not indirectly : we must test those very mental
processes which the plain man regards as intelligent.
We must note whether the subject is " quick in the
uptake," whether he has " nous " or " gumption,"
whether he can " see beyond his nose." And it
will be observed, too, that instruments and machines
have been relegated to the psychological laboratory,
where they are of inestimable value both to pure
psychology and to certain branches of applied
DEVELOPMENT OF MENTAL TESTS 13
psychology. For educational purposes they are of
little use. At any rate, it is certain that no machine
has yet been invented which will measure intelli-
gence ; and faith in the possibility of such a machine
is growing fainter every day. We now regard
laboratory tests and school tests as distinct things —
a position which was taken up from the very first
by that pioneer of educational experiments, Mr.
Winch.
At this stage we have abandoned certain specious
by-paths which seemed to lead to the desired goal,
and we have found the true path, which has, after
all, proved to be the common high road which the
mass of humanity has been treading all along. But
in one sense we are as far from the goal as ever. We
know the kind of test to be applied, but we have no
scale. We can test, but we cannot measure. It
was left for Binet to discover the scale. And the
full significance of his discovery is rarely realised.
His critics — and they are very numerous — have bee a
so concerned in pointing out what he has not done
that they have neglected to give him credit for what
he has done. And Binet's crowning glory is, not
that he got together a medley of heterogeneous tests
for the detection of the feeble-minded, but that he
invented a scale. In this he resembles Saul, the
Son of Kish, who set out to look for asses and found
a kingdom. Binet's scale is his kingdom ; not the
individual tests — these may so change that there is
nothing recognisable left — but the scale itself.
Its principle is age-performance. He took a certain
test, such as, say, counting backwards from twenty to
nought, applied it to a large number of children,
found the lowest age at which between 60 and 70 per
1 4 MENTAL TESTS
cent, of the children passed, and allocated the test
to that age. Thus he has a series of about five
tests for each age ; and if a child of five passes the
tests for seven, his mental age is recorded as two years
in advance of his chronological age. Thus the unit
of his scale was one year of mental age. It was a
plan so simple and obvious that one wonders that it
never was used before. But it is certain that it
never was. We came near it in the old Standards
of Examination that were issued by the Board of
Education in the days of payment by results. But
they just missed it through being an arbitrary scale,
based on opinion and not on actual age-performance.
And in modern standardised tests the standard is
always actually, or ideally, an age-performance.
Binet regarded intelligence as a complex process
whose main characteristics were threefold : its
purposefulness, its capacity for making adaptation,
and its power of self-criticism. He therefore made
his tests as heterogeneous as possible. He looked
everywhere for tests, and showed much sagacity in
finding those that were simple, practical and effective.
They involve such ordinary questions as : " Are you
a little boy or a little girl ? " and such unusual
problems as : Put the following words in such order
that they make sense —
To Asked Spelling
My I Master
Correct My.
And he used no apparatus. The only material
required, beyond what is found in an ordinary school-
room, is a series of five pill-boxes similar in size
and appearance, but differently loaded.
DEVELOPMENT OF MENTAL TESTS 15
Binet's scheme led to abundant controversy.
Indeed, the literature of the controversy would fill a
small library. Everybody abused Binet, and every-
body used him. They said his tests were very bad,
but they were the best we had. If Binet had lived
he would have improved his tests, for he was con-
tinually revising them. He, indeed, issued three
series— the 1905, the 1908 and the 191 1 series — each
an improvement on the previous one ; and just
before he died in 191 3 he was engaged in making ^'
another revision. His line of research was en-
thusiastically taken up in America, where three
important revisions were issued, the Goddard
revision, the Yerkes point-scale revision (where the
method of marking was changed), and finally the
revision that should be best known in this country,
the Stanford revision, carried out by Terman. The
basis in each case was Binet, and the alterations
and extensions were matter of detail rather than of
principle. The tests applied to the recruits for
the American Army were partly based on Binet ;
the tests used for Matriculants at Columbia Uni-
versity were inspired by Binet's, and the tests used
in Berlin to select supernormal children are imitative
of Binet's. The only extension of principle is the
addition of group-tests. Binet's tests are indi-
vidual, and it takes from half-an-hour to an hour
to examine a single subject. A large number of
new tests have been devised, which may be applied
simultaneously to a large number of pupils ; but
they agree with Binet's in being age-performance
tests, and in being tests of intelligence rather than
of acquired knowledge.
The latest development of the Binet system is
1 6 MENTAL TESTS
to be found in Burt's system of Reasoning Tests ;
or rather, it is the logical outcome of Burt's own
theories, for the only part of Binet's system that he
retains is the age-performance basis of selection.
The tests themselves he rejects. If intelligence,
which Burt defines as " inborn all-round mental
efficiency " is mainly manifested in the higher
mental processes, the best intelligence tests would
embody various forms of reasoning, and would omit
all reference or appeal to the lower mental processes.
Burt has, accordingly, recently published a series of
fifty reasoning tests, which are the best of an initial
list of 250 that were tried on miscellaneous groups
of children and adults. The questions are not the
usual syllogistic questions of traditional logic, but
involve the application of thought to the ordinary
affairs of life. Where the purpose of the test is to
pick out the brightest children rather than to pick
out the blockheads, these tests will, in my opinion,
prove far more satisfactory than Binet's. They are
new as yet and have not been extensively used ;
but I believe there is a great future for them.
They display much ingenuity, and the more difficult
tests are far more likely to pick out what Terman
calls the superior adult than Ter man's own tests.
So much for intelligence tests. But this is only
one of the fields in which the technique of mental
testing has been improved, and in which it has been
put to practical purposes. Laboratory apparatus
and laboratory methods have been rapidly improving
within recent years, and psychological tests have
been applied with great success in the realms of
medicine, business and industry. To describe these
developments here is outside my scope ; I must
DEVELOPMENT OF MENTAL TESTS 17
limit myself to mental tests in the educational
realm, and so far I have not touched the measure-
ment of school attainments.
Our success in securing standardised tests and
standardised measurements depends mainly upon
three mathematical discoveries. One of these I
have already dealt with — that of age-performance.
Before Binet's day we tried to measure one unknown
by another unknown. We tried to test mental
processes by means of tests whose difficulty was
merely a matter of guess-work. We knew neither
their absolute nor their relative difficulty. Binet
gave us a method by which the difficulty of the tests
could be graded and standardised. But he himself
seems to have made no use of the other two mathe-
matical conceptions which have largely advanced
the science of mental measurements — the theory
of normal distribution, and the theory of correla-
tion. It was Quetelet, the Belgian mathematician,
who showed how widely applicable was the law
of normal distribution, what a large number of
social and physical phenomena were found to follow
the normal curve, and how the form of distribution
could be deduced from the laws of probability. But
it was Galton who suggested that mental traits
generally, and intelligence in particular, would be
found to follow the same law. And to-day this
curve is universally used to test tests and to certify
them. If the results do not conform to the normal
curve they at once become suspect. Goddard and
Terman have amply demonstrated that Binet's
tests triumphantly stand this criterion of validity.
To Galton, too, we are mainly indebted for the
doctrine of correlation, which has proved so valuable
c
1 8 MENTAL TESTS
an instrument in determining the extent to which
two orders of measurements tally with each other —
or, in other words, the extent to which two functions
vary concomitantly. The doctrine of Correlation
was developed and elaborated by Professor Karl
Pearson, to whom we owe the formula most com-
monly used. Professor Spearman discovered a
formula of correlation which, although not so
strictly accurate as Karl Pearson's, is much simpler
to use, and is quite satisfactory when the rank only
of the subjects is known. This correlation formula
was extensively used by Burt, and has figured
largely in the exposition and proof of the doctrine
of Intelligence as conceived by Spearman and his
school.
Another device which has proved of great service
is that of " equal grasps," invented by Mr. Winch.
Each child in one group is paired with another
child of equal merit in the other group. Such an
arrangement enables the experimenter to estimate
the efficacy of a new method of teaching, or to
compare the efficacy of two rival methods ; for one
group may be taught by one method and the
other group by the other method.
It remains for me to deal with the development
of mental tests in the realm of school studies — a
realm where mental tests in the form of examina-
tions have from time immemorial been regarded
as an essential part of the school machinery. But
the newer system of school tests differs essentially
from examinations. An examination, however care-
fully it is conducted, is a test of comparative ability.
It enables the examiner to arrange his examinees
in order of proficiency. But it does not enable
DEVELOPMENT OF MENTAL TESTS 19
him to compare the proficiency of the group as a
whole with the proficiency of any other group as
a whole, nor yet with the normal achievement of
other pupils under similar conditions of age and
schooling. For a public examination is like a
rocket : it can be used but once. It measures, it
is true ; but it measures the individuals of a group
one against the other : it does not measure them by
reference to a standard scale. The newer tests,
on the other hand, are standardised tests : they are
tests whose difficulty is already known in terms of
age-achievement. The separate tests, of which an
examination consists, are either regarded as carrying
equal marks, or as carrying marks which vary in
accordance with the examiner's opinion of their
difficulty or their importance. Standardised tests,
on the other hand, do not depend for the marks
they carry on anybody's mere opinion, but rather
on their difficulty as experimentally determined.
The tests are tested and standardised by a mathe-
matical analysis of the results obtained by applying
the tests to a large number of children. The sub-
jective element is reduced to a minimum, and the
objective element raised to a maximum.
The extent to which objective tests of school
studies are possible depends partly on the study
itself, and partly on what aspect of the study it is
desired to test. Proficiency in arithmetic, for
instance, is easier to test objectively than pro-
ficiency in English composition ; and proficiency in
the fundamental processes of arithmetic easier than
proficiency in the capacity to apply these processes.
This difference in the degrees of objectivity is still
more readily seen in the attempts that have been
2 o MENTAL TESTS
made to measure the various aspects of reading.
For proficiency in reading depends upon at least
three distinct types of ability — ability to translate
symbols into sounds, ability to absorb meaning,
and ability to read aloud so as to make the meaning
intelligible to others. These are concerned with
the mechanical aspect, the intellectual aspect and
the elocutionary aspect respectively. The first
ability is easy to measure, the second difficult, the
third almost impossible. Apart from a few abortive
experiments with the dictaphone nobody has ever
attempted to invent standardised tests of elocu-
tionary power.
In America alone have educational tests received
the attention they deserve. There a number of
tests have been devised and norms established in
arithmetic, reading, spelling, grammar, geometry
and algebra. All these rest on the same basic
principle as Binet's Intelligence tests, except that
grade-performance is substituted for age-perform-
ance. The standardised tests in composition, and
handwriting and drawing, however, involve an
entirely new principle — a principle which, though
not invented by Professor Thorndike, is mainly
associated with his name. It consists in devising
a scale of typical specimens, a scale arrived at by
collating the opinions of a number of independent
judges. When it is wished to mark a particular
paper, it is compared with the specimens of the
standard scale with a view to discovering to which it
is most nearly equal in merit. If the variability in
the marking of the same set of papers by a number
of independent teachers is less when the scale is
used than when it is not used, it is claimed that
DEVELOPMENT OF MENTAL TESTS 21
the use of the scale is justified. As a matter of fact
it is found that after some practice in the use of
the scale the variability is less ; but not so much
less as to give this type of measurement any very
high degree of objective validity. Even in America
the scales are not used much. In England they are
not used at all. But the other scales are to a certain
extent. We have not, in this country, gone far in
the invention of tests and the establishments of
norms ; but we have begun the work, and we are
slowly but steadily forging ahead.
CHAPTER II
INTELLIGENCE AND KNOWLEDGE
All who have given serious thought to education
have been wont to exalt intelligence at the expense
of knowledge. " Wisdom," says an ancient writer,
" is the principal thing ; therefore get wisdom ;
and with all thy getting get understanding." And
although by wisdom he doubtless meant something
more than intelligence, he at least meant intelligence.
On the other hand, " he that increaseth knowledge
increaseth sorrow." Here in these far-off times we
see recognised a distinction which has much exer-
cised the mind of the modern educationist — the
distinction between wisdom and learning, intelli-
gence and knowledge, mental power and mental
content.
The cry for intelligence was acutely clamant
in the first decade of the twentieth century. The
primary schools had just recovered from the cramp-
ing effect of annual examinations and had begun to
breathe more freely. Intelligence became a cult
and a quest and a watchword. Teachers aimed at it ;
inspectors looked for it ; administrators encour-
aged it. There was a " slump " in accuracy and
a " boom " in intelligence. And the educational
psychologist, seeing herein a promising field for his
labours, began to investigate intelligence and to
INTELLIGENCE AND KNOWLEDGE 23
cast about for means to measure it. Although
Binet's name was alone prominent before the public,
he was by no means the only investigator ; nor,
perhaps, the most original. Still, the work he
achieved was great. He had a genius for discerning
what was worthy in the work of others ; he cast his
net so wide that nothing valuable escaped him ;
and — most important point of all — he kept a steady
eye on practicability.
But while the teacher tried to cultivate intelli-
gence, and the psychologist tried to measure in-
telligence, nobody seemed to know precisely what
intelligence was. It was certain that the term
covered a wide field. When at a recent meeting
of an education committee a member asserted that
teachers as a body were highly trained and intelli-
gent, an opponent (advocatus diaboli) retorted that
the same was true of elephants. Clearly there was
need to sharpen and define the term. It is not
enough to say that it means ability as distinct from
knowledge, capacity as distinct from content, power
as distinct from product. For how can we gauge a
mind's capacity except by finding out what it can
contain ? But what it contains is knowledge. And
how can we measure a mind's power without
measuring its product ? But its product is, again,
in part at least, knowledge. Binet saw this diffi-
culty, and frankly accepted knowledge as one of the
many marks of intelligence. One of his earlier
intelligence tests, for instance, for a child of nine
is : name in order the days of the week. He would
probably argue that a child of nine who in his inter-
course with his fellow-creatures had not picked up
these names was necessarily somewhat stupid,
24 MENTAL TESTS
Binet was inclined to adopt the easy-going attitude
of the artist, who does not mind a notion being
nebulous so long as it is workable, rather than that
of the scientist, who strives for precision and con-
sistency at all costs.
There are other psychologists, however, who
have taken up the question in a different spirit ;
most notably Professor Spearman in England and
Professor Thorndike in America. Their mode of
research was mathematical. They submitted specific
mental abilities — the ability to add numbers, to
memorise words, to discriminate lengths, to sort
cards, and so forth — to certain rigid tests, and care-
fully marked the results. Then they compared the
various scorings and found the correlation between
them ; that is, the degree of concomitant variation,
or the extent to which the compared abilities tend
to rise and fall together. And from similar statistics
the two investigators arrived at entirely different
conclusions. Thorndike concluded that the indi-
vidual abilities were entirely independent ; that
there was, in fact, no such thing as general intelli-
gence, but only particular intelligences. For may
not a man be intelligent at geometry and stupid
at history, or brilliant as a poet and hopelessly bad
at figures ? Spearman, on the other hand, con-
cluded that there was a certain dominant factor
common to all the specific abilities, a central fund
of intellective energy, to which the term general
intelligence or general ability might fitly be applied.
To put it in another way, Thorndike held that a
man's intellectual wealth consisted entirely of
coupons ; Spearman that it consisted partly of
coupons, each of which may be expended in one
INTELLIGENCE AND KNOWLEDGE 25
direction only, and partly of cash, which could
be expended in any direction. Both views recog-
nised the obvious fact that individuals varied widely
in intellectual wealth ; the views clashed on the
question of the availability of that wealth.
Let us examine the ground of the belief in
general intelligence. It is a matter of common
observation that a man who is good at one thing is
good at most other things. At least it is so as a rule ;
cases of one-sided ability are very rare. Generally
speaking, a wise man is wise in all things, a fool is
a fool all round. Indeed, it can be proved mathe-
matically that there is a positive correlation between
all forms of native ability ; they always tend to
hang together ; the odds are always in favour of
high ability in any given function being accom-
panied by high ability in any other function. Why
should this be ? Why should mathematical ability
be positively correlated, as it is, with linguistic
ability ? Even if we make every allowance for
such operations as might be conceived to be common
to the two abilities, we still fail to account for the
whole relationship. There still remains an un-
explained nexus. We are forced, in fact, to assume
a general factor common to all the multifarious
operations of the mind, a factor with which each
specific ability is, in its own measure, charged and
energised. This common factor is intelligence.
Such is Spearman's reasoning ; and its cogency is
hard to dispute. But he further arrives at the dis-
concerting conclusion that this central factor cannot
be cultivated. It is born with one, and can neither
be improved by schooling nor dulled by neglect.
Intelligence is mother wit, and mother wit is a
26 MENTAL TESTS
matter of heredity. The ancient writer already
quoted holds a more hopeful view. He believes in
the possibility of cultivating wisdom. But even he
admits that it is sometimes difficult : " Though thou
shouldest bray a fool in a mortar among wheat,
with a pestle, yet will not his foolishness depart
from him."
It follows from this that the only practical thing
a psychologist can do with general intelligence, if
there is such a thing, is to measure it. Hence the
assiduous pursuit of mental tests. It is clear that
general intelligence cannot be directly tested, for
it can never be found alone : it is always embedded,
as it were, in specific abilities. But can it not be
tested indirectly ? We have in the thermometer
an excellent example of indirect measurement.
There are no known means of measuring temperature
directly ; we have to make use of the fact that the
expansion of a thin column of mercury is almost
perfectly correlated with temperature. So instead
of measuring the temperature, we measure the
mercury ; which does quite as well. Now is there
not some simple function of mind or body which
when measured will give us in the same way an exact
valuation of intelligence ? The quest of this key-
ability resembled the quest of the philosopher's
stone. It sometimes led to the discovery of un-
expected treasure ; it often led to the discovery of
mare's nests ; it always, as far as the essential search
was concerned, ended in disappointment. Every
device failed that assumed an essential corre-
spondence of soul with sense and sought to measure
mind through matter.
There was no help for it. The hope of finding
INTELLIGENCE AND KNOWLEDGE 27
a simple key-ability had to be abandoned ; and
investigators had to fall back on the laborious
expedient of testing as many abilities as possible and,
by a process of mathematical analysis, extracting
the common intellective element. Among the
pioneers in this particular field of research, Mr.
Cyril Burt takes the foremost place. After years
of patient labour he proved that while almost any
kind of ability was a presumptive sign of intelligence,
some abilities were much safer signs than others.
To use his own words : " Of all the tests proposed,
those involving higher mental processes, such as
reasoning, vary most closely with intelligence."
There are no tests of intelligence that are more
widely used than Binet's. And Binet's tests are
based on the principles I have just expounded.
Like Spearman and Burt, he discarded brass instru-
ments, and relied more on the higher mental pro-
cesses than on the lower/ His tests measure general
ability simply because they measure a large number
of specific abilities. So far as he is concerned, it
does not in the least matter whether Spearman is
right or Thorndike ; on either theory his tests are
a real, though rough, measure of individual mental
endowment.
William James, in advocating his pragmatic
method, contends that the best way to discover the
essential difference between two conflicting theories
is to find out the practical difference in the conse-
quences that flow from them. How, in fact, do
they affect practice ? We have seen that, whichever
of the rival theories of Spearman and Thorndike
be accepted, it makes no difference in the mode of
testing ; does it make any difference to the teacher ?
28 MENTAL TESTS
Again the answer is " No." On either view the
only cultivable thing is the multifarious group of
special abilities. It must not, however, be thought
that all this theorising and researching has had no
effect on school practice. It most distinctly has.
Its effect has been to broaden the outlook, to
multiply the school pursuits, to vary and amplify
the methods of study.
CHAPTER III
THE MEASUREMENT OF INTELLIGENCE
The British Press refers to mental tests as though
they were new things invented by Americans. In
point of fact they are neither new nor American.
They have been the common property of the race
since the dawn of history. They are no more
mysterious than the conundrums that delight
children at a Christmas party. They are, in fact,
merely questions or tasks that invite a trial of
intelligence. What is new is not the tests them-
selves, but the aptness with which they are chosen
and the scientific precision with which they are
applied.
A tacit distinction seems to be made between
examinations and mental tests. This distinction
is illegitimate ; for an examination is nothing but
a series of tests, which are just as mental or psycho-
logical as any that have ever been devised. They
are not non-mental tests, but simply a special kind
of mental test. The real distinction lies between
tests of knowledge and tests of ability ; tests of
school attainments and tests of natural intelligence ;
tests of book-learning and tests of mother-wit — a
distinction which is easy to make but difficult to
maintain. For it is impossible to devise a test of
ability which does not also test knowledge, and
29
30 MENTAL TESTS
impossible to devise a test of knowledge which does
not also test ability. Let us, for example, take one
of the most characteristic of Binet's intelligence
tests. The child is asked to say what is absurd in
the following : " If I should ever grow desperate
and kill myself, I will not choose Friday, because
Friday is an unlucky day and will bring me unhappi-
ness." To answer this correctly he must know,
among other things, the meanings of the words,
and he must know that a dead man is, in this world
at least, neither happy nor unhappy. If the ques-
tion were put in these words to a unilingual China-
man he could never answer it, however intelligent
he might be ; and to a race of immortals it would
be meaningless. Now let us consider this peda-
gogical test : Which are the four largest towns in
Scotland ? A correct answer would at least involve
ability to grasp a fact and to remember it, to under-
stand a question and to respond to it. The differ-
ence between the two types is one of proportion.
To answer the second we must acquire a special bit
of knowledge which casual experience will not
necessarily force upon us ; to answer the first
involves the application of such knowledge as no
sane mortal can fail to pick up in the ordinary
course of life.
The distinction, however, so far as it goes, is
perfectly sound, and indeed is constantly cropping
up in the folk-lore and legends of all races. The
point of the old English ballad of King John and the
Abbot depends entirely on this distinction. The
king sets the abbot three mental tests. At the
risk of forfeiting his life in case of failure, the abbot
has to say what the king is worth, how long it
MEASUREMENT OF INTELLIGENCE 31
would take him to ride round the world, and what
he is thinking of. A respite of three weeks is given.
The abbot cudgels his poor brains in vain. He rides
to " Cambridge and Oxenford, but never a doctor
there was so wise that could with his learning an
answer devise." In fact, the only man who could
rescue him from his plight was his own shepherd,
a man who could neither read nor write. The
shepherd, who happened to resemble his master in
appearance, disguised himself as the abbot, pre-
sented himself before the king, and spoke thus :
" How much are you worth ? Twenty-nine pieces
of silver, for you are worth at least one piece less
than our Saviour. How long will it take you to
ride round the world ? You must rise with the
sun, and ride with the sun, until he rises here again
next day ; then the journey will take you" exactly
twenty-four hours. What do you think ? You
think I'm the abbot ; but I'm not : I'm his shep-
herd." Here we have three tests, which all the
learning in the land could not satisfy, triumphantly
solved by simple mother-wit. And, indeed, all
through the records of folk-lore and mythology do
we find the learning of the learned put to confusion
by the wisdom of the simple. The supreme ordeal
which reveals the native nobility of the hero is just
as likely to be the solution of an enigma or the
interpretation of a dream as the slaying of a dragon
or a giant. While the dragon is the test of physical
prowess, the enigma is the test of intellectual
prowess. Perseus by his courage rescued Andro-
meda, but (Edipus by his intelligence rescued the
whole of Thebes. As the reader will remember,
the riddle of the Sphinx (" What animal walks in
32 MENTAL TESTS
the morning on four legs, at noon on two, and in
the evening on three ? ") was solved by OEdipus
alone, a king's son reared by peasants, a man in whom
nature triumphed over nurture.
The whole family of riddles, puzzles, charades
and conundrums belongs to a larger group of ability
tests. They require for their solution no special
knowledge, but a general aptitude for applying
knowledge — for the discernment of subtle analogies
and contrasts. We must not, however, fall into
the error of thinking that the modern application
of mental tests consists in the mere asking of riddles.
The tests are, in a sense, riddles, but they are riddles
of a special kind : they must have diagnostic value.
Let me illustrate by comparing an old riddle with a
new test. The riddle is : What relation does a
loaf of bread bear to a steam-engine ? And the
test : Complete the analogy : As a loaf of bread
is to a glass of water, so is eating to . The
answer to the riddle is " mother " ; for a loaf of
bread is a necessity, a steam-engine an invention,
and necessity is the mother of invention. This is
not a test of intelligence, but of a perverted ingenuity.
It is not intended to gauge one's reasoning powers,
but to raise a laugh. As a means of diagnosis it is
quite useless, unless, indeed, one gives the result a
negative interpretation ; for success in its solution
is, as a mark of sanity, no more significant than
failure. It is sophistry rather than logic, a jest
rather than a test. Very different is the analogies
test given above. Failure to answer this would, in
an adult, mean a serious defect of intellect. For
its solution lies not in some crooked by-path of
sophistry, but on the great highway of human
MEASUREMENT OF INTELLIGENCE 33
thought. That is why it serves for a diagnosis of
mental endowment and unfoldment. For there is
a certain stage of mental development below which
it cannot be answered, and above which it cannot
be mis-answered. Although, therefore, the enigmas
of the ancients and the catch-questions of to-day
are, in the broad sense of the term, mental tests,
they lack the special characteristics of those tests
which are now-a-days called mental or psychological.
The whole point and pungency of the riddle or the
catch lies in its being exceptional. To repeat it in
another form is to thwart its purpose. For its
purpose is to trip us up ; and once the trick of it is
known it ceases to trip us up. After we have got
the hang of such a sentence as " Pas de l'yeux Rhone
que nous," it is quite easy to read " Guy n'a beau
dit qui sabot dit nid a beau dit-elle ? " When
CEdipus, in pondering over the riddle of the Sphinx,
guessed that " the morning " meant " the morning
of life," the enigma was virtually solved ; and all
others based on the same metaphor. The riddle,
in fact, needs novelty for its success. The nut to
be cracked may be any nut but a chestnut.
As distinct from this spurious mental test, the
genuine mental test depends for its value upon its
universality. It is always a member of a family,
and the larger the family the better. The analogies
test given above may be multiplied indefinitely ;
indeed, as many as a hundred have been set at one
sitting. For example : As cat is to kitten, so is
dog to ? As sheep is to ox, so is flock to ?
As Paris is to France, so is London to ? In
testing ability, as in testing knowledge, it is always
best to set a fair number of questions. To base a
34 MENTAL TESTS
judgment on one test, however good that test may
be, is extremely precarious ; when the test itself
is suspect, no verdict at all can be reached. Each
test must itself be tested, and the results inter-
preted in accordance with the laws of probability.
I have, for instance, tried the following test on young
and old : If you are given a corked bottle half full
of wine, how can you get the wine out without
taking out the cork or breaking either the cork or
the bottle ? Most adults proclaim the task im-
possible, but many a small boy, who has solved
certain difficulties with his bottle of ginger-beer
or liquorice water, can answer the question promptly.
The test, in fact, proves to be too dependent on
an accident of experience to be a real gauge of
practical ability. Nor is it necessary to fall back
on such doubtful tests when we have so many
that are of demonstrable validity.
Another element of prime importance in scientific
testing is speed — an element which is clearly ex-
emplified in the cancellation test. The subject is
required to cancel as rapidly as possible all the ah
or all the rs, or any other letters, separately or
concurrently, in a page of printed matter. A
limited time is allowed — say, two minutes. In
rating the results the marker has to take into account
the number of letters rightly erased, the number
wrongly erased, and the omissions. It is obvious
that with unlimited time all who could distinguish
the letters at all would score full marks. To ignore
the time element is to nullify the test.
If, however, there is one single feature which
essentially distinguishes the modern test from the
gncient, it is that the modern test h standardised,
MEASUREMENT OF INTELLIGENCE 35
The man who first convinced the world of the
necessity for standardised tests, as distinct from
casual and haphazard tests, was Alfred Binet. And
he achieved it incidentally. What he really was
after was to find out which children in the schools
of Paris were mentally defective. To do this he
devised two carefully graduated scales, one to
measure school attainments and the other to
measure general intelligence. The first measured
roughly, the second more exactly ; the first sifted
out the suspects, the second found from among the
suspects the real blockheads. The assumption was
that all fools are dunces, but all dunces are not
fools — an assumption quite in accordance with
common sense. This general scheme, full of flaws
as it is in detail, has given birth to the whole modern
system of scientific testing. He has not only shown
us how to discover intellectual and moral weaklings,
but he has led us into a fertile land which it is
our duty to explore and subdue.
Binet avoided the mistake of trying to deduce a
scale from first principles. He based his scale on
fact. Before asserting what degree of intelligence a
child of ten ought to possess, he took the trouble
to ascertain what degree of intelligence a child of
ten actually does possess. Before testing children
with a test, he first tested the test with children.
By applying it to a large number of children of
different ages he was able to fix the lowest age at
which the majority were able to pass it. Let me
illustrate this with reference to a type of test which
was evidently a favourite of Binet's, for in his scale
of fifty-four intelligence tests it appears four times
— more frequently, in fact ? than any other. It
36 MENTAL TESTS
consists in the repetition of a series of digits. The
examiner says : " I am going to say x numbers.
Listen and repeat them after me : 5, 8, 2, . . .
etc." The numbers are uttered steadilv at intervals
of half a second. Binet found that at three years
of age few could repeat three digits, but consider-
ably more than half could repeat two. Thus he
fixed the repetition of two digits as a three-year-
old test ; and acting on the same principle he
assigned three, five and seven digits to the ages of
four, eight and fifteen respectively. This innocent
little test is not, in its implications, so simple as it
seems. It measures the span of primary memory —
the number of things one can hold in the conscious
memory at the same time. It depends on the fact
that consciousness is not a point but a patch ; and
the assumption is, the bigger the patch the bigger
the mind. When the mind ceases to attend to A
and passes on to B, A does not suddenly vanish from
consciousness leaving B in sole possession, but
fades gradually away, forming in the process what
William James calls a " fringe " to B. If by the
time the last digit is uttered by the examiner the
first has quite disappeared from the child's mind,
the child will fail in the test. His psychic fringe is
too small.
The importance of primary memory is more
apparent in dealing with words than in dealing
with figures. For if all the essential parts of a
spoken sentence do not in some sense reverberate
together in the mind, the sentence has no chance
of being understood. Fortunately the words in
connected discourse have greater cohesion than a
string of digits, and the number of syllables that
MEASUREMENT OF INTELLIGENCE 37
can, according to Binet, be repeated at the ages
of three, five and fifteen amount to six, ten and
twenty-six respectively. Roughly speaking, a child
should be able to repeat twice as many syllables as
the number of years he has lived. Binet's typical
example for fifteen is : " The other day I saw in
the street a pretty yellow dog. Little Maurice
has stained his nice new apron."
Much interest is attached to Binet's third test
for children of twelve years of age : " I am going
to allow you three minutes, and I want you to say
as many words as you can think of. Some children
have said more than two hundred ; let me hear
how many you can say. Ready ? Start." In
order to pass the child must say at least sixty words.
This is an ingenious way of finding the child's
association time — the time it takes one word to call
up another. The laboratory method is to deal
separately with each word and to measure precisely
by means of a specially constructed clock the interval
that elapses between the hearing of the word and
the emergence of an associated idea. And, indeed,
some such plan is necessary in scientific investigations
of the association process itself. But for measuring
mental activity in children, for finding roughly
the rate at which ideas march through the mind,
Binet's method is equally effective and much
simpler. Although he uses this test once only, it
has been found that the number of words a child
can utter in the given time increases steadily with
his age ; that norms or averages could consequently
be fixed, and the test used to mark different levels
of intelligence. It has also been found that one
minute will give results just as trustworthy as three
38 MENTAL TESTS
minutes. All this serves to illustrate how Binet's
tests are constantly being criticised and refined
upon ; how tests themselves are not exempt from
the testing of time and experience.
Interesting, however, as these tests are, they are
probably inferior in diagnostic value to the absurdity
tests to which I have already referred . And although
Binet sets them at one point only — ten years of
age — they are appropriate to all ages except the
very lowest. The following test, which I devised
some years ago to arrange rapidly (and of course
roughly) in order of intelligence a group of children
of eleven years or over, was found to be suitable for
all children who could read fluently and were
advanced enough to express their ideas in writing —
John Carew lived in a small cottage which stood
on the top of a barren hill and faced the east. From
the foot of the hill a grassy plain stretched in every
direction as far as the eye could see. On the even-
ing of John's thirtieth birthday, while he was sitting
on the front door-step looking towards the setting
sun and watching his shortening shadow on the
gravel path, he suddenly became aware that a
horseman was riding down to the cottage. The
intervening trees and foliage made it difficult for
him to see clearly, but he was able to perceive
that the horseman had only one arm. When,
however, he got a closer view he recognised that the
visitor was his son William who had left home to
join the army twenty years before and had not
been heard of since. On seeing his father William
immediately dismounted, ran towards him, threw
his arms round his neck, and burst into tears.
MEASUREMENT OF INTELLIGENCE 39
Each child in the class was given this passage in
print, and was allowed a quarter of an hour to read
it and to write out any absurdities he could find
therein. It contains about seven absurdities, and
it was found that it did actually differentiate the
children in close accordance with the teacher's
rating of their general intelligence. Elementary
school children of eleven years of age who discovered
at least five absurdities were, as a rule, those who
on other grounds had been selected as worthy of a
secondary school training.
It has been complained that most absurdity tests
are too easy for the older pupils. Given, however,
a time limit, it would not be difficult to devise a
test which would floor the majority of adults.
Let the reader, for instance, try to find within
fifteen seconds the absurdity (there is but one) in
the following : " John Jones, who had married his
widow's sister, used to say that if a man had a bad
sister it was his misfortune, but if he had a bad wife
it was his fault."
When Binet published his scale in 1908 he regarded
it as tentative. Indeed, he revised it himself three
years later ; and since his death many a patient
experimenter has laboured at separating still further
the gold from the dross. It has been found that
while some of the tests are almost worthless, others
are of no small value : they are reliable in practice
and suggestive in theory. Moreover, new tests
are daily being devised and put to the trial of rigid
experiment, so that we are gradually accumulating
a body of tests which bear the hall-mark of demon-
strated success.
One of the main defects of Binet's scale is its
4 o MENTAL TESTS
bias in favour of language — of words rather than
things ; another defect is the paucity of tests (there
are but five) prescribed for adults. When therefore
adults have to be tested for intelligence — and
especially such illiterate adults as were plentiful in
the American Army — recourse has to be had to
tests of a new type. One of the most useful of
these is the " construction board," a variation of
the jig-saw puzzle. The sections, however, are
generally rectangular, and the time always restricted.
A test which has gained considerable vogue
during recent years consists merely in the giving of
instructions to be obeyed by the subject. These
instructions may possess any degree of complexity
and may be used to mark the rapidity with which
the meaning is grasped. For instance : " Print
your Christian name (or names) in small letters
and your surname in capitals unless there are more
than six letters in your surname, in which case you
should print your surname in small letters and your
Christian name (or names) in capitals." It is quite
easy to see how these tests may be indefinitely
complicated either by making the provisos more
puzzling or by simply increasing their number.
As a sample : " If your grandfather's only child was
your uncle draw a square ; if not, draw a circle."
Here, as always, a time limit is essential.
So much for intelligence tests. Binet's other
scale (his bar erne d' instruction) consisted of ordinary
examination questions, with the difference — a highly
important difference — that they have been carefully
sifted and standardised. The development of this
scale is quite another story. Nor must the intelli-
gence tests with which we have been dealing be
MEASUREMENT OF INTELLIGENCE 41
confused with vocational tests, which are designed
to find out what sort of work a person is best fitted
for. These are just as often physical tests as mental
tests, and frequently involve the use of delicate and
complicated apparatus.
Having made a distinction between the two broad
types of mental tests (tests of knowledge and tests
of ability), and having shown that these types are
never quite pure, let us examine the bearing of this
theoretical distinction on the practical art of
examining. There are four possible methods of
procedure. First, we may dispense with the ordin-
ary examination altogether and substitute a series
of ability tests. This is no new proposal. It has
been made many a time before, but it has always
been rejected. And rightly so. For it cannot be
too strongly insisted on that education is directly
concerned not with natural ability but with culture.
For natural ability, as scientifically conceived, grows
with the growth of the brain, fluctuates with fresh-
ness and fatigue, and varies with varying states of
health and nutrition. The most that the school-
master can do is to take full advantage of what
natural ability his pupils may happen to possess.
His business is not to train intelligence but to use
it — to use it himself, and to see that his pupils
use it. And his success is measured by the degree
of culture he imparts. But culture means know-
ledge. True, it means other things as well ; but
it means knowledge at least. And the most effec-
tive way of testing knowledge is by examinations.
The second possibility is to leave things as they
are — to rely on examinations pure and simple,
examinations of the good old-fashioned sort. But
42 MENTAL TESTS
this policy implies a wilful blindness to certain grave
defects which have been pointed out repeatedly and
persistently for the last half -century.
The third possibility is to hold two distinct
examinations, one a pedagogical examination, and
the other a psychological examination — one a test
of acquired knowledge, the other a test of natural
aptitude. This is virtually Binet's plan for detect-
ing the feeble-minded. The teacher sifts first, the
psychologist afterwards. It is also the scheme that is
reported to have been recently adopted at Columbia
University for selecting candidates for entrance.
Those who pass the ordinary matriculation ex-
amination are further submitted to a series of tests
for general intelligence. It is not quite clear
whether any are knocked out in the second round.
It is not indeed clear whether the second round
is intended to knock them out at all. The prob-
ability is that the second examination is not selective
but experimental. It is intended to reveal the
degree of correspondence between the results of
the two types of examination. Rash indeed would
be the examiner who would lightly brush aside the
evidence of combined scholarship and intelligence
which a wisely conducted examination affords.
Besides, the virtue of matriculation lies in the fact
that it guarantees a certain minimum of culture.
The university starts no course of study from the
beginning ; it takes up the tale at the point where
the secondary school left off. It demands of its
alumni a determinate measure of literacy, and by
means of a matriculation examination it assures
itself that its demand is met. And if this demand
is met — if the candidate has sufficient intelligence
MEASUREMENT OF INTELLIGENCE 43
to carry him successfully to that point in the road
of learning — it would need more proof than our
modern mental tests can at present afford to demon-
strate that he can go no farther. If the purpose
of the second examination is to reduce the element
of chance, there seems to be more reason for re-
testing the failures than for re-testing the passes.
For failures may be due to illness or excessive
nervousness, or sheer bad luck. Moreover, a high
degree of intelligence in a candidate may well be
regarded as counterbalancing certain gaps in his
store of knowledge.
The last alternative is to combine both types
of test in one. examination. When this took place
in the past- it was mainly a matter of accident;
now it is done by design. It is the way in which
the new science of testing has affected English
examinations. The examiner is beginning to ask
himself what it is that he is really examining. Is it
parrot knowledge, or is it knowledge that has been
intelligently acquired and can be intelligently
applied ? Is it the application of common know-
ledge in an unfamiliar field, or is it the application
of exotic knowledge to familiar instances ? Is it
the capacity to acquire, or the capacity to express ?
In any case the wise examiner never neglects to test
the capacity of the candidate to apply knowledge
and to express it in new forms, for in so doing he
kills two birds with one stone : he tests both know-
ledge and intelligence. Speaking generally, the
younger the pupil the less the importance to be
attached to school attainments as distinct from
native ability. In other words, if intelligence tests
are important anywhere, they are important at that
44 MENTAL TESTS
stage in the pupil's career when we decide upon
the type of education for which he is best fitted.
This principle is clearly observed in the selection
of mentally defective children at the age of seven ;
less obviously in the selection of supernormal
children at the age of eleven. It is customary at
examinations for junior scholarships to set two
papers only, one in English and the other in arith-
metic. And the questions are so devised as to
frustrate, as far as may be, the designs of the
crammer. That this examination will radically
change in the near future is highly improbable,
for it is difficult at present to conceive a better
practical device for testing annually the intelligence
of a large number of children. The fact that it is
conducted every year, or even twice a year, in the
same schools renders known and standardised tests
almost useless. If we are to remove entirely the
possibility of special coaching, the element of
novelty is essential. And the scope for variety in
the realm of English and mathematics is virtually
unbounded. Here intelligent anticipation on the
part of parents and teachers can readily be defeated.
Again, the 8000 children who sit every six months
for the London junior scholarship examination can
all be examined quite easily in one morning with
no superintendents besides the ordinary teachers.
Under the Binet scheme, where the testing is oral
and individual, it would take a trained examiner
nearly four years to get through one half-yearly
batch. To test the whole lot in one morning
would need the services of 1400 examiners specially
trained for the purpose. As a matter of fact samples
of the candidates have been tested by both written
MEASUREMENT OF INTELLIGENCE 45
and oral methods, and the results obtained are so
similar that confidence in the present system is
amply justified.
There are two fairly distinct types of ability which
are tested by these junior examinations — mathe-
matical ability and literary ability. Proficiency in
problem arithmetic is in itself no poor criterion.
One of the most trustworthy of Binet's tests is the
second in his scale for eight-year-olds : Count
backwards from twenty to nothing in twenty seconds.
It is the departure from the beaten track that gives
it its value. The modern tendency, however, is to
attach more importance to the English paper than
to the arithmetic. Binet long ago remarked that
no child who could compose was mentally defective ;
and Mr. Cyril Burt, the psychologist to the London
County Council, has expressed the opinion that the
school subject most clearly symptomatic of intelli-
gence is composition, provided it is marked for its
power of thought. The only serious defect of the
present system is the absence of tests of the third
broad type of ability — manual ability. This defect
has been carefully considered by the London
authority, who have adopted the view that in
early years the ablest with their heads are also as
a general rule the ablest with their hands ; that it
is very difficult to discover at the age of eleven (that
is, before the children have begun to attend the
handicraft and . domestic centres) what practical
ability children actually do possess ; and, finally,
that those few children who have a special aptitude
for craftsmanship and escape the junior county
scholarship net are captured by the trade scholarships.
It were unwise to prophesy what developments
46 MENTAL TESTS
will take place in English examinations in the near
future. Already one English authority has included
in its junior examination a test of the " instruc-
tions " type given above, and it is not impossible
that a third paper will become the general rule — a
paper corresponding to the obsolete general know-
ledge paper, except that it will aim at discovering
not whether the candidate is well-informed, but
whether he is sharp-witted. The secondary schools
and universities show as yet no alarming symptoms
of infection.
Thoughtful people have recently been asking :
Why is it that America has been moving so rapidly
in the matter of mental tests while England has
almost stood still ? The answer is simple : Speaking
generally, Americans believe in psychology but
Englishmen do not. When America entered the
war one of the first things she did was to mobilise
her psychologists. The war was nearly over before
England discovered that psychologists were of any
use. And the discovery was due partly to the
witnessing of what was achieved in the American
Army, and partly to an appreciation of the wonder-
ful results that followed the psychological treat-
ment of shell-shock and war-strain by those numerous
psychologists (medical and non-medical) who volun-
teered their services early in the war. The fault
did not lie with the British psychologists. In the
pursuit of their particular science and in the inven-
tion of new tests they were in no way behind their
neighbours. And the Americans knew this ; they
consulted our professors and freely used their tests.
There is no psychological instrument which is more
generally useful than the dotting machine, an
MEASUREMENT OF INTELLIGENCE 47
invention of Mr. McDougall, of Oxford ; there is
no instrument more specifically useful than the
K tube, an invention of Dr. C. S. Myers, of Cam-
bridge. The new science of testing could never
have reached its present stage but for the discovery
of that potent means of mathematical analysis known
as " correlation." It was discovered by an English-
man, perfected by an Englishman, and simplified
for common use by an Englishman. In the investiga-
tion of general intelligence and in the critical
examination of the methods used nobody has done
better work than Mr. Cyril Burt. Like Binet, he
was concerned in testing tests as well as in testing
ability. But the methods he adopted were more
rigidly mathematical. He invented tests too ; among
others the analogies test mentioned above. Many
other Englishmen have successfully laboured in the
same field, such as Mr. W. H. Winch, Dr. W. Brown,
Dr. E. O. Lewis, Professor J. A. Green, Dr. J. L.
Mclntyre, Dr. N. Carey, Miss N. Taylor, Mr.
H. B. English, Miss May Smith, Dr. Bernard Hart,
and Dr. Edgar Schuster and other investigators in
the psychological laboratories of University College,
King's College and Oxford. Such records of their
work as have already been published may be found
distributed among the pages of the British Journal
of Psychology, the Journal of Experimental Pedagogy,
and the Annual Reports of the British Association.
CHAPTER IV
binet's tests of intelligence
The translation of Binet's Tests given below
was made by Mr. Cyril Burt in consultation with
Dr. Simon, who was Binet's collaborator in de-
vising and standardising the scale. The tests,
originally standardised for Parisian children, have
been tried by Mr. Burt on a large number of
London children, and modified to suit their char-
acteristic rate of development. The tests, in fact,
are here rearranged in the order of their freshly
ascertained difficulty, and the age-assignments are
given as they were determined afresh for children
of London Elementary Schools.
Each child has to be tested individually, and
under conditions most favourable to the removal
of shyness and nervousness. The examiner must
use his discretion respecting the point of the scale
at which he should begin : the usual rule is to
start with the group of tests just below the child's
chronological age. If, however, there is a failure
in any of those tests it is expedient to go back and
try all the tests in the previous group. The ex-
amination should then be carried up the scale until
the child fails in four or five consecutive tests,
4 8
BINET'S TESTS OF INTELLIGENCE 49
Terman gives as the lowest permissible limit of
thoroughness a range which starts at the year
which yields only one failure, and ends with the
year which yields only one success. To estimate
the child's mental age the examiner should regard
the age at which all tests are passed as the base
age, and should add one-fifth of a year for every
additional test belonging to any of the higher
ages ; or, where in the following revision there are
more or less than five tests in the higher year, the
corresponding reciprocal fraction.
It is now customary to state the final result in
the form of an Intelligence Quotient, a method
first suggested by the German psychologist, Stern.
The Intelligence Quotient is found by dividing the
mental age by the real age. If, for instance, a
child's real age is eight and his mental age six,
his intelligence quotient is '75. If his mental
age were ten his intelligence quotient would be
1-25.
Binet lays it down that the amount of retarda-
tion that determines a child as defective is two
years when he is under nine, and three years
when he is past his ninth birthday. Stated in
terms of the intelligence quotient the border-line
between normality and deficiency is somewhere
about '75. No cases, however, where the intelligence
quotient falls between "j and *8 are quite free from
doubt.
50 MENTAL TESTS
BINET TESTS (BURT'S TRANSLATION
AND REVISION)
Age Three
1. Understanding Simple Commands.
Instructions. — " Show me (point to, put your
finger on) —
(i) your nose.
(2) your eyes.
(3) your mouth."
Evaluation. — All should be correctly performed ;
but free encouragement may first be given.
2. Repeating Numbers.
Instructions. — " I am going to say some numbers.
Will you listen, and say them after me ? "
(For use only after failure in first set.)
5 8 9
37 64 72 (Age 3)
714 286 539 (Age 4)
3681 5749 8526 (Age 5)
52947 63852 973i8 (Age 6)
250634 5739 16 495827 (Age 9)
9647518 4829653 5928136 (Age 11)
Note : rate should be two per second ; utter-
ance should be without rhythm, emphasis or in-
flection. Do not tell the child if he is wrong.
Do not repeat the same series. Merely give him
BINET'S TESTS OF INTELLIGENCE 51
another chance with another series. Failure owing
to interruption does not count. (While uttering
the numbers or syllables, hold up the hand or
finger to prevent the child starting to repeat before
entire phrase or list has been completed. Drop
the hand as a signal to child that you have finished
and he is to repeat.)
Evaluation. — One correct repetition out of three
trials counts as success. Note, therefore, the largest
number the child can repeat. The age at which
series of different lengths can be repeated is given
in the last column above. The repetition of figures
in their natural order, e. g. 9645678, should be noted
as an instance of automatism.
3. Naming Own Sex.
Instructions. — " Are you a little boy or a little
girl ? " (for a boy). " Are you a little girl or a
little boy ? " (for a girl).
If child says " yes " or " no " or merely echoes
part of the phrase, ask the two questions separ-
ately : " Are you a little girl ? " " Are you a little
boy ? "
4. Giving Surname.
Instructions. — " What is your name ? " If child
merely gives Christian name, ask : " And what
else ? . . . Tommy what ? "
Evaluation. — If child gives surname he has some-
times been known by e.g. stepfather, or mother
(when illegitimate) record it as correct.
52 MENTAL TESTS
5. Naming Simple Objects.
Materials. — A penny, a closed knife, and a
common kind of key.
Instructions. — " What is that ? " or " What is
this called ? " showing each object successively.
Evaluation. — All three must be named, but slight
errors, such as " money," " pennies," for " a
penny," are allowable.
6. Describing Pictures.
Materials. — Binet's three pictures — chosen as con-
taining people, and suggesting a story, and having
a certain standardised difficulty.
There can, I think, be little doubt that pictures
(1) better printed, (2) larger, (3) coloured, (4) re-
presenting actions in progress, (5) showing children,
would be much more appropriate than Binet's
original engravings. But these alone have been
standardised.
Instructions. — " Look at this picture and tell me
all about it." Binet's instructions are : " What is
this ? " and if the child says, " A picture," " Tell
what you see there." It is better, however, to
avoid leading phrases like " What can you see in
it ? " (which rather suggests enumeration) and
" What are they doing ? " (which suggests inter-
pretation). Repeat instruction once for each pic-
ture, if there is no answer. Words of praise or
encouragement alone may be added : " Isn't it a
pretty picture ? . . . Do you like it ? " Or even
BINET'S TESTS OF INTELLIGENCE 53
" That's right ! " if the child is on the point of
saying something but is withheld by shyness.
Evaluation of replies. —
Record the type of response given to the first
picture ; if doubtful, use the second and third and
record the type of response most frequently given,
i. e. employed for two pictures out of the three.
Binet distinguishes three types of response corre-
sponding to three stages of development.
A. Enumeration (E.). (Age 3.) E.g. —
i. " A man, boy."
ii. " There's an old man and a lady,"
etc. (mere list of objects or details),
iii. " I can see a room with a chair, a
table, and a looking-glass, and there's
a man and a sofa."
Two items at least should be enumerated.
If the child only gives one, do not ask,
" Anything else ? " but proceed to another
picture.
B. Description (D.). (Age 6.) (Phrases indicat-
ing actions or characteristics.)
i. " They're pulling a cart."
ii. "A man and a woman sitting on a
seat." " An old man asleep."
iii. " A man standing on a bed and trying
to look out of the window." " A
man looking at himself in the
glass."
54 MENTAL TESTS
C. Interpretation (I.). (Age 12.) (Goes be-
yond what is actually visible in the picture
and mentions the situation or emotion it
suggests.)'
i. " They're moving," " they've a heavy
load," " they can't pay their rent."
ii. "Miserable," "poor," "have no
home," " the man is saying his
prayers," " his daughter " or
" wife " (looking after him, etc.).
iii. " A prisoner," " he wants to get out,"
" he's trying to see what's in the
yard," " he is lonely " or " think-
ing," " a rag-picker," " a man in
trouble," " a man on board ship."
Age Four
7. Repeating Syllables.
Instructions. — " Listen again, and say this after
me." (The phrases should be pronounced deliber-
ately and with expression. Begin with no. iii. ;
but if the child remains silent the examiner may
give him first a shorter sentence (i. or ii.), and then
apparently try iii. again.) —
i. (2 syllables). " Father."
ii. (4 syllables). " My hat and shoes."
iii. (6 syllables). " I am cold and hungry."
(Age 4.)
iv. (8 syllables). "Here is the cloth ; my hands
are clean."
v. (10 syllables). " His name is Jack : he's
such a naughty dog." (Age 5.)
BINET'S TESTS OF INTELLIGENCE 5$
vi. (12 syllables). " It is raining outside ; but
we can stay indoors."
vii. (14 syllables). " While Jack was doing his
lessons, I caught a little mouse."
viii. (16 syllables). "We are going for a walk:
Mary, let me see your pretty hat." (Age 7.)
xiii. (26 syllables). " The other morning I saw
in the street a little yellow dog. Little
Maurice has spoilt his new apron."
(Age 14.)
Evaluation. — Allow no error at all, except mis-
pronunciations due to speech defects. (Binet's
sentences appear to have been deliberately com-
posed of two clauses. This seems unfortunate, as
even an intelligent child may forget one. In trans-
lating them I have endeavoured to keep the general
sense of the original, while making the phraseology
more natural for a child.)
8. Repeating Numbers.
Instructions. — " Listen. Say these numbers after
me"—
(For use only after failure in first set.)
714 286 539
Evaluation. — (See Test 2.)
9. Counting Pennies.
A. Four Pennies. (Age 4.)
Materials. — Four pennies placed in a row.
Instructions. — " Do you see these pennies ?
Count them, and tell me how many there are."
56 MENTAL TESTS
If the child at first answers at random, add :
" Count them aloud, and point to each penny as
you count it " ; but do not demonstrate. (It may
be of interest to see if the child can do it when
shown : " Count like this : one, two " touching
the first two with the finger as each is counted.
But do not use his answer for strict comparisons.)
Evaluation. — The first random answer does not
count.
B. Thirteen Pennies. (Age 6.)
Instructions, etc., as before.
10. Comparing Two Lines.
Materials. — Two parallel horizontal lines, 5 cm.
and 6 cm. respectively, previously drawn in ink on
a card or paper, the longer 3 cm. below the shorter,
with its centre under that of the other.
Instructions. — " Do you see these lines ? Tell
me which is the longer ? "
Evaluation. — No hesitation is allowed. (Some
investigators allow the examiner to repeat the in-
struction : English children will often respond
more readily to the injunction : " Put your finger
on the long one." But Binet insists that the child
shall not only perceive the difference, but also
understand, without any further help, that the
phrase " the longer " implies making a comparison.)
11. Comparing Faces.
Materials. — Binet's six faces. Show only two at
a time.
BINET'S TESTS OF INTELLIGENCE 57
-O
705
(Lo
Us "^
r
%
%&
58 MENTAL TESTS
Instructions. — " Which is the prettier of these
two faces ? " (If " prettier " seems not to be under-
stood, "prettiest" or "Which do you like the best
of these two ladies ? " "Which is the nice one ? "
may evoke correct answers. But these should not
be counted for purposes of strict comparisons.)
Method. — It is better to cover the lower pairs or
pair while dealing with the first or second.
Evaluation. — All three comparisons must be cor-
rectly made. Repeat the questions once if necessary.
Age Five
12. Performing Three Commissions.
Materials. — Key, book, etc. Arrange the room
while the child is carrying out one of the drawing
or writing tests.
Instructions. — "Do you see this key? Go and
put it on the table there. Then shut the door.
And after that bring me the book on the chair near
the door. Do you understand ? First, put the
key on the table, then shut the door, then bring
me the book." (Note repetition of instructions.
Do not let the child commence until this is com-
pleted.)
Evaluation. — All three commissions must be per-
formed spontaneously without any further instruc-
tion or hint (" Well, and now ? " " What have you
forgotten ? ").
BINET'S TESTS OF INTELLIGENCE 59
13. Draws from Copy.
A. A Square. (Age 5.)
Materials. — A square, each side measuring about
3 to 4 cm., drawn beforehand in ink, preferably on
a card. Plain paper. Pen and ink (deliberately
advised by Binet, making the task more difficult.
Most American adapters recommend pencil.)
Instructions. — " I want you to copy this for me "
(pointing to square). " Draw it here " (handing
pen and paper). (If encouragement is needed :
" What do you think this shape (picture) is ? See if
you can draw it." Do not use the word " square "
yourself.)
Evaluation. — Passes if it can be recognised as an
attempt at a square. If one side is twice the other,
if the lines cross considerably at the corners or bend
round without any angles, the drawing fails. The
size does not matter. (Allow only one attempt.)
Should take about one minute.
B. A Rhombus or Diamond. (Age 6.)
Materials. — A " diamond," about 7 cm. long
and 4 cm. high, with sides 4 cm. long, drawn as
before on a card. Paper, pen and ink.
Instructions and Evaluation. — As before. Binet
apparently requires at least one pair of opposite
angles to be fairly equal, at least one pair of adjacent
sides to be fairly equal, and the vertical diameter to
be long. Absolute parallelism of the opposite sides
is not insisted upon. The pass-standard is thus con-
siderably below what an uninstructed teacher would
6o MENTAL TESTS
be apt to accept as a satisfactory reproduction.
(Reference should be made to his samples.)
14. Repeating Syllables.
(10 syllables.) " His name is Jack : he's such a
naughty dog."
Instructions and Evaluation. — (See Test 7, no. v.)
15. Giving Age.
Instruction. — " How old are you ? "
Evaluation. — Child should give his age in years,
last birthday. Note : children very often say
" seven " when they mean " getting on for seven."
Hence, if the first answer is wrong, ask specifically
" How old were you last birthday ? " Parents also
often give an infant and a child about to leave
school an age above the true one : and dull children
(except when about to leave) an age below the real
one. The child's answer should be accepted if it
corresponds with what it has commonly or recently
been told. Do not, therefore, insist too rigidly on
the age given by the birth certificate or the register.
16. Distinguishing Morning and Afternoon.
Instructions. — " Is it morning or afternoon now ? "
(in the morning). Or, " Is it afternoon or morning
now ? " (in the afternoon).
Evaluation. — Repeat the question if there is any
possibility of the child having merely echoed one
of the words thoughtlessly. (Asking " Have you
had your dinner yet ? " elicits answers interesting
to compare with the above.)
BINET'S TESTS OF INTELLIGENCE 61
17. Naming Four Primary Colours.
Materials. — Four oblong pieces of paper, 2x6
cm., coloured bright red, yellow, blue and green,
and gummed beneath one another on a card.
Instructions. — " What colour is this ? " pointing
to each in turn.
Evaluation. — No error is allowed. Should take
about 6 seconds. But the time limit does not appear
to be strictly enforced.
18. Repeating Numbers.
Instructions. — " Listen. Say these numbers after
me."
(For use only after failure in first set.)
3681 5749 8526
Evaluation. — (See Test 2.)
19. Comparing Two Weights.
Materials. — Four small similar boxes (about
1*5 x 2*5 x 3*5 cm.) weighing 3, 12, 6 and 15 grammes.
Instructions. — " You see these boxes (showing
first the pair weighing 3 and 12 grammes placed 5 or
6 cm. apart). Tell me which is the heavier."
If the child merely points, add, without any
gesture : " Take them in your hands and weigh
them." (English children respond better to the
instruction " Lift them " or " Feel them." " And
give me the heavy one." But do not use the
modification if strict comparability is required.
Kuhlman and the Stanford Revision allow a demon-
stration : Binet and Yerkes prohibit it.) In any
62 MENTAL TESTS
case do not put them in his hands. If he merely
lifts one, or both together, do not correct him. If
there is any doubt with the first pair, repeat the
experiment with the second pair (6 and 15 grammes) ;
and then with the first pair again. (If the child
fails to understand, it is interesting to put them
successively into his hand, and ask, " Which is the
heavier ? " But his response in this case does not
count.) For the second and third trials use the 6
and 15 gramme weights and then the first pair again.
Evaluation. — All three trials must be correct : if
any doubt continue repetitions.
20. Giving Number of Fingers.
Instructions. — " How many fingers have you on
your right hand ? " " And how many on your left
hand ?"..." How many does that make on both
hands altogether ? "
Evaluation. — The replies must be made without
stopping to count : and all three questions must be
correctly answered.
Age Six
21. Counting Pennies.
A. Thirteen Pennies.
Materials. — Thirteen pennies placed in a row.
Instructions and Evaluation. — (See Test 9, B.)
22. Drawing from Copy.
B. A Rhombus or Diamond.
Instructions and Evaluation. — (See Test 13, B.)
BINET'S TESTS OF INTELLIGENCE 63
23. Transcription.
Materials. — " See little Paul " written in a bold,
copy-book handwriting on a card or sheet of paper.
Paper, pen and ink.
Instructions. — " Will you copy that for me ? "
Evaluation. — The test is passed if the copy is
sufficiently legible to be read by a person who did
not know what was to be written.
24. Naming Days of the Week.
Instructions. — " Can you tell me what are the
days of the week ? "
Evaluation. — The days must be named in order
without error or hesitation, in 10 seconds.
25. Naming Coins.
Materials. — Nine coins, all placed in a row on the
table with the head upwards : similar coins should
not be adjacent. Order upon table : is., id., 6d.,
Id., 2s., \d., £1, 2s. 6d., 10s. Order of difficulty : id.,
Id., \d., is., 6d., 2s., £l, 10s., 2S. 6d. (3^., 5^., 4J.).
(One pound and ten-shilling notes must be allowed.)
Instructions. — Ask " What is this ? " pointing to
each in succession. Neither examiner nor child
should handle them or turn them over.
Evaluation.
A. Four Commonest Coins. (Age 6.)
(is., 6d., id., r^d.) No error allowed.
B. Nine Commonest Coins. (Age 9.)
All should be named correctly in 40 seconds. If an
error is attributable to passing confusion, Binet
64 MENTAL TESTS
allows a second trial of the whole series after a few
minutes. (An interesting variant is to ask what
coins there are larger than a shilling, before passing
to B.)
26. Reconstructing Divided Oblong.
Materials. — Two cards (4*5 x 7*5 cm.), one in-
tact, the other divided along one diagonal into two
equal triangles. Place the triangles so that the
longest sides are at right angles, but do not face
towards each other.
Instructions. — " One of my cards has been cut
in two ; can you put the pieces together again, to
make a whole one, like this ? "
If the child merely looks at the cards without
touching them, say : " Move them about and see
if you can fit them together," and, if necessary, place
one in his hand.
See that the child does not turn one triangle over.
(Before cutting the card, black one side all over.
This does not appear to alter the difficulty of the
test, but prevents turning over.) If the child
makes a wrong combination and appeals for judg-
ment, give no opinion. Remain silent, or say
merely, " What do you think ? "
27. Defining Concrete Terms.
Instructions. — " What is —
(1) a fork ?
(2) a table ?
(3) a chair ?
(4) a horse ?
(5) a mother ? "
BINET'S TESTS OF INTELLIGENCE 65
(A word commonly used by other investigators
is " kitten," or " cat " : " fork " is unfortunately
a difficult word to begin with.) For shy or silent
children : " You know what a fork is, don't you ?
Well, tell me what it is — what is a fork ? " " You
have seen a horse, haven't you ? Tell me what a
horse is." The instructions may be repeated, but
use no other form of words. (Give the child a
minute to reply in.)
Evaluation of Replies. — The character of three
replies out of five determines the value of the test.
The variations in the age assignments of definition
superior to use depend largely on the inclusion of
such replies as ii., (1), (3), (4) and (7) under i.
rather than under ii.
Note " U " or " G " according as child defines —
i. In terms of Use. (Age 6.)
(1) What you eat with.
(2) Something to have your dinner on :
where the plates are put.
(3) It draws a cart.
(4) She takes care of the babies.
ii. In terms superior to Use (by Genus with or
without Differentia, or by Description).
(Age 10.)
(1) A thing to sit on : something that you
eat with. (" Thing " and " some-
thing," however, are not accepted
for " horse " or " mother.")
(2) An instrument.
(3) It has four legs : it's silver.
66 MENTAL TESTS
(4) A piece of wood, part of the furniture.
(5) An animal.
(6) A lady.
(7) One who cooks our dinners.
(Note : if a child uses " thing " or " some-
thing " for " chair," I then give " mother " and
"horse," otherwise even a bright child, having
given "thing" for "chair," "table" and "fork"
without correction is apt from sheer inertia to
offer " thing " as the genus of " horse.")
iii. Merely repeating the word, or pointing to
the object is marked a failure, without
(apparently) giving the child a further
chance.
28. Repeating Numbers.
Instructions. — " Listen. Say these numbers after
me."
(For use only after failure in first set.)
52947 63852 973 18
Evaluation. — (See Test 2.)
29. Describing Pictures.
Description (D.) (Phrases indicating actions and
characteristics.)
Instructions and Evaluation. — (See Test 6, B.)
30. Distinguishing Right and Left.
Instructions. — " Which is your right hand ? "
..." Which is your left ear ? "
Evaluation. — The child must perform both cor-
rectly without any kind of help. Hesitation and
BINET'S TESTS OF INTELLIGENCE 67
self-correction (without any hint) are allowed : if
by a slip the child shows his left hand or right ear,
the experimenter waits a moment for a spontaneous
correction, which is allowed to pass, but his manner
of waiting should not suggest that the first action
was wrong.
Age Seven
31. Recognising Missing Features.
Materials. — Binet's four pictures of faces without
mouth, nose, eye, and of body without arms.
Instructions. — " Look at this lady's face. Can
68 MENTAL TESTS
you tell me what has been left out ? " (Begin with
face without mouth.)
If the child says, " Her body," reply, " Oh, I was
only trying to draw her face. What must I put in
to finish the drawing of her face ? " (" What have
I forgotten in drawing her face ? ")
Evaluation. — Three correct answers out of four
are required.
32. Adding Three Pennies and Three Halfpennies.
(Concrete.)
Materials. — Three pennies and three halfpennies
set out in a row. (American investigators commonly
use stamps, but to many children the value of these
is unfamiliar.)
Instructions. — " Will you count this money for
me, and tell me how much there is altogether ? "
Evaluation. — No error and no repetition of the
instruction is allowed. The test should be done
in 8 to 10 seconds. " It is useless to wait 15
seconds."
33. Stating Differences between Concrete Objects.
Instructions. — " You know what wood is, don't
you ? . . . And you know what glass is ? . . . They
are not the same, are they ? ... In what way are
they not the same ? " (I would add, if child
hesitates : " They are different, are they not ?
Well, do you think you can tell me what the dif-
ference is ? . . . How can you tell ' glass ' from
< wood ' ? ")
BINET'S TESTS OF INTELLIGENCE 69
The following words are suggested by Binet —
i. fly, butterfly;
ii. wood, glass ;
iii. paper, cardboard.
Evaluation. — Two out of three statements must
be correct. Any true difference will pass, though
trivial. But if the child repeats the same differ-
ence, e. g. " It is larger," it is insufficient. (Ask,
" In what other way are they not the same ? ")
Often a child takes a minute : but if he takes
longer than 2 minutes for all he fails.
34. Repeating Syllables.
(16 syllables.) "We are going for a walk: Mary,
let me see your pretty hat."
Instructions and Evaluation. — (See Test 7, no.
viii.)
35. Writing from Dictation.
Materials. — Pen, ink, paper.
Instructions. — " Will you write this down for me
on this piece of paper ? "
" The pretty little girls."
Evaluation. — The writing ( ? and spelling) must
be sufficiently legible and accurate to be read by a
person who did not know what was to be written.
jo MENTAL TESTS
Age Eight
36. Reading and Reproduction.
Material. — Translation of Binet's passage, printed
or typed, with English place-names and money-
values substituted for the French.
Three / Houses / on Fire./
London, / September 5th. / A huge fire / last night/
burnt down / three houses in the middle of the
city. / Seventeen families /now have no homes. /The
loss is more than 15,000 pounds. /A young barber,/
who saved /a baby / in its cradle, / was badly / hurt/
about the hands./
Instructions. — " Will you read this for me,
please ? " Two seconds after reading is finished,
remove the passage, and say : " Tell me what you
have been reading about."
Evaluation. — Each correct phrase or word as
indicated above constituted one item.
A. Recalls two items. (Age 8.)
B. Recalls six items. (Age 9.)
37. Answering Easy Questions.
Instructions. — " Tell me this —
(1) Suppose you are going somewhere by train.
What must you do if you miss your train ?
(2) Suppose one of the other boys (girls) hit you
by accident — without meaning to. What
should you do ?
(3) Tell me what you ought to do if you broke
something that belonged to somebody else I "
BINET'S TESTS OF INTELLIGENCE 71
If no answer is given, repeat the question as usual,
not sternly, but pleasantly, prefixing : " Did you
catch what I said ? " Do not vary the wording.
Evaluation. — Two out of three must be answered
satisfactorily.
(1) Satisfactory answers. — " Wait for another " —
" Take the next."
Unsatisfactory answers. — " Go home again "
— " Run after it " — " Try not to miss it."
(2) Satisfactory answers. — " Do nothing " — " For-
give him."
Unsatisfactory answers. — " Tell teacher " —
" Hit him back."
(3) Satisfactory Answers. — " Pay for it " — " Own
up "— " Buy another " — " Ask to be for-
given " — " Say I was sorry."
Unsatisfactory answers. — " I should cry " —
" Hide it."
38. Counting Backwards 20-0.
Instructions. — "You can count, can't you — 1, 2,
3, and so on ? Now do you think you could count
backwards ? Start at 20, and go on till you reach
I." (If the child does not understand :) " Count
like this : 20, 19, 18 " (proceed no further).
(Yerkes suggests that the experimenter always
count from 25 to 21, and then pause for the examinee
to continue.)
Evaluation. — One error (either of omission or
inversion) only is permitted (Binet allows 20 seconds).
The child who thinks out the numbers by counting
up from 1 each time fails.
72 MENTAL TESTS
39. Giving Full Date.
Instructions. — " What is the date to-day ? " (If
the word " date " is not understood, ask in detail :
" What day of the week is it to-day ? " " What
month is it ? " " Do you know what day of the
month?" [ist, 2nd, 3rd, or] "what number?"
" And what is the year — nineteen what ? ")
Evaluation. — All four items must be correctly
given : but an error of three days either way is
allowable for the day of the month (unless that
involves an error in naming the month).
40. Giving Change.
Materials. — The current coins \d., \d., id., 6d.,
2s., 2s. 6d., ios., £1 and is., and in addition three
pennies and the three halfpennies. The five
boxes used for the weights.
The shilling is kept by the experimenter to pay
for the box.
The rest, with the boxes, are placed near the child.
Instructions. — " Now, shall we play shop for a
change ? You shall be the shopkeeper. Here are
some boxes for you to sell ; and here is your money.
See how rich you are ! Now, will you sell me one of
your boxes, please ? How much are they each ?
Twopence, shall we say ? Well, here is a shilling.
Can you give me the right change, please ? " (The
examiner holds out his hand for the money.)
Evaluation. — The child must actually hand over
the right amount (sixpence and fourpence in pennies
or halfpennies) : merely stating it correctly does
not count.
BINET'S TESTS OF INTELLIGENCE 73
Age Nine
41. Repeating Numbers.
Instructions. — " Listen. Say these numbers after
me"-
(For use only after failure in first set.)
250634 5 7 3 9 ! ^ 495827
Evaluation. — (See Test 2.)
42. Naming Months.
Instructions. — " Can you tell me what are the
months of the year ? "
Evaluation. — Binet allows 15 seconds, and one
error.
43. Naming Coins.
Nine Commonest Coins.
Instructions and Evaluation. — (See Test 25, B.)
44. Reading and Reproduction.
Recalls Six Items.
Instructions and Evaluation. — (See Test 36, B.)
45. Defining Concrete Terms.
Superior to Use (G.).
Instructions and Evaluation. — (See Test 27, no. ii.).
46. Arranging Five Weights in Order.
Materials. — Five boxes, identical in colour, shape
and size (about 1*5 x 2*5 x 3*5 cm.), and loaded
with shot and cotton wool to weigh 3, 6, 9 and
74 MENTAL TESTS
15 grammes, without rattling. The key letters, B, I,
N, E, T, may be written in order on the bottom of
the boxes.
Instructions. — " Do you see these boxes ? They
all look the same, don't they ? But they don't
weigh the same. Some are heavy, and some are
light. I want you to find the heaviest of all, and
put it here. Then find the one which is nearly
as heavy, and place it next. Then the one which
is still less heavy ; then the one which is lighter still ;
and, last of all, the one which is lightest of all, here."
Allow three trials if necessary, mixing the boxes
up first.
Evaluation. — The arrangement must be abso-
lutely correct in two out of three trials, and the
whole accomplished in 3 minutes. It is of special
interest to record the subject's actual arrangement.
47. Sentence Building with Three Words.
Materials. — Paper, pen and ink: and a card
with " London, river, money " written on it.
Instructions. — " I want you to make up a sentence
for me with these three words in : London, river,
money."
Instead of " London " it is often customary to
employ the nearest town that is on a river. (Most
American investigators, following Goddard, conduct
this test orally.)
Evaluation.
A. Two distinct ideas or sentences given (indi-
cates mental age of 10). " London has
money and rivers."
BINET'S TESTS OF INTELLIGENCE 75
B. One idea or sentence given (indicates mental
age of 1 1), e. g. " In the river at London
I found some money." A set of sen-
tences in which the thought is well
co-ordinated into a unitary story or
description passes.
C. Three distinct ideas or sentences consti-
tuted a failure. E. g. " London is a
town. There is a big river. Some people
have money."
Enter "1," "2" or "3" according to number
of sentences given, and note time. At least three-
quarters of the test should be written within a
minute. Binet states that this is one of the rare
cases in which a child may succeed by having heard
of the test from another child. If there is any
likelihood of this, ask at the outset : " What do you
think I have been asking the others to do with these
words ? " and substitute others, if necessary. The
child may guess the test from school exercises.
48. Drawing Two Designs from Memory.
Materials.— Binet's two designs, drawn pre-
viously on a single card or sheet, kept out of sight
until required. A pencil and plain paper.
Binet's list No.4*S.
76 MENTAL TESTS
Instructions. — " I am going to show you two
easy drawings. I want you to look at them very
carefully until I take them away. Then, after I
have turned them over, see if you can draw them
both from memory on this paper. You will only
see them for a very few seconds. Now look at
them both carefully first of all. Ready ? Now ! "
The drawings are held steadily in front of the child,
the truncated prism on the left, for 10 seconds (see
that the child does not imagine he has to copy them
at once), and then taken away and turned face down-
wards. " Now try and draw them for me here."
Evaluation. — The whole of one and a half of the
other must be reproduced fairly exactly. No
second attempt is allowed. Neatness of drawing
does not count. The examiner must be careful
not to interpret " fair exactness " more strictly for
older than for younger children. The standard
accepted in the case of this test is thus far below
what the uninstructed teacher would accept as a
satisfactory reproduction.
Age Eleven
49. Explaining Absurdities.
Instructions. — " Listen carefully to what I am
going to say. There is something in it that is
really quite silly and impossible. See if you can
tell me what is wrong."
i. " ' One day, a man fell off his bicycle on to his
head and was killed instantly. He was taken
to the hospital and they say he will never
get better.' What is there silly in that ? "
BINET'S TESTS OF INTELLIGENCE yj
ii. " ' Once the body of a poor girl was found
in a wood, cut into eighteen pieces. They
say that she killed herself.' What is silly
in that ? "
iii. " ' Yesterday there was a railway accident.
But the newspaper says it is not a serious
one, as only forty-eight people were killed.'
What is silly in that ? "
iv. " * I have three brothers — Jack, Tom and
myself.' What is silly in that ? " (Female
examiners must preface this with, " A boy
said to me, etc.," or else substitute, " I have
three sisters, Jane, Mary and myself.")
v. " ' A man once said, " If I should ever grow
desperate and kill myself, I shall not choose
a Friday to do it on, for Friday is an un-
lucky day, and would bring me bad luck." '
What is foolish in what the man said ? "
Evaluation. — Three absurdities should be de-
tected out of five. If a child's first statement is
not clear (e. g. " myself is silly " in answer to iv.),
say, " Explain what you mean." Otherwise, give
him no second chance.
(i.) Correct : " He couldn't get well if he was
already dead." " First you said he was
dead, and then you said he wouldn't get
well again."
Incorrect ; " They ought to have taken him
to the mortuary." " If he fell off his
bicycle, he wouldn't fall on his head."
(ii.) Correct : " You can't cut yourself into
eighteen pieces." " If she killed herself
she couldn't cut herself up."
78 MENTAL TESTS
(iii.) Correct : " It must have been serious if
forty-eight were killed, or if anybody was
killed." " If it wasn't serious only one
or two would have been killed." " Forty-
eight isn't serious in war-time."
Incorrect : " Forty-eight people couldn't be
killed in a railway accident."
(iv.) Correct : " You have only two." " You are
not your own brother." " You shouldn't
count yourself."
Incorrect : " You should put yourself last."
(v.) Correct : " If he killed himself, the day
wouldn't matter." " He couldn't have
bad luck if he was dead." " If he was
desperate, he wouldn't wait till Friday."
Incorrect : " He is silly to believe in bad luck."
" Friday isn't different -fee any other day."
50. Answering Difficult Questions.
Instructions. — " Can you tell me this ? "
(1) " What should you do if you found you were
late on your way to school ? "
(2) " Suppose a boy does something that is un-
kind : why do we forgive him more readily
if he was angry than if he was not angry ? "
(3) "If some one asked what you thought of a
boy (or girl) whom you did not know very
well, what should you say ? "
(4) " Why should we judge a person by what he
does, and not by what he says ? "
(5) " Suppose you were going to undertake some-
thing very important : what should you
do first of all ? "
BINET'S TESTS OF INTELLIGENCE 79
Repeat a question once, if necessary, but do not
vary the wording.
Evaluation. — Allow 20 seconds for reflection on
each question. Three out of five must be answered
satisfactorily.
(1) Satisfactory: "Hurry" or "Run" ("Go
straight to school " may be accepted if it
appears that the child sometimes plays or
carries out errands on its way).
Unsatisfactory : " Get the stick." " Leave
earlier." " Get up sooner next time."
" Ring the bell." " Get a note to excuse
me." By convention, anything not em-
bodying the idea of hurrying.
(2) Satisfactory : " Because he didn't know what
he was doing." " Because he'd be sorry
afterwards." Anything suggesting that
anger may constitute an excuse, however
badly expressed.
Unsatisfactory : " He oughtn't to get angry."
Anything suggesting disapproval of anger.
(3) Satisfactory : " I could not say anything."
" I could not tell him without finding out."
" I should say, ' I do not know.' "
Unsatisfactory : " I should have to ask."
" Say I did not know his name." Usually
unintelligible.
(4) Satisfactory : " You can rely on his actions,
but not on what he says." " Because
he might not always speak the truth."
" Actions speak louder than words."
Unsatisfactory : Usually unintelligible. " Be-
cause you can't tell." " You ought to
speak the truth."
80 MENTAL TESTS
(5) Satisfactory : " Think it over." " Ask some
one about it." " Prepare for it."
Unsatisfactory : Usually unintelligible. " Not
do it."
(Some say " Tidy myself," " Put on a clean collar ; "
in that case make it clear that you mean doing some-
thing important, not going somewhere important.)
51. Repeating Numbers.
Instructions. — " Listen. Say these numbers after
me."
(For use only after failure in first set.)
9647518 4829653 5928136
Evaluation. — (See Test 2.)
52. Giving Sixty Words in Three Minutes.
Instructions. — " I want you to give me as many
words as you possibly can in 3 minutes. Keep
saying words like this till I stop you : school, teacher,
board, boy, girl, and so on. Some children can
give more than 200. Are you ready ? Now start."
When he stops encourage him immediately by say-
ing : " Very good. Keep on."
Evaluation. — Sixty words must be given, ex-
clusive of repetitions. If the child gives sentences,
start him again, saying: "You must give separate
words." Be careful (1) to note the time, if possible
with a second hand (2) to count the words, entering
a stroke or other mark for each, and calculating the
total. It is interesting to record the key-words of
the child's various topics ; it is seldom possible to
put down all words.
BINET'S TESTS OF INTELLIGENCE 81
53. Sentence Building with Three Words.
Materials. — Paper, pen and ink, and a card with
" London, river, money," written on it.
One Idea or Sentence.
Evaluation. — (See Test 47, B.)
Age Twelve
54. Giving Three Words to Rhyme.
Instructions. — " Do you know what a rhyme is ?
When two words end the same way, we call them
rhymes. ' Jill ' rhymes with ' hill ' because they
both end in ' ill.' Do you understand ? Now can
you give me three words which rhyme with
< obey ' ? "
Evaluation. — -The child must give three genuine
words that rhyme in 1 minute. Binet's instruc-
tions to the child ask for " other words " or " all
the words." It saves time to specify three to the
child. If the child gives nothing, or has not given
enough, urge him by saying " what (else) rhymes
with ' obey.' " Apparently " disobey " may be
accepted as one of the three.
55. Rearranging Mixed Sentences.
Materials. — Three cards containing the following
words —
i. a defends dog good his bravely master ;
ii. my have teacher I the correct asked paper to ;
iii. home we early our in country left visit the to
friends.
G
82 MENTAL TESTS
Instructions. — " Put these words in order, and
find out the sentence which they make."
Evaluation. — Two correct solutions must be given
out of three. Only I minute is allowed for each.
Correct solutions are —
i. " A good dog defends his master bravely."
" A dog defends his good master bravely."
(" A master defends his good dog bravely,"
is, according to Binet, " poor " and appar-
ently incorrect),
ii. " I have asked my (the) teacher to correct
the (my) paper."
(" I asked my teacher to have the paper
correct" is, presumably, incorrect.)
iii. " We left home early to visit our friends in
the country."
(" We left our friends in the country to visit
our home early," and other sensible variants
are presumably correct.)
56. Describing Pictures.
Interpretation. — (Goes beyond what is actually
visible in the picture, and mentions the situation or
emotion it suggests.)
(See Test 6, C.)
Age Thirteen
57. Resisting Suggestion.
Materials. — A book of six leaves with two lines
drawn in the same straight line on one page in
opening. The lengths must be as follows —
BINET'S TESTS OF INTELLIGENCE 83
1st line.
2nd line.
1st page
4 cm.
5 cm.
2nd page
5 cm.
6 cm.
3rd page . .
6 cm.
7 cm.
4th page . .
7 cm.
7 cm.
5th page . .
. - . 7 cm.
7 cm.
*
6th page
7 cm.
7 cm.
Instructions. — For the first three pages : " Which
is the longer of those two lines ? " for the last three,
without changing the tone : " And these ? "
Evaluation.- — Note whether child's judgments
are right or wrong in each case, especially with
reference to the last three. The child succeeds
if he judges two out of three equal lines to be
equal.
58. Solving Circumstantial Problems.
Instructions. — " Can you guess the answer to this
riddle ?
i. " One day a woman, walking in Epping Forest,
stopped still, terribly frightened. Then she
hurried to the nearest police station, and
told the policeman she had just seen, hang-
ing from the branch of a tree, a well,
what do you think it was she saw ? "
ii. " My next-door neighbour has had three
visitors. First, a doctor called ; then a
lawyer ; and then a clergyman. What do
you think has been happening there ? "
Evaluation. — Both questions must be correctly
answered.
84 MENTAL TESTS
i. Correct: Replies must contain the idea of
some one hanged. If the child answers
" a man," " a dead person," therefore, ask
" How did he get up in the tree ? "
Incorrect : " A bird." " Some one robbing a
nest."
ii. Correct: "Some one is dying," "is very
ill." (Severe illness can be inferred from
the visit of the doctor alone. But Binet
apparently accepts it, without inquiring
whether the child knows the object of the
other visitors.)
Age Fourteen
59. Repeating Syllables.
(26 syllables.) " The other morning I saw in the
street a little yellow dog." " Little Maurice has
spoilt his new apron."
Evaluation. — (See Test 7, no. xiii.)
60. Defining Abstract Terms.
Instructions. — " Can you tell me this ? What is
meant —
i. by kindness ?
ii. by justice ?
iii. by charity ? "
(For " charity " some investigators have substi-
tuted " obedience " : but this changes the difficulty
of the test.)
BINETS TESTS OF INTELLIGENCE 85
Evaluation. — Two must be correctly defined out
of three. Correct definitions —
For (i) contain the idea of an instance of affec-
tion, tenderness, politeness or consideration to
others. (" Being polite or good to others " is
correct. " Being kind," " doing something good,"
are inadequate.)
For (ii) contain the idea of treating people accord-
ing to their merits, or of protecting the innocent
and their interests, or of punishing the guilty.
E. g. " when you punish wicked people," " playing
fair."
For (iii) contain the ideas of (a) poor or un-
fortunate people, and (b) of showing kindness.
E. g. " when you give poor people some money,"
" giving alms."
Age Fifteen
61. Drawing from Imagination the Cuts in a
Folded Paper.
Materials. — Two sheets of paper about 6 inches
square. A pencil. One sheet is folded in four like
a letter ready for an envelope. In the middle of
the edge which presents but a single fold, a small
triangular notch, about 1 cm. deep, is drawn.
Instructions. — " Here is a sheet of paper that has
been folded across and then folded again. Suppose
now Icut out a notch just here. When the paper
is unfolded again, what would it look like ? Will
you show me on this piece how and where it would
be cut ? "
86 MENTAL TESTS
Evaluation. — Two diamond-shaped holes should
be drawn in a line with each other, each in the middle
of one half of the paper.
62. Giving Differences between Abstract Terms.
Instructions. — " What is the difference between —
i. pleasure and happiness ?
ii. poverty and misery ?
iii. evolution and revolution ? "
The words baggested by Binet {Bulletin) are —
i. paresse, oisivete ;
ii. evenement, avenement ;
iii. evolution, revolution : and in 1908 scale also —
iv. plaisir, bonheur ;
v. orgeuil, pretention.
Evaluation. — Two out of three must be cor-
rectly answered. Good replies should bring out
an opposition or antithesis between the differenti-
ating ideas. E. g. (i) " happiness " is superior to
or more general than " pleasure " ; (ii) should con-
trast having little money with being in misery or
pain ; (iii) should contrast slow change with sudden
change. But Binet admits mere differences ; e. g.
evolution is the movement of troops, revolution is
an insurrection.
63. Drawing the Displaced Triangle.
Materials. — Paper and pencil for drawing. A
card about 10 x 15 cm., cut across the diagonal,
as used for divided card test, The card is first laid
BINET'S TESTS OF INTELLIGENCE 87
on the table before the subject with the cut edges
touching.
Instructions. — " Look carefully at the lower piece
of this card. Suppose I turn it over and lay this
edge (A-C) along this edge (A-B of the upper
triangle), and suppose that this corner (C) is placed
just at this point (B), what would it all look like ?
Now I am going to take the piece away (remove
lower triangle from view). Imagine it placed as
I told you, and draw its shape in the proper position.
Begin by drawing the outline of the top triangle."
Evaluation. — The essential points are : (i) A C B
must be preserved as a right angle ; (ii) A C must
be made shorter than A B.
64. Summarising Hervieifs Reflections on Life.
Instructions. — " Attend carefully to what I am
going to read you. When I have finished I shall
want you to tell me the meaning of what I read in
88 MENTAL TESTS
your own words. The exact words that I use do
not matter. Listen —
' Many opinions have been given on the value of
life. Some say it is good, others say it is bad. It
would be truer to say that it is just medium. For,
on the one hand, the happiness it brings us is never
so great as we ourselves should like, and, on the
other hand, the misfortunes it brings us are never
so great as our enemies would want us to have.
It is the intermediate nature of life that makes it
fair, or, at least, prevents it from being altogether
unfair.'
Now see if you can give me, in your own
words, the sense of what I have just read to you."
Evaluation. — The central thought must be under-
stood and these three ideas reproduced : (i) Life
is neither good nor bad, but medium, for (2) it is
not so good as we wish, but (3) better than what
others wish for us. The terms and expressions
matter little.
65. Giving the Differences between President and
King.
Instructions. — " There are three chief differences*
between a King and a President of a Republic. Can
you tell me what they are ? " " Can you think of
any of them ? "
Evaluation. — Two of the following differences,
apparently, must be given. (Some consider Binet
required all of the first three.)
i. A King inherits his crown : a President is
elected.
BINET'S TESTS OF INTELLIGENCE 89
ii. A King is king for life : a President's term of
office is limited,
iii. The powers of a King are greater than those
of a President,
iv. A King is not directly responsible to the
people : a President is. (Added by Melville
in place of iii.)
(This test is obviously more suited to French and
American children than to English. The third
difference is hardly true of an English king.)
CHAPTER V
burt's reasoning tests
A teacher who has studied Binet's tests will at
once perceive that they are unsuitable for the upper
standards. They were not devised for the discovery
of the bright children, but for the detection of the
dull. If the aim is to select supernormal children
to whom scholarships might be awarded, a more
difficult series of tests — a series that appeals more
exclusively to the higher mental functions — is
necessary. The tests recently published by Mr.
Cyril Burt and printed below were designed to meet
that need. A full description of the circumstances
under which the tests were drawn up and stan-
dardised, together with an account of the develop-
ment of reasoning in school-children, will be found
in an article by Mr. Burt in the Journal of Experi-
mental Pedagogy for June 191 9 and December 1919.
To quote from that article : " The test-questions
are intended to be given to each child individually
and orally. In my own experiments each problem
was type-written upon a separate card ; a fresh
statement commenced on a fresh line ; and by
means of indentation and spacing, question and
premises were distinguished from each other. A
card is handed to the child with the following
instructions ; ' Will you read this little puzzle ?
90
BURT'S REASONING TESTS 91
There is an easy question at the end. When you
have read the question, read carefully again what
is printed above, and try whether you can think of
the answer.' The younger and duller children
should read the test-questions aloud ; and with
the youngest and dullest of all, the examiner should
read the questions with or to the child. Children
of higher levels (Standard III) need only read
aloud the first few questions. Any child who is
unable to read a particular word or to comprehend
its meaning should be freely helped. The graver
incongruities between difficulty of phrase and
difficulty of logic have been eliminated. In a per-
fectly revised list they should never occur. A bright
young child is occasionally puzzled by such words
as ' sub-tropical,' and ' emotion,' although com-
petent to follow the reasoning. When it is clear
that the child understands his task, he should be
left quietly with the card, forgetful, if possible, of
the examiner's presence. The emotional confusion,
the ' examination paralysis,' that so commonly
embarrasses an oral interview is by this means
largely avoided. When the child gives an answer,
it is invariably received with a word of praise,
whether right or wrong ; and the child is asked
to give his reason.
" One mark is given for each test correctly answered
and correctly reasoned. When necessary, the child
may be given additional trials, not exceeding three
in all for any one test. But for each unsuccessful
attempt a quarter of a mark is deducted. A frac-
tion — as a rule, a quarter, a half, or three-quarters
respectively — is also deducted for an ill-expressed
reason, an inadequate reason, or no reason at all,
92 MENTAL TESTS
In the cross-examination as to reasons lies the most
valuable part of the test. The examiner gleans
considerable information, not only about the know-
ledge and intellectual procedure of the child, but
also about its temperament and disposition so far
as they affect his intellectual efficiency. In the
final estimate of the child he would take into account
both these broader observations, and in particular
the speed with which the child has worked. In
the actual marking no allowance is made for such
latter factors ; nor is any time-limit assigned.
Could scores be corrected on the basis of the general
impressions incidentally gained, the correlations
with ability, high as they are, would be still further
raised.
" For a child to work steadily through a series of
fifty reasoning tests until he breaks down would,
at any rate for the brighter and older children, be
a slow and fatiguing process. A short series has
therefore been constructed by selecting every third
test in the full series. The short list thus contains
only seventeen questions, two for each age except
the first, which has three.
" In the full list appended these selected questions
are marked with asterisks. They have been more
carefully chosen, more extensively used, and more
thoroughly revised. For practical purposes, indeed,
the short list will be sufficient, since this allows a
rough and rapid determination of mental age.
Where, however, it is required to obtain a more
exact estimate of a child whose mental level is
already approximately known — for example, in
testing children within the same school standard —
the full list is indispensable, since with the short
BURT'S REASONING TESTS 93
list no member of a fairly homogeneous class could
be expected to differ from the others by more than
one or two marks.
" Children should be tested with the short list first.
Even the oldest and brightest should begin with
the easiest test. They should be carried through
the series until they have broken down with three
consecutive tests. The supplementary questions
should be given subsequently, and upon a different
day. Here it will be expedient to start, not at the
beginning of the series, but about four tests below
the level of the first serious failure made on the
short list ; and the child should be carried through
until he breaks down on at least six tests consecu-
tively."
BURT'S GRADED REASONING TESTS.
Seven Years
*i. Tom runs faster than Jim : Jack runs slower
than Jim. Who is the slowest — Jim, Jack or Tom ?
2. All wall-flowers have four petals : this flower
has three petals. Is this a wall-flower ?
3. It looks like rain : but I shall stay indoors.
Shall I want an umbrella to-day ?
*4. Kate is cleverer than May : May is cleverer
than Jane. Who is the cleverest — Jane, Kate or May ?
5. It is Sunday ; and on a Sunday afternoon Ada
usually takes the baby out, or goes by herself to the
pictures, or walks over to see her aunt, or else goes
by tram to the cemetery. To-day she has no
money with her : and the baby is asleep upstairs.
Where do you think she has probably gone ?
6. Tom said to his sisters : " Some of my flowers
94 MENTAL TESTS
are buttercups." His sisters knew that all butter-
cups are yellow. So Mary said : " All your flowers
must be yellow." Grace said : " Some of your
flowers must be yellow." And Rose said : " None
of your flowers are yellow." Which girl was right ?
*7. I have bought the following Christmas pre-
sents : a pipe, a blouse, some music, a box of
cigarettes, a bracelet, a toy engine, a bat, a book,
a doll, a walking-stick and an umbrella. My
brother is eighteen : he does not smoke, nor play
cricket, nor play the piano. I want to give the
walking-stick to my father and the umbrella to my
mother. Which of the above shall I give to my
brother ?
Eight Years
8. All great men work hard and long every day :
Sir John Smith worked three hours a day. Was
Sir John Smith a great man ?
9. Peter has a half-holiday on Wednesdays and
Saturdays, and a whole holiday on Sunday. I am
at work all day, except on Monday, Wednesday,
Friday and Sunday. I want to take Peter to the
tailor's to buy a new suit. Which afternoon could
we go together ?
*io. I don't like sea voyages ; and I don't like
the seaside. I must spend Easter either in France,
or among the Scottish Hills, or on the South Coast.
Which shall it be ?
1 1 . Ethel has twice as many apples as John :
Lucy has half as many as John : Lucy has ten.
How many has Ethel ?
12. Edith is fairer than Olive, but she is darker
than Lily. Who is darker — Olive or Lily ?
BURT'S REASONING TESTS 95
*I3« The person who stole Brown's purse was
neither dark, nor tall, nor clean-shaven. The only
persons in the room at the time were : (1) Jones,
who is short, dark and clean-shaven. (2) Smith,
who is fair, short and bearded. (3) Grant, who is
dark, tall, but not clean-shaven. Who stole Brown's
purse ?
Nine Years
14. C is smaller than B : B is smaller than A.
Is A greater than C ?
15. A burglar entered my room at the Hotel
Splendid last night. The windows were all securely
fastened on the inside, and the fastenings and the
window-panes are undisturbed. The opening up
the chimney is only nine inches square. The door
opening into the main corridor was locked and the
key left on the outside. The ceilings, walls and
floor have no other openings, either secret or forced,
through which he could have entered. How did
he get in ?
*l6. Three boys are sitting in a row : Harry is
to the left of Willie ; George is to the left of Harry.
Which boy is in the middle ?
17. If I have more than a shilling I shall either
go by taxi or by train : if it rains I shall either go
by train or by 'bus. It is raining, and I have half a
crown. How do you think I shall go ?
18. On one side of my street the houses all have
odd numbers, beginning with the grocer's, which
is No. 1. On the other side the numbers are even ;
No. 2, the baker's, being opposite No. 1. My
house is No. 16. Walter is my next-door neigh-
bour : you pass his house as you come up from the
96 MENTAL TESTS
baker's just before you get to mine. What is the
number on his door ?
*I9« In cold, damp climates, root crops like
potatoes and turnips grow best ; in temperate climates
there are abundant pastures, and oats and barley
flourish ; in sub-tropical climates, wheat, olives
and vines flourish ; in tropical climates, date palms
and rice flourish. The ancient Greeks lived largely
on bread, with oil instead of butter : they had wine
to drink and raisins for fruit. Which climate do
you think they had ?
Ten Years
20. Some children were asked : " Why are towns
nearly always more unhealthy than the country ? "
They gave the following replies : (i) " Some country
places are by the seaside." (2) " There are more
doctors in the towns." (3) " The smoke of the
houses and the breath of the people prevent the
air from being fresh." (4) " The cottages in the
country are dark, tiny, and badly built." (5)
" Disease spreads where people are crowded to-
gether." Which two children gave the best
answers ?
21. " Drinking the sea dry." " Catching the
wind in a cabbage net." " Gathering grapes from
thistles." " Washing a blackamoor white." " Touch-
ing the end of a rainbow." All these sayings mean
something that is ? (Give the meanings of all
of them in one word.)
*22. The doctor thinks Violet has caught some
illness. If she has a rash, it is probably chicken-pox,
measles or scarlet fever. If she has been ailing with
BURT'S REASONING TESTS 97
a cold or cough she may develop whooping-cough,
measles or mumps. She has been sneezing and
coughing for some days, and now spots are appearing
on her face and arms. What do you think is the
matter with Violet ?
23. "I sprang to the stirrup and Joris and he :
I galloped, Dirck galloped, we galloped
all three."
What was the name of the person referred to as
" he " in these lines of poetry ?
24. The Duchess of Dustiland's diamonds have
been stolen. After the ball at the palace she gave
them to her manservant to take home, with instruc-
tions to hand them over to her maid at once. When
he was half-way home he met a friend, and a few
minutes afterwards a man in blue uniform and helmet
came up to -them and said : " I arrest you for stealing
the Duchess of Dustiland's jewels." They were
taken to a large building outside which a blue lamp
was hanging with the words " Police Station "
printed on it. Here another man in uniform took
possession of the diamonds, and locked up both
the manservant and his friend in a small, bare room
for the night. When day dawned, hearing nobody
about they climbed out through the window, but
could see nothing of either the lamp or policemen.
The Duchess is still looking for her jewels. Who do
you think is the thief ?
*2$. There are four roads here. I have come
from the South and want to go to Melton. The
road to the right leads somewhere else : straight
ahead it leads only to a farm. In which direction
is Melton — North, South, East or West ?
98 MENTAL TESTS
Eleven Years
26. A man was found nearly dead with his throat
cut, and on the back of his left arm there was a
blood-stained mark of a left hand. The policeman
says he tried to kill himself. Do you think the
policeman was right ?
27. C is West of B : B is West of A. Is A to the
North, South, East or West of C ?
*28. Father has just come home in a brand new
overcoat : there is clay on his boots, and flour on his
hat. The only places he can have been to are North-
gate, Southgate, Westgate or the City. He has
not had time to go to more than one of these.
There is no clay anywhere in the streets except
where the pavement is up for repair. There are
tailor shops only in Southgate, Westgate and the
City. There are flour mills only in Northgate,
Westgate and the City. I know the roads are not
being repaired in the City, though they may be in
the other places. Where has father been ?
29. The following are some of the occasions on
which people shed tears : People laugh till they
cry. When they are very unhappy they weep.
A fly in the eye makes the tears flow. Peeling
onions, scraping horseradish, going through smoke,
a cold wind in the face — all make the eyes water.
These instances suggest two general causes which
produce tears. What are they ? Choose your
answer from the following phrases : (1) Moderate
happiness. (2) Bright colours such as red and green.
(3) Germs. (4) Violent emotions. (5) Irritation of
the eye-ball. (6) A warm temperature.
30. In our school a third of the school play
BURT'S REASONING TESTS 99
football, and a third play cricket. (1) Are there
any who play neither football nor cricket ? (2) Are
there any who play both ? (If it is impossible to
tell without asking further, say so.)
*3I. Where the climate is hot, aloes and rubber
will grow : heather and grass will only grow where
it is cold. Heather and rubber require plenty of
moisture : grass and aloes will grow only in fairly
dry regions. Near the river Amazon it is very hot
and very damp. Which of the above grows there ?
Twelve Years
32. My brother writes : " I have walked over
from Byford Wood to-day, where I had the mis-
fortune yesterday to break a limb." Can you guess
from this which he probably broke — his right arm,
left arm, right leg, or left leg ?
33. In the old world the most thickly-populated
parts have usually been India, China, and the South
and West of Europe. In India and China the
rainfall is high in the summer ; on the shores of the
Mediterranean it is high during the winter ; on the
shores of the Atlantic it is fairly high all the year
round. In the deserts of Russia, Persia and Africa,
it is dry all the year round. Africa is very hot ;
India and China are very warm ; South and West
Europe rather warm ; the deserts of Russia cold.
What kind of climate seems to have helped the
growth of civilisation most of all — cold and dry,
warm and dry, or hot and dry ? cold and wet, warm
and wet, or hot and wet ?
*34_. Field-mice devour the honey stored by the
humble-bees : the honey which they store is the
ioo MENTAL TESTS
chief food of the humble-bees. Near towns there
are far more cats than in the open country. Cats
kill all kinds of mice. Where, then, do you think
there are most humble-bees — near towns or in the
open country ?
35. My birthday is on December 27, and I am
just four days older than Tom. This year Christmas
Day comes on a Tuesday. On what day of the
week is Tom's birthday ?
36. If the train is late he will miss his appoint-
ment : if the train is not late he will miss the train.
We do not know whether the train was late or not.
Can we tell whether he kept his appointment ?
*37. I started from the church and walked 100
yards ; I turned to the right and walked 50 yards ;
I turned to the right again and walked 100 yards.
How far am I from the church ?
Thirteen Years
38. Explain how the following code is worked :
Message (in code) . dpnf up Mpoepo bu podf.
The same (translated) come to London at once.
What is the secret letter for " x " in this code ?
39. Dismal Johnny said to Sunny Jim : "If I
marry I shall be miserable, because I shall be bothered
with looking after my wife ; if I don't marry I shall
still be miserable, because I shall have no wife to
look after me. So in either case I shall be miser-
able." Sunny Jim replied : " On the contrary, you
ought to be happy in either case ; for, if you do not
marry, you will be happy, because you will not be
bothered with looking after your wife, and "
How do you think he finished his argument ?
BURT'S REASONING TESTS 101
*40. A pound of meat should roast for half-an-
hour ; two pounds of meat should roast for three-
quarters of an hour ; three pounds of meat should
roast for one hour ; eight pounds of meat should
roast for two hours and a quarter ; nine pounds of
meat should roast for two hours and a half. From
this can you discover a simple rule by which you
can tell from the weight of a joint how long it
should roast ?
41 . I walked 10 yards down High Street ; I turned
to the left and walked 15 yards down Thomas
Street ; I turned to the left again and walked 10
yards down James Street ; I turned to the left
again and walked 15 yards down another street;
I turned to the left again and walked 10 yards down
that street ; I turned to the left again and walked
5 yards. What street was I in ?
42. 1 is 1, that is, 1 times 1.
1 and 3 added together are 4, that is, 2 times 2.
1 and 3 and 5 added together are 9, that is,
3 times 3.
1 and 3 and 5 and 7 added together are 16,
that is, ?
Look at the above carefully. Can you see a simple
rule for guessing the answers without adding up the
figures ? Work the following sums yourself ; this
will help you to find the rule : (i) 1 and 3 and 5 and
7 and 9 added together are , because this is
times . (ii) What do the first seven odd
numbers (1, 3, 5, 7, 9, 11, 13) come to when added
together ? This is times . Use the rule
to find how much the first hundred odd numbers
would come to if added up.
102 MENTAL TESTS
*43. What conclusions can you draw from the
following facts ? Iron nails will not float in a pool ;
a cup of pure gold dust weighs nearly twenty times
as much as a cup of water of the same size ; if you
drop a silver sixpence or a copper coin into a puddle,
it will sink to the bottom ; a cubic inch (about a
tablespoonful) of water weighs less than half an
ounce ; a cubic inch of brass weighs over two
ounces ; a leaden weight will drop to the bottom
of the ocean. Sum up all these observations in one
short sentence of the following form : " Most
are ."
Fourteen Years
44. When you enter my house you will find a
window on your right in the side wall of the passage.
When the sun sets it shines straight through this
window on to the wall opposite. What direction
are you facing when you stand in the doorway and
look across the street ?
45. If the A's have a bigger army than the B's
we ought first either to fight the B's, or attack the
C's by sea, but not attack the A's ; if their army
is smaller we should attack the A's first. If the C's
have a bigger navy than we, we ought to fight either
the B's or the A's, but not the C's. If their navy
is smaller, we should first attack the C's by sea.
The size of their armies and navies is as follows —
A.
Men
7,000,000
Ships
300
B.
C.
Ourselves.
5,000,000
4,000,000
6,000,000
400
500
200
Whom should we attack first ?
BURT'S REASONING TESTS 103
*/\6. John said : " I heard my clock strike yester-
day ten minutes before the first gun fired. I did
not count the strokes, but I am sure it struck more
than once, and I think it struck an odd number."
John was out all the morning, and his clock stopped
at five to five the same afternoon. When do you
think the first gun fired ?
47. Mary has just taken a penny ticket. The
trains from this station all stop at Euston, but after
that some go to Chalk Farm and Golders Green ;
others go to Kentish Town and Highgate. They
stop nowhere else. The fare to Euston, Chalk
Farm or Kentish Town is a penny : to Highgate or
Golders Green twopence. Mary did not get in
the Golders Green train. To what station do you
think she is travelling ?
48. They say that in Dodoland hundreds of years
ago all the kingfishers had legs about six inches
long, and beaks about two inches long, and they
used to wade in the water to catch fish for food.
The individual birds might differ from one another
in the length of their beaks and legs by about half
an inch or so—- not more. But the offspring of the
birds with the shortest legs or beaks would inherit
legs and beaks equally short, though again the
brothers would differ a little from each other ; and
similarly with the birds whose parents had longer
legs and longer beaks. And the same happened
with each succeeding generation. Now in those
days the pools were only four inches deep. But
they got gradually deeper and deeper ; and to-day,
where the fish swim, the water is always a foot
deep at the very least. Kingfishers of the ancient
kind would now-a-days either drown in the deep
104 MENTAL TESTS
water, or starve for lack of food ; for they could
never learn to swim. What, then, do you think
has happened to these wading birds in the course
of centuries ?
*49« Captain Watts and his son James have been
found shot — the father in the chest, and the son in
the back. Both clearly died instantaneously. A
gun fired close to the person — as, for example,
when a man shoots himself — will blacken and even
burn the skin or clothes ; fired from a greater
distance it will leave no such mark. The two
bodies were found near the middle of a large hall
used as a rifle range. Its floor is covered with
damp sand, which shows every footprint distinctly.
Inside the room there are two pairs of footprints
only. A third man standing just outside the door
or window could aim at any part of the room, but
the pavement outside would show no footmarks.
Under Captain Watts's body was found a gun ;
no such weapon was found near James. In each
case the coat, where the bullet entered, was blackened
with gunpowder, and the cloth a little singed.
Captain Watts was devoted to his son, and would
have died sooner than harm him purposely ; hence
it is impossible to suppose that he killed him deliber-
ately, even in self-defence. But some think that
James secretly disliked his father, and hoped to inherit
his fortune at his death, (i) Was Captain Watts's
death due to murder, accident or suicide ? (-2)
Was James's death due to murder, accident or
suicide ?
50. The crust of the earth — that is, the outer
layer down to at least fifty miles below the top —
consists chiefly of rock and stone. Rock and stone
BURT'S REASONING TESTS 105
weigh about three times as much as a bulk of water
of the same size. The heaviest materials found in
the crust of the earth are metals ; but in the outer
layer of the earth these are, of course, comparatively
rare. The earth as a whole weighs over five times as
heavy as a globe of water of the same size. What
does this suggest that the interior and middle of
the earth are mainly composed of — water, rock and
stone, metal or hot gas ?
CHAPTER VI
THE MEASUREMENT OF KNOWLEDGE
The heart of the problem may best be made
manifest to the reader by supposing that a father
brings to him a boy of ten years of age with the
request that he should ascertain whether that boy,
who is, say, attending a dame's school, is up to the
average in his school studies. The father does not
want to know anything about the boy's intelligence
(no father ever doubts that), but he does want to
know whether the school is giving him value for
his money — a matter which all parents frequently
call into question. Have we any means of telling
with any degree of exactness whether the lad is of
average proficiency in such rudimentary branches of
instruction as reading, composition and arithmetic ?
Until quite recently we had not. We did not know
what the average child of that age could perform :
the most we could do was to make a rough guess ;
and the value of that guess depended purely upon
personal judgment, which again depended upon the
range and nature of personal experience. We had,
in fact, no stable standards of measurement. Each
man measured for himself, and each measured with
a private yard-stick. And even to-day the position
is not much better. We have only just begun to
standardise our tests, and have merely arrived at a
106
MEASUREMENT OF KNOWLEDGE 107
few tentative standards of achievement in the
simplest of processes. There is an urgent call for
the extension of this work.
Many are the motives that urge us to make the
attempt. In France the aim has been to find the
child who fails to benefit by the ordinary instruc-
tion of the e coles primaires. Binet's motive in for-
mulating his Bareme d 'Instruction was the same as
his motive in establishing his scale of intelligence —
the quest of the subnormal child. But other motives
are constantly operative. There is a need for stan-
dards of comparison between the achievements of
children of different races, of different historic
periods, of different environments, and under
different types and modes of education. Pedagogic
records are no less useful than athletic records. It
is surely as profitable to know how long it would
take a nine-year-old child to add a given column
of figures as it is to know how long it would take
a two-year-old horse to run a mile. Yet we have
records in the one case and not in the other.
Finally, there is the need for placing in the hands
of the class teacher some means of protection against
arbitrary and unreasonable criticism by the head-
master or the inspector.
It is well to be quite clear as to the precise
nature of the standards we wish to establish ; for
there are three distinct possibilities. We may ascer-
tain what children of a given age actually do do,
or what they can do, or what they ought to do.
The first standard is actual, the second maximal,
and the third ideal. And none of them can be
either deduced from a priori principles or extracted
from the inner consciousness of a Board of Examiners.
108 MENTAL TESTS
Our scale must rest on measurable facts, must be
built up by a careful investigation into the actual
achievement of children under clearly defined con-
ditions. What standard of proficiency in reading,
for instance, or in spelling, or in arithmetic, can be
expected of London children of a given age under
their present conditions of schooling ? The simplest
and most convenient means of ascertaining this is
to devise a suitable test (by no means an easy
matter), apply it to as large a number of children
as possible, and submit the results to statistical
analysis. The scale thus arrived at will afford what
may be called, by way of distinguishing it from the
other two types of standard, Norms of Performance.
These norms have a simple and definite meaning ;
they can be verified or modified by further testing,
and they have a definite range of usefulness.
Is it possible to establish either of the other two
standards ? And if possible, is it expedient ? Is
there any point, for instance, in trying to discover
what the result in arithmetic would be if all the
schools in London were to sacrifice for a time every
kind of intellectual activity in order to secure
maximal proficiency in this one direction ? There
evidently is not. Apart from the violence done to
the victims of the experiment, the results would
be of no value. Can we deduce from the norms
of actual performance some sort of ideal standard
which would serve as an index of what the children
ought to do as distinct from what they do ? I
myself see no scientific way of doing so. Where
ethics and aesthetics have failed, it does not seem
likely that pedagogics will succeed. We have there-
fore to fall back on norms of actual performance as
MEASUREMENT OF KNOWLEDGE 109
constituting the only type of standard which proves
to be both practicable and useful. Of the other
two, one is practicable without being useful, and
the other useful without being practicable.
Let us now consider what attempts have actually
been made to arrive at standards or norms of
instruction. They first appear in the guise of
curricula issued by a central authority. In our own
country the Board of Education's standards of
examination are familiar to all those who remember
the days of payment by results. Indeed, they have
not yet quite disappeared from the Code of Regula-
tions : they appear in a modified form in the section
on certificates of proficiency. It may be said at
once that they do not represent any pure type of
standard, as the word is used in this book. They
are neither actual, maximal, nor ideal. Partaking
of some of the worst features of each, they are
hybrids of very doubtful pedigree. Based originally
on the opinion of what certain authorities at the
Education Department thought reasonable, moulded
to suit the exigencies of examination, they lacked a
solid foundation of objective fact. Moreover, the
scheme was used, and, indeed, intended to be used,
in such a way as to exemplify what Professor Adams
has pointed out to be the one great danger to which
all norms or standards are exposed : it was used as
a goal, and not merely as a test. Since preparation
for annual examinations was at that time virtually
the sole aim and purpose of the school, it was
inevitable that these standards of examination
should become syllabuses of work ; should form the
ground of the classification of scholars ; should, by
simple metonymy, give their name to the classes
no MENTAL TESTS
preparing for the specific examinations ; and, finally,
be regarded as norms of performance for children
of various ages. Thus the word tried to do several
distinct pieces of work, and did none of them well.
As examination syllabuses and schemes of study,
the standards were marred by serious faults. In
the first place, they were often so vague as to be
almost useless. The reading requirement for Stan-
dard III, for example, was " To read a passage from
a reading-book " ; and for Standard IV, " To read
a passage from a reading-book or history of England."
Neither sentence tells us more than the first two
words alone. No indication is given of the mode
of judging whether a child has reached the specified
standard, nor any indication as to the amount of
progress in reading to be expected of a child between
the ages of ten and eleven, the normal ages for
" passing " these standards. Secondly, the scale
involved too high a standard of mechanical accuracy
— a standard which could not be reached without
an undue expenditure of time, and a consequent
narrowing of the curriculum and loss of interest.
With the present wide and generous scheme of
studies, so high a level of attainment in one direction
would be difficult, if not impossible. Finally, as
norms of achievement, the standards of examination
were woefully defective. Even supposing that some
objective means of testing were prescribed, so that
two independent examiners would inevitably arrive
at the same results (which was certainly not 'the
case), the standards themselves were arbitrary and
lacked the guarantee of a careful scientific study of
the normal capacities of children.
Nor did the schedule of standards represent a
MEASUREMENT OF KNOWLEDGE in
real age scale. Although a normal pupil was sup-
posed to pass through each of the seven standards
before he reached fourteen years of age, he could
only pass one a year ; and while a stroke of bad
luck would put him back a whole year, no amount
of good luck could put him forward a day. The
consequence was that retardates were plentiful and
accelerates entirely absent.
A very different kind of standardisation is that
of Binet. His Bar erne & Instruction was based on
the mean performance of a large number of Parisian
children. It supplies, in fact, real norms in three
important branches of instruction — reading, number
and spelling. But although the aim is laudable, the
table published in Les idees modernes sur les Enfants
is of no great value. As the term " Bareme "
implies, it claims to be a ready reckoner — a rough-
and-ready means of finding out whether a child of
a given age has profited adequately by his schooling.
But its very roughness and readiness constitute its
main defect. It is supposed to take only ten minutes
to assess a child's scholarship — to fix his position in
the age scale ; and it is hard to believe that so
meagre and hurried an examination can gauge the
effect of years of teaching.
Binet's method of judging the reading is based
on the position of the pauses made by the reader.
If he pauses between the syllables, it is marked
" syllabic " ; if he pauses between the words, it is
marked " hesitant " ; and if he does not pause at
all, except where the sense of the passage demands
it, the reading is marked " current." In all, five
grades of proficiency are indicated — subsyllabic,
syllabic, hesitant, current and expressive. This
ii2 MENTAL TESTS
seems a simple scheme, and if it were easy to apply,
its obvious lack of delicacy of gradation might be
overlooked. But, as a matter of fact, it is impos-
sible to apply it with any feeling of certainty. It
is very rare that any child's reading is found to fall
entirely within any one of the prescribed grades.
In arithmetic simplification is again aimed at.
Binet goes so far as to contend that it is unnecessary
to test both addition and subtraction, for a know-
ledge of the latter implies a knowledge of the former.
By parity of reasoning, a test in division is claimed
to render a test in multiplication unnecessary.
Indeed, three simple questions for each child seem
to be considered adequate ; but each question must
be given in concrete form. To illustrate the kind
of question he advocates, I will cite his typical
example for children between six and seven years
of age : " From 19 apples take away 6 apples :
how many remain ? "
The spelling test is of no value outside France ;
for it is a test of grammar as well as of spelling —
the two being inseparable in the French language.
This peculiarity is rendered obvious by the typical
sentence given in the Bareme : " Les jolies petites
filles etudient les plantes qu'elles on remasses hier "
— a sentence abounding in pitfalls.
Turning from France to America, we find there
the home of standardisation. A tremendous volume
of work has been done by Thorndike, Courtis, Ayres,
Terman, Munroe, Judd and others with a view to
reaching as much exactitude as the nature of the
subject will permit. But its usefulness for us is
marred by its insularity — by the fact that it is
American and not English. Their arithmetic tests
MEASUREMENT OF KNOWLEDGE 113
abound in dollars and cents, their problems often
refer to customs and transactions peculiar to them-
selves, and the spelling is Websterian. The next
serious defect, however, is that the results are not
embodied in an age scale, but in a grade scale. We
are told what level the children in Grade IV, say,
of a certain American school, did, on an average,
attain in arithmetic, spelling and reading. But we
are not told the actual age of these children. Nor
can we infer it from the grade. We learn from
their educational papers that the ages in the grades
are not what they are theoretically supposed to be.
They vary from school to school, and from city to
city. Moreover, it makes a very great difference
whether the tests were given at the beginning, the
middle, or the end of the school year. Binet's
scales are of universal value because they are age
scales ; and if the American scales are to be used
in England they must be translated from grade
scales to age scales. They will then cease to be
national and become international.
I must emphasise the fact that the tests are
merely tests, and that their value diminishes in the
proportion that they deflect the general course of
study. Although in those rudimentary habits which
are the beginnings of school pursuits this danger
may be met by making the tests comprehensive,
that device is impossible in those higher and more
advanced branches of study where no test can cover
more than a small fraction of the whole field. We
have, in fact, ultimately to rely on the co-operation
and good faith of the teacher. And my experience
goes to show that such confidence is rarely misplaced.
It has been urged as an objection to the use of norms
1
ii4 MENTAL TESTS
that they refer to the most measurable school sub-
jects, and the most measurable are the least valuable.
There is thus a danger that the teacher will con-
centrate his efforts on the more mechanical aspects
of school work, to the neglect of the higher and
more spiritual aspects. As a matter of fact, this
fear is groundless. Norms in the mechanical sub-
jects are just as much a protection as a menace.
They will probably show that in some schools the
mechanical work is too good : it is so far above the
normal that too much time and attention have been
devoted to it. When the norm is reached further
" drill " is for the time unnecessary. The teacher
would realise that he could with a clear conscience
reduce the drudgery and render the work more
recreational and more broadly cultural. What is
really aimed at in establishing the more mechanical
norms is the discovery of that degree of automatism
which best serves the interests of the higher pro-
cesses of thought. And in any given school it may
be just as necessary to slacken the standard as to
string it up.
CHAPTER VII
DISTRIBUTION AND DISPERSION 1
When we measure a multitude of natural objects
belonging to the same genus or class we find that
the results tend to assume a determinate shape.
The magnitudes occur in accordance with a general
law. Let us suppose, for instance, that we start
measuring the heights of adult Englishmen by
making one after the other stand against a post and
marking each height with a horizontal stroke. If
we choose our specimens at random we shall soon
find our marks tending to crowd round a fixed
height. By the time we had measured ioo the
marks would appear somewhat as in Fig. I. The
largest number of marks would probably fall between
6j and 68 inches, and the numbers would tail off
as they diverged in either direction from this central
position. As the number of men increased we should
find our range of heights extending, for we are
much more likely to find a giant or a dwarf among
10,000 men than among fifty. By the time 8000
had been measured we should probably find our
range extending from 57 to 78 inches. Let us now
1 For a fuller treatment of this subject the reader is referred
to Nunn's Exercises in Algebra, Vol. II. , Sec. IX. (Longmans).
Some excellent diagrams illustrating distribution and dispersion
will be found in Burt's Educational Abilities (King and Son).
i J 5
n6
MENTAL TESTS
take our ioo men, or, for statistical convenience,
let us say ninety-nine, and put them to stand in a
row in order of height so that the tallest stand at
one end of the row and the shortest at the other.
An unreflecting observer would expect the line
joining the tops of their heads to be a straight line.
Figs. land 2. Probable heights of 99
Englishmen- taken at
Quartile.
Inches
7+1
7 3 "
7*
—
V
70-
n
69-
- : 5^ UblwT
66-
6 7 -
^ — Nedian
66-
^52— Latter
=f Qvartik
65-
s
64-
=
63
~
61
t
6m
E
6o-
59
1
5$\
f'a
99»ien standing in a row in. oraitr of height (£ou«si s6"tnniiu