t
C. x \ “LL
.7... AN” 9rd...
13‘. 1“ V6 r.
FOL .0 Aft-l0 I99.- \
.I‘
V‘ .u ('9 L
‘05 I! ...
5H5)... 1. an.“ ‘Wool
o. o‘.‘o I b‘lIIJ I O. +.‘L
‘ihu Co:- h‘J“ . .
a: “06 3.0.1. 300 I :4: no
9005. ~ )‘o 53w“. “‘3‘
WHY- I“. (o inch 5 a.“ I
‘ .t . _ . .1
.nU ”5.3. 5.»: on... . .. . w Rh...
‘ LI': 5"“.
i P 5 Tu»... ”it.“ .9,.... ‘-
.V‘ot i o a... ‘I‘
0 9hr!“ o. . in mass...
«2.» 5. . “n... «J 2. “.3
‘1‘! PE .- v .w «Nov .b'AK 0.?
C .o .- 3. ‘0... A? ’orho {S
“H‘ r, a y..- a‘. "To. L ‘33
Fit 5:. ill u. :3 may
who... 3-... s . .. a .. L. n...
‘ .. my... 6»: S .h v
.3. as. h .. T: .4 0
:IW‘ ‘ - ...Dh . .k 4 . I ’ ..
H4; W. Io. 393‘“ .9. '1'. 1‘}! A u V
9t .9 H J .knJr. ad MVP... c. n
0'0”. 3%; o n Y on
3.... nun 3.x 3U ab PH.
1 {f 7 to d Iv... .59.. «an»!
I570... «St .‘I . (I
& ,
'

 

E,.,.,,§:,,,E.,,

I ’II |' _ I. .

‘ 3‘
h
«

 

 

 

MSU

LIBRARIES
m

‘—

 

 

RETURNING MATERIALS:
Place in book drop to
remove this checkout from
your record. FINES wilI
be charged if book is
returned after the date
stamped be10w.

 

 

 

2%”

RV
2823

 

ABSTRACT

A STUDY OF CERTAIN INSTRUCTIONAL MATERIALS
AS THEY INTERACT WITH SPATIAL AND
NUMERICAL PRIMARY MENTAL ABILITIES

By
Sarah A. Sprafka

Thirty fourth and thirty fifth grade children were
given three subtests of the Primary Mental Abilities Tests.
Based on scores on these tests, S's were assigned to one
of four groups: Number high/spatial low, or number low/
spatial high, fourth or fifth grade. Treatments consisting
of two versions of a paper and pencil program dealing with
the arithmetic of directed numbers were then administered
to the groups; Half of each group learned by a spatial
version of the program, the other half by a symbolic version
of the program. A Lancaster x2 showed that for this parti-
cular sample there was no interaction between spatial and
numerical Primary Mental Abilities and a spatial or symbolic

approach to learning about directed numbers.

A STUDY OF CERTAIN INSTRUCTIONAL MATERIALS
AS THEY INTERACT WITH SPATIAL AND

NUMERICAL PRIMARY MENTAL ABILITIES

By

Sarah A. Sprafka

A THESIS

Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of

MASTER OF ARTS

Department of Counseling, Personnel Services
and Educational Psychology

1969

ACKNOWLEDGEMENTS

The author wishes to express her appreciation to
Dr. Lee Shulman, Dr. Stephen Yelon, Dr. Andrew Porter,
Dr. Maryellen McSweeney, Mr. Bert Sterling, Mrs. Ann Reed,
Mrs. Bessie Hetrick, Mrs. Gertrude Bissel, and Mrs.
Georgie Dillon for their cooperation and assistance in

the preparation of this thesis.

ii

TABLE OF CONTENTS

Page

ACKNOWLEDGEMENTS . . . . . . . . . . . . . ii

LIST OF TABLES . . . . . . . . . . . . . . v
Chapter

I. INTRODUCTION . .. . . . . . . . . . . 1

Identification of Aptitudes . . . . 3

Studying Aptitude x Treatment Interactions . . 7

The Aims of This Study . . . . . . . . 8

II. REVIEW OF LITERATURE . . . . . . . . . . 10
Relevant Literature . . . . . . . . . 10-
III. METHODS AND MATERIALS . . . . . . . . -. 19

Plan of the Study . . . . . . . . . . l9

Instructional Programs . . . . . . . . 20
Subjects . . . . . . . . . . . . . 22
Aptitude Measures . . . . . . . . . . 23
Outcome Measures . . . 23
Assignment of Subjects to Treatment Groups . . 2A

I‘xperimental Schedule . . . . . . . . . 26
IV. RESULTS . . . . . . . . . . . . . . 29

V. DISCUSSION . . . . . . . . . . . . . 39

The Criterion Test . . . . . . AO
Predictive Power of Tests (Predictors)

For Fourth and Fifth Grades . . . . Al
Possible Reasons for Inconclusive Results . . A2

VI. CONCLUSIONS . . . . . . . . . . . . . AA
Possibilities for Future Research . . . . . A6
BIBLIOGRAPHY . . . . . . . . . . . . . . A8

APPENDICES . . . . . . . . . . . . . . . 51

iii

Appendix
Appendix
Appendix

Appendix

iv

Table

A

0-3

D-A

LIST OF TABLES

Scores of 9 and Less Than 9 on Addition
Test (Frequencies) for Each Treatment
Condition

x2 Values for Main and Interaction Effects-—
Addition Test

Scores of 3 and Less than 3 on Diagram Test
(Frequencies) for Each Treatment Condition

x? Values for Main and Interaction Effects——
Diagram Test

Reliabilities of Spatial Relations and
Number Sense Subtests for Grades A and 5

Intercorrelations of Spatial Relations and
Number Facility Subtests for Fourth and
Fifth Grade . . . . . . . .

Frequencies and x2 Value for Main Effect of
Grade Level——Addition Test

Post Test Scores . . . . . . . .

Treatment: Symbolic; Type: Spatial high/
Number low; Fourth and Fifth Grades

Treatment: Symbolic; Type: Spatial low/
Number High; Fourth and Fifth Grades . .

Treatment: Spatial; Type: Spatial high/
Number low; Fourth and Fifth Grades . . .

Treatment: Spatial; Type: Spatial low/
Number High; Fourth and Fifth Grades . . .

Page

30

3O

32

32

37

37
53

62

63

6A

65

CHAPTER I
INTRODUCTION

In recent years interest in the individualization of
instruction has vastly increased. Programmed instruction
is used more and more in the classroom. In the absence of
programmed instruction, individualization is often achieved
using conventional textual materials, and audio-visual
materials. The teacher does not get up in front of the class
and make a presentation, but acts as a monitor for a class,
each member of which is working individually. Some individ—
ualization is being used a bit differently in at least one
other area. That is the area of mastery learning being
investigated by Bloom (1968). He contends that mastery of
a given amount and type of subject can be achieved by almost
everyone, but it will take some students a little longer than
‘others. To assure mastery, it may be necessary to make
available to a student materials and modes of presentation
which are different from (and supplementary to) the conven-
tional classroom presentation, or perhaps an individualized

approach using conventional materials. Cronbach (in Gagne,

1967, Chap. 2) suggests a number of approaches to implementation
of such a scheme. The first decision must be whether every-
one is going to learn everything to mastery, or if there are
some things that everyone ought to learn to mastery (possi-
bly reading and arithmetic) and other things which need not
be completely mastered. For our purposes, let us decide

that at least some aspects of simple mathematics ought to be
learned to mastery. Bloom suggests that we achieve this
through initial practice in one given mode with standard
materials. If this does not work, increased time spent in
practice as well as use of alternate modes and materials

will help the student achieve mastery. Cronbach (ibid)

does not favor simply extending the length of time of expo—
sure, hut instead strongly advocates the modification of
instructional technique so as to capitalize on the student's
abilities. He mentions teachers who have done this on an
intuitive level, and notes that teachers who do this are
prone to expecting too much from a student who seems to do
certain kinds of things well. They may also try to discour—
age a student from trying to do something of the type he

does not appear to do well. Whether a teacher is trying to
"bring out the best in a student" or to help another student
learn by proposing various study materials or using different
teaching techniques, there is always a lot of guesswork in-
volved. Which way of teaching would work best. Which mater-

ials would be most appropriate?

The guesswork arises partly because some of the avail-
able materials may be generally inadequate, as well as
because they may not present the material to be learned in
a way that the student can learn it. What is needed then
is to determine what kinds of tasks the student does best,
or identify his abilities. We can then select one or more
of these abilities-and design treatments we would expect to
interact with them (see Gagne, 1967, chapter 2 again for
more on this), i.e. create a situation wherein a student
possessing a high degree of this ability would profit most
from this treatment, and the student possessing little of
this ability would profit least from it by comparison with

others in the treatment group.

Identification of Aptitudes

 

If aptitude x treatment interactions are to be studied,
aptitudes must be identified. For this purpose the best
instruments available to us are multi—aptitude test batteries
of which there are quite a number and quite a variety. These
tests had their roots in an early--about l920--interest in
the specific abilities required to do certain jobs and excel
in certain school subjects. Multiple factor analysis was
used as a statistical method for separating out the various
abilities tested by the individual test items,

One of the earliest and best known sets of tests is
that which tests for Primary Mental Abilities, developed by
Thurstone through factor analysis of 56 tests given to stu—

dents at the University of Chicago. The test purported to

identify and measure the extent of seven "Primary Mental
Abilities": verbal, spatial, word fluency, memory, and
reasoning. It was Thurstone's conviction that these seven

abilities were ”primary" in that they were fundamental
abilities which combined to make up the aptitude for any
complex intellectual task. Thurstone thought these abili-
ties combined essentially additively and in different pro-
portions depending on the individual. Those Primary Mental
Abilities are still tested in the tests as used today. In
their original form the tests were appropriate for high school
and college students. They have since been adapted to apply
to children ages five through seventeen. For the younger
children some of the Abilities are not included. (Extensive
reviews of the Primary Mental Abilities Tests may be found
in the Fourth Mental Measurements Yearbook, Buros, 195“).
The Primary Mental Ability Tests (PMA) were intended
to be used for guidance purposes (e.g. for advisement on
possible school courses of possible interest to a student,
etc.). There are a number of other multi-aptitude test
batteries, two of which are quite well known and representa-
tive. The first of these is the Differential Aptitude Test
(DAT). These tests do not attempt to identify and isolate
any primary mental abilities. Instead they try to find com-
plex abilities that relate closely to various categories of
jobs and school requirements. The subtests are entitled
Verbal Reasoning, Space Relations, Numerical Ability, Abstract

Reasoning, Mechanical Reasoning, Clerical, Language Usage.

It is interesting to note that despite their different
emphasis, the first categories of the DAT are much like those
of the PMA, attesting to the influence of Thurstone's work.
The DAT tests are aimed at the eighth grader through high
school senior, and are also used for individual counseling
purposes and for giving advice as to what line of work

might be of most interest, or what sort of course work would
be most profitable.

The last set of multi-aptitude batteries to be men-
tioned here is the General Aptitude Test Battery (GATB). It,
too, shows a strong influence of Thurstone in that its sub—
tests measure verbal, numerical, and spatial aptitude in
addition to such abilities as clerical perception, motor
speed, finger and manual dexterity. Valid for ages 16 and
over, this set of tests is intended for finding people with
the right combinations of abilities to fit certain jobs (an
institutional decision is involved here) or for helping an
individual decide what kind of Job might be appropriate for
him, and fitting him to that job (this involves individual
decisions). Thus the GATB can work two directions, for
institutional as well as for individual decision making.

Perusing these sketches of some of the major test
batteries and their aims, one becomes aware of the variety
of tests and purposes they may serve. These tests do have
a couple of interesting things in common, however. With
the exception of the PMA, the tests such as those repre—

sented by the DAT and GATB are applicable to children age

6

l“ or 15 and older. This is probably partly because the
different abilities do not emerge clearly before that age
(Anastasi, 1967). It is also true that these abilities are
quite unstable in the earlier years, and that little long
range prediction can be done based on these aptitudes before
Grade 11 (Cronbach, 1960). In Grades 7—10 short range
advising as to possible courses which might be taken is
possible, but not much more. Another thing that should be
noted about these tests is that they are meant to be used
under rather static circumstances. That is, they are meant
to fit an existing individual into an existing situation,

be it a suitable tenth grade shop course or a good job. They
are not meant to be applied to predict how a situation might
be changed to interact favorably with the individual and his
aptitudes. Thus, if we are going to choose an aptitude and
attempt to design a treatment that will interact with it, we
have very little to base our decision on. We can only use
commOn sense which tells us such things as "verbal aptitude
should interact with verbal treatment" and "numerical aptitude
should interact with a treatment emphasizing the use of num-
bers (and possibly number-associated symbols), and a spatial
aptitude should interact favorably with a treatment using
spatial representation of concepts." Common sense is not
always right. The Spatial Relations scores on the DAT have
been found to be reasonable-to-poor predictors of success in
geometry. The Number scores were better predictors by far

(Cronbach, 1960). Common sense would tell us the opposite

might be true. This warning should put us off making common
sense decisions, but for want of a better basis for deciding,

common sense is what will have to be used.

Studying Aptitude x Treatment Interactions

 

If we follow Cronbach's advice (Gagne, 1967, chapter 2),
and adapt instructional techniques to fit the student, we
will be studying more and more aptitude x treatment inter—
actions. Following his advice again, we should identify one
(or more) abilities in a student and purposely design a
treatment that will interact with that ability. This treat-
ment should be an extreme example of the type which we might
propose for wider use. This is because, first, under labora—
tory conditions we are working with a limited sample. Second—
ly, as Cronbach also tells us (Gagne, 1967, p. 30), if we
want to use aptitude information usefully in the adaptation
of instruction, the interaction we get must be a disordinal
one, i.e. the regression line that relates aptitude to payoff
under one treatment must cross the one for the competing
treatment. If the lines do not cross, then the result may be
that one treatment is consistently better than the other, so
we need not take the trouble to gather all that information
about aptitudes.

There is also an important implication for the design
of teaching materials in these interaction studies. Not all
subject matter is amenable to being presented in such a way

as to capitalize on (interact with) various aptitudes. This

has been evident in at least two of the interaction studies
done (Hamilton, 1969 and Tallmadge, 1968, to be discussed
in Relevant Literature section). In both of these studies
the subject matter interacted with the treatment, i.e. the
students who got one treatment simply weren't learning the
same thing as those who got the other treatment. Thus any
effect due to ability or to teaching technique used in the

two treatments were not readily evident.

The Aims or This Study

 

In this particular study fourth and fifth grade children
worked on either a symbolically or numerically oriented in-
structional sequence in the addition of positive and negative
numbers. It was felt that those children identified as being
more spatially then numerically apt would profit most from a
spatial treatment, and those with greater numerical aptitude
would profit from a treatment involving rules for the mani-
pulation of signs and numbers.

In the spatial treatment no rules were given and the
children were directed to "figure out the answer" using num—
ber line diagrams. In the "number" or symbolic treatment
number line diagrams were used, but the students were never
required to draw such diagrams. All responses could be got
from just using the rule.

This procedure is somehwat like that suggested by
Cronbach and Gleser when they advise us to accentuate any

interaction "by deliberate design of the experimental course"

if we hope to find any interaction effects at all (reference
taken from Hamilton, 1969, p. 3). Cronbach also suggested
that this might be necessary in his statement (mentioned
above) that aptitude information cannot be employed usefully
unless the result is the crossing of the aptitude and treat—
ment regression lines. It is probable that an optimum
treatment of the addition of signed numbers would involve
the use of both rules and the number line diagram. However,
for our purposes the two were separated.

The verbal dimension was not considered in this study
for a number of reasons discussed under the "assignment of
subjects to treatment" heading.

The null hypothesis associated with this study was
that there would be no interaction between aptitude and

treatment.

CHAPTER II
REVIEW OF LITERATURE

Relevant Literature

A study of aptitude and treatment interaction is
essentially a study of a particular kind and use of individ—
ual differences. The literature relevant to individual
differences is vast, and much of it is not relevant to this
particular study. On the other hand, the literature rele-
vant to aptitude x treatment interactions (discussed below)
is minimal.

Aptitude x treatment interactions are a way of studying
individual differences as they relate to learning, especially
school learning. A discussion of individual differences
that relate to learning is given us by Jensen (in Gagne,
1967, Chapter 6). He chooses to postulate two kinds of
learning-related individual differences—-intrinsic and ex-
trinsic. He feels that attitudes and personality traits are
examples of extrinsic abilities, i.e. of "those subject
variables which operationally bear no resemblance to the
learning process . . ." (p. 121). Chronological age, sex,
IQ, and mental age would also be included in this category.

.Jensen then goes on to talk about intrinsic individual

10

11

differences, or those differences which are inherent in
learning. He suggests three major classes of these variables:
types of learning (conditioning, motor, rote, etc.); pro-
cedure (pacing, distribution of practice, etc.); and content
and modality (verbal, numerical, spatial, flexibility, etc.).
He suggests that we then cross each member of one category
with a member of each of the other two categories (apparently
by creating learning tasks that reflect these subcategories)
and see how much within subject variation we get as subjects
interact with this combination of categories. He admits it
will be quite a job. The outcome of this investigation would
be information about the "underlying factors or basic pro-
cesses" (apparently inherent in the learner) which cause
the patterns of interaction among the task characteristics
as delineated by the three categories mentioned above. Thus
Jensen is not hypothesizing certain inherent abilities and
attempting to design tasks which will cause them to be ex-
pressed. He is, instead, taking what he knows about tasks
and developing controlled types of tasks which, when presented
to subjects will yield information about what the abilities
actually are.

With this in mind he says that a tggk is spatial, ver—
bal, numerical, etc., not a person. Thurstone (1938)
attempts to prove that spatial, verbal, numerical, and so on
refer to attributes of people and not of tasks. He reasons
thus: some people seem to be able to do some types of things

much better than others. Some people can visualize things

12

in space. Others don't visualize as well, but can do just
about anything that has to do with computation. We may,
from these observations, hypothesize that the former has some
sort of visualizing ability and the latter has some sort of
computational ability. We can devise tasks which require
these two types to display their abilities. This done we
find that the visualizer does better on the visualization
test and the calculator does better on the computation test.
From this we are allowed to infer that people who do very
well on visualization tests and less well on computation
tests are more visually than computationally oriented.

Thurstone hypothesized a large number of mental acti-
vities (getting his ideas from items used in general intelli-
gence tests) and categorized them. When testing was com—
pleted and factor analysis was performed, he could claim to
be able to identify the Primary Mental Abilities, i.e. a
certain set of seven abilities which different people possessed
to different degrees and which could be said to account for
much of the individual difference variance in the performance
of certain complex tasks.

It may indeed by the case, as Jensen states, that the
innate abilities are yet to be identified. It may also be
the case that the identified abilities (spatial, verbal,
number, etc.) are unique to the learner and not a part of
the task. More important to the present study, however, is
that the latter interpretation is the most useful one. Tests

have been developed which supposedly identify the degree to

13

which an individual possesses these abilities. The results
are quantifiable and decisions can be based on them.

Once the existence of individual differences (i.e.
different abilities) is established, how can the different
abilities be related to learning? Abilities which are re—
lated to different kinds of learning have been tentatively
identified and their relationship to the learning process
has been discussed. Some abilities identified as relevant
to verbal learning are such things as rote memory, and span
memory (Fleishman in Gagne, 1967, p. 62). Other individual
difference variables are suggested by Jenkins (also in
Gagne, p. U8) such as the "ability to form new associative

bonds or habits.‘ Fleishman (Gagne, chapter 8) has done ex—
tensive investigation into the abilities involved in learning
motor skills. He has identified such abilities as manual
dexterity, finger dexterity, control precision and several
others. He has also found that the combinations of abili-
ties contributing to performance change as practice continues
and that the contribution of "non-motor" abilities——such as
verbal or spatial-~decreases with increased practice, whereas
pure motor abilities contribute progressively more (Gagne,

p. 179). Whether these phenomena (i.e. the increase or de-
crease with practice of different contributing abilities)
occur during the learning of verbal skills (e.g. small

children learning to read) has apparently not been investi—

gated.

1U

Another interesting investigation of abilities has
been done by Fredericksen (annual Review of Psychology,
1969, p. 357). He feels that the relationship of the abili-
ties an individual possesses to a learning task occurs
through strategies. Depending on what ability or abilities
may be dominant in an individual, he will choose different
strategies through which to learn a task. By his choice of
a specific learning strategy, the learner essentially restruc—

tures the learning task to conform to his abilities (and

J

883

choice of strategy). In addition, whether he posse:

U

certain abilities to a greater or lesser extent will influ—
ence his performance using the strategies he has chosen.
How Frederickson defines ability is not clear in the reference
cited. One ability mentioned is "associational fluency."
Adapting instruction to abilities is of direct interest
to this study. Bloom suggests indirectly that we adapt
instruction to individuals when he delineates his mastery
learning procedures (Bloom, 1968). Although he does not
mention adaptation based on previously obtained ability mea-
sures, this could possibly facilitate the carrying out of a
mastery learning program. The vast number of abilities
which have been proposed might make it difficult to choose
which ones to adapt to. The Primary Mental Abilities are a
possibility.
Cronbach also discusses the adaptation of instruction
to abilities (Gagne, 1967, chapter 2) and makes specific

reference to the aptitude x treatment interaction type of

15

study. Cronbach states that one way of dealing with indi-
vidual differences is to teach different people in different
ways. It remains then to define the differences between the
people and their degree, and to define how the differences

are going to be treated. He suggests that the first task
might be dealt with through the scheme suggested by Jensen
(see above). However, warns Cronbach, "aptitude information
is not useful in adapting instruction unless the aptitude

and treatment interact" (p. 30). The regression lines corres-

ponding to the two competing treatments must cross. If the

 

information about differences in aptitude is not good or if
the treatment and the aptitude do not interact, then the
teacher would do well to maintain the status quo. As Cron—
bach says: "Modifying treatments too much produces a worse
result than treating everyone alike" (p. 30).

Few aptitude x treatment interaction studies have been
done, and in those that have, the desired interaction has not
been obtained for various possible reasons. An experiment by
Bush g§_gl. (1965) attempted to find an interaction between
the learner and the mode of instruction. Abilities were
measured by the California Achievement Test. The Mathematics
Fundamentals and Reading Vocabulary subtests were used.
Differences between these two scores for each individual
were correlated with his pretest—posttest gain relative to
each treatment condition. It was found that those subjects
who scored significantly higher on a reading vocabulary test

than they did on a mathematics fundamentals test "tended to

16

acquire more learning" from a lecture-like treatment than

did students whose mathematics scores were higher than their
reading scores. Conversely, those students who showed rela-
tive strength in mathematics "tended to do relatively better"
in a laboratory-like situation than did those students whose
reading scores were higher than their math scores. This was

t

interpreted as meaning that 'students with relative strength
in reading vocabulary are superior to students with relative
strength in mathematics fundamentals when both are required
to learn from instructional situations that are highly
verbal" (p. 11). There were a number of drawbacks to this
study from a design point of View, not the least of them
being the diversity of subject matter and a number of con-
founding variables (e.g. content with treatment and degree of
participation with method).

Another study which sought an aptitude and interest x
treatment interaction was done by Tallmadge (1968). He
chose two procedures by which to teach the solution of maneu-
vering board problems to Aierorce recruits. One method
emphasized Gagne's type 3 (chaining) learning and required
that the subjects memorize the sequence of steps involved
in the solution of these types of problems. The competing
treatment emphasized Gagne's type 7 (problem solving) learn-
ing. In it the subjects were taught the underlying principles
and concepts, and the rationale of the procedure for solving
these problems.

Instructional Resources Center

133 ERICKSON HALL
MICHIGAN STATE umvensurv

EAST LANL-JNG, MiCHlGAN 48823

17

Many ability and interest tests were given the sub-
jects among them the Kuder Preference Test, and spatial
orientation and visualization tests. The result was that
over one hundred abilities (and levels of them) were identi-
fied. The subjects were rated "high” or "low" on 16 of
these measures of both ability and treatment.

Tallmadge found a main effect for treatment—-the Type
7 method being superior over the Type 3 method. But there
was no interaction between aptitude or interest and treat—
ment. Tallmadge gives as possible reasons for this, poor
choice of competing training methods, measurement of inappro—
priate aptitudes and interests, or other interactions between
materials and training that washed out any aptitude x treat—
ment interaction.

One last attempt to find an aptitude x treatment inter-
action will be mentioned. It is the study done by Hamilton
(1969): She administered three cognitive tests to junior
high school students: spatial visualization, spatial orienta—
tion, and wide-range vocabulary. The students then worked
on either a verbal or primarily pictorial version of an
instructional sequence in the geometry of crystals. The
result was that the pictorial treatment produced higher mean
post test scores than the verbal treatment. But no inter-
action between aptitude and treatment was found.

The study reported here investigated an interaction

between spatial or numerical ability and a spatial or symbolic

18

treatment of an arithmetic task. Spatial and numerical
abilities as they relate to success on mathematical tasks
have been discussed by Cronbach (1960) and Murray (1999).
Murray correlated numerical, verbal, spatial relations, and
reasoning test scores with scores on a geometry test, and
with final grades in geometry. It was found that the numeri—
cal score correlated most highly with the final grade, and
the spatial relations score correlated least with both the
geometry test scores and the final grade.

Cronbach (1960) quotes findings that, although a spa—
tial relations test score correlates positively with geometry
scores, mechanical drawing ability, etc., it is the numerical
ability measure which is the better predictor of success in

geometry.

CHAPTER III

METHODS AND MATERIALS

Plan of the Study

 

No pilot work was done on the materials, so a number
of assumptions had to be made. First it was assumed that
there would be no sex differences. Secondly it was assumed
that since the children in the sample were middle to upper
middle class whites in a primarily middle to high SES
school that they would be quite verbal. Hence none of the
vocabulary used in the program would have to be explained
at any length (e.g. terms like "positive," "negative," and
"direction"). Thirdly, it was assumed that the fifth grade
would fare systematically better (perhaps because they had
had more practice with the subject matter) than the fourth
graders. Lastly it was assumed that neither fourth nor
fifth graders would have studied positive and negative num—
bers thoroughly. Therefore it was assumed that pretest
scores would be near 0, whereas post test scores would show
some uniform degree of dispersion.

The only effect that was sought was an interaction of

ability with treatment (hopefully a disordinal one) for the

19

20

fourth and fifth grade. The design was such that main

treatment and aptitude effects would be detected, however.

Instructional Programs

 

Two instructional programs were prepared by the eXperi-

menter. One treatment was very dependent on use of number

line diagrams to obtain a correct response. This was the

"spatial" treatment, hypothesized to interact favorably with If
spatially apt subjects. The "symbolic" treatment, aimed at i

more numerical children involved application of a verbal rule

to obtain a correct response.

The sources for construction of these programs were:

High School Mathematics: Course 1 by Beberman
and Vaughan (196D)

 

Discovery in Mathematics,ga text for teachers by
Davis (1963)

 

Modern Mathematics, Book 1 by Eigen et a1. (1961)

 

Algebra I by Smith et al. (1960)

 

Advancing in Mathematics (J2) by Morris and
Topper (l96u)

 

The objectives for the teaching sequence (expected to
be attained by use of either program) were:

At the end of the sequence the student will be able
to:

1. find the sum of two positive numbers
2. find the sum of two negative numbers

3. find the sum of one positive and one
negative number

The numbers themselves did not exceed 15, so that any

effect would be due to whether the treatment was spatial or

21

symbolic and not depend on whether one subject could add or
subtract better than another. The two programs were identi-
cal in the first half. Since the experimenter had been told
(by the fourth and fifth grade teachers) that some of the
children "had had" some of this material, whereas others
"had not had" any of it, it was deemed wisest to include a
quick run down of what a positive and a negative number were
and how they were expressed on the number line. The first
half of the program was meant to serve this purpose. It would
probably act as review for many of the children, and be new
material for the others.

The second half of each program dealt with the addition
of positive and negative numbers, and the expression of equa-
tions on the number line. The symbolic (number oriented).
program demonstrated the use of the number line for the ex—
pression of equations, but never asked the student to diagram
an equation on the number line. The emphasis in this program
was on finding the sums of positive and negative numbers
using one of two rules. Thus by the end of the program the
student would (supposedly) use the appropriate rule to find
the sum of two numbers of identical sign or of two numbers of
opposite sign.

The spatial program in the second half continued the
use of the number line for the expression of equations. The
student was required to draw a diagram and from the diagram
he was to find the solution to the equation. The spatial

program gave no rules for finding sums. Thus by the end of

this program the student should be able to find the sum of
two numbers of the same sign or of two numbers of opposite
sign by drawing a diagram of the equation and getting the
solution from the diagram.

It was stated earlier that these two programs were
identical in content and differed only in that one was spa-
tial and the other was symbolic. They were identical in
that all students found the sums of exactly the same number
of pairs of integers. The distribution of equations involv-
ing two numbers with opposite signs and those involving two
numbers with identical signs was the same for both programs.
A possible uncontrolled difference between the programs arose
after the first half. The spatial program continued using
the number line as a central reference point, and required
that all tasks be done using the number line. The symbolic
program shifted emphasis from the number line to rules. This
shift may have caused the symbolic student to learn something
different or to learn more than did the spatial student
(i.e. student with spatial program). Such a phenomenon is
almost impossible to investigate and was not of specific
interest in this study. It could be investigated if one

wished to do so.

Subjects
The subjects for this study were rather difficult to

choose. In the Research Design outline (see Appendix C)

which served as the basis for this study, it is suggested

23

that eighth graders be used, with the assumption that they

do not yet know how to add directed numbers. Talks with
eighth grade teaChers yielded the information that their
students "had had" all that long ago. It was finally neces-
sary to go as low as fourth and fifth grade, and even those
teachers said their students had been exposed to this kind of
material. As it turned out, pretest scores were quite high.
Another problem arose when it came to using such young
children, and that was that differential aptitudes are not
‘particularly well defined at that age. However, despite this
drawback, fourth and fifth graders were used. Thirty fourth
graders and thirty fifth graders participated in the experi—

ment.

Aptitude Measures

 

Since the children were so young, the SRA Primary Mental
Abilities Tests were used. This is the only widely distri-
buted multi-aptitude test battery which provides tests
appropriate to that age group. Three tests from that battery
were given: Verbal meaning (V), Number Facility (N), and

Spatial Relations (S).

Outcome Measures

 

One post test was given to all the students. This
test also served as the pretest. The objectives of the
teaching sequence were that the student would be able to

find the sum of two numbers with identical signs and of two

 

‘I. _'-ih-o‘-

2“

numbers with opposite signs. The post test asked him to do

that. It consisted of nine equations:

One of the type +2 + +2 ==Ej
One of the type 2 + 2 =

One of the type 1 + 2 = (smaller positive

number first)
Two of the type 2 + 1 = (larger negative
number first)
Two of the type 2 + l = (larger positive
number first)

 

— +
Two of the type 1 + 2 = E] (smaller negative
number first)

Thus each permutation of possible positive/negative combina-
tions was treated, although one was represented only once.
In addition to solving the equations, the children were
asked to draw diagrams of the first three equations (number
lines were provided). They were told they could draw dia—V
grams of the second three equations if they wanted to. Num—

ber lines were not provided in this case.

ssignment of Subjects to Treatment Groups

 

Four classes participated in the experiment--two fourth
grades and two fifth grades. Grade level was used as a
blocking variable as it was thought that fifth graders would
perform systematically better than fourth graders. Children
in each grade were assigned to treatment groups according to
their scores on the Number Facility and Spatial Relations

tests. Scores on the Verbal Meaning test were not considered

25

for two reasons. The experimenter was interested in those
students who had significant differences between their spa-
tial relations and their number facility scores. The verbal
meaning scores were to be used to assure homogeneity on at
least one dimension. It was decided that only those child-
ren above the verbal median (national norms) would be used.
However, when it came time to do the assigning, it was noted
that to exclude those children below the verbal median would
considerably reduce the size of the treatment groups. This
was partly because the children were not as highly verbal as
had been anticipated. In the fourth grade a total of 10 of
the 36 tested were below the national verbal median. In the
fifth grade a total of 5 of the “0 tested were below the
national verbal median.

A second and more interesting reason for not excluding
those children was that all but one of the 15 showed a
difference between their spatial and numerical scores (see
below for definition of what constituted a difference). Of
those, 11 showed a higher spatial aptitude than verbal apti-
tude. Those 11 children were almost 85 percent of the total
number of children below the verbal median. Of those above
the verbal median, only uu percent had a higher spatial
aptitude than they did a numerical one. Thirty-eight percent
had a higher numerical aptitude. The others showed no differ-
ence in aptitude. Thus it can be seen that exclusion of the
children below the verbal median would have meant a noticeable

reduction in size of treatment groups.

 

"T

26

The students were assigned to one of the two treat-
ment conditions if they showed a difference of one stanine or
more between their number facility scores and their spatial
relations scores. The national norms for the PMA are re-
ported in stanines. Assignment was done based on the
national.norms and a difference of one national norm stanine
between number and spatial score was the criterion for being n
assigned to a treatment group. Usually a difference of two
stanines (approximately one standard deviation) is considered

significant. However had the criterion for assignment been r ‘

 

II

a difference of two stanines, the sample size would have been
reduced to 28 and of the eight cells, one would have con-
tained only two subjects. This was clearly undesirable.
Hence a difference of one stanine was considered acceptable
as a criterion for assignment. Once all the children had
been identified as "spatial high/number low" or spatial low/
number high," the lists of names were randomized. The first
half of each list was assigned to one treatment, and the
second half to the other treatment. In the case of an

uneven number of subjects, the last name on the list was

dropped.

Experimental Schedule

 

Aptitude tests were given first—-one week before treat-
ment. The three subtests plus instructions took a total of
about one and a half hours to administer. Pretest, instruc-
tional program, and post test took from U5 minutes to 2 hours,

depending on the speed at which the student worked.

27

The experimental design was not matched to the class-
rooms' schedules. Individualization of instruction was not
practiced regularly in any of the four classrooms. Thus the
experiment constituted somewhat of an intrusion on the regu—
lar classroom proceedings. Apparently this particular school
is accustomed to being "experimented on" and the teachers
gave their consent. There was little or no hostility ex—
pressed at the interruption. If such a program were to be
incorporated in a school similar to this one, it should pro—
bably be broken up into smaller sections, taking 20 to 30
minutes each to complete.

At the beginning of the treatment period the experimenter
explained how the materials were to be used. An explanation
of the pretest--treatment--post test procedure was also
given. Each programmed booklet had on it the name of the
child to whom it was destined. The booklets were handed to
each child as he turned in his pretest. Children in the
spatial treatment (who had to draw number line diagrams) took
10 to 15 minutes longer on the average to complete the pro-
gram than did those in the symbolic treatment. The class
who "had not had" any of the material treated in the teach-
ing sequence (and who averaged near 0 on the pretest) took
the longest to complete the treatment. Before any further
use of these materials with naive subjects, the materials
must be broken into shorter sections, and completed over

three or four work periods.

43.

j -1—-aﬁ§’iA.-=:‘E ' '
‘

ments

period

28

The children who finished early did homework assign-
or went to the library. Throughout the experimental

the teacher was present in each class.

 

CHAPTER IV
RESULTS

Analysis was performed as follows: reliability of
predictors; intercorrelation of numerical and spatial sub-
tests for this sample; tests for main and interaction
effects on addition part of post test and on diagram drawing;
z scores (spatial and number) for fourth and fifth graders.

The null hypothesis that there would be no interaction
between treatment and aptitude was not rejected. The null
hypothesis that the spatial treatment and the symbolic
treatment would not differ in their effect was also not
rejected.

Table 1 shows the number of subjects who were above
and below the median on the addition section of the post
test (see Appendix A for actual scores). These frequencies
were used as the observed frequencies in a Wilson procedure
for x2 analysis of this type of data (see Appendix B for
short explanation of Wilson procedure). The original intent
had been to analyze the data using an analysis of variance
technique. However an obvious ceiling effect was observed
(there were 35 out of 60 scores of nine, the maximum score

possible) and it was deemed wiser to use a median--or cut-off

29

30

TABLE 1.-—Scores of 9 and Less Than 9 on Addition Test
(Frequencies) for Each Treatment Condition.

 

 

 

 

 

Treatment
Symbolic Spatial
Score <9/score = 9 Score <9/score = 9
Sp Hi
Grade NumLo U/U 5/3
A 0
02 L0
NumHi 2/5 u/3
SpHi
Grade NumLo 5/5 3/7
5 SpLo
NumHi l/A 1/u

 

TABLE 2.—-x2 Values for Main and Interaction Effects--
Addition Test.

 

 

 

 

df x2
Rows
(grade'and
aptitude) 3 3.720
Columns
(treatment) 1 .069
Rows X

Columns 3 1.77A

 

31

point-—test which would essentially determine if there were
more scores of nine under any one circumstance than under

any other. The non—significant x2 showed that this was not
the case. The technique might be criticized for not consider-
ing a good deal of information due to the dichotomization of
results. However conservative it may be, it is one of the

few honest tests used in this sort of situation. Analysis

of variance or correlation techniques are based on assumptions
of normality of distribution and equality of variance across

treatment groups. Although the latter assumption may have

 

NICK 3.1—! .IV 1
'

been satisfied, the distribution of scores was distinctly
not normal. The analysis of variance technique is quite
robust to violation of the assumption of normality. Nonethe—
less, to violate this assumption means to err on the side of
conservatism. Therefore it was felt that use of an initially
conservative test which violated no assumptions would give
equally believable results. I
Table 3 and A show the results of the diagram drawing
test. The frequencies were dichotomized based on whether a
score was three or not three.. Table 3 shows the observed
frequencies in each cell. Table A shows the x2 values ob—
tained from those frequencies. It will be noted that there
is no interaction effect or main effect for aptitude. How-
ever there is a significant main effect for treatment. This
tells us that those students who had the spatial treatment
could diagram the equations significantly better than those

who had had the symbolic treatment. That is not a surprising

32

TABLE 3.-—Scores of 3 and Less than 3 on Diagram Test
(Frequencies) for Each Treatment Condition.

 

 

 

 

 

 

Treatment
Symbolic Spatial
Score <3/Score = 3 Score <3/Score = 3
SpHi
NumLo 7/1 3/5
Grade
'4 o
QRLO
NumHi U/3 A/3
SpHi
NumLo 9/1 3/7
Grade
5
SpLo
NumHi 3/2 2/3

 

TABLE A.—-x2 Values for Main and Interaction Effects--
Diagram Test.

 

 

 

 

df X2

Rows

(grade and

aptitude) 3 5.51
Columns

(treatment) 1 12.97 *
Rows X
Columns 3 .357

 

*Significant at .01 level.

 

33

result. The children in the spatial treatment group had had
a good deal more practice making this kind of diagram than
had the children in the symbolic treatment group. The
latter had had no practice doing this at all.

The reliabilities and intercorrelations of the predic—
tors will be discussed next. Table 5 shows the reliability
for this sample of the spatial subtest (fourth and fifth
grade) and for the "number sense" section of the Number
Facility subtest. The reliabilities reported by the pub—
lishers of the Primary Mental Abilities Tests are included
in this table. As may be seen those for the spatial sub-
tests are somewhat lower for this sample than for the national
norm group. There has been some criticism of these tests in
their reporting of reliabilities (see Anastasi's review of
PMA tests in Fourth Mental Measurements Yearbook, p. 716).
Although this criticism occurred some time ago, no more
recent bases for judgement of this aspect of the test can be
found. Both the spatial and numerical (both sections) sub-
tests are speeded, the second section of the numerical test
(addition) being extremely so. Criticism occurs when Kuder—
Richardson techniques are used to calculate the reliability
of the speed tests. This was apparently done by the pub—
lishers. Some qualification of the criticism may be in
order. Only under certain circumstances does speededness
have to be corrected for (see Guilford, 195A, p. 392). In
the case of the spatial test, it is not so speeded that a

correction need be applied and the split half technique for

L

1.: f2 mil-Sq

3“

TABLE 5.--Reliabilities of Spatial Relations and Number
Sense Subtests for Grades A and 5.

 

 

 

 

 

 

 

Reliabilities
Spatial Subtest Number Sense Test
Local National deal National*
Grﬁde .7u .63+
.96 .89
. a
Grade .73 .7“

5

 

*This is the reported reliability for the two sections of
the Number Facility Test: Number Sense and Addition.
T ‘ SDu _ ‘
Corrected for speeding - SEW — .65

SDu = .18

6Not corrected for speeding - SDw

35

finding reliability may be used. In the case of the number
sense test, the split half technique was appropriate for
fifth graders, but a correction for speededness had to be
applied to the reliability estimates for fourth graders.
Guilford (ibig.) advises us that there are certain circum—
stances under which even a correction of the split half
technique is not appropriate. In cases of extreme speeding
(see Guilford, p. 392, for what is meant by "extreme speed—
ing") no estimate of reliability can be found which does
not involve inordinate amounts of calculation. The addition
section of the PMA numerical subtest is extremely speeded.
More complex calculation than was warranted by this report
was required to find the reliability of this section for
the sample under consideration.

It has probably been noted that reliabilities for the
local sample were reported relevant to grade level whereas
those for the norm group were reported relevant to age.
Grade level is used here as a basis for grouping because it
is felt that in this particular circumstance a grade is a
more homogeneous group than an age group. In other words,
it would seem that nine and ten-year-olds in the fourth
grade would be more alike than ten-year-olds in the fourth
and fifth grades. Unfortunately this homogeneity was not
tested so the use of grade as a basis for grouping may not
be a legitimate one. It will be used nonetheless.

Table 6 shows the means, standard deviations, and inter—

correlations of the spatial and numerical tests for this

 

‘1'

36

sample as well as those for the national sample. It was
noted that the intercorrelation for the fourth grade is a
good deal higher than that for the fifth grade. It is
doubtful that this occurs due to emergence of distinct abili-
ties between the fourth and fifth grades. It is more pro-
bably due to chance.

It had been planned to report a correlation of differ— ;
ence-between-aptitude-tests-scores (z scores would be used) ‘
with post test scores to see whether those children who were '

more spatially (or numerically) inclined, and who were in

 

the treatment group consonant with their salient aptitude,
had scored consistently high on the post test. This would

be opposed to those who had been in a treatment group which
was not consonant with their aptitude. For this purpose the
fourth and fifth grades were pooled. Based on the results

of a x2 analysis (again the Wilson procedure) it was found
that the fourth and fifth grades did not differ significantly
on their post test scores. This finding warranted pooling
the two grades based on post test results (see Table 7).

Then 2 scores were calculated for each treatment condition.
It was found upon taking the differences between the z scores
that over half the subjects had been misassigned to treatment
groups. It may be remembered that assignment was done based
on stanine scores used in reporting the national norms for
the spatial and numerical subtests. Tables D-l through D-U

(see Appendix D) show two z scores for each subject, the

37

TABLE 6.--Intercorrelations of Spatial Relations and Number
Facility Subtests for Fourth and Fifth Grade.

 

Spgtial Subtest Number Subtest Intercorrelation

 

 

 

 

.Mean SD Mean SD of spatial and
number
Grﬁde. 1u.3o u.12 29.22 5.87 .u01
Grgde 15.u5 ' u.oo 31.52 5.91 .199

 

Reported for National Sample . .096

 

 

TABLE 7.--Frequencies and x2 Value for Main Effect of Grade
Leve1-—Addition Test.

 

 

 

 

 

 

 

 

Treatment
Symbolic Spatial.
Score <9/score = 9 Score <9/score = 9
avi
013(e 6/9 9/6
Grgde 6/9 u/11
df x2

Rows

(grade and

aptitude) 3 ‘ 3.720

Columns

(treatment) 1 .069

Rows X

Columns 3 1.77“

 

38

difference between the z scores (a negative difference indi-
cating a misassignment), and stanine scores on which assign-
ment was based. When these discrepancies were found the
correlation was decided against as it would not yield the
desired information. It was noted that over half (33 out of
60) the subjects were misassigned (based on the difference
between their 2 scores). That left 27 who were correctly
assigned. A quick calculation revealed that, although some—
what worse than chance, the correct assignments were not

significantly fewer than chance. Significantly worse than

chance would have meant 36 misassignments and only 2“ correct

assignments.

It would be advisable in any future study of this kind

to base cssignment to treatment groups on differences between

standard (2) scores for the sample in question.

 

CHAPTER V
DISCUSSION

The hypothesis that there would be no interaction
between treatment and ability was not rejected. There was a
main effect of treatment found for the diagram drawing task.
This was rather predictable since those children in the spa-
tial treatment group had had more practice at drawing dia-
grams than the other subjects. The study was not attempting
to find out if one treatment was better than the other. The
hope was that one treatment would be better for one group of
subjects and the other would be better for the other group.
During development of the treatments it was noted that neither
treatment was easier to develop, nor did either one or the
other seem more appropriate to teaching this subject matter.
The symbolic treatment, with some aspects of the spatial
treatment added in, is the approach most generally used in
arithmetic texts. This made the two a little hard to
separate, and when finished, both treatments seemed to be
"lacking a little something." It is possible that the
Optimal sequence for the teaching of this subject matter is
the combination of the spatial and symbolic approaches—~as

suggested by the arithmetic texts. It would be interesting

39

 

U0

to investigate the extent to which spatial and symbolic
treatment factors contribute to the optimization of learning
for this subject-~this as opposed to developing separate
treatments for different people in an effort to maximize

learning.

The Criterion Test

 

The criterion test was intended to measure acquisition
of the final objective-—that is it attempted to measure how
well the children could add positive and negative numbers.
Given the ceiling effect obtained by this test, the actual
effectiveness of the two teaching sequences might have been
better measured by a different test. The test could be
lengthened and diversified somewhat. The section requiring
the drawing of diagrams might be retained. Rather than test-
ing only the final objective, questions concerning intermedi-
ate objectives (e.g. identification of positive and negative
number, corresponding points and direction on number line,
etc.) might be included in the test. Verbal tasks (story
problems) could also be included. The subjects would be mea-
sured on a verbal dimension, and correlations could be found
between story problem performance, overall test performance,
and verbal and number test scores. One last thing that
might be added to the test would be a request for a statement
of rules governing the addition of positive and negative
numbers. This would be of interest for at least two reasons.

It would answer the question of whether children given the

 

Al

rule remember the actual verbalization or see the application
and continue to apply it while not being able to verbalize
it. Secondly it would tell us whether the children (at
least some of them) who were not given a rule generated one

themselves.

Predictive Power of Tests (Predictors)
For Fourth and Fifth Grades a

It has been stated earlier that the abilities (as tested
by PMA type paper and pencil tests) do not emerge until high

school age. This might lead us to believe that the predic— ;

 

tive power of these tests for fourth and fifth graders is L
rather low, either because the abilities are truly not dif-
ferentiable at this age or because the paper and pencil
test does not find these abilities in a young child even if
they are there. It could be hypothesized that rather than
paper and pencil tests, some sort of tasks would help identify
these abilities if indeed they were differentiable. One
might study Piagetian type tasks appropriate to the various
developmental stages and determine how loaded they were with
primary mental ability (or other yet unidentified) factors.
Then again the tasks could be modified so they would be
loaded with one_or another of these factors.

Such an undertaking would be a considerable one and
might encounter some problems, not the least of these being
that the factors comprising the abilities themselves are

still in a state of flux.

U2

It has also been stated earlier that the number factor
appears to have the greatest predictive power for high
school boys doing a geometry task. Whether this would apply
to younger children doing a different kind of mathematical

task deserves further investigation.

Possible Reasons for Inconclusive Results

 

Throughout this report mention has been made of factors
which might have contributed to the inconclusiveness of the
present study. Those factors will be summarized here:

1. The abilities on which the treatments sought to
capitalize are not discernible at this age.

2. Possibly the numerical test items do not bear very
much relationship to success in manipulating symbols. Al—
though Gagne (see Appendix C) states that "high numerical
ability should facilitate the learning of symbolic concepts"
this is not necessarily the case. It is also possible that,
although there may be a numerical/symbolic relationship, the
symbolic treatment in this study did not incorporate that
kind of symbolic concepts.

3. It is possible that the spatial relations test
does not predict success on the kind of spatial task used in
this treatment.

A. The criteria for assignment to one or the other
treatment group were not widely enough separated. Of the
60 subjects assigned, 31 had a number/spatial stanine dif-

ference of less than two (i.e. less than one standard

 

“3

deviation difference between the two scores). A larger
sample might allow for the assignment of only those subjects
with a difference of one standard deviation or more.

5. Assignment to treatment groups should be based on
local norms (z scores or percentile rankings) or national
test norms if these are reported in percentiles. Assignment
can be based on national norms only if it has been ascertained
that the set of scores obtained from the local sample does
not differ significantly frOm those used to establish the
national norms. A procedure using national norms apparently f
vastly different from the local norms (and which are reported
as stanines) resulted in a worse than chance assignment of
subjects to their respective treatment groups.

6. The post test did not test completely the effect
of the treatments. The test was possibly too short as well

as too homogeneous.

CHAPTER VI
CONCLUSIONS

The present study has made us aware of the complexity
involved in the study of aptitude x treatment interaction.
We found through this study that development of abilities
may or may not occur at a young age. We found that predic-
tive power of the tests currently available needs more
investigation. We also found that many children were not
strongly differentiated in the abilities tested which raises
a question about the very value of trying to adapt different
kinds of instructional treatments to identified aptitudes.

The study raises the question of the feasibility and
economy of such an endeavor. Should we go to the trouble of
developing special instructional materials that will capital-
vize on identified differential aptitudes or should we try to
develop a number of "optimal" treatments that will reach
everyOne to a greater or lesser degree?

If we are to continue investigating differential apti—
-tudes in the interest of developing instruction to capitalize
on them, we must concentrate very closely on the definition
of already identified aptitudes as well as the identification

of new aptitudes.

HA

“5

If we are to accept the existence of differential
aptitudes (and they are becoming more widely accepted), we
must, after defining them, clarify how separated the apti-
tudes are from one another, and which ones share enough
factors that for instructional purposes they might be
grouped.

We are trying to pair a student having certain aptitudes
with an instructional treatment that capitalizes on those
aptitudes. Too many "important" aptitudes can lead to the
desire to create too many "appropriate" instructional treat—
ments. Rather than being concerned over dealing with every
possible aspect of this highly complex relationship of apti—
tude and treatment, we should perhaps attempt to clearly
identify the elements involved and simplify the structure
down to something we can work with both physically and
economically.

Trait by treatment interactions appear to have a healthy
future. However at present we seem awed by the very complex-
ity of the situation. The complexity does exist as demon-
strated by the present (and other) studies (e.g. Hamilton,
1969). Our first task then is to reduce the complexity by
identifying the elements. What are the aptitudes which can
be addressed by different instructional procedures? What
subject matters call most on which aptitudes? If many apti-
tudes are identified, how closely related are they to one

another, and can they be pooled for instructional purposes?

U6

Once we have identified our field of operation, only
then can we launch into large scale development of differ—

ential instructional materials. We have a long way to go.

Possibilities for Future Research-

 

Rather than looking for an interaction of treatment and
ability, one might investigate a main effect for treatment
with children even younger than fourth grade. Using Bruner's F”
ideas that we go through three levels of representation,
enactive, iconic, and symbolic, we could hypothesize that

the more spatial (iconic) treatment would work better with

 

 

even younger children (second and third graders) than would
the symbolic treatment. Given that there were high pretest
scores in the fourth and fifth grades, introduction of this
material can (and does) occur in the third grade. Some
mention of positive and negative numbers is made in the
first grade. When this occurs, we could investigate treat-
ing it iconically--or even enactively--rather than in the
conventional manner which is symbolically.

As mentioned earlier one might advance toward resolving
the controversy over whether abilities emerge before high
school age by developing tasks to replace the paper and
pencil tests now used to identify the abilities. One might--
and should--investigate the usefulness of the existing tests
(intended for counseling) for predicting treatments, and
their appropriateness to aptitudes. It would seem that tests

which were more subject matter specific (or at least more

U7

specific to school like tasks) might be more powerful for

matching student to treatment.

iﬁa

BIBLIOGRAPHY

U8

BIBLIOGRAPHY

Anastasi, Anne. Psychological Testing. Macmillan, 1965.

Annual Review of Psychology. Vol. 20, 1969.

 

Bloom, Benjamin S. Learning for Mastery. Evaluation
Comment. May, 1968, Vol. 1, Num. 2.

Ruros, O. K. (ed.). Fourth Mental Measurement Yearbook.
Gryphon Press, 195U.

Bush, Wilma Jo; Gregg, Dolores Kathryn; Smith, Edgar A; and
McBride, Coit B. Some Interactions Between Individual
Differences and Modes of Instruction, Aerospace Medical
Research Laboratories, Wright-Patterson AFB, Ohio,
December, 1965. ' .

Cronbach, Lee J. Essentials of Psycholpgical Testing.
Harper Brothers, 1960. '

French, J. W. _"How Important is Spatial Ability." Contem-
porary Psychology, 1965, Vol. 25, p. 9—28.

Gagne, Robert M. Ability differences in the learning of
concepts governing directed numbers, in Research
Problems in Mathematics Education. Cooperative
Research Monographs. No. 3, 1960, pp. 112-113.

 

Gagne, Robert M. (ed.). Learning and Individual Differences.
Charles E. Merrill Book, Inc., 1967.

Chapter 2: Cronbach, Lee J. How Can Instruction be
Adapted to Individual Differences?

Chapter U: Anderson, Richard C. Individual Differ-
ences and Problem Solving.

Chapter 6: Jensen, Arthur R. Varieties of Individual
Differences in Learning.

Chapter 8. Fleishman, Edwin A. Individual Differ-
ences and Motor Learning.

“9

50

Guilford, J. P. Psychometric Methods. McGraw-Hill, l95U.

Hamilton, Nancy R. Differential Response to Instruction
Designed to Call upon Spatial and Verbal Aptitudes,
Stanford University, February, 1969.

Murray, John E. Analysis of Geometric Ability, Journal of
Educational Psychology, 19U9, Vol. U0, p. 118-12U.

 

 

Rao, C. R. Advanced Statistical Methods if Biometric Research.
New York, Wiley, 1952.

Tallmadge, G. K. Relationships between training methods
and learner characteristics. Journal of Educational
Psychology, Vol. 59, No. 1, 1968, pp. 32-367

 

 

Thurstone, J. L. Primary Mental Abilities. Psychometric
Monograph #1. University of Chicago Press, April, 1938.

 

 

 

Wilson, Kellogg V. A Distribution-—Free Test of Analysis of £3
Variance Hypotheses. Psychological Bulletin, Vol. 53, ”
NO. 1, 1956.

Math Texts

 

Beberman, Max and Vaughan, Herbert E. High School Mathema-
tics: Course 1. D. C. Heath and Co., l96U.

 

Davis, Robert B. Discoverygin Mathematics, a text for
teachers. (Madison Project), Addison-WeSley Pub. Co.,
Inc., 196U.

Eigen, Lewis 0. and Kaplan, Jerome D. Modern Mathematics,
iook 1. Science Research AsSociates, 1961.

Smith, C. A.; Totten, W. Fred; and Douglass, Harl R. Algebra

1. Second Edition (Teacher's Edition), Row—Peterson,

1960. '

Morris and Topper. Advancing in Mathematics (J2). Science
Research Associates, 196UT

APPENDICES

51

APPENDIX A

POST TEST SCORES

52

4 ur—
EMF LP . A!“

 

53

 

 

 

 

m a m m
m m o m
m o o m
m m m o
m w o m
m m m m o m H m m
m m o m m m o m compo
m m m a m m o m
o m m m m m o m
H m m m m m m m
mEmmeHQ coHuHop< memsmmHQ :oHqup< mEomeHQ coHpHpo< meLmMHQ COHHHUU<
m m m m
H m m o o H o m
m m H H o m o o
H m m m m m m m s
H o m a m m o w oomsm
m m m m m m o w
H w m w o m o m
m a o m m s o :
hemammHQ :oHuHoo< mEmesHQ :oHpHUU< meosmmHQ :oHpHoo< mEstmHo coHuHoo<
Hm ssz \ so am do 5:2 \ H: mm H: esz \ so dm 04 ssz \ Hm am
am pcoEpmosB mm pcoEummLB

 

.mmcHOOW ummB umomll.ﬁ.< Mdmdfb

APPENDIX B

5U

The Wilson (1956) procedure for calculating a x2 makes
a slight modification on a technique developed by Rao (1952,
p. 192-205) for the decomposition of a x2 statistic from a
contingency table into components similar to the manner used
for decomposing a total sum of squares in an analysis of
variance computation.

Wilson's modification allows one to make a distribution
free (non parametric) test of hypotheses concerning main
effects and interactions usually tested by an analysis of
variance technique.

The procedure is as follows:

The median value for the entire set of n observations
is determined. This median should not be interpolated but
should be determined only as a "boundary" which divides the
entire set of observations, as nearly as possible, into two
groups of equal size. In this particular study, for the
group having a score of 9, n=35, and for that having a score
of less than 9, n=25.

A contingency table is set up representing the frequen-
cies of above and below median scores in each cell (see
Table l).

The total x2 value is calculated using the following

formula for unequal cell frequencies:

55

56

 

 

2 2
2 _ Z 2 (afij-nijna/n) + (bfiJ-nijnb/n)
X Total 1 J nijn nijn
a/n b/n
Where:
afij = the number of scores = 9 in any one cell
bfij = the number of scores less than 9 in that cell
nij = the number of scores in that cell
na = number of scores = 9
nb = number of scores less than 9
n = total number of scores

The component corresponding to the cell in the lower right

hand corner of the Table 1 grid (l/U) would then be:

u _ 5)/6o>2 + (1 - 5 (25/60)2
5

5 (3
5 (373/60 5 (25)/6o
This procedure is recognizable as an elaborate version

(o - E)2

F technique. In this case the expected fre-

of the

quencies are obtained from the null hypothesis that the main
and interaction effects produce no change in the distribution

of scores.

1
x2 values for row effects are calculated using the

marginal row totals of the contingency table. In the present
case: 9/7, 6/8, 8/12, 2/8. x2 values for columns are cal-
culated using the marginal column totals of the contingency
table. In this case 12/18, 13/17. The x2 value for the
interaction effect is found by subtraction of row and column

values from the total value.

57

The respective degrees of freedom for each 2 value are:

 

X2 df
X2T rc—l
X2R r -l
X2C c — l

x I (P-l)(c—l)

 

APPENDIX C

58

Research Design 11

Ability Differences in the Learning of
Concepts Governing Directed Numbers

Robert M. Gagne
Princeton University

Problem

Differences in fundamental abilities appear to be
prominent in the learning of mathematics, as well as in the
way people use mathematical concepts. Well—established
factors in human abilities are spatial, numerical, and ver—
bal. Although there are studies which have revealed moder-
ate to high correlations between aptitude measures and grades
in mathematics, no studies have been conducted in the attempt
to make specific predictions concerning the facilitation of
different kinds of conceptual learning by different funda—
mental abilities. The possession of a high degree of spatial
ability should facilitate the learning of spatial concepts;
high verbal ability should facilitate the learning of verbal
concepts; and high numerical ability should facilitate the
learning of symbolic concepts.

The learning of concepts of addition of directed num-
bers may be done verbally, spatially, or symbolically. Verbal
rules are perhaps the best known method, occurring in most
conventional text books. Spatial concepts have been used with
considerable Success, notably in the textbook of the Univer-
sity of Illinois Committee on School Mathematics. Symbolic
concepts can readily be designed to serve the same purpose;
in one form they might resemble some of the symbolism of
boolean algebra. Thus in this mathematical topic, the oppor—
tunity exists of relating differences in fundamental abilities
to ease of learning the different types of concepts, as well
as to final performance in problem solving.

Method

The study requires three groups of school children, of
equivalent age, probably eighth graders. These children
should have no previous knowledge of or acquaintance with
algebra, or more particularly with the addition of directed
numbers.

All subjects are tested on a carefully chosen factor

reference battery designed to provide adequate measures of
spatial, numerical, and verbal abilities. Each group is then

59

-nr.

6O

instructed in the topic of adding directed numbers, using
one of three methods (spatial concepts, symbolic concepts,
or verbal concepts). Care should be taken to make the in—
structional materials equivalent in scope of coverage and
in amount of practice provided.

When instruction has been completed, the students
should be given a speeded test requiring the addition of a
relatively long and varied list of directed numbers. Al-
ternatively, two such tests could be given during instruction,
making possible the use of gain scores. In any case, this
type of test is used to measure degree of learning of the
concepts. The test itself will constitute a practice session.
Following this, students are given another test containing
verbally stated problems which require the application of
learned concepts to unfamiliar problems.

Correlations are obtained among: (a) ability scores,
(b) concept—learning scores, and (0) problem performance
scores.

Results '

To the extent that the hypotheses underlying the study
are verified, correlations between spatial ability scores
and spatial-concept learning scores should be significantly
. higher than between spatial ability scores and verbal-concept
learning or symbolic-concept learning scores. Similar hypo-
theses would be tested relative to numerical ability and to
verbal ability. Correlations between concept-learning and
problem-performance scores, on the other hand, may reveal
differences in the efficacy of different types of concepts
in problem solving. _

APPENDIX D

STATISTICAL DATA

61

62

 

 

O N em. I as. mm OH. u NH

O N me. n mm. Hm mm. u OH

H O mm.H mO.H- em 3:. OH

N O OO. mm. mm mN. Om

O O OH. a mO. mm 2:. OH

O O 2:. mm. em Nm.H mm
O N Hm.H- Hm.H Om OH. u NH
3 m NO. O Hm. . mm mm. n 2H
m O NH.H NH. u ON OO.H Hm
m O mm. a 2:. mm mm. I OH
m N mm. ON. . ON mm. . OH
O O Om. as. mm OO.H Hm
O O Os. . O:.H Nm OO.H Hm
O O NH.H mH. . Om OO.H Hm
m 3 OO. I mm.mu OH mm.mu O

N w on. u mO. mm NH. OH.
m s OH. mm.m- OH so.mu OH
O N ON. 1 NH. 1 mm mm. I OH

sooesz Hmemqm E3: am whoom osoom
mchwpm N I mpoom N Lonesz opoow N HmHqum

 

moowso cpmHm new gossom
30H HooE52\£ch Hmemdm "ooze

oHHooEzm

upCmEpmmLEII.HID mqm<8

63

 

 

 

 

m N Hm. mo. mm ON. I HH
0 N Nm. I mm. . mm cm. :H
m N ON. mm. mm ON. I HH
3 N mm. onH %H mm. I HH
m w Hm. I mm. :m om. 2H
m 0 mm. oo.H Nm :3. mH
m N mN. I H:. :m mH.H mH
O N N. I H:. :m OHH mH
: O Hm. I :mHI mm mmHI m
N m mo. I NN. om ow. :H
s m OOHI NONI am mm. I m
m m wH.H am. I am mo.mI o
Looezz HmeQO E3: on opoom whoom
mchmpm N I I whoom N smoesz whoom N HmHumdm
moomso cpNHm pew cpssom
:me sooESZ\30H HmHumom ”some

OHHopeNm "sedgemdseII.NIo Names

6U

 

 

O N H:. I OO. ON OO. I OH
O O OO.H ON. I NN OO.H NN
O O OO. OO. NO OH.H HN
O O HO. NN. OO OO. ON
O O OO. OO. O OO.H NN
O O HO. I OO. ON OO. I OH
N O OO. I OO.H OO OO. OH
N O OO. I OO.H OO OO. ON
O N OH. I OO. OO ON. I OH
O s Os.HI ON. I HH OH.NI OH
O O ON. OO. I ON OO. I OH
N O NN. I OO.H OO ON. OH
N O OO. I OH.H OO OO. OH
O O NO. I OO. I ON OO.HI NH
O N OO. H:. I ON OO. I OH
N O OH. I NO. .O OO. I NH
O O NO. OO. I ON ON. OH
H N OO.H OO.HI ON OO. I NH
O O OO. I OO.HI ON OO.HI HH
smnesz HNHpOam Es: om whoom whoom
ochspm N I osoom N LonEsz osoom N HOHumdm

 

moomsm cupHm was condom
30H LooE32\£wH£ HNHumdm ”some
HOHHOOO HOOOEHOOLBII.OIO OHOOB

65

 

 

O N OO. I OH. I am OO. :H
O O Om.HI NO.HI ON ON. I HH
2 O OO.H NH.H Om OO. I OH
H O NN.H OH. I :O NO.HI O
O O OO. I OH. I :m mO. :H
O O OO. I OH. I Om ON. OH
O N OH. OH. I HO OO. I OH
N O ON. NO.HI om NO.HI O
N O OO. N:.H Om OO.H NH
O N OO. I OH. I :m OO. OH
O O mm. ON.H O: OO. OH
N O OO. I OH. I :m OO. OH
sooesz HmHumom Es: om opoOm osoom
ochmum N I osoom N pooesz opoom N HmeQO

 

woomso numHm was npssom
:me sooESZ\30H HmeQO "maze

HOHHOOO

”psoEpmoHBII.:IQ mqm<e

Sprafka, S .
M.A. arah ’ A' 1969

’\

9%

Instructional R§urcos Center
133

MICH TATE uumsmr
EAST ING MICHIGAN 48823

 

 

MICHIGAN STATE UNIVERSITY LIBRARIES

II IIIIIILIIIII Hill

3 77 5780

3 1293