"\

. “D 2 V)

M0:

©0404?

ABSTRACT

INFANTS' PERCEPTION OF ACOUSTIC CUES FOR THE VOICED

ALVEOLAR STOP IN TWO VOWEL ENVIRONMENTS

BY

Kristine MacKain King

A critical question in infant Speech perception con-
cerns whether or not infants perceive speech linguistically.
The present study investigated this question using an infant
learning task and a generalization paradigm. Specifically,
it was asked whether or not infants could categorize as the
same two diverse acoustic cues which signal the same phone,
/d/. Twelve 3-to-5 1/2 month-old infants (average age = 4.4
months) were equally divided into two experimental groups
and one control group. Speech stimuli in the experimental
groups consisted of the discriminative stimuli /di/ and /ga/
and the test stimulus /du/ or the discriminative stimuli /du/
and /ga/ and the test stimulus /di/. In the control group,
the discriminative stimuli were /di/ and /du/ and the test
stimulus was /ga/. The experiment proper consisted of two
sequential stages, a conditioning phase and a test phase.
During the conditioning phase, each infant was conditioned

to discriminate between the two discriminative stimuli with

Kristine MacKain King

directional head turns. Following attainment of an a priori
criterion level of correct responses, test stimuli were
introduced along with the discriminative stimuli and each
infant's directional generalization responses to the test
stimuli were measured. Correct responses to the discrimina-
tive stimuli were reinforced; the generalization stimulus
was not reinforced.

Two infants successfully responded with directional
head-turns during the conditioning phase, one infant from
the experimental group di/ga and one infant from the control
group di/du. A third infant from the experimental group
di/ga showed signs of conditioning towards the end of the
test phase. Both conditioned infants' directional responses
to the test stimuli during the test phase were random. The
random responses of the control subject were expected for
the control condition. The experimental subject's random
responses, however, did not support the linguistic hypothe-
sis. The perceptual saliency of the steady—state vowel and
its possible role dominance in making gneralization responses
was offered as a possible explanation for failure to support

the linguistic hypothesis.

INFANTS' PERCEPTION OF ACOUSTIC CUES FOR THE VOICED

ALVEOLAR STOP IN TWO VOWEL ENVIRONMENTS

BY

Kristine MacKain King

A DISSERTATION

Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of

DOCTOR OF PHILOSOPHY

Department of Psychology

1976

TABLE OF CONTENTS

Page
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . v
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . vi
INTRODUCTION . . . . . . . . . . . . . . . . . . . . . 1
LITERATURE REVIEW . . . . . . . . . . . . . . . . . 3

The Need for a Linguistic Code . . . . . . . . . 3
Differences Between Speech Perception and
Auditory Perception . . . . . . . . . . . . 5
Infant Discrimination of Speech Contrasts . . . . 13
Infant Discrimination of the Acoustic Cue
for Voicing . . . . . . . . . . . . . . . 15
Infant Discrimination of the Acoustic Cue
for Place of Articulation . . . . . . . . 25
Infants' Categorization of Phones . . . . . . 29
Purpose of the Present Investigation . . . . . . . 32

METHOD . . . . . . . . . . . . . . . . . . . . . . . . 33

Subjects . . . . . . . . . . . . . . . . . . . . . 33
Apparatus . . . . . . . . . . . . . . . . . . . . 36
Design and Procedure . . . . . . . . . . . . . . . 40

RESULTS . . . . . . . . . . . . . . . . . . . . . . . 47

Conditioning . . . . . . . . . . . . . . . . . . . 48
Test of the Linguistic Hypothesis . . . . . . . . 52
Analysis of Unconditioned Infants . . . . . . . . 55
Conditioned Head-Turning . . . . . . . . . . . 55
Directional Responses . . . . . . . . . . . . 58

DISCUSSION . . . . . . . . . . . . . . . . . . . . . . 60

Limitations of This Research . . . . . . . . . . . 62
Implications for Future Research . . . . . . . . . 64

SUMMARY AND CONCLUSIONS . . . . . . . . . . . . . . . 66

ii

BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . .
APPENDICES . . . . . . . . . . . . . . . . . . .
A. Letter to Parents . . . . . . . . . . . . . .
B. Parental Consent and Release Form . . . . . .
C. Schematic Description of Speech Stimuli .
Cl. Fundamental Frequency Contour for
Stimuli /di/, /du/, and /ga/ . . .
C2. Schematic Representation of Formant
Structure for /di/ . . . . . . . . .
C3. Schematic Representation of Formant
Structuring for /ga/ . . . . . . . .
C4. Schematic Representation of Formant
Structure for /du/ . . . . . . . . .
D. Spectrograms . . . . . . . . . . . . . . . . .
E. Frequencies of Left and Right Responses
Across Baseline . . . . . . . . . . . . . .
F. Correct and Incorrect Response Frequencies To
Discriminative Stimuli Across Conditioning
And Test Phases For Conditioned Subjects
2 And 9 . . . . . . . . . . . . . . . . .
G. Conditioning Performance Across Conditioning

And Test Phases for Unconditioned Subjects .

Gl. Conditioning Performance Across
Conditioning and Test Phase for
Subject 3, Experimental Condition
di/ga -. . . . . . . . . . . . . . .
G2. Conditioning Performance Across
Conditioning and Test Phase for
Subject 4, Experimental Condition
du/ga . . . . . . . . . . . . . . .
G3. Conditioning Performance Across
Conditioning and Test Phase for
Subject 5, Experimental Condition
du/ga . . . . . . . . . . . . . .
G4. Conditioning Performance Across
Conditioning and Test Phase for
Subject 6, Experimental Condition
du/ga . . . . . . . . . . . . . . .
G5. Conditioning Performance Across
Conditioning and Test Phase for
Subject 7, Condition di/du . . . . .

iii

67
71
71
73

74

74

75
76
77

78

79

80

81

81

82

83

84

85

H.

I.

G6.

di/du . .
G7.

di/du . .

Response Frequencies to Discriminative
Stimuli Across Baseline Conditioning and

Test Trials . . .

Summary of Response Frequencies to
Discriminative Stimuli Over Conditioning

and Test Phases .

iv

Conditioning Performance Across
Conditioning and Test Phase for
Subject 8, Control Condition

Conditioning Performance Across
Conditioning and Test Phase for
Subject 10, Control Condition

86

87

88

89

LIST OF TABLES

Table Page

1. Comparison of Correct and Incorrect Response
Frequencies to Discriminative Stimuli

Across Test Phase . . . . . . . . .
2. Conditioned Infants' Correct Response
Performance to Test Stimuli Across Test
Phase . . . . . . . . . . . .
Summary of Results for Conditioned Subjects
2 and 9 . . . . . . . . . . . .

. 49

S4

56

3.

LIST OF FIGURES

Figure Page
1. Apparatus Diagram . . . . . . . . . . . . . . . 37
2. Sample of Recorded Data . . . . . . . . . . . . 38
3. Experimental Design . . . . . . . . . . . . . . 41

4. Conditioning Performance Across Conditioning
and Test Phase for Subject 2, Experimental
Condition di/ga . . . . . . . . . . . . . . . 50

5. Conditioning Performance Across Conditioning
and Test Phases for Subject 9, Control
Condition di/du . . . . . . . . . . . . . . . 50

6. Backward Conditioning Curve Showing Approach
to Criterion for Conditioned Subjects 2&9 . . 53

7. Mean Percent Response to Conditioning for
Unconditioned Infants . . . . . . . . . . . . 57

8. Conditioning Performance Across Conditioning

and Test Phase for Subject 1, Experimental
Condition di/ga . . . . . . . . . . . . . . . 59

Vi

INTRODUCTION

Studies of infant Speech perception indicate that
young infants can discriminate differences in speech sounds.
For example, infants can discriminate pairs of speech sti-
muli that differ in voice, place of articulation, and manner
of articulation for consonants or that differ in tongue
height and/or point of maximum constriction for vowels.
There are at least two possible explanations for the in-
fant's ability to make these perceptual discriminations.
First, the infant may be coding speech auditorily and dis-
criminating differences on the basis of auditory features
extracted from the speech signal. Second, the infant may
be coding speech linguistically and discriminating differ—
ences on the basis of extracted linguistic features from
the speech event.

Several investigators have tested the first explana-
tion that infants code speech linguistically (e.g., Cutting
and Eimas, 1974; Eimas, Siqueland, Jusczyk and Vigorito,
1971; Morse, 1972). Although these studies suggest that
infants perceive speech in a linguistic mode, serious
methodological problems preclude a clear interpretation of

their results.

The purpose of the present study was to determine
whether young infants perceive speech linguistically or
auditorily. Operantly conditioned head turning was used
to shape the initial perceptual discrimination. Then, a
generalization task was used to test the linguistic versus
auditory processing hypotheses. Specifically, this study

investigated infants' ability to equate two diverse acoustic

cues which signal the same phone.

LITERATURE REVI EW

The literature review is divided into three major
sections: the need for a linguistic code in speech percep-
tion, evidence (from adults) which supports linguistic
coding in speech perception and auditory coding in non-
Speech sounds, and evidence which supports linguistic

coding in the perception of Speech by infants.

The Need for a Linguistic Code

This section is designed to demonstrate the reasons
why the acoustic signal of speech has to be coded in order
for it to be perceived as speech and the conditions under
which speech is coded by infants if they, in fact, perceive
Speech linguistically. Specifically discussed are l) the
discrepancy between the acoustic speech signal and its final
representation as a linguistic event and 2) the subsequent
need for a transformation of speech sound into speech phone,
enabling speech to be segmented into a sequence of phonemes.
Further, it is suggested that if infants code speech linguis-
tically, they must be capable of coding discrepant acoustic
cues in the Speech event as the same phone.

In most cases of speech perception, there is no way

to map the acoustic signal directly onto a sequence of

phones such that any given acoustic segment will correspond
to a given phone. Speech perception is therefore consid-
ered a complex process which requires that the acoustic
Signal be decoded through some type of an analysis-by-
synthesis procedure in order for it to be perceived as
speech (Liberman, 1970; Stevens & Halle, 1967).

As an example of the complexity of Speech, the same
acoustic segment may provide simultaneous cues for the
perception of two distinct phones. For example, the
second formant transition which connects the consonant [C]
and vowel [V] in a CV or VC sequence provides Simultaneous
information about place of articulation for the consonant
and the identity of the vowel. In fact, at no point along
the formant transition can the consonant be segmented from
the vowel so that the two phones are perceived separately
(Liberman, Cooper, Shankweiler, & Studdert-Kennedy, 1967).
Interestingly, different acoustic events contain informa-
tion that signal the same phone, requiring a temporal
synthesis of cues for accurate phoneme perception (Mattingly
& Liberman, 1969). For example, in some CV sequences
identification of a specific stOp consonant requires infor-
mation provided by both the initial consonant burst and
the second formant transition connecting the consonant
and vowel. Therefore, perception of continuous speech
requires Simultaneous processing of cues for different

phones and temporal synthesis of cues for the same phone.

To add to the complexity of the speech (or linguistic)
code, the same acoustic cue may Signal the perception of
two distinct phones, depending upon its syllabic environ-
ment. To illustrate, Liberman, Dellatre, & Cooper (1952)
found that an 1800 Hz noise burst was perceived as /p/ when
it preceded /u/ or /i/ and as /k/ when the burst occurred
before /a/. Conversely, the same phone may be signalled by
two different acoustic cues depending upon its syllabic
environment. For example, the second formant transition
signalling the perception of /d/ rises when /d/ is followed
by /i/ and falls when /d/ is followed by /u/ (Liberman et
al., 1967). In this example, then, perception of /d/ re-
quires the transformation of a variable acoustic cue into
the same linguistic event, /d/. If infants are perceiving
speech linguistically, they must transform these cues into
the same phone. However, if they are perceiving Speech
auditorily, the transformation of sound into phone does not
take place and the two acoustic cues will be perceived as
different auditory events. This perceptual dichotomy pro-
vides a sensitive test for whether or not infants are coding
speech linguistically or auditorily and serves as the basis
for testing the hypothesis advanced in the present study.

Differences Between Speech Perception
and Auditory Perception

 

The purpose of reviewing several studies concerned

with adult speech perception is to provide experimental

evidence for the contention that speech is perceived in a
linguistic mode and other auditory events (classified under
the general heading, "nonspeech") are perceived in an audi-
tory mode. First, a model of Speech and nonspeech proces-
sing is presented which provides the framework for concep-
tualizing the way in which Speech and nonSpeech are per-
ceived. Following this discussion, selected research is
reviewed which experimentally demonstrates that speech and
nonspeech are differentially processed and which directly
or indirectly supports the differential processing model of
speech perception. Support for the differential processing
model comes from 1) behavioral and physiological evidence on
hemispheric specialization for speech, 2) evidence from the
perception of acoustic cues for speech in and out of a
Speech context, and 3) dichotic listening tasks.

Ample evidence exists to support the contention that
speech and nonspeech sounds are processed differently (e.g.,
Cutting, 1974; Mattingly, Liberman, Syrdal, & Hawles, 1971;
Studdert-Kennedy, Shankweiler, & Pisoni, 1972; Wood, 1973).
Theoretical accounts of speech perception explain speech
processing in terms of different levels or stages of proces-
sing (Day & Wood, 1972a, 1972b; Studdert-Kennedy et al.,
1972; Studdert-Kennedy & Shankweiler, 1970; WOod, 1973).
Specifically, Speech perception involves the processing of
both auditory and phonetic information as represented by

the hierarchial arrangement of auditory and phonetic stages

of processing. At the auditory level, processing is accom-
plished by the auditory system by which extracted auditory
features from the acoustic signal are converted into psy-
chological attributes of pitch, loudness, duration, and
timbre (Studdert-Kennedy et al., 1972). Based upon this
auditory analysis, phonetic processing transforms the
acoustic parameters of the Speech signal into phones
through the extraction of linguistic features. Acoustic
cues extracted from the speech Signal or other sounds (both
classified as nonSpeech) are processed at the auditory
level only. The resulting distinction between Speech and
nonspeech perception is that speech is perceived linguis-
tically (or phonetically) and nonspeech is perceived
auditorily.

The differential processing of speech and nonspeech
is substantiated both behaviorally and physiologically by
studies that demonstrate hemispheric specialization for
speech. Several studies have shown a right ear advantage
(left hemisphere representation) for speech stimuli and a
left ear advantage (right hemisphere representation) for
nonspeech stimuli (e.g., Kimura, 1961, 1964, 1967). More-
over, this differential ear advantage is determined not by
the stimuli themselves but by whether or not the stimuli
signal linguistic or nonlinguistic attributes. Dichotic
tasks using the same speech stimuli (e.g., /ba/) Show a
right ear advantage if the subject is asked to identify a

linguistic parameter of the signal (e.g., a stop consonant)

and a left ear advantage if the subject is identifying a
nonlinguistic parameter of the signal such as pitch

(Day, Cutting & Copeland, 1971; Day & Wood, 1971). Using
the same type of identification paradigm, Wood, Goff, &

Day (1971) found significantly greater auditory evoked po-
tentials over the right hemisphere during the nonlinguistic
task. Wood and his associates used the linguistic vs.
nonlinguistic task in order to clarify the results of

other studies measuring auditory evoked responses to Speech
vs. nonspeech stimuli. These studies (specifically, Cohen,
1971; Matsumiya, Tagiliaso, Lombroso, & Goodglass, 1972;
Morrell & Salamy, 1971) also demonstrated hemispheric
Specialization for Speech vs. nonspeech stimuli.

It can be concluded from the results of these studies
that Speech and nonspeech are processed differently. Spe-
cifically, the linguistic aspects of Speech are processed
by the left hemisphere, whereas nonspeech or the nonlinguis-
tic aspects of speech are processed by the right hemisphere.

That the perception of Speech is in some sense differ—
ent from the perception of other auditory events is further
substantiated by studies concerned with the perception of
acoustic cues for Speech in and out of a Speech context.

First, the acoustic cue for place of articulation for
stops--the second formant transition (F2)--sounds different
in and out of Speech. In Speech, the cue is heard as a
linguistic event, for example, [bae]. When presented alone

out of the Speech context, however, it is heard as a chirp

of a specified frequency or as a rising glissando (Mattingly
et al., 1971). Second, the acoustic cue for place is dis-
criminated differently, depending upon whether it is in a
Speech or a nonspeech context. For example, Mattingly et
al. (1971) demonstrated that when the direction and extent
of the F2 transition in a speech context were varied by
equal acoustic increments, the listener continued to hear
the same stop consonant until the increment crossed the
phoneme boundary. At this point, the listener heard a dif-
ferent stop (differentiated by place of articulation).
Changes in the acoustic Signal resulted in identification
of either one stOp or another; intraphonemic variations
were not heard. In other words, perception of the acoustic
cue for place in a Speech context was categorical. However,
when the acoustic variants were presented alone for dis-
crimination, discrimination responses were distributed
bimodally. That is, performance was either better or
worse than in the Speech context.

Similar results have been found with the discrimina-
tion of voice onset time (VOT), the acoustic cue which dif-
ferentiates voiced and voiceless stops. VOT is defined as
the interval between the release burst of the consonant
and the onset of vocal fold vibration. When this dimen-
sion was systematically varied by equal increments in a
Speech context, perception was nearly categorical; that
is, few sounds could be discriminated better than they

could be identified, with Speech discrimination functions

10

Showing high peaks at the phonetic boundaries (Liberman,
Harris, Kinney & Lane, 1961). However, when the cues were
extracted and presented alone for discrimination out of
speech, no peaks occurred at the boundaries and discrimi-
nation performance was at near chance levels.

It should be noted that these results (Mattingly
et al., 1971; Liberman et al., 1961) are not meant to imply
that only speech is perceived categorically. For example,
Cutting and Rosner (1974) have demonstrated categorical
perception of' nonspeeCh sounds (sawtooth waves) that vary
in rise time (rise time is defined as the time between
sound onset and the point at which the sound reaches its
maximum intensity). Thus, the same acoustic dimension
(e.g., VOT or changes in the extent and direction of the
F2 transition) is perceived differently in and out of
speech. This is an important distinction Since Liberman
and his associates have argued elsewhere (e.g., Liberman,
1970; Liberman et al., 1967; Mattingly & Liberman, 1969)
that categorical perception is an attribute unique to
Speech perception.

The perception of acoustic cues for Speech that vary
as a function of phonetic context are also perceived dif-
ferently in and out of Speech. In some CV or VC sequences,
the cue for place of articulation (the F2 transition)
varies as a function of the preceding or following vowel.
In the /di/-/du/ example, the F2 transition signalling

perception of /d/ rises in /di/ and falls in /du/. In

11

speech the /d/'s perceptually are identical. However, in
isolation they are perceived as a rising or falling
glissando or as chirps of a higher and lower frequency
(Liberman et al., 1967).

It seems clear that acoustic cues for Speech are
processed differently, depending upon whether or not they
are in a Speech context (Liberman et al., 1961; Liberman
et al., 1967; Mattingly et al., 1971). Out of speech the
acoustic cues are perceived as auditory events and can be
discriminated along a continuum. In speech, the cues are
perceived as phonemes and can be discriminated no better
than they can be identified. In other words, perception
of the acoustic cue for speech in a Speech context is
categorical, or linguistic. Out of speech, perception is
continuous, or auditory.

Dichotic listening tasks have provided a sensitive
test for determining whether Speech discriminations are
based upon an auditory or linguistic feature analysis.

In the tasks to be discussed, listeners identified CVC
sequences where the auditory or linguistic parameters of
the speech signal were experimentally manipulated.

The general pattern to emerge suggests that phonemic de-
cisions are based upon a linguistic (rather than an audi-
tory) feature analysis.

In two studies (Hawles, 1969; Studdert-Kennedy and
Shankweiler, 1970) listeners' errors were recorded to

dichotically presented CVC Sequences in which the stop

12

consonant in each pair was differentiated by two linguis~
tic features (e.g., place and voicing). The presentation
of /pa/, for example, to the right ear and /ga/ to the

left resulted in responses of either /ba/ or /ka/. A
response of /ba/ can be explained by the extraction and
integration of voicing from the left ear and place from

the right whereas /ka/ resulted from the integration of
voicing from the right ear and place from the left.
Responses which involved no integration of feature cues
(such as /da/ or /ta/) rarely occurred. These data support
the notion that linguistic features are psychologically
real and further demonstrate that linguistic features can
be extracted from phones and combined to form a "new"

phone not presented in the original stimulus pair. Al-.
though these studies suggest that Speech perception is
based upon extracted linguistic (rather than auditory)
features, the auditory parameters of the Speech signal

were not manipulated. No distinction can be made, there-
fore, between an auditory vs. linguistic feature dependency
during Speech perception.

The question of whether speech perception is lin-
guistically or auditorily based was specifically addressed
in an investigation where both parameters were experimen-
tally manipulated (Studdert-Kennedy et al., 1972).

Correct responses to dichotically presented CV pairs whose
stop consonants shared both phonetic and auditory informa-

tion (e.g., same place of articulation and identical F2

13

transitions as in /di/, /ti/) were compared with pairs
which shared only phonetic information (e.g., same place
of articulation but diverse F2 transitions as in /di/,
/tu/). Studdert-Kennedy et a1. hypothesized that if
(linguistic) feature sharing (e.g., same place of articu-
lation) is auditorily based, listeners should respond with
greater accuracy to pairs which shared the same linguistic
feature and identical F2 transitions. However, if speech
processing is linguistically based, no difference in per—
formance between the two stimulus sets should result.
Results Showed no Significant difference in performance
between the two sets of stimulus pairs, supporting the
authors' contention that phonemic decisions are based upon
the extraction of linguistic (rather than auditory)
features from the speech event.

Collectively, the studies reviewed in this section
indicate that 1) Speech and nonspeech (or the linguistic
and nonlinguistic aspects of speech) are processed in
different hemispheres of the brain, 2) the acoustic cues
for Speech are perceived categorically in speech and non-
categorically when extracted out of speech, and 3) phone-
mic decisions in speech are based upon a linguistic,

rather than an auditory, code.

Infant Discrimination of Speech Contrasts

 

A number of studies have demonstrated that infants

can discriminate between a variety of Speech contrasts.

14

For example, infants have discriminated pairs of speech
stimuli that are differentiated by voicing (Eimas et al.,
1971; Trehub and Rabinovitch, 1972; Trehub, 1973; Eimas,
1974), place of articulation (Moffitt, 1971; Morse, 1972;
Cutting and Eimas, 1974; Eimas, 1974) and manner of
articulation (Eilers and Minifie, 1975) for consonants,
and pairs that are differentiated by tongue height and/or
point of maximum constriction of the tongue for vowels
(Trehub, 1973).

Few of these studies, however, have addressed them-
selves to the question of whether or not the infant is
coding speech linguistically. Since this hypothesis is
the focal point of the present investigation, only those
studies which attempt to answer the linguistic coding
hypothesis will be critically reviewed. These studies can
be grouped into the following general categories: 1) dis—
crimination of the acoustic cue for voicing in stOp con-
sonants, 2) discrimination of the acoustic cue for place
of articulation in stop consonants, and 3) Similarity
responses to two different acoustic cues which Signal the
same stop consonant. One other line of research will be
discussed since it provides indirect evidence for linguis-
tic processing of speech, namely the study of auditory

evoked responses to speech and nonspeech stimuli.

15

Infant Discrimination of the
Acoustic Cue for Voicing_

 

 

The purpose of the following studies was to determine
whether infants perceive the acoustic cue for voicing in
stop consonants in a categorical manner. Specifically,
when the acoustic cue for voicing--voice-onset time (VOT)——
is systematically varied in equal acoustic increments, will
infants discriminate the change when it crosses the phone-
mic boundary but not discriminate the same increment when
it occurs within a phonemic class? If infants successfully
discriminate between phonemic categories but do not make
intraphonemic distinctions, their performance would match
that of adults'; that is, they would discriminate differ-
ences in phones no better than they could assign them to
one phonemic category or another.

The results of most of the studies reviewed here
suggest that infants are coding speech linguistically;
however, the results cannot be clearly interpreted because
of methodological problems in the discrimination paradigms
employed and confounding variables inherent in the stimulus
dimensions studied.

Eimas et a1. (1971) were the first to investigate
linguistic discrimination of Speech contrasts in young in—
fants. Their procedures have served as the basic method—
ology used in most subsequent investigations of infants'

discrimination of speech sounds.

16

Eimas et al. (1971) sought to determine whether one-
to four-month-old infants could perceive the acoustic cue for
the voicing distinction in stops, voice-onset time (VOT),
in a categorical fashion. Discrimination functions were
obtained both within and between phonemic categories for
/ba/ and /pa/. Each of three experimental conditions in-
volved discrimination of a 20 msec increment in VOT. For
the two within—category shift conditions, these values
were —20 and 0 msec VOT (both perceived by adults as
[ba]), and +60 and +80 msec VOT (both perceived by adults
as [pa]). The between-category Shift condition included
VOT values of +20 and +40 msec VOT, both perceived by
adults as [ba] and [pa], respectively. In the control
(no-shift) condition, subjects heard one of the six ex—
perimental stimuli throughout the entire session.

Discrimination responses were measured using a high-
amplitude sucking (HAS) procedure similar to that described
by Siqueland and DeLucia (1969). First, a baseline level
of high-amplitude sucking is obtained (approximately 20-30
responses/minute at a Specified amplitude) under condi—
tions of no auditory stimulation. Following baseline
stimulus presentation is made contingent upon the infants'
rate of sucking. Initially, the infant increases his rate
of sucking in response to the (presumably) reinforcing
prOperties of the stimulus. After a time the infant
satiates to the stimulus and his sucking rate decreases.

When a predetermined response decrement criterion is

17

reached, a new stimulus is presented. If sucking rate
increases it is assumed that the infant has discriminated
the stimulus. Significant group effects are computed by
determining whether or not there is a Significant differ-
ence between the mean sucking rate of the experimental
group who have been shifted to a new stimulus and the mean
sucking rate of a control group who continue to hear the
same stimulus. If there is a Significant difference in
the rate of responding between the experimental and control
groups during this period, it can be inferred that the
experimental group discriminated the stimulus change.

In subsequent studies investigating the linguistic
coding hypothesis, the Eimas et al. (1971) procedure was
modified by making the presentation of the stimulus con-
tingent upon the emission of each high-amplitude sucking
response.

In the Eimas et al. (1971) study, there was a sig-
nificant difference between experimental and control groups
for the between-category shift only, allowing the authors
to infer that the infants were perceiving the cues cate—
gorically, or according to phonemic categories.

Subsequently, Eimas (1975) investigated two- to
three-month-old infants' categorical perception of the
voiced-voiceless distinction in alveolar stop consonants.
The same general procedure was employed as in the Eimas
et a1. (1971) study except that stimulus presentation

was made contingent upon the emission of each sucking

18

response. Infants' ability to discriminate 50 msec varia—
tions in VOT was investigated both within and between the
phonemic categories, /d/ and /t/. Again, infants demon-
strated discrimination between phonemic categories but did
not evidence discrimination for within-category changes.

In addition to the voiced-voiceless distinction found
in English, Eimas (1975) investigated perception of a
third voicing distinction, prevoicing, found in languages
such as Thai. When a stop consonant is prevoiced, laryn-
geal vibration begins considerably before the consonant is
released. In this study, two- to three-month-old infants'
discrimination<ﬁ380 msec changes in VOT was investigated
within the prevoiced phonemic category (which had VOT
values of -70 and -150 msec) and across the phonetic
boundary separating prevoiced from voiced stop consonants
(with VOT values of -70 and +10 msec, respectively).
No significant difference in performance was found between
the within-category and between-category groups, although
the between-category group did show a reliable recovery
in sucking response from preshift to postshift stages.

These studies suggest that infants discriminate dif-
ferences along the VOT continuum for the voiced-voiceless
distinction in bilabial and alveolar stops; however,
whether or not infants can discriminate differences across
the prevoiced-voiced boundary is still unclear.

Two salient criticisms have emerged from studies in-

vestigating infants' discrimination of voicing contrasts

19

and from infant speech discrimination studies utilizing

the HAS procedure. One criticism concerns questions re—
garding the validity of the HAS procedure. The other ques-
tions whether the speech stimuli in the voice discrimina-

tion studies can be used as a measure of linguistic dis—

 

crimination of speech contrasts.

AS was mentioned earlier, the methodology employed
in all infant discrimination studies which test the lin-
guistic coding hypothesis is a variation of the high—ampli-
tude sucking procedure described by Siqueland and DeLucia
(1969). The high-amplitude (HAS) procedure assumes that
the infant will increase his rate of sucking above a
baseline level 1) when he learns that it is his HAS re-
sponse which determines stimulus presentation and 2) if he
finds the stimulus event reinforcing. When the stimulus
event is no longer reinforcing, the infant's sucking rate
decreases. After sucking rate has decreased to a pre-
determined criterion, a new stimulus iS presented con-
tingent upon the infant's sucking response. Given that the
infant both discriminates the new stimulus and finds it re-
inforcing, a subsequent increase in sucking rate will
result. If the experimental group demonstrates a signi-
ficant increase in recovery of HAS following stimulus
shift as compared with a control group (who receive the
same stimulus throughout the experimental session), it is
inferred that the experimental group detected a difference

between the two stimuli.

20

Butterfield and Cairns (1974) have challenged the
rationa1e(s) underlying the pre- and postshift phases of
the HAS procedure. Their criticisms revolve around the
fact that infant speech discrimination studies have not
provided adequate controls for effects which violate the
assumptions underlying this procedure. The need for such
controls is demonstrated in the following discussion.

During the preshift phase of the HAS procedure, an
increase in sucking relative to baserate is assumed to
result from the reinforcing properties of the novel stimu-
lus. This increase continues until the minute just prior
to the response decrement criterion. Butterfield and
Cairns argue that whether or not the initial stimulus
actually has reinforcing properties is not demonstrated
using this procedure. AS evidence for their position,
they experimentally demonstrated that infants reliably
increase their sucking rate above baseline during the
minute immediately preceding a response decrement criterion,
regardless of whether or not they are being presented with
a stimulus. Further, the decrement in response rate fol-
lowing repeated presentation of the auditory stimulus
cannot be attributed to a loss in the reinforcing pro-
perties of the stimulus since the same effect is experimen-
tally produced with no contingent stimulation. Control-
ling for these possible effects necessitates the inclusion
of two additional control groups, one group which receives

no auditory stimulation and another group which receives

21

noncontingent auditory stimulation. To date, infant speech
discrimination studies have not provided controls for
these effects.

Using a random walk procedure, Butterfield & Cairns
demonstrated that infants Show response decrement and a
subsequent increase in sucking rate by chance alone and
that this function closely parallels that obtained by the
experimental shift groups. This suggests that the pattern
of results shown by the Shift group could be occurring as
a random event and not as the result of discrimination.
Butterfield & Cairns, however, did not produce a random
walk for the control condition. Unless a random effect
can also be demonstrated under conditions of no-shift,
their argument is unsubstantiated.

In summary, a causal relationship between the pre-
sentation of a stimulus and the pattern of sucking responses
using the HAS procedure cannot be established 1) until
adequate controls are included in experimentation and 2)

2) until differences between experimental and control
groups can be demonstrated as nonsignificant.

A criticism that has specifically arisen from studies
investigating the categorical perception of VOT concerns
the basis from which the discriminations are being made.
The critical question, of course, is whether infants are
perceiving speech auditorily or linguistically. Because
categorical discrimination of VOT in stop consonants was

demonstrated, Eimas and his coworkers have inferred that

22

infants are perceiving these distinctions according to
linguistic categories.

Recently, Stevens & Klatt (1974) have offered an
alternative explanation to linguistic discrimination of
these Specific contrasts. They suggest that infants may
be making categorical discriminations on the basis of
an auditory (rather than a linguistic) feature analysis;
specifically, on the basis of whether or not a first
formant (Fl) transition occurs in the acoustic speech
signal.

Stevens and Klatt isolated two important cues for
perception of the voiced-voiceless distinction, voice-
onset time (VOT) and the presence or absence of a signifi-
cant Fl transition. The F1 transition prevails immediately
following voice onset for voiced Stops and is nonexistent
during production of voiceless stops. This transition
occurs as the result of the supraglottal articulators
changing positions from the consonant to the following

vowel after voice onset. During production ofaivoiceless

 

stop, the articulators have already reached their target
position by the time that voicing begins, and so no Fl
transition results. This effect occurs for all StOp con-
sonant pairs differentiated by voicing.

In an experiment where VOT and rate of formant motion
were independently manipulated, it was found that the pre-
sence of an F1 transition after voice onset was necessary

for perception of a voiced stop (/d/) and that the absence

23

of such a transition cued perception of /t/. Stevens and
Klatt concluded that the voiced-voiceless distinction in
stops is categorically perceived, but according to auditory
features rather than linguistic features. In their expla-
nation, VOT values of less than 20 msec are accompanied

by an F1 transition and the stop is perceived as voiced.
VOT'S greater than 20 msec are accompanied by no such tran-
sition and are perceived as voiceless. Although this
strategy for perception of the voicing distinction in stops
was specifically applied as an alternative explanation to
the Eimas et al. (1971) results (which used the stimuli
/b/-/p/ with a phonetic boundary of about 20 msec VOT),

the F1 transition vs. no F1 transition is a characteristic
of all stops differentiated by : voice and can therefore

be applied to all subsequent experiments investigating the
categorical discrimination of the voiced-voiceless distinc-
tion in stop consonants.

Although Stevens and Klatt introduced an alternative
explanation for the categorical discrimination of VOT in
stop consonants, the strategy used by infants in making
this discrimination is still unclear. Two possible alter-
natives still remain: auditory or linguistic.

Support for the notion that categorical discrimina-
tion of the voicing distinction can be made on the basis
of auditory features alone has been provided by Kuhl and
Miller (1975) in a speech perception experiment with

chinchillas. Chinchillas were selected because their

24

auditory system resembles that of the human. The chinchil—
las successfully discriminated the voiced-voiceless dis—
tinction in alveolar plosives in initial position when the
sequences were repeated by the same talker or by different
talkers and when the tokens were repeated by the same
talker and by different talkers in different vowel
environments. However, the most important outcome of this
experiment concerns the perception of acoustic variations
in VOT of synthetically produced alveolar plosives. The
discrimination function which resulted was nearly identical
to that obtained from infant and adult subjects. That is,
discrimination of the acoustic cue for the voicing distinc-
tion in the alveolar stops, /d/ and /t/, was nearly
categorical.

The results of Stevens and Klatt (1974) and Kuhl and
Miller (1975) suggest an interesting alternative to the
categorical perception of stOp consonants that are differ-
entiated by voicing--that is, the phonetic boundary separa-
ting these two phonemic categories into voiced vs. voice—
less stops may be the surface manisfestation of an under—
lying acoustic distinction. If this is true, then the
Speech events are coded auditorily and the accurate per-
ception of this "linguistic" distinction rests upon the

auditory system's ability to resolve these differences.

25

Infant Discrimination of the Acoustic
Cue for Place of Articulation

 

 

The purpose of most investigations of infants' per-
ception of the primary acoustic cue for place of articula-
tion in stops-—the second formant transition--has been to
demonstrate that infants are processing the cue linguis-
tically. Two types of evidence support a linguistic pro-
cessing interpretation; 1) infants' categorical perception
of the acoustic cue for place of articulation and 2) in—
fants' differential processing of the acoustic cue for
place of articulation in and out of a Speech context. In
each case, however, the results are inconclusive. First,
infants' categorical discrimination of the place distinc-
tion can be explained in terms of auditory processing of
the acoustic cues. Second, the demonstration that infants
process acoustic cues for speech differently when they are a
part of speech still does not directly answer the question
that they are processing these cues linguistically. The
second result is further weakened by the fact that not all
investigators have found a clear demonstration of differ-
ential processing of speech in infants.

Morse (1972) was the first to investigate infants'
linguistic perception of the place distinction in stops.
Using the nonnutritive sucking procedure, he investigated
two month olds' discrimination of the second (F2) and
third (F3) formant transitions (which signal perception

of /ba/ and /ga/) in a speech context and in isolation.

26

In [ba], the F2 and F3 transitions rise to the steady state
vowel. In [ga], the F2 transition falls while the F3 tran-
sition rises. The experimental procedure was similar to
that employed by Eimas, et a1. (1971). In the speech con-
dition, results showed a reliable recovery of sucking in
the experimental group relative to that of a control group.
However, in the nonspeech condition the results were
bimodally distributed. That is, some infants in the ex-
perimental nonspeech group significantly increased their
rate of sucking during postshift relative to postshift
performance of the experimental speech group whereas others
in the experimental nonspeech group ehﬂmx Showed no change
or significantly decreased their sucking relative to the
experimental speech group. Because of this distribution,
the results could not be clearly interpreted.

In the speech condition, there was strong evidence
of discrimination between [ba] and [ga]; however, nothing
can be said with respect to whether or not the discrimina-
tion was made on the basis of auditory or linguistic fea-
tures Since the acoustic parameters were not systematically
varied.

Eimas (1975) attempted to clarify these findings by
investigating categorical perception of place cues in
two- to three-month-old infants. The Stimuli, /daa/ and
/gaa/, were synthetically produced. The starting fre-
quency and direction of the F2 and F3 transitions were

systematically varied in equal increments, resulting in

27

two tokens of [daﬂ and one of [gas]. (Adults perceive
the acoustic distinctions when they cross the phonetic
boundary but intraphonemic discrimination are only slightly
above chance levels.) Results Showed discrimination of
the acoustic increment when it crossed the phoneme bound—
ary and no discrimination of within-category differences.

A similar experiment (Cutting & Eimas, 1974) was con—
ducted employing the stimuli [baa] and (deal. These two
stimuli differ in the direction and extent of the F2 tran-
sition. In [baa] there is a sharp increase in frequency
from approximately 1200-1600 Hz, whereas in [daa] there is
virtually no change in the F2 transition from its starting
frequency to the steady—state vowel. Six stimuli were
constructed which varied by equal acoustic increments
in the trajectory of the F2 transition. Four of these
stimuli signalled the sequence [baa], whereas two were
perceived as [daa]. Two- to three-month-old infants'
ability to discriminate the acoustic changes within and
across phoneme boundaries was investigated. Again, in-
fants discriminated pairs where each stimulus lay across
phoneme boundaries but intraphonemic variations were not
detected.

The results of these studies concerning the categor-
ical perception of place cues (i.e., Eimas, 1975; Cutting
& Eimas, 1974) cannot be interpreted as evidence for lin-

guistic processing of the acoustic cues for place of

28

articulation. However, an alternative explanation is
available, one which explains the discrimination function
in terms of auditory processing. For example, Cutting &
Eimas (1974) noted that the phoneme boundary separating
the linguistic categories /d&2/ and /baa/ also serves as
the boundary separating two distinct acoustic categories.
For [bae], each acoustic increment involves a Sharp rise
in frequency from its beginning to the steady-state level
of the vowel. In contrast, the two acoustic increments
signalling perception of [deal both involve very little

or no transition to the Steady—state level. Given these
two conditions, it is possible to categorize the acoustic
increments on the basis of extracted auditory features,
specifically, a sharp frequency change vs. no frequency
change. Since these two distinctions fall across the
natural phoneme boundary, this boundary may be the result
of auditory rather than linguistic processing of the acous-
tic cues. The same eXplanation can account for the discrim-
ination of acoustic changes for [gag] and [dag] (Eimas,
1975) except that in this case, the acoustic increments
within the [9&2] category are signalled by a sharp fall in
frequency.

Cutting and Eimas (1974) examined this possibility
by extracting the F2 transition from each stimulus item
(the [baa] and [daz] stimuli) and presenting F2 pairs for
discrimination. In the nonspeech context, infants success-

fully discriminated each pair of nonspeech stimuli. These

29

results are similar to those found by Mattingly et a1.
(1971) with adults. That is, out of a speech context,
intraphonemic variations as represented by equal acoustic
increments along an acoustic continuum can be discriminated
from one another until the increment crosses the phoneme
boundary. Although these data represent the strongest
evidence for perception in a linguistic mode,they are
clouded by the equivocal findings presented by Morse

(1972).

Infants' Categorization of Phones

 

Using a discrimination paradigm in the study of cate-
gorical Speech perception introduces the possibility that
discriminations will be performed successfully on the basis
of cues other than the ones under investigation. All
infant discrimination studies thus far reviewed are sub-
ject to this problem. My critical review of these studies
has suggested that the phonetic boundary separating stOp
consonants into two distinct categories (on the basis of
voicing or place distinctions) may in fact be a manifesta~
tion of an underlying auditory boundary. Thus, categorical
discrimination of Speech in infancy may be the result of
auditory rather than linguistic processing.

Another way of approaching the question of whether
or not the infant perceives speech linguistically or audi-
torily is to measure his responses to sounds which are

perceptually identical (i.e., share the same phoneme) but

30

where each sound is cued by very different acoustic charac-
teristics. If the infant responds in a similar manner
when he hears two phones which are phonetically identical
but acoustically distinct, it may be inferred that he is
responding to the linguistic (rather than the auditory)
features of the two sounds.

This approach was undertaken in an investigation by
Fodor, Garrett, & Brill (1975) with fourteen- to eighteen-
week-old infants. An attempt to condition head-turning
to reinforced and unreinforced monosyllabic sequences was
undertaken under the following conditions: 1) where the
two reinforced monosyllables were acoustically distinct
but phonetically identical (e.g., /pu/, /pi/) and a third
nonreinforced sequence differed both phonetically and
acoustically (e.g., /ka/) and 2) where the two reinforced
monosyllables were acoustically and phonetically distinct
(e.g., /ka/ and /pi/) and a third unreinforced monosyllable
Shared a phone present in one of the reinforced sequences
(e.g., /pu/). Results suggested that the infants were
re3ponding to the phonemic identity of the two phones in
the same phone group in that significantly more responses
occurred under the reinforcement condition when the CV
sequences shared the same initial consonant than when they
did not. However, when the three experimental conditions
are examined separately, one condition (/pi-ka-pu/) Showed
a Significant effect of reinforced over nonreinforced

syllables for both the same and different phones conditions.

31

Although Fodor et a1. dismiss this result as being the
consequence of a small sample size, it still weakens the
support found for their hypothesis.

Molfese (1972) measured auditory evoked responses to
sets of speech and nonspeech stimuli in infants, children,
and adults. Speech stimuli were natural tokens of [ba] and
[daa] and naturally produced [bal] and [dog]. Nonspeech
stimuli consisted of a piano chord and a speech noise burst
containing frequencies between 250 and 4000 Hz. Signifi-
cantly greater auditory evoked responses over the left
hemisphere occurred for all speech stimuli in each age
group. In addition, there were significantly greater
auditory evoked responses over the right hemisphere in
each group for the nonspeech stimuli. Although these data
do not relate directly to the phonemic coding capabilities
of the young infant, they do suggest that Speech lateraliza-
tion is present very early in infancy. Moreover, it appears
that speech and nonspeech are processed by different hemi~
spheres of the brain throughout the life span.

Several general conclusions are suggested by the
literature. First, speech processing in adults appears
to differ in several ways from nonspeech processing.
Specifically, evidence indicates that speech is perceived
phonemically. Second, there is limited evidence that
infants also perceive Speech phonemically. Third, the
auditory vs. linguistic processing hypothesis has not been

adequately tested with infants because problematic response

32

measures have been used and discriminative stimuli have
been selected in such a way that the auditory vs. linguis-

tic cues cannot be isolated from one another.

Purpose of the Present Investigation

 

Whether or not infants perceive speech phonemically
can be tested under conditions where two identical phones
are signalled by two very different acoustic cues.

The purpose of the present investigation was to
determine whether infants can categorize as the same two
diverse acoustic cues which signal the same phone. If
infants perceive the sounds phonemically, they should
respond to the two tokens as if they were the same lin-
guistic event. However, if infants process the sounds
auditorily, they should perceive the sounds as different
and be unable to make a similarity response.

Using differential head turning as the response
measure, consider the following example: the infant is
responding consistently with a right head turn to [di] and
a left head turn to [ga]. A new stimulus [du] is presented.
If the infant is perceiving the acoustic cues phonemically,
a right head turn should occur, on the basis of the initial
stop consonant. However, if he is perceiving the acoustic
cues auditorily, head turning should be random because

the cues are perceived as different.

METHOD

Subjects

The subjects were twelve home-reared infants residing
in Ingham County, Michigan. There were eight male and four
female infants ranging in age from 12 to 22 weeks (mean
age = 17.6 weeks). The infants were obtained from answers
to a written request distributed by pediatricians to their
patients (see Appendix A). Permission for participation
in the experiment was obtained in writing from each parent
(see Appendix B).

Each infant's level of arousal was operationally
defined as "quiet awake" as demonstrated by open eyes,
occasional movements of the torso and extremities, and
occasional vocalizations (Brackbill and Fitzgerald, 1969).

Two subjects were eliminated from the analysis be-
cause they failed to reach the criterion of four consecu-
tive correct responses to the conditioning stimuli. Ten
subjects remained in the analysis, three in each experi-
mental group and four in the control condition. The measure
of performance in this experiment was the frequency of
correct right or left head turns to stimuli across the con-

ditioning and test trials.

33

34

The stimuli were three synthetic monosyllabic se-
quences, /di/, /du/, and /ga/. The acoustic cues (i.e.,
F2 transitions) signalling perception of /d/ in synthetic
/di/ and /du/ are highly divergent. That is, the F2 tran-
sition in /di/ Shows a sharp rise in the frequency Spectrum,
whereas the F2 transition in /du/ shows a sharp fall. This
difference in the F2 transition characteristics is the
result of coarticulation from the consonant to the two
different vowel environments. In the present experiment,
the stimuli /di/ and /du/ were constructed so that the
only available cue Signalling perception of /d/ in the
two vowel environments was the F2 transition. Even in
natural speech the F2 transition plays a major role in
perception of /d/. In fact, there are no known invariant
acoustic cues which could Signal perception of /d/ in
these two contexts (Ronald Cole, personal communication,
1975).

Adults judge the vowels /i/, /u/, and /a/ to be
equally dissimilar from one another (Pols, vanderKamp,
& Plomp, 1969). Although this equivalence was established
for adults, the selection of these three vowels reduces
the possibility of infants' making a generalization re-
sponse on the basis of vowel Similarity.

All stimuli were synthetically produced on a PDP/9
computer using a terminal analog Speech synthesis proce-
dure (for a description of this procedure, see Klatt,

1974). Stimuli were synthesized without initial consonant

35

bursts. The fundamental frequency for all three sequences
began at 103 Hz at 0 msec, rising to 125 Hz at 35 msec.
At 85 msec, the fundamental frequency began drOpping from
125 Hz, reaching 94 Hz at 215 msec and terminating at 50
Hz at 255 msec. (A schematized representation of the fun-
damental frequency contour is presented in Appendix C.)

The acoustic parameters of each CV sequence were as
follows:
/ga/

F1: 200 Hz at O msec to 720 Hz at 45 msec

F2: 1640 Hz at 0 msec to 1240 Hz at 40 msec

F3: 2100 Hz at 0 msec to 2500 Hz at 45 msec

F4: 3600 Hz constant

F5: 4500 Hz constant
Formants are fixed for /a/:

F1: 720 Hz

F2: 1240 Hz

F3: 2500 Hz
/di/

F1: 180 Hz at 0 msec to 330 Hz at 15 msec

F2: 2000 Hz at 0 msec to 2200 Hz at 40 msec

F3: 2800 Hz at 0 msec to 3000 Hz at 40 msec

F4: 3900 Hz at 0 msec to 3600 Hz at 40 msec
Formants are fixed for /i/:

F1: 269 Hz

F2: 2200 Hz

F3: 3000 Hz

36

/du/

F1: 180 Hz at 0 msec to 370 Hz at 15 msec

F2: 1600 Hz at 0 msec to 1100 Hz at 15 msec

F3: 2700 Hz at 0 msec to 2350 Hz at 15 msec

F4: 3200 Hz constant

F5: 4500 Hz constant
Formants are fixed for /u/:

F1: 330 Hz

F2: 1100 Hz

F3: 2300 Hz

(Schematized representations of stimuli characteris-
tics can be seen in Appendix C. For spectrograms of each

stimulus, see Appendix D.)

Apparatus

 

A schematic representation of the apparatus (Figure
1) and a description of the recorded data (Figure 2) can
be seen on the following pages.

An Ampex 602-2 tape recorder (frequency response
50-15000 Hz, + 2 dB) was used for stimulus presentation.
Channels 1, 3, and 4 of the tape recorder were routed
through an audio channel selector (or manual switching
device) such that each channel of the tape recorder could
be manually selected.

The Audio Channel Selector received audio input from
each channel of the tape recorder. The audio signal was

then sent to an Ampex speaker (Model AA620) located in a

37

 

 

PLUG SELECTABLE

 

 

 

 

SPEAKER

PLUG SELECTABLE

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

REINFORCERS REINFORCERS
LEFT (L RIGHT
REINFORCEMENT REINFORCEMENT
SWITCH SWITCH -

LOOR
SWITCH
CUEING SWITCH
TO E2
“ A
1 CUEING LIGHT
# AF ESTER INE
AMPEX REINFORCEMENTL ' 5* webs
. CONTROLLER EL,
4 CHANNEL Aumo l D RECORDER
' CHANNEL J ’
C3.—.0 SELECTOR *
TAPE RECORDEFj * 4’
Fan
Q G

l::}—d>

REINFORCEMENT
DEPRESSOR

Figure 1.

Apparatus Diagram

stnmﬂuseammt

left respcnse

rigu2reagxse

rehﬁbnxment

chmuelllsthmﬂus
duumell3sthmﬂus
chmumlLlsthmﬂus

Figure 2.

38

 

ll

 

 

 

 

Q’IJMUOWP

 

 

 

Sample of Recorded Data

thma

 

39

sound attenuated booth one meter directly in front of the
infant. Audio output from the Audio Channel Selector was
also fed to an Esterline Angus Operation Recorder (Model
A602X) where the occurrence of each stimulus event was
recorded.

Each channel of the Audio Channel Selector was con-
nected to a separate channel on the event recorder. The
onset and offset of each channel was recorded on the event
recorder. The onset and offset of each channel was also
sent from the Audio Channel Selector to the Reinforcement
Controller.

The Reinforcement Controller received input from
both the Audio Channel Selector and the right and left
reinforcement switches. These switches were located in the
sound attenuated booth and were activated when the infant
made a right or left head turn. Right and left reSponses
were routed through the Reinforcement Controller to
separate channels of the event recorder where they were
recorded. The right or left response Signal also was
relayed through the Reinforcement Controller to the
reinforcers. A correct response resulted in the activa-
tion of each reinforcer.

Output from the Reinforcement Controller also was
fed through a small floor switch located inside the sound
attenuated booth. Depression of this switch activated one
red and one green blinking light (1 cm. in diameter)

which were embedded in the ears of one of the right and

40

left reinforcers. Depression of the switch following
stimulus presentation caused the lights to blink in the
direction of a correct response.

The Esterline Angus Operation Recorder provided a
record of the following: 1) the occurrence of reinforce-
ment, 2) a left response, 3) a right response, 4) the
stimulus event, 5) the onset and offset of [di] from
Audio Channel 1, 6) the onset and offset of [du] from
Audio Channel 3, and 7) the onset and offset of [ga] from
Audio Channel 4.

Depression of a small switch in the sound attenuated
booth activated a small cueing light located on top of the
Reinforcement Controller outside the booth. This light
signalled Experimenter l to switch channels. Extinction
of the light signalled Experimenter l to present the
stimulus.

A switch located on the floor outside of the booth
to the right of Experimenter l was connected to the Rein-
forcement Controller. Depression of the switch stopped
the presentation of reinforcement following a correct
response. This switch was manually controlled by Experi-

menter 1 during the baseline phase of the experiment.

Design and Procedure

 

Infants were randomly assigned to one of three con—
ditions: experimental condition di/ga, experimental con-

dition du/ga, and control condition di/du (see Figure 3).

41

Fixed Parameters Across Conditions

 

Number of subjects: 4
Presentation: randomized order
Number of trials:

Phase 1: 10 presentations of each stimulus type de-

fine the baseline period on any given day

Phase 2: variable
Phase 3: the infant's behavior (i.e., a change of

state) defines the length of the testing
period on any given day four successive
days of testing constitute the testing
phase

 

 

 

 

Session X Condition # Exp. Egp. Congrol
PHASE 1 stimuli di/ga/du du/ga/di di/du/ga
Baseline reinforcement no no no

discriminative
stimuli di/ga du/ga di/du
reinforcement
1. continuous to criterion (four'consecutive
PHASE 2 correct responses to the discriminative
Conditioning stimuli).
2. following attainment of the response cri-
terion, either a) begin testing or b)
terminate the session and begin testing
on the following day. The length of the
conditioning period preceding the esta-
blishment of the response criterion on
any given day determines which of the
two procedures is followed.
discriminative
stimuli di/ga du/ga di/du
generalization
PHASE 3 stimulus du di ga
Testing
reinforcement
continuous

discriminative stimuli: reinforced
generalization stimulus: not reinforced

 

Figure 3. Experimental Design

42

Each infant was conditioned to respond differentially to
the two discriminative stimuli with either a right or left
head turn. In each condition, the stimulus-response con—
tingency was counterbalanced so that half of the subjects
responded, for example, withaaleft head turn for /di/ and
a right head turn for /ga/ and half of the subjects re-
sponded with a right head turn for /di/ and a left head
turn for /ga/-

Each infant was tested individually approximately
1/2 hour after the last feeding. Infants were tested at
the same time on each subsequent day of testing. Occasion-
ally, an infant was tested twice in one day--one session in
the morning and one session in the afternoon with testing
times kept constant across days.

The experimental sessions took place in a small,
semi-darkened sound attenuated booth. The infant was
either seated on the floor (supported by Experimenter 2
who sat directly behind the infant) or in an infant seat
at a 45° angle. The infant sat facing a speaker at a
distance of one meter. Sounds were presented at a level
of 75 dB (re: .0002 dynes /cm2). The parent was permitted
to observe the session from inside the booth; however,
he/she remained behind the infant out of the infant's
visual field. Experimenter 2 also remained directly
behind the infant and was responsible for reinforcing the

infant's head turning responses.

43

The experiment consisted of three phases: baseline,
conditioning, and testing. The number of phases completed
during any one experimental session depended upon whether
or not there was a change in the infant's level of arousal.

A Single trial consisted of two repetitions of a
single stimulus. During baseline, 20 tokens of each
stimulus type were presented in random order for a total
of 30 trials (or 60 tokens). A response was defined as
a lateral right or left head turn from midline of 45° or
greater that occurred within four seconds following a
stimulus presentation. A subjective judgments was made by
Experimenter 2 for determining whether or not a head turn
occurred. A lateral head turn was counted as a response
if the left or right reinforcer was in the infant's
potential visual field. The reinforcers were placed so
that the infant had to make a lateral head turn of at
least 45° before he could see the reinforcer.

The conditioning phase began on a subsequent day.
Conditioning began with the presentation of n trials of
one discriminative stimulus. The number of trials pre-
sented was determined by each infant's anticipatory head
turning response. When the infant spontaneously made two
consecutive correct head turns in the same direction
immediately following a stimulus presentation, the second
discriminative stimulus was presented and the same proce-
dure was followed. When the infant reached the response

criterion of two consecutive correct anticipatory head

44

turns following stimulus presentation, the presentation
sequence was changed to four consecutive trials of each
stimulus type. This sequence presentation was maintained
until the infant demonstrated a discriminative response.
A discriminative response was defined as a correct head
turn to one stimulus followed by a correct head turn to
the second stimulus. For example, four trials of /di/
(correct response = right head turn) are presented. On
the fourth trial the infant responds correctly with a
right head turn. On the fifth trial /ga/ is presented
and the infant responds correctly with a left head turn.
This change in direction of response is operationally
defined as a discriminative response. Following the
demonstration of this response pattern, the order of pre-
sentation of each stimulus type was randomized under the
constraint that no stimulus was presented more than four
consecutive times.

If the infant did not make a spontaneous head turn
in the correct direction following stimulus presentation,
his head turning response was shaped through the presenta-
tion of one red and one green blinking cueing light, each
embedded in the ears of two of the reinforcers (the
dancing bears). The cueing lights were located approxi—
mately 75° to the right and 75° to the left of the infant's
midline.

Each correct head turn was followed immediately by

2 1/2 seconds of reinforcement presented at approximately

45

a 75° angle to the right or left of the infant. Through
pilot work, the two most effective reinforcers were found
to be 1) a stationary 30 cm high hollow yellow plastic
lion which was illuminated by a light bulb located inside
the lion or 2) a 20 cm high toy dancing bear which rotated
as it played the drums. Pilot work indicated that the
reinforcer alone or alternating between these reinforcers
maintained head turning at a high level throughout the
session. Reinforcers were presented to each infant either
1) alone, 2) alternating within a session, 3) alternating
from session to session, or 4) together in combination.

Experimenter 2 returned the infant's head back to
midline immediately before the next stimulus was presented.

Four consecutive correct responses to the discrimina-
tive stimuli marked the end of the conditioning phase.
The test phase generally began without interruption.
However, this procedure was altered under the following
condition: if the infant reached criterion towards the
end of what marked an average conditioning session for that
child (somewhere between 30 and 60 trials, depending upon
the individual infant), the session was terminated and
testing began on the following day. This procedure was
followed in order to avoid testing the infant when he
was approaching fatigue or satiation.

During Phase 3, a continuous reinforcement schedule

was maintained and a novel (generalization) stimulus was

46

introduced. Responses to the discrminative stimuli were
reinforced, the generalization stimulus was never reinforced.
The generalization stimulus and the two discriminative
stimuli were presented randomly. The length of each test—
ing session was determined by the infant's level of

arousal (Brackbill & Fitzgerald, 1969). Four successive

days of testing marked the end of the testing phase.

RESULTS

Two infants were successfully conditioned to respond
with directional head turns, one infant from the experimen-
tal group (di/ga) and one infant from the control group
(di/du). In the test phase, directional responses to the
test stimulus were random for both subjects. The experi-
mental subjects‘random responses to the test stimulus /du/
failed to support the linguistic hypothesis. The control
subjects' random responses to /ga/, however, were as ex-
pected for the control condition. A third infant from the
experimental condition di/ga showed signs of conditioning
to the two discriminative stimuli towards the end of the
test phase.

A striking feature of these data was the marked
variability found in individual response patterns across
time. Since group analysis would seriously misrepresent
the data, the data were analyzed individually with the
strength of the analysis lying in the large number of
observations obtained for each infant. Group analysis
was limited to situations where individuals formed homo-
geneous groups.

The results were analyzed by first determining
which infants performed above chance expectations during

the test phase of the experiment. Responses to the test

47

48

stimuli were then analyzed for these infants. Uncondi-
tioned infants were included in a separate analysis of
characteristic response patterns and trends in individual

performance.

Conditioning

 

In order to test the linguistic hypothesis, each
infant had to first meet the criterion of a discriminative
response to the two discriminative stimuli. Discrimination
was measured by determining whether each infant's correct
responses to the discriminative stimuli deviated from that
expected by chance. Each infant's response frequencies to
the discriminative stimuli were compared with those fre-
quencies expected by chance under the null hypothesis.

The null hypothesis assumes that, given a spontaneous
lateral head turn and no response bias, the probability
of a right or left response is 0.5.

Results showed that two infants achieved the discri-
mination criterion: one fromthe experimental group di/ga
and one from the control group, di/du. Neither infant
showed a reSponse bias (see Appendix E). Results for all
infants are summarized in Table l.

A visual representation of the general course and
pattern of conditioning for the two conditioned infants is
shown in Figures 4 and 5. Trials across conditioning
and test phases were segmented into twenty-trial blocks.

The prOportion correct was derived from the frequency of

49

 

 

 

 

 

 

 

 

 

 

. I m L a .
mo v «C a vm m A mm«
mm.o mH.H m.n~ m.>~ mm mm «m OH
ah.o «mm.aa m.vm m.vm mm om mv m
mm.o hm.o m.Hm m.Hm mv ma vm m
No.0 mn.o m.oa m.oa an m ma 5
Illléba
Honucoo
Hm.o ow.~ mm mm mm mm ov a
~m.o oo.o m.va m.vH mm vH ma m
mm.o mm.o He av mm hm me e
mm\sc
Hmucmeﬁnmmxm
m¢.o Ho.H m.am m.Hm mm mm mm m
hh.o *om.m ma ma mm m cm m
Ho.o vv.m m.m~ m.m~ mm mm mm a
mm\ac
Hmucmﬁﬂummxm
mmmnm umma Hmuoa
mmouod Nx uomnuoucH uomuuoo 30m uomuuoocH uomnuoo
uomuuoo uomnnsm
SOAuHomoum >ocmsvmum pouommxm hocmskum ©w>ummno
mmmgm umma mmouos Hadeﬂum
o>aum¢asﬂuomﬂo ou m0ﬁocmswwum omcoammm uomuuoocH 0cm uomuuou mo SOmHHMQEoo .H manna

50

conditioning test phase
phase 8
H
u
m
1.)
H
H
o

proportion
correct

0 20 4O 6O 80 100120140160

trials in blocks

Figure 4.

Conditioning Performance
Across Conditioning and

Test Phase for Subject 2,
Experimental Condition di/ga.

 

conditioning a: test phase
phase _2|
u:
1.0 3'
'H.
c u
0“ 0.8 0'
'38 0.6 I
81104 '
0.0 ' :
8° 0.2 n
d
o.0*::::::':::4..‘#

0 20 40 60 80 100120140160130200220240260

Figure 5.

trials in blocks

Conditioning Performance Across
Conditioning and Test Phases for
Subject 9, Control Condition di/du.

51

correct and incorrect responses. Incorrect responses in-
cluded both responses in the wrong direction and no-re—
sponses. (Response frequencies and correct response propor-
tions for the two conditioned infants are found in Appendix
F.)

Examination of the conditioning curves for these
two infants reveal no marked differences in the rate or
general pattern of conditioning. Although the criterion
level was reached sooner in the conditioning process for
the experimental subject (Trial 60) as compared with the
control subject (Trial 155), both infants were performing
above chance expectations by Trial 40. The control sub-
ject's average performance following her initial increase
remained above chance for the subsequent 80 trials, indi-
cating that She, in fact, had achieved conditioned dis-
crimination prior to attainment of the response criterion.

The conditioned infants' performance is characterized
by a relatively stable response pattern as compared to the
highly variable response pattern observed for the uncon-
ditioned infants (see Appendix 61-7). The overall course
of conditioning for the conditioned infants also generally
increased across conditioning trials, a trend not repre-
sented in the performance of most unconditioned infants.

A modification of the backward learning curve, des-
cribed by Zeaman and House (1962), was used to compare the
way in which the two conditioned infants approached a

conditioned level of responding. This information is

52

displayed in Figure 6 in a backward conditioning curve.
The backward curve was produced by shifting each infant's
conditioning curve to the right. The starting point on the
graph is the point at which each infant attained and pro-
longed a performance level above chance (or 0.7 correct).
This criterion was selected instead of the a priori cri-
terion level because it better represented the point at
which the conditioned state was actually reached. This
point was at Trial 40 for both the experimental and control
subject (see Figures 4 and 5). Trial 40 therefore became
Trial 0 on the new graph. The preceding block of twenty
trials is indicated by a minus Sign on the abscissa of
the graph. Both infants' initial peak level of performance
is therefore equated in space, providing a comparison of
the response Slope of each curve for the period immediately
preceding attainment of above chance responding.

The response Slopes in Figure 6 are very Similar, in-
dicating that both the rate and the approach to criterion

weretie same for both infants.

Test of the Linguistic Hypothesis

 

The linguistic hypothesis was tested by comparing
each conditioned infant's response frequencies to the
test stimuli against those frequencies expected under the
null hypothesis. The null hypothesis assumes that given a
spontaneous lateral head turn, the probability of a right

or left turn equals 0.5.

53

 

Experimental 52

----- Control 59

r 1.0
4.0.9
- 0.8
)( n-0.7
~r0.6

,’ 1.0.5
X: «r- 0.4
¢' "0.3
0.0.2
.. 0.1

F_q__+__r—1——f—ﬁp—f—d 0.0
-60 ~40 -20 0

l

 

trials preceding last block

Figure 6. Backward Conditioning
Curve Showing Approach
to Criterion for Con-
ditioned Subjects 2&9.

54

It was hypothesized that infants in the experimental
group would generalize their response to the test stimulus
in the direction of the discriminative stimulus which
shared the same phoneme, /d/. It was hypothesized further
that infants in the control group would not Show a general-
ization reSponse to the test stimulus /ga/, since it shared
no acoustic or linguistic similarity with either of the
discriminative stimuli.

Results for the two conditioned infants are shown in
Table 2.

Table 2. Conditioned Infants' Correct ReSponse Performance
to Test Stimuli Across Test Phase

 

 

 

 

 

 

Observed Frequency Expected Frequency Proportion
. Correct
Subject A ross
Cor- Incor- Row Cor- Incor- X2 T :t Phase
rect rect Total rect rect e
Experi-
mental
di/ga
2 9 14 23 11.5 11.5 0.69 0.39
Control
di/du
9 13 10 23 11.5 11.5 0.17 0.56
2

*x 3 3.84; df = 1; p_<_ .05

The experimental subject's directional responses to the
test stimulus /du/ were random, lending no support to the
linguistic hypothesis. The control subject's directional

responses to the test stimulus /ga/ were also random, as

55

expected for the control condition. A summary of the

results for the two conditioned infants are Shown in Table 3.

Analysis of Unconditioned Infants

 

Recall that infants were defined as conditioned only
if they demonstrated performance discrimination above chance
expectations for the two discriminative stimuli. Two infants
in this experiment met that criterion; eight infants did not.
Further analysis of the unconditioned infants, however, re-
vealed the following: 1) the number of head-turning re-
sponses to discriminative stimuli increased across experi-
mental trials for all infants, 2) all infants responded with
more correct than incorrect head turns to the discriminative
stimuli, and 3) one of the infants showed Signs of condition-

ing across the test phase of the experiment.

Conditioned Head-Turning .

 

The eight unconditioned infants were grouped in an
analysis of increases in the frequency of head-turning re-
sponses to discriminative stimuli across baseline and con-
ditioning trials. (The infants' response frequencies are
tabled in Appendix H. A summary of the results are shown
in Figure 7. Conditioned infants' responses are tabled but
not graphed.)

Average responses were plotted in ten-trial blocks
across baseline and in twenty-trial blocks across the con-
ditioning and test phases. Two infants contributed less

than 60 trials across the conditioning phase, leaving the

56

up“

:n
o
v
0.
F4
II

vm.m A Xs

N

 

 

 

 

 

 

 

ha.o mm.o 0H ma mm.HH« Hh.o om me oH ma mm nﬁ<ﬂc v m
HDHﬁHO
$6 $6 3 a ma. E6 8 om a 3 so 83. m m
H355
1.3%
uomuuou noon uowu uomnuoo noon uomu I 3 V S
mx coauuomoum THOUGH IHOO Nx counpuomoum IuoocH IuoO unmﬁm uqu m m % m.
4 m... T a.
S .4 u o
ﬂasaﬁum puma. «Haeﬂum o>ﬁumcweﬂnomﬂa a. T. 1.
mmmcommmm m. w W.
maﬁammmm .m. u
mmmcm msﬂummm. Hm>o mmmcommmm m m.
m pcm N muomhndm cocoauﬂpcou How muHSmmm mo mumEEsm .m canoe

100 '-

Mean Percent Response

57

90 "

60--

50 d-

40‘r

30‘b

20 ab

lO'P

 

O L L I L L l L J
I I I— I I r j I
1

2 3 l 2 3 4 1

 

994)-

Nd)-

 

 

 

ten-trial blocks twenty-trial blocks

Figure 7. Mean Percent Response to Conditioning for
Unconditioned Infants.

58

third and fourth trial blocks with an n of seven and six,
respectively.

The response pattern in Figure 7 is characterized by
a strong increase in mean level of responding across trials.
The curve itself is ogival in shape, Showing its greatest
acceleration across the conditioning phase of the experiment.
This curve demonstrates that although infants were not con-

ditioned in directional head-turning, conditioning did in-

 

fluence their level of head-turning activity.
(Individual conditioning curves for the unconditioned

infants are included in Appendix Gl-7.)

Directional Responses

 

Although the unconditioned directional responses to
the discriminative stimuli did not differ from chance, all
infants responded with more correct than incorrect head
turns (see Table 1, page 49). Further, the infant with the
greatest discrepancy in correct and incorrect responses
(Subject 1) showed a Sharp rise in conditioning performance
across the test phase (see Figure 8, next page). This con-
figuration suggests that Subject 1 was in the process of
being conditioned and may have reached the conditioned
state with continued experimentation.

(A summary of response frequencies to discriminative
stimuli for all infants across conditioning and test

phases is presented in Appendix I.)

59

.mm\wo conuﬂecoo
Hmpcmaﬂuwmxm .H wommndm How mmmnm umme can
msecoﬂuﬂpcou mmouod mocwEuomumm msecoﬂuwpcoo .m musmﬂm

mHMHuu macoSp mo mxooHn

oﬁoﬁoﬁamoaovmoNoaommomogommooa om ow ow ON 0

TI4IITI+llTlIII+IiII+IITI+I4TI+IITILIITI+IJ_oio

1
1091100 uorqzodoxd

 

--------
l‘
O
O

DISCUSSION

The purpose of the present study was to determine
whether infants code Speech linguistically or auditorily.
Specifically, it was predicted that infants would respond
to the linguistic characteristics of speech sounds by
grouping together two consonant-vowel monosyllables which
Shared the same consonant but whose consonants were sig-
nalled by very different acoustic cues.

In order to test the hypothesis, infants were first
required to discriminate stimuli with directional head
turns. Two infants succeeded at this task: one infant
from an experimental group successfully discriminated be-
tween /di/ and /ga/ and one infant from the control group
successfully discriminated between /di/ and /du/. General-
ization responses to the test stimulus were subsequently
measured. The control subject's responses to the test
stimulus /ga/ were random, as expected for the control
condition. The experimental subject's responses to /du/
were also random, failing to support the linguistic
hypothesis.

The experimental subject's failure to support the
linguistic hypothesis may have been due to the overriding
influence of the vowel in making a categorization response.
In all three CV stimuli, the vowel is perceptually more

60

61

salient than the consonant. The steady-state vowels /i/,
/u/, and /a/ are more intense and longer in duration,
characterized by higher energy concentrations in formants
extending over 200 msec (see Appendix C). In contrast,
the formant transitions which define the stop consonants
/d/ and /g/ are weaker in energy and extend over a much
shorter period (less than 35 msec). For adults, the vowels
/i/, /u/, and /a/ are perceptually equidistant. Assuming
that these vowels are also perceptually equidistant for
infants (a result Shown by Fodor et al., 1975), generaliza-
tion responses to test stimuli on the basis of the vowel
would serve as an explanation for the random responses ob-
tained for both the experimental and control subjects.
Since only two subjects were tested, however, these sugges-
tions are only speculative.

Nine of the ten infants did not demonstrate a re—
sponse bias across the baseline phase. One five-month-old
infant (Subject 3) Showed a strong response bias to the
right. The bilateral responses found for infants in the
age range three- to five-and one-half months were in con-
trast to other studies Showing a biastn the right for four—
month-olds (Siqueland, 1964) and a bias to the left for
three-month-olds (Levison & Levison, 1967).

Neonates characteristically Show a response bias to
the right which diminishes as the infant becomes neurologi-
cally more mature (Brockbill, 1970). Although the equi-

vocal results concerning the direction of the response bias

62

in young infants are difficult to explain, the lack of a
response bias in this study may be accounted for by the
older age of the infants. Six infants in this study were
approximately five-months-old, three infants were approxi-
mately four-months-old, and only one infant was three-
months-old (see Appendix I).

Fluctuations in state both within and across experi-
mental sessions may partially account for the marked
within—subject variability found in the responses of the
unconditioned infants. Although each session was terminated
when the infant exhibited noticeable state fluctautions
(e.g., crying or fussing), changes in attention or motiva-
tion could not be assessed. For example, during experi-
mental sessions, Subjects 3, 6, and 8 (Appendix G1, G4,
and G6) achieved a high level of correct responding which
then ceased abruptly and never recovered. This suggests a
transient conditioning effect. Why the conditioned re-
sponse could not be recovered is unknown; however, state
fluctuations and reinforcement satiation are possible ex-

planations.

Limitations of This Research

 

Procedural problems were encountered during measure-
ment of the head-turning reSponse. When the infant's head
was positioned at midline, Experimenter 2 signalled Experi-
menter l to present the stimulus. The latency between the

signal to Experimenter l and the actual occurrence of the

63

stimulus was approximately two seconds. Because the infant
was often making bilateral responses during this period,
Experimenter 2 was unable to maintain the infant's head

at midline. This resulted in responses which did not ori-
ginate from midline and could not therefore be assessed as
either correct or incorrect.

Since no mechanical head-turning appratus was used,
measurement of a criterion head-turn was based upon the
subjective judgment of Experimenter 2. A lateral re-
Sponse had to exceed 45° in order to be counted as a head-
turn; however, since Experimenter 2 had to estimate whether
or not the 45° criterion was reached, a response of 60°
: 15° was operationalized. The use of a range of responses
rather than a response locus may have made it more difficult
for some infants to learn the discrimination. Further dif-
ficulty may have been introduced by erroneously reinforcing
a correct response which did not originate from the midline
position.

Since there was no adequate a priori method for iden-
tifying the point at which the infant reached a conditioned
state and could be subsequently tested, a non-stringent
criterion of four consecutive correct responses was
employed. However, post hoc analyses revealed that one in-
fant (Subject 9) was conditioned before she reached
criterion (or before she received any test trials). To
avoid this situation, test trials could be introduced

following a given number of conditioning trials regardless

64

of the infant's performance up to that point. A post

hoc analysis would then be used to identify the conditioned
period and test trials prior to that period would be dis-
regarded. Such a procedure would prevent the loss of test-

trial data during a time when the infant was conditioned.

Implications for Future Research

If the vowel, rather than the consonant, serves as the
basis for making a generalization response, the generaliza-
tion paradigm is unsuitable for testing whether infants re-
spond to the linguistic characteristics of the stop
consonant, /d/. However, a discrimination paradigm may be
employed using two tokens of the same phonemic sequence,
/du/. The two tokens (/du/ and /du/) can be synthesized so
that the linguistic cues /d/ and /d/ are perceptually iden-
tical but the acoustic cues Signalling perception of each
/d/ are highly divergent (Ronald Cole, personal communica-
tion, 1975). Although the discrimination paradigms (i.e.,
high-amplitude sucking and heart rate) are problematic,
these methods avoid the serious problem of vowel dominance
and allow discrimination measures to be obtained from a
larger number of infants.

Because of the difficulty in teaching infants to make
a discriminative response using the head-turning paradigm,
this particular paradigm does not appear to be a fruitful
means for studying speech sound discrimination or

categorization. In a recent unpublished report, Morse,

65

Leavitt, Donovan, Kolton, Miller, and Judd-Engel (1975)
tested thirty-nine infants and encountered Similar problems
using the same paradigm. Only three infants acquired the
discriminative response and infants satiated without novel

reinforcement.

SUMMARY AND CONCLUS IONS

This research attempted to determine whether or not
infants could categorize two CV syllables which shared the
same initial stop consonant, /d/, but which were signalled
by highly divergent acoustic cues. Two of twelve infants
met the conditioning criterion of the experiment and their
categorization responses were subsequently measured.
Neither infant demonstrated a categorization response. The
random responses to /du/ by the infant in the experimental
group, di/ga, failed to support the linguistic hypothesis.
The random reSponses to /ga/ by the infant in the control
condition, di/du, were as predicted for the control condi-
tion.

The hypothesis examined in this study may not have
been adequately tested because of two prOblems. First, the
salience of the vowel in the CV sequences selected may be
determinant in making categorization responses. Second,
many infants did not acquire the initial discriminative
responses required to test the linguistic hypothesis. It
is concluded that the hypothesis may be best examined by
testing for discrimination of two different acoustic cues
that signal perception of /d/ in the same linguistic se-
quences, /du/ and /du/, using a high-amplitude sucking or
a heart-rate habituation paradigm.

66

B I BL IOGRAPHY

Brackbill, Y. & Fitzgerald, H. E. Development of the sen-
sory analyzers during infancy. Advances in Child
Development and Behavior, 1969, 4, 173-208.

 

 

Butterfield, E. C. & Cairns, G. F. Discussion summary--
infant reception research. In R. L. Schiefelbusch
and L. L. Lloyd (Eds.), Language Perspectives--
Acquisition, Retardation, and Intervention. Balti-
more: University Park Press, 1974}

Cohen, R. Differential cerebral processing of noise and
verbal stimuli. Science, 1971, 172, 599-601.

Cole, R. & Scott, B. Toward a theory of Speech perception.
Psyghological Review, 1974, 81, 348-374.

 

Cutting, J. E. Different speech-processing mechanisms can
be reflected in the results of discrimination and
dichotic listening tasks. Haskins Laboratories:
Status Report on Speech Research SR—37/38, 1974,
39-53.

 

Cutting, J. E. & Eimas, P. D. Phonetic feature analyzers
and the processing of speech in infants. Haskins
Laboratories: Status Report on Speech Research
SR-37738, 1974, 45-63.

Cutting, J. E. & Rosner, B.S. Categories and boundaries
in speech and music. Perception and Psychpphysics,
1974, lg, 564-571.

 

Day, R. 8., Cutting, J. E., & Copeland, P. M. Perception
of linguistic and nonlinguistic dimensions of
dichotic stimuli. Haskins Laboratories: Status
Report on Speech Research SR—31732, 197I, 193-197.

Day, R. S. & Wood, C. C. Interactions between linguistic
and nonlinguistic processing. Haskins Laboratories:
Status Report on Speech Research SR-31/32, 1971, 185-
192.

 

67

68

Eimas, P. D. Speech perception in early infancy. In L. B.
Cohen and P. Salapatek (Eds.), Infant Perception:
From Sensation to Cognition. Volume II: Perception
of Space, Speech, and Sound. New York: Academic
Press, 1975.

 

 

 

Eimas, P. D., Siqueland, E. R., Jusczyk, P. & Vigorito, J.
Speech perception in infants. Science, 1971, 171,
303-306.

Fodor, J. A., Garrett, M. F., & Brill, S. L. Pi ka pu:
the perception of speech sounds by prelinguistic
infants. Perception and Psychophysics, 1975, 18,
74-78.

 

Hawles, T. Effects of dichotic fusion on the perception
of speech. Unpublished doctoral dissertation, Uni-
versity of Minnesota, 1969.

Kimura, D. Cerebral dominance and the perception of verbal
stimuli. Canadian Journal of Psychology, 1961, 15,
166-171.

Khmma, D. Left-right differences in the perception of
melodies. _guarterly Journal of Experimental Psyphol-
ogy, 1964, 1_6_, 355-358.

Kimura, D. Functional asymmetry of the brain in dichotic
listening. Cortex, 1967, 3, 163-178.

Kuhl, P. K. & Miller, J. D. Speech perception by the
chinchilla: voiced-voiceless distinction in alveolar
plosive consonants. Science, 1975, 190, 69-72.

Liberman, A. M. The grammars of speech and language.
Cognitive Psychology, 1970, 1, 301-323.

 

Liberman, A. M., Delattre, P. C., & Cooper, F. S. Some
cues for the distinction between voiced and voice-
less stops in initial position. Language and Speech,
1958, 1, 153-167.

 

Liberman, A. M., Harris, K. S., Kinney, J. A., & Lane, H.
The discrimination of relative onset-time of the
components of certain speech and non-Speech patterns.
Journal of Experimental Psychology, 1961, pg, 379-388.

Liberman, A. M., Cooper, F. S., Shankweiler, D. P., &
Studdert-Kennedy, M. Perception of the speech code.
Psychological Review, 1967, 14, 431-461.

 

BIBLIOGRAPHY

69

Matsumiya, Y., Tagiliasco, V., Lombroso, C. T., & Goodglass,
H. Auditory evoked response: meaningfulness of

stimuli and interhemispheric asymmetry. Science,
1972, 175, 790-792.

Mattingly, I. G. & Liberman, A. M. The speech code and
physiology of language. In K. N. Leibovic (Ed.),
Information Processipg in the Nervous System.

New York: Springer-Verlag, 1969, pp. 97-117.

Mattingly, I. G., Liberman, A. M., Syrdal, A. K., & Hawles,
T. Discrimination in speech and nonspeech modes.
Cognitive Psychology, 1971, 2, 131-157.

Moffitt, A. R. Consonant cue perception by twenty-to
twenty-four—week-old infants. Child Develppment,
1971, 42, 717-731.

 

Molfese, D. L., Freeman, R. B., & Palmero, D. S. The
ontogeny of brain lateralization for speech, and
nonspeech stimuli. Brain and Language, 1975, 2,
356-368.

 

Morrell, L. K. & Salamy, J. G. Hemispheric asymmetry of
electrocortical responses to speech stimuli. Science,
1971, 174, 164-166.

Morse, P. A. The discrimination of speech and nonspeech
stimuli in early infancy. Journal of Experimental
Child Psychology, 1972, 14, 477-492.

Morse, P. A., Leavitt, L. A., Donovan, W. L., Kolton, S.,

Miller, C. L. & Judd-Engel, N. Head-turning to speech:

explorations beyond the heart and the pacifier.
Infant Development Laboratory: Research Status
Report #1, Harry A., Waisman Center, UnIversity of
Wisconsin, 1975.

Siqueland, E. R. & DeLucia, C. Visual reinforcement of
nonnutritive sucking in human infants. Science, 1969,
165, 1144-1146.

Stevens, K. N. & Halle, M. Remarks on analysis by synthesis
and distinctive features. In W. Walthen-Dunn (Ed.),
Models for the Perception of Speech and Visual Form.
Cambridge, Mass.: M.I.T. Press, 1967.

Stevens, K. N. & Klatt, D. H. Role of formant transitions
in the voiced-voiceless distinction for stops. Jour-
nal of the Acoustical Societypof America, 1974, 55,
653-659.

7O

Studdert-Kennedy, M. & Shankweiler, D. Hemispheric special-
ization for speech perception. Journal of the
Acoustical Society of America, 1970, 48, 579-594.

 

Studdert-Kennedy, M., Shankweiler, D., & Pisoni, D.
Auditory and phonetic processes in Speech perception:
evidence from a dichotic study. Cognitive Psychology,
1972, 3, 455-466.

 

Trehub, S. Infants' sensitivity to vowel and tonal con-
trasts. DevelOpmental Psychology, 1973, 9, 91-96.

Trehub, S. & Rabinovitch, M. S. Auditory-linguistic
sensitivity in early infancy. Developmental Psychol-
ogy, 1972, g, 74-77.

 

Wood, C. C., Goff, W. R., & Day, R. S. Auditory evoked
potentials during speech perception. Science, 1971,
173, 1248-1251.

Wood, C. C. Parallel processing of auditory and phonetic
information in speech perception. Haskins Laboratories:

 

Status Repprt on Speech Research SR-35/36, 1973.

APPENDICES

APPENDIX A

LETTER TO PARENTS

APPENDIX A

LETTER TO PARENTS

RESEARCH IN INFANT SPEECH PERCEPTION

PURPOSE: An investigation of the speech perceptual capa-
bilities of young infants is presently being
conducted through the Department of Audiology and
Speech Sciences at Michigan State University. The
purpose of the project is to determine if preverbal
infants reSpond to speech stimuli in the same
manner as adults do.

AGE OF INFANTS: three to six months. The experiment will
run until June 1 so if your child is three months
by about May 15 he will qualify.

BACKGROUND: It was previously thought that infants from 0
to about 10 or 12 months of age heard speech as
noise. There is recent evidence to suggest,
however, that infants hear speech during this
period in the same manner as adults do. This pro-
ject is aimed at determining more precisely the
way in which infants do perceive speech.

TASK: The task involves having the infant listen to Speech
sounds presented through a speaker. He is taught to
turn his head either to the left or right when he
hears one of the sounds. Each correct head turn is
reinforced with an illuminated yellow plastic lion or
a toy dancing bear.

LENGTH OF EXPERIMENT: Each session will last approximately
1/2 hour. The number of sessions required depends upon
how quickly the infant learns the task. It may take
between 19 and 15 sessions for learning to take place.
Therefore, please do not Sign up unless you are wil-
ling and able to make this kind of commitment.

TRANSPORTATION: Experimenters can provide transportation.
PLACE: Speech and Hearing Clinic, Michigan State University.

WHOM TO CONTACT: Kris King

71

72

PHONE NUMBER: If you are interested and can make the
commitment necessary to complete the experiment,

please call me at the Speech and Hearing Clinic,
353-5107.

BENEFIT: This is a difficult experiment to execute because
of the time required for training. The results of
the experiment, however, will greatly contribute to
our understanding of the normal Speech perception
process and its development. It will also serve as
the basis for our understanding of speech and lan-
guage problems in early childhood.

All participants will receive a summary of the
research findings.

Thank you for your consideration.

Sincerely,

Kris King

Doctoral Candidate

Department of Audiology and
Speech Sciences

APPENDIX B

PARENTAL CONSENT AND RELEASE FORM

APPENDIX B

PARENTAL CONSENT AND RELEASE FORM

I give consent for my child to participate in an investiga-
tion of infants' ability to perceive speech. I understand
the purpose and procedure of the experiment as it has been
explained to me. I understand that the individual data
obtained from my child will be kept confidential and that
names will not be used when reporting data from individual
subjects. I further understand that I may discontinue my
child from participation in the experiment at any time, and
that I may have my child's data destroyed if I so desire.

I understand that I will receive a summary of the results

of this experiment when it is available.

Mailing Address:

 

 

 

 

Signature Name (please print)
Date Address
City Zip

73

APPENDIX C

SCHEMATIC DESCRIPTION OF SPEECH STIMULI

APPENDIX C

200

150.L

100. /’/’r

50

 

l j

100 130 260 250 330 msec

cf'
m
«3%

C1. Fundamental Frequency Contour for
Stimuli /di/, /du/, and /ga/.

74

frequency in KHz

75

 

 

 

 

APPENDIX C
5
'F
4"
~\‘\__ F4
31. /’ F3
2.. / F2
F1
5 5 A J

 

 

‘V

50 100 150 200 250 300 msec

O -\

C2. Schematic Representation of Formant Structure
for /di/.

76

APPENDIX C

 

 

5.
F
F5
4(P
F4
F3

 

Fl

 

/
\ F2
/

frequency in KHz
I"

 

 

0 50 100 150 200 250 300 msec

C3. Schematic Representation of Formant Structuring
for /ga/.

 

frequency in KHz

C4.

77

 

 

 

 

 

 

 

Schematic Representation
for /du/.

APPENDIX C
‘F'
F5
1)-
TF4
L F3
.- \\‘ F2
F1
f
I. - 4 I. .- : 1
O 50 100 150 200 250 300 msec

of Formant Structure

APPENDIX D

SPECTROGRAMS

\ﬂp\

 

 

 

Ziluiill ti:

mzdmwomﬁommm

Q xHQmem4

\mm\

 

 

 

78

APPENDIX E

FREQUENCIES OF LEFT AND RIGHT RESPONSES ACROSS BASELINE

Frequencies of Left and Right Responses Across Baseline

APPENDIX E

 

 

 

 

 

 

 

 

Observed Expected
Condition 5:5; Frequengy Frequencyg
Left Right T2321 Left Right 2
Experimental
di7ga 1 5 8 13 6.5 6.5 .35
di/ga 2 10 9 19 9.5 9.5 .00
di/ga 3 2 l7 19 9.5 9.5 *10.32
Experimental
du/ga 4 12 10 22 11.0 11.0 .02
du/ga 5 4 2 6 3.0 3.0 no test
du/ga 6 ll 12 23 11.5 11.5 .00
Control
—__d—i7du 7 3 0 3 1.5 1.5 no test
di/du 8 2 2 4 2.0 2.0 no test
di/du 9 13 10 23 11.5 11.5 .20
di/du 10 5 4 9 4.5 4.5 no test
*X2 3 3.84; df = 1; p _<_ .05

no test: X2

79

statistic could not be computed for expected
frequencies less than 5

APPENDIX F

CORRECT AND INCORRECT RESPONSE FREQUENCIES TO
DISCRIMINATIVE STIMULI ACROSS CONDITIONING AND

TEST PHASES FOR CONDITIONED SUBJECTS 2 AND 9

 

 

 

mm. mv. mv. om. me. mm. om. Hm. mm. vm. mm. mm. uomuuou
SOHHHomoum
mm\sp
o H m N o N s o H v N N mmcommmu
:02 m Honucou
m m m H m m o m N o H o uomuuoocH
mH m m NH m m m m m h m H uoouuoo
om. om. on. he. Hm. mv. uomuuoo
coHuuomoum
o H H o o o mmcommmu mm\Hp
'02
m HmucwEHHmmxm
H m m m N v uomuuoocH
m m m 0H m m pomuuou
oem omm com omH omH oVH omH 00H om om ow om

 

mxoon SH mHMHuB mchoHqucoo

muommumu Homn

mucommmm

COH H CO
Thaw .0.6 o

 

 

m can m muomnnsm COCOHqucou How mommnm #009 can mchoHqucoo
mmouo¢ HHSEHum m>HumcHEHuomHo ou mmHocwswmum mmcommmm powuuoocH p00 uoouuoo

h xHDmem<

80

APPENDIX G

CONDITIONING PERFORMANCE ACROSS CONDITIONING AND

TEST PHASES FOR UNCONDITIONED SUBJECTS

.mm\Nc coHu
IHUSOO HmucmEHmexm .m pommndm
00m mmmnm #009 can mchoHUHpcou

mm0H04 OOSOEHOMHOA mchoHqucou .HO wusmHm

mHmHuu Munoz» mo mxooHn
smﬁemowomomoeﬁssmsﬁ s s 9

1| 1 1] d I

d

d
C

‘ Q 1‘

  
 

0 xHQmemd

BN 0

O
O

,_4
O

N
HOOOOOOOO
qoexxoo uorqxodOJd

c3 a.¢n P-\O U3‘¢ "1
O

81

 

82

.mm\0p SOHH
IHpcou HmucmﬁHuomxm..v pomnnsm
How 00030 Hume paw mchoHqucoo

mmouog mocmEHOMHmm mcHsoHqucou .mw musmHm

meHHu >u003p mo mxooHn

ommoomomHomHovHomH ooHS cm 8 8 o

O xHQmemﬂ

14—1

A

o.o
.05
TN.o
.m.o
its
$5
$5
.50
$5
11.0

o.H

 

1001100 UOIQJOdOId

83

.mm\sp coHqucou
HmucmEHummxm .m pomnbsm How Omanm umme can
mchoHqucoo mmouog mocmEnowme GSHSOHHHUSOU .mw

mHmHuu >ucmzu mo mxooHn

oqm 0mm 08 0% 0mm 3N 0mm oR omH WmHHNHHOWH 8H mm mm .9 c

P b
d .1 (q I I d d I d

  
 

.
.
.
.
.
.
.
.
.

O xHazmmm<

musmHm

Lu

1001100 uorq1od01d

 

 

84

.mm\sp

coHuHocoo HmucmsHummxm .m pommnsm

How 00020 umme cam mchoHqucou
mmouo< mocmEuomuwm mSHSOHuHUSOO .vm wustm

mHMHHu xucmsu mo mxooHn

ommoHNomHomHovHomH OOHO om ov om o

 

FL” 0, n u. a n u NIJ}I1IJ ego
.
. : H.o.d
. 1
. : N.o.m
" ..m.o m
. AI 0 T7
. v o m
. II moo
0
L. o.o m
. 1
. .. >6 0
. 0
. ..m.o 4
.
u .. m.o
O lﬁ OOH

 

U NHOmem¢

 

85

.sp\Hc coHuHccoo
Houucou .b pommndm How 00030 0005 can mchoHu
ancou mmouom mocmEHOHHmm mchoHqucoo .mo musmHm

mHmHHu mucus» mo mxooHn

oRomNommovNommoHNomHomH ovH ONHOOH .m cm .v .N o

I. H.o
.. N.o
.. m.o
.. v.o
.. m.o
it 0.0
N.o
.. N.o
.. m.o
.. o.H

 

-0---.----
L
t

O anzmnHmd

1001100 uot11odo1d

86

.0p\Hp.00Hqucou
Houucoo .m HOOnQSm How chasm 0008 can
mchoHqucou mmouo< mocmsuomumm mGHSOHqucou

mHmHnu mucm3u mo meoHn

oRBmowNovm oNBNomHomHnwvHomquH 5cm we we

omﬁv

 

I d u d u 4‘ d I d d d

.‘
.
.
.
.
.
.
.
o
.
.
.
.

H...

 

 
 

0 anzmmm¢

, o.o
. do
NS
.NS
. To
. To
Id
. To
. NS
.NS
. SH

 

.mw musmHm

10e1100 uor11odo1d

87

.56\N6 coHuHccoo Houucoo .OH nomﬂnsm 000 mmmam paws

paw qucoHqucoo mmouod mocmEHOMHmm mchoHqucou

BmomuovmommoﬁommoﬁovmoﬁoomomHomHQVHomHooH om om ov cm 0

mHmHuu Spams» mo mxooHn

D xHQmemd

.no musmHm

o.o
H.o
N.o
m.o
«.0
m.o
w.o
5.0
m.o
m.o
o.H

1001100 uor11odo1d

APPENDIX H

RESPONSE FREQUENCIES TO DISCRIMINATIVE STIMULI

ACROSS BASELINE CONDITIONING AND TEST TRIALS

Hm pcm m muoanSm pmcoHqucoo OpsHocH uoc Op mmcommmu w 0003 p00 uncommmn 0mm: "muozv

 

 

 

 

 

 

NNN NNN NNN NN4 NN4 N44 NNN NNN NNN NN4 "mmcommmm Hamoumm cums
N.HH o.HH 4.HH o.N 4.N N.N 4.N N.N N.N N.4 "mucommmm cums
NH NH NH 4 N N H N o N NH :6\Hc
HH NH NH N N N H N N N N sN\Ho
N OH OH NH N N o 4 o o N swap
HH N 4 N N N N N o N N sN\HN
Houucou
4H NH NH u . NH NH 4 NH N N mN\so
N HH NH N N N N 4 H H N mmxoo
NH NH NH I HH NH N N N N 4 mN\sn
HmucmEHmexm
OH NH NH NH NH N N N N N N NN\HN
N N NH HH HH N N N N 4 N mmxHN
NH N HH N N N N N N N H NN\HN
HmucmEHmexm
N N H 4 N N H N N H
Homnnsm coHHHccoo

 

 

meoon mHmHuelwucmsev

000nm 0009

meoon HOHHBTNHSOSBV
mmmnm mcHSOHqucou

Amxoon HOHHBISOBV
mmmnm wcHHmmmm

 

 

meoon 0H mHmHHBV mHmHHB umma p00 mCHCOHuHUSOO
OSHHmmmm mmouod HHSEHum m>HumcHEHHomHo on mmHocmsqum wmcommmm

m XHDmemﬂ

88

APPENDIX I

SUMMARY OF RESPONSE FREQUENCIES TO DISCRIMINATIVE

STIMULI OVER CONDITIONING AND TEST PHASES

mmmcommmu uomuuoo .mmmcommmu pogomuuoo .mwmcommou uomuuoo mopsHocH mHMHHu UOOHOMSHmm

.msHseHum any on annulpwms msomcmucomm m 00>Hm noncommmu uomuuoo mo coHuHomonm

 

N

.mmmcommmu @000 000 .ocHHUHE 00:0 Hmnuo SOHHHmom m 500w mcHumchHHo

 

 

 

 

 

H
NN.N N NN NN NNH NNH 44.N NH NN H4 NHN NNN soxHN N.N 2 NH
HN.N N NN N4 HN NN NN.N 4H NH H4 H4H NNH :N\HN 4 N N
4N.N 4 NN 4N NNH NNH N4.N N NN NN NNH NNH 5N\HN N m N
NN.N N N NH NN NN NN.N NN NN NN NNH HNH NNNHN 4 2 N
Houucou
HN.N N NN N4 NNH NHH 4N.N N N NH N4 N4 mesN N.N z N
NN.N N 4H NH NN NN NN.N N NN NH HNH HNH mmxan N.4 z N
NN.N N NN N4 NNH NNH NN.N H NN NN 4N NN mmxao N N 4
HmucoEHHmmwm
NN.N N NN NN 4NH NNH NN.N 4 NN H4 NHH 4NH mN\HN N z N
NN.N N N NN NN NN 4N.N N N NH NN NN NNxHN N N N
NN.N N 4N N4. NNH HNH HN.N N NH NH NNH NNH NNNHN N 2 H
HmucmEHuwmxm

«0d N I 3 1:3 q4m «01. N 41 «0 Jun mum
01 O u 0 08 80 01 O ..u o 08 80
10 D 1 IT. 11. IO 0 I 1.... 13.
NN N N N N... N.N N N N N N... N4. .55.. .2...
N.N. N N N. Hp 1N NN.N. N N N. 9 NM 80348 E 4N4 .0... -3
N N. N N N.N N. N N N.N .

u S P. u S P.

9 RV II S I
ommnm puma mmmcm mchoHqucoo

mommzm umma paw mchoHqucoo Hm>o

H xHQmem<

HHSEHum m>HumcHEHHOmHQ o» mmHocmsvam mucommmm mo NHOEEdm

89

   

1|HlllllllHUIIWIWIMIIWUIHIH‘IWIUIHNHHI
31293103217141