A RELIABILITY TEST. OF THE TWENTY
STATEMENTS TEST".

Thesis for the Doom of M. A.
MICHIGAN STATE UNIVERSITY

Carole Overmier Beﬁinghaus
I966

 

ABSTRACT
A RELIABILITY TEST OF THE
TWENTY STATEMENTS TEST

by Carole Overmier Bettinghaus

The Twenty Statements Test (TST) is an instrument designed to
access self-identification in a manner consistent with the "self" theories
of George Herbert Mead. As such, its utility is limited to research which
approaches the study of social psychology from the vantage point of
sociology.

To administer the TST, the researcher asks a respondent to:

Write twenty answers to the simple question "Who am I?"

in the (twenty numbered) blanks. Just give twenty

different answers to this question. Answer as if you

were giving the answers to yourself, not to somebody

else. Write the answers in the order that they come to

you. Don't worry about logic or "importance". Go along

fairly fast, for time is limited.

This open-ended question acts as a stimulus for a set of reSponses
from each reSpondent. Each set of responses is called a protocol. Every
protocol is content analyzed on the basis of a coded set of discrete cate-
gories.

The present study attempts to measure the reliability among six
coders in the task of coding 150 such protocols. The 150 protocols were
sampled randomly from a population of 1,528 protocols. The 1,528
protocols were obtained from reSpondents in a modified area probability

sample of the United States general public (outside of institutions),

age 21 or older.

Carole Overmier Bettinghaus

The code applied to the protocols contained MB discrete categories,
and the coding task was done under independent conditions. Scores of
reliability (range: from 0 to 10) were obtained for each reSponse and
each protocol. These reliability scores were based upon a combined scale
of consistency (range: from 1 to 6), and agreements (range: from O to 15).

The basic hypothesis was: ”there will be a high level of reliability
among the six coders in the coding of the TST protocols, and responses."
”High level of reliability" was defined as (l) 75 percent of all protocols
will have a mean score of eight or better, and (2) 75 percent of all
responses will have a score of eight or better, or the mean score of all
responses will be eight or better. Results obtained were: (1) 72 percent
of all protocols had a mean score of eight or better, (2) 72 percent of
858 responses had a score of eight or better, and the mean score of all
reSponses was 8.19.

A set of sub-hypotheses was also tested. Sub-hypothesis one was:
"when the difference between consensual reSponses and evaluative-con-
sensual responses is overlooked, reliability is eXpected to appear as
, greater than under realistic conditions.” Sub-hypothesis two was:

"when differences between coders in the decision of whether to divide

a statement into more than one statement for coding purposes are
adjusted so that only the immediate statement is affected rather than
all subsequent statements in the protocol, reliability is expected to
appear as greater than under realistic conditions." Both of these
hypotheses were supported. The suggestion is made that use of coders
with training in grammar would contribute to an increase in inter-coder

reliability.

Carole Overmier Bettinghaus

Sub—hypothesis three was: "the more statements made on a TST
protocol the lower will be the mean reliability score for that protocol."
This hypothesis was supported. The conclusion was that reliability will
be enhanced when fewer than 20 reSponses are sought.

Sub—hypothesis four was: "as content categories increase in com-
plexity, reliability will lessen." This hypothesis was not SUpported.

Subrhypothesis five was: "there will be no relationship in the
pattern of coder agreements which contribute to unreliability." This
hypothesis was used to operationalize the assumption made in the study
that the coders were independent. The test of this hypothesis may be
thought of as a validity check on the reliability study. Under one
condition the hypothesis was rejected. This suggested that enhanced
independence of coders in future studies may be eXpected to lower
measured reliability.

Finally, some potentially useful forms of the TST are suggested:
(1) structured and standardized with factor loadings for individual
items, (2) a Q-sort form for self-styling by the reSpondent, and

(3) unchanged, but with provision for self-coding by the respondent.

A RELIABILITY TEST OF'THE

TWENTY STATEMENTS TEST

By

Carole Overmier Bettinghaus

A THESIS

Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of

MASTER OF ARTS

Department of Sociology

1966

ACKNOWLEDGMENTS

I wish to express my appreciation to my advisor, Dr. Hideya Kumata,
who guided me through the report writing stage of this research; and to
the other members of my committee, Dr. Frederick Waisanen, and
Dr. Mason Miller. During Dr. Kumata's absence from the country, Dr.
Robert Stewart served as my advisor, and it is he to whom I am grateful
for aiding me in the formative stages of the research.

The data used to check inter-coder reliability in this study, as
well as the coding of the Gallup coder, were graciously made available to
me by Dr. Charles Loomis, Dr. Kumata, Dr. Stewart, and Dr. Waisanen.

I wish to thank them for this consideration.

The four individuals, other than myself, who recoded the data were
Clark McPhail, Dr. Stewart, Charles Tucker and Dr. Erwin Bettinghaus.

I thank them for conscientious attention to detail in a time-consuming,
somewhat non-stimulating task. Any aSpersions cast on their grammatical
competence in Chapter IV, Conclusions, should be discounted. Grammar

is not their recognized area of competence or training; and one would
not expect those in the role of social scientist to exhibit the level of
grammatical soPhistication needed here.

My sincere thanks go to Mrs. Shirley Sherman, who typed the final
manuscript for this thesis, and was a great help with many formal details.

Finally, I wish to thank all the individuals, within my acquaintance,
whose consistent expectations have been that this research and report
either should be completed, or had already been completed. This "generalized
other" has had great influence on the personal perseverence which I brought

to the task.

CHAPTER

II

III

IV

REFEREN

TABLE OF CONTENTS

BACKGROUND 0 O O O O O O O O O O O O O O 0

Introduction 0 o o o o o o o o o o o o o 0

Relation of a Self Identification Instrument to

Meadian Theory . . . . . . . . . . . . .

Review of Types of Self Identification Instruments

Historical Development of the Twenty Statements Test . .

Review of Previous Investigations Relevant
S ummary O O O O O O C O O O O O O O 0 O O

PROCEDURE 0 O O O O O O O O 0 O O O O O O

The Experimental Context of this Study . .
The Coders and Data Collection Procedure .
The Hypotheses and Scoring Procedure . . .
Physical Analysis Procedure . . . . . . .
Data Analysis Procedure . . . . . . . . .
Statistical Analysis Procedure . . . . . .

RESULrl‘S O O O O O O O O O O O O O O O O 0

General . . . . . . . . . . . . . . . . .
When the Effect of Qualifiers is Removed .

When the Accidental Effect of Splits is Removed

Factors Present in Low Reliability . . . .

to this

Study

0 O O
O O O
0

Reliability Related to Complexity Level of Code Category
Reliability Related to Number of Statements in Protocol

Results of Testing for Independence of Coders

COIJCLUSIOI‘IS o o o o o o o o o o o o o o

The Basic Hypothesis of Reliability . . .

Knowledge of Grammar Related to Coder Reliability

Independence Related to Coder Reliability

Factors in the Present Code which Limit Reliability

Length of the TST as a Factor in Reliability . .
Future Utility of the Twenty Statements Test . . . . . .

CBS 0 O O O O O O O O O O O O O O O O O 0

iii

Page

2”
25
27
35
36
H2

H3

H3
H7
48
H8
”9
52
52

57

57
59
60
61
6M
6M

66

TABLE

II

III

IV

IVA

IVB

VI

VII

VIII

LIST OF TABLES

Distinctions Between Categories for McPartland's

Plan . . .

Growth and DeveIOpment of Code Plans for the

TST SIP .

Reliability Score - Based on a Scale of
Agreement-Consistency

Mean Reliability Score of 150 TST Protocols by N

and % for Three Conditions

Mean Scores of Eight or Over . . . . . . . . . .

Mean Scores of Less than Eight . . . . . . . . .

Reliability Scores for all TST ReSponses by N and

% for Three Conditions .

Code Categories by Frequency of Use for Nine
Coded Least Reliably

Modal Complexity Score of ReSponses by N and

0

Reliability for Two Conditions . . . . . . .

X2 Values for Coder Pair Agreements Under Two

Conditions

iv

Responses

 

Page

 

15

18

28

H3

”3

#4

HS

50

51

54

LIST OF FIGURES

Page

 

Figure 1 Model of Coder Relationships . . . . . . . 55

APPENDIX
A
B
C

Section

Section

II

LIST OF APPENDICES

Page

 

Form of the TST in the Interview Schedule . . . 69
T8 T COde Bock I O 0 C O O O O O I O O O O O O O 7 l
150 TST Protocols . . . . . . . . . . . . . . . 88

Protocols Scoring the Same Under All Three
Conditions, N = nu . . . . . . . . . . . . 89

Protocols Scoring Differently When
Qualification Effect islkmoved, N = 53 . . . 95

Section III Protocols Scoring Differently When Accidental

Section

IV

Effect of Splitting is Removed, N = 17 . . . ion
Protocols Scoring Differently When Accidental

Effect of Splitting is Removed, and When
Qualification Effect is Removed, N = 37 . . 109

vi

CHAPTER I

BACKGROUND

Introduction

 

Any open-ended item in an interview schedule depends upon its
code, rather than its form, for reliability. Therefore, this study is
designed to test a comparatively complicated, formal code system for the
content analysis of responses to a fairly simple, informal question,
"Who am I?", asked as if the respondent were soliloquizing (16, p. 69).
The test is one of reliability.

Reliability is operationalized as very good agreement between six

 

individual coders in the task of coding identical sets of re8ponses to
the above question. Following eXplanations shall refer to each set of
re5ponses as a protocol, and to the question as one form of a self
identification problem or §I3_(23, p. 7a). This Specific SIP is known as
the Twenty Statements Test, or TST, Thus, a TST-SIP protocol is under-
stood to be a set of reSponses made by an individual resPondent to the
question "Who am I?" At times the six coders may be referred to in
pragmatically equivalent terms as eXperimental subjects. A discussion

of the high level of reliability in Chapter II will deal with the concise
operational definition of the above phrase "very good agreement".

In this chapter, I will (1) discuss the need for a SIP which
creates minimum bias in operationalizing Meadian self-theory within any
experimental design context, (2) review the characteristics of several
types of SIP, (3) discuss the history of the TST-SIP which asks a direct

question, while creating a non-directive bias, and (u) review previous

investigations into the reliability of content analyses.

Relation of Self Identification Instrument to Meadian Theory

From the standpoint of a researcher in behavioral science, any
verbalization designed to elicit communication from a reSpondent is a
tool of the trade, or an instrument. From the vantage point of an
individual respondent such a verbalization creates or increases the
imbalance between work required and work completed. That is, for the
respondent, it creates a task or problem to deal with. Since it is the
same verbalization from different standpoints, we will adept the practice
of using the term "self identification instrument" as interchangeable
with SIP. A SIP is, then, an instrument for eliciting communication
from a respondent about the concept of self; Specifically, what objects
he identifies or associates self with_as well as what objects he identifies
or defines self as:

The concept of self is central to the self—role theories of
George Herbert Mead. He held that the self mediated between society and
the individual and allowed the development of the mind. The self was
composed of two phases, the "I", and the "Me". The "I" is the impulsive
tendency of the individual which begins social acts and gives prepulsion
to them. The ”Me" comprises the incorporation of other individuals
within each individual. It gives direction to each social act, and is
the final state of a social act. The "Me" or object phase of the self
is formed through the process of role-taking (26, p. 15). Kuhn tells
us that Mead saw the self as carrying on an internal conversation

between the "I" and the "Me” (14, p. 629). Meltzer interprets Mead's

belief that ”the ability (of the "I" phase of self) to act toward

oneself (in the ”Me" phase) makes possible an inner experience which

need not reach overt expression ... which constitutes mipd," (26, p. 18)
Others, especially Charles Horton Cooley, a contemporary of Mead,

and Gordon Allport, a later reviver of Meadian theory, have added their

contributions1o self-theory (in, p. 629). Cooley characterizes self

as feeling or primitive emotion and claims that the feeling of self

"is, perhaps, to be thought of as a more general instinct, of which

anger, (fear, grief, etc.) etc. are differentiated forms...” (6, p. 171).
Allport, Speaking of the study of personality says,

...all psychological functions commonly ascribed to a
self or ego, must be admitted as data in the scientific
study of personality. These functions are not, however,
co-extensive with personality as a whole. They are
rather the Special aSpects of personality that have to
do with warmth, with unity, with a sense of personal
importance...I have called them "propriate" functions.
If the reader prefers, he may call them self-functions,
and in this sense self may be said to be a neCessary
psychological concept (1, p. 55).

Of social psychology Allport says,

Scholars interested in culture and personality deal
primarily with the function of ego-extension, for their
task is to account for the process of socialization (l, p. 57).

Of the "prepriate" function of ego-extension he has this to say,

Soon the process of learning brings with it a high regard
for possessions, for loved objects and later for ideal
causes and loyalties. We are Speaking here of whatever
objects a person calls "mine." They must at the same
time be objects of importance, for sometimes our sense

of "having" has no affective tone and hence no place in
the prOprium. As we grow older we identify with groups,
neighborhood, and nation as well as with possessions,
clothes, home (1, pp. #1 ff.).

 

However, Allport evinces a view contrasting to Mead's, that the self
directs and controls an individual's behavior, gives rise to the mind,
and is necessarily covert (26, pp. 11 8 18). Allport contends,

What is unnecessary and inadmissable [as data in the

scientific study of personality] is a self (or soul)

that is said to perform acts, to solve problems, to

steer conduct, in a trans-psychological manner,

inaccessible to psychological analysis (1, p. 55).

In addition to the views of self put forth by Allport and Cooley,
several changes have crept into Meadian theory through interpretation.
The self has come to be thought of not as the locus for an internal
conversation, nor as an entity carrying on a conversation with itself
and within itself, but, rather, as embodied in that conversation; or
carried to an extreme form, the self is thought of merely as a re-
flected image of significant or generalized others (18, p. 629).

Kuhn, in pointing to this trend on the part of others (1n, pp. 156 S
198) (11, p. 316), makes no mention of his own like tendency in the
development of the TST, the instrument dealt with in this study. The
TST, is purported to elicit communication from an individual about his
self. In operation it uses a cue verbalization of a form approximating
what one might imagine the ”I" asking of "Me." (Who am I? - as if I
were asking and answering myself, with no other involved.) This elicits
communication of a "stream—of—consciousness" nature, directed toward
the concept "I".

Now Kuhn 8 McPartland found that reSponses to this question could
be divided into two gross categories which seem to fit the needs of

Meadian theory (1) consensual responses which ”refer to groups and

classes whose limits and conditions of membership are matters of common

knowledge, (16, p. 69) " and (2) subconsensual reSponses which "refer
to groups, classes, attributes, traits or any other matters which would
require interpretation by the reSpondent to be precise or to place him
relative to other people." (16, p. 69) It seems to me that the former
category might be characterized as statements one could expect to be
made by the "Me," or internalized other phase of the self, while the
latter category might be characterized as statements one could expect
to be made by the "I," in disagreeing retort to the "Me," or in Simple
interjection.

Indeed the finding that ”respondents tended to exhaust all of the
consensual references they would make before they made any subconsensual
ones," (16, p. 70) points to an order in which the ”Me" reSponds to the
one question with consensual responses, followed by a rejoinder from
the "I" with subconsensual statements. Perhaps Waisanen also viewed
subconsensual reSponses as related to the "I," for having accounted for
some sub-consensual reSponses as evaluative consensual (reSponses of a
consensual nature, but which included modifiers) he termed all remaining
sub-consensual responses ”idiosyncratic."* Dictionary definitions of
idiosyncracy characterize it as a queer, peculiar, unusual or individual
way of acting or expressing oneself (34, p. 310) (2, p. 599), while
Mead characterizes the "I" as saying "something that was novel to
himself." (25, p. 197)

Now it is apparent that the data elicited by the TST is data

from the internal conversation which Mead refers to, rather than data

 

§This category system is attributed to Waisanen by Dyer (10, p. 39)

from the self. Furthermore, because the subsequent analysis of data
obtained using the TST has largely neglected the sub—consensual responses,
the "I" portion of the data has lain dormant. In essence then, Kuhn

has, with good intention, created an instrument which, effectively, treats
the self only as a reflected image of Significant or generalized others.

It is indeed very likely, that what Allport saw as a self
inadmissable to scientific study was as much a matter of convenience as
anything. I see no useful way of getting at data which is internalized
telow the level of language. However, more rigorous treatment of the
TST'S subconsensual data may bring greater insight into the nature of the
"I" phase of the self.

As we shall see shortly, there are a great many instruments
designed as SIP'S. Most of them have been used to measure the self in
the manner of the psychologically oriented social psychologist, as a
concept deduced from eXperimental data, rather than a self in the manner
of the Sociologically oriented social psychologist, as a concept induced
from Meadian theory. (1Q, p. 629)

The very fact that so many self identification instruments are
extant, documents a great need for an adequate SIP. The TST with its
direct approach to the internal conversation of gestures comes closer
than perhaps any other current instrument to allowing operationism of
Meadian theory. Furthermore, the direct approach to data overcomes the
bias of inferential research which attempts to deduce the nature of
the self from non-verbal behavior or from tangential verbal behavior.
(16, p. 68) Kuhn also strove to minimize the effect of directive or

structural bias. (12, p. 283) There is evidence that even when the

TST is administered in a structured context respondents are not led
to make reSponses reflecting that context. (10, p. 22)

However, the TST entails other biases; (l) a self-selection bias
on the part of those reSpondents who choose to (a) respond volubly or

(b) reSpond minimally, and (2) a post factum structural bias, dependent

 

upon what form of content—analytic code the data is forced into. The
former bias produces non-equivalent numbers of reSponses between
protocols, making any rigorous statistical treatment impossible. However,
if the TST incorporated any manner of intensity measurement, which it

does not, this lack of equivalence would be lessserious. The TST is

an SIP designed to overcome experimental bias, and as such it has

sacrificed a great deal of experimental utility.

Review of types of self-identification instruments

 

Wylie (35) has reviewed self-identification instruments at great
length, including measures of both phenomenological and non-phenomenological
self. The latter type of measurement includes the TAT and Rorschach.

Here we are mainly concerned with reviewing these instruments which
measure phenomenological self, however.

Wylie divides these instruments into three general types,

(1) Q-Sorts, (2) rating scales, questionnaires, adjective check lists,
and (3) coding plans for interview materials. Although she does not
honor the TST with inclusion in her list of measurements, it could.be
placed legically in the third class. This omission is undoubtedly due

to Wylie's psychological rather than sociological orientation.

Her list (35, pp. 61-6u) includes 23 different Q-sorts, of which
she deals most extensively with the Q-sorts of (a) Butler and Haigh and
(b) Hilden. Wylie, writing in 1961 found that the second major
category of instrument accounted for the most use. However, it was this
author's experience, in surveying dissertations (l954-present) which
dealt with "self" in a sociological orientation to social psychology,
that Q-sort technique was used in a majority of cases.

Wylie lists (35, pp. 87-97) 83 instruments of the type gathered
together under the heading "rating scales, questionnaires, and adjective
check lists." For only one-third of these is reliability information
available and it is usually of the Split-half coefficient type.

In addition, Wylie lists seven coding plans for interview materials.
(35, p. 107-110) Although the TST is essentially a coding plan type,
it differs from all those listed by Wylie in that it is used to code
data from a pen and pencil or oral interview which could be administered
in either an experimental or field setting. The seven code plans listed
by Wylie are for coding material sampled from a pepulation of clinical
clients. Reliability between judges is reported in six of the seven
code plans.

Raimy (28) reported, "three out of four judges coded 356 client
responses with percentages of agreement ranging from 50.5 percent for
'ambivalent' to 81 percent for informational questions. Bugental (36)
reported five raters rated five protocols with reliabilities for "unit
determination, 87.u percent; for categorization, 59.1 percent; and for

evaluation, 75.0 percent."

Vargas (33) reported a 96 percent agreement between two judges on
which statements constituted self-description, in general. For Stock (37)
two judges categorized each statement from three interviews. The per-
centages of exact agreement for each interview ranged from 68.5 percent to
82.2 percent.

Scheerer's (29) scheme used a 5 point scale for each variable.
"To explore reliability, three judges rated all units in six interviews.
At least two out of the three judges agreed 93.8 percent of the time. On
only four out of 178 Self units did any one judge's rating deviate by two
scale points from both other judges ratings." Raskin (38) using a
similar technique with a four-point scale reports an "interjudge reliability
on 59 items" of .91.

For Lipkin (18), "interjudge agreements were obtained ranging between
81.9 percent and 100 percent" using a "complicated reliability formula."

It is interesting to note that Wylie includes Bugenthal and
Zelen's W-A-Y Test or "Who are you" as a questionnaire (5), even though
it has great similarity to the "Who am I" of the TST, and employs a

1? category code plan for the coding of three responses.

Historical Development of the Twenty Statements Test

The TST was developed by Kuhn and McPartland and first reported.by
them in 1958. The instructions to reSpondents which they used in their
initial applications were:

There are twenty numbered blanks on the page below.
Please write twenty answers to the simple question,

'Who am I?’ in the blanks. Just give twenty different
answers to this question. Answer as if you were giving
the answers to yourself, not to somebody else. Write

the answers in the order that they occur to you. Don't
worry about legic or 'importance.' Go along fairly fast,
for time is limited." (16, p. 69)

10

These instructions were given in writing to some groups, and orally

to other groups in a college class setting. The time was limited to

twelve minutes. The number of reSponses obtained in this way varied

from 20 to one or two, with a median of 17. Each reSponse either began

with "I am ...” or omitted that phrase and consisted of single words.
The responses were content analyzed into two gross categories,

either consensual or subconsensual. A consensual Statement refers to

 

"groups and classes whose limits and conditions of membership are

matters of common knowledge. Those which refer to "groups classes,
attributes, traits or any other matters which would require interpretation
by the respondent to be precise or to place him relative to other people"

were considered subconsensual.

 

Examples of the consensual variety are 'student,’ 'girl,‘
'husband,‘ ‘Baptist,' 'from Chicago,‘ 'pre-med,‘ 'daughter,‘
'oldest child,’ 'studying engineering,’ that is, statements
referring to consensually defined statuses and classes.
Examples of the subconsensual category are 'happyf

'bored,' 'pretty good student,' 'too heavy,’ 'good wife,’

'interesting'; that is, statements without positional

reference, or with reference to consensual classes obscured by

ambiguous modifiers. (16, pp. 69-70)

An intercoder reliability check between two coders was reported
as resulting in less than three differences per 100 responses. The
major score developed from the reSponses was a locus score, or the number
of consensual statements made before any but erroneous (Guttmann scale
"error") subconsensual statements are made in a TST SIP protocol. This
locus score was developed after submission of the data to Guttmann
scalogram analysis techniques indicated that subjects tended largely to

exhaust all their consensual reSponses before beginning to make any

subconsensual reSponses. Kuhn and McPartland equated the size of this

ll

locus or consensuality with the respondent's amount of anchorage or
self-identification.

It is not clear what utilitarian purpose was served by submitting
the data to Guttmann analysis. A locus score could be obtained simply
by counting the number of consensual items made as responses. Only as
evidence that consensual reSponses in general are more salient is such
a method useful.

The salience score was based upon the assumption that earlier

 

statements were more salient for reSpondents than later statements.
This assumption was made in Spite of the Specific instruction to the
respondent to the contrary. Therefore, this writer assumes that the
TST'S developers had some type of subconscious salience in mind when
they made this assumption. The salience score of any statement is
obtained from its rank position in the protocol, beginning with #20
for the first statement.

This assumption that sequence is equivalent to salience is based
on a statement by Newcomb, who says that salience "refers to a person's
readiness to reSpond h a certain way. The more salient a person's
attitude the more readily will it be eXpressed with a minimum of outer
stimulation." (16, p. 72) They also advance an alternative explanation
that consensual statements are made first because individuals are
habituated to such responses in our culture by remembered external
stimuli, such as applications, questionnaires and census takers. They
refute this alternate eXplanation on the evidence that three and four-
year-olds answer the question, ”Who are you?” with the consensual

statements of name, sex and age, though they have not yet any memory

12

of such habituating stimuli.

The research reported in connection with this introduction of the
TST SIP was a correlational study between religious denomination of the
respondent and amount of anchorage of the reSpondent. Wylie says of
this research, "Their results are not meaningful psychologically for
several reasons. First, as we have said above, there is no clear re-
lationship between religious affiliation and psychologically relevant
variables. In addition, there was no control over reSponse total in
obtaining scores, and there was no attempt to match groups of varying
religious affiliations with respect to variables relevant to the ”Who
am I" responses." (35, p. 182)

Following this original report of the TST, several dissertations
were done under Kuhn's direction at the University of Iowa employing
the TST to gather data. Notable among these is that of McPartland (19)
which was completed a year prior to the Kuhn E McPartland article, and
the master's thesis of Robert Stewart (31).

Other of Kuhn's students who have employed the TST in their
post-doctoral research are Fred Waisanen and Carl Couch, along with
McPartland. Under the influence of the presence of Waisanen, Couch and
Stewart at Michigan State University the TST was employed by Wilbur

Brockover, in a study of self-concept of Junior High (u) students.
The TST was also used by Charles Tucker (32), Clark McPhail (23), and
Delwyn Dyer (10) in their dissertations, and by this author in the

present research.

13

Writing in 1956 Kuhn alluded to the TST as non-specific, non-

suggestive and non-structured. He adds that locus scores increase with

age during the first 20 years, and that there are significant variations

in locus by sex and occupation. He tells Of a different category system

used on the responses of "more than 200 respondents”;

(1) references to statuses in social categories and social
groups; (2) references to ideolOgical beliefs —- moral,
philosophical, ethical; (3) statements of personal aSpir-
ation and achievement; (4) identification in terms of
interests and aversions, including positively and negatively
held social objects; and (5) self evaluations. (12, p. 2H5)

 

Unfortunately, he reports no inter-coder reliability for this coding

plan.

In another source he elaborates on this coding plan:

These five categories (sufficient to order all_the reSponses
made) are the following: (1) social groups and classifications
(such as age, sex, educational level, occupation, marital
status, kin relations, socially defined physical characteristics,
race, national origin, religious membership, political
affiliation, formal and informal group memberships); (2) Ideo-
logical beliefs (including statements of a religious,
philosophical or moral nature); (3) interests (including
statements relating objects to the self with either positive
or negative affect); (u) ambitions (and all anticipated
success themata); (5) self-evaluations (such as evaluations
of mental and physical and other abilities, physique and
appearance, relatedness to others, aspirations, persistence,
industriousness, emotional balance, material resources, past
achievements, habits of neatness, orderliness, and the like,
and more comprehensive self—typing in clinical or quasi-
clinical terms).(15, pp. uo-ui)

In viewing these two explanations of the same code plan by the

same author, there is one glaring contradiction which might lead to

coder error: In the first explanation ”aSpirations" is included in the

type three, aChievement, ambition, aspiration category. In the second

explanation "aSpirations” are included in the examples of type five,

self-identification category. Furthermore, the type three and Type

14

four categories interchange content in the two eXplanations which could
lead to further confusion.

There are other sources of possible confusion which can be stated
in more operational terms. These all pertain to a difference between
one of the last four categories, and the first category. That is, they
are differences between the consensual category and the four new
categories into which Kuhn has Split the subconsensual category.

(1) While physical description is category one, physical evaluation is
category five; (2) while membership is category one, aSpiration to
membership is category three or four, interest or affect toward a
_group is category three or four, and beliefs associated with group
membership (such as religious or political belief) is category two.
McPartland, meanwhile, over the years 1958 to 1961, set out
another method by which the TST might be coded, while suggesting still

other methods. In his Manual for the Twenty Statements Problem, (22,

 

p. 2) he suggests category systems based on "literal content, referen-
tial frame or logical form". Kuhn's category system discussed above,
as well as the code plan developed by McPartland are of the referential
frame type. Drawing upon three separate explanations of the McPartland
category System Table I was constructed to aid the reader in grasping
all the distinctions drawn by MoPartland.

McPartland reports a time limit of 15 minutes on his reSpondents,
all of whom were patients in one or another facility for treatment of
mental illness. His instructions were printed for reading and read
aloud for listening for each eXperimental group, as follows: "There

are 20 numbered Spaces on this sheet. Just write 20 different things

15

CH now>mnmm onmz who coaummocooumaom: .comumnnmw .m mcoamz pom nmcHEEDU .m crow

 

.saaaaaa .aa .Hema .mase .snboeohoom .mampnmmoz canvaseosmm oze

. . m mmeoﬁ .eqmﬂnmmoza

 

.Apomapnmmoz on ow csocx nocpsmv .mmma .znmscmw .ommﬂ>mm .EoHQOpm

 

mpcoEmwmumtmpnoze one now Hmoemz .cosmowom mo wcoEmeomQ .coﬂpmocsom guano: Hopes: huwo mowemx geomonw ones
.mm-:a .aa .mmma .ms .sa .Ho> .eoﬂbmaﬂammgo

cmeom :.£pamw: Hmvcmz pom ﬁmwmao Hmwoom .cowpaoocooumaow: .wwcweeoo .m crow pom .m mmeonh .ocmaphmmoze

 

 

mmwsommpmo

Hmeamhpo

wamomcomcoooom wamomoomoooosm wamowcomcoo wamomcomcoo EMMmeHMMW
Z .-

wpommoo Hmﬂoom < chﬂpmoﬂHoEH

epommhsm Hmﬂoom m

wuoomosm Hmﬂoom m

myoomoo.HMﬂoom m

 

wopoppmo

wcnoppmm

=mnmewo= mo

 

 

 

whospo Hmcowuouﬂpmcw so me poo ameOMpoqumcﬂ cm cw
ampcmocoomcmne .rouo pmnwamsocow < nmcuo powwamnmcmw < wmcoz soaprﬂHQEH
:onmcs no cogs: poo poo
. :sog: mo coHuowpwnm woowuowomnm mvﬁEpom
meow>ma muwenmm .oahum poopmww .now>maoo mo soap noﬁ>mnon mo
zoo Hoonm mcowumvooa 1:00 new: poo mcowpm: Iawpommna ow%womem cowpowompm pom
axe cascades oz :uwm oownm> ow omwnm> pom mane: mowamEH % mcoz mcompmowaoEH
woaomoﬂcoeeoo .wcwmmos
wcowpmowcoeeoo mo GOEEoo m wcﬁ>mn ewcoEmnowmmE
mahmwmwnm> poo .oopmowam> .mnmcpo he ooowmnmon: empwwoom he oopmowam> so monooms I a a ho
mHHwomcmmcoo poz poo .maAMﬂwwwm> poo maamowcomooo ow coconmmwn so coﬂpmomam>
wpcmEpwEEoo «cowpomnmpcw
no mocmowmmcoo «onouosnpm amﬁoom wcowuomnmuow smoonnp amwoow chanson ou uumwoom

o>wyomwmwnw oz

cw pawso>ao>=ﬂ omooq

mpowoom ma oomwwom

mponocoo 009 I maoz

o» oowpmHmm

 

emcowpmopwm

«.cowpmenom

 

woosuﬂwpm emaome: o» mousnvnppm Hmﬁoom omnsposnpm new mo smash memo

no .uom .pxmucoo Homecommwn may gown: >Hywowaexw cw coﬂuwoﬁmwpcmow: pom Pompcoo
amasowpnma oz now>mzon mo woahpm pco5m>ao>aw mmHHQEH mopsnﬂnuum Hoowmmcm
«masbm empoapumcoo

«m>wwcmcmnmeoo mommacowpmouwm «Hmcowpwmom «pom muonocoo omamamq

 

swam mooo m.oamayhmmoz

H magma.

pom weapowmpmo cooxvom mcowwocwuwwm

16

about yourself in the spaces. Don't worry about how important they are or
the order you put them in. Just write the first 20 answers you think of to the
question: 'WHO AM I?'"

McPartland reports that for a sample of 60 reSpondents, three in-
dependent coders agreed on more than 97 percent of reSponses. He also

developed another type of score for the TST, the modal reSponse. Bach

 

respondent's protocol was characterized as type A, B, C, or D, dependent
upon which category of response was modal for the reSponses of the
protocol. Of the mode he says, "about nine patients in ten write responses
which show a clear mode in some one category, that is, they made at least
one more statement in some one category than in any other. The remaining
10 percent either tied in two categories or gave so few responses that the
mode was obviously unreliable. These few respondents, nevertheless, were
categorized by mode when there was one, or in the more ”distinctive
category involved in a tie; in 'A' or 'D' if one of these was tied with'B'
or 'C' and in 'D' if that category was tied with 'A!‘ The rationale is
that more unusual reSponses should receive greater weight in analysis."
Further, when a respondent made no responses he was classified as Type A
on the assumption that he was reSponding in a "constricted manner."

(22, p. 9) The three coders reached 100 percent agreement on the modal
type of 60 respondents. (20, p. 118)

Couch seems to be the first investigator to use a category system
stressing the ”literal content” referred to by McFartland. Using the
same instructions as the original Kuhn a McPartland study, with an eight
minute time limit, he content analyzed for "references to a role in, or

an attachment to, a major institution of our society...responses that had

17

a reference to religion, the family or education." (8, pp. u92—u93)

In a later study his code plan "noted whether or not reSpondents had
identified themselves on the TST as male or female, boy or girl, or man
or woman." (7, p. 118)

The code plan under consideration in this study (see Appendix B)
is perhaps the most elaborate in terms of "literal content" analysis,
separating Kuhn's original ”consensual" category into 23 separate content
areas (see Table II). Each of these categories was required as a relevant
variable in at least one of the research interests bound together by
cooperative data collection. Because "wastebasket" categories are
included, these categories comprise an exhaustive code plan for the
consensual responses.

In a suggestion for a ccde plan based on ”logical structure,"
McPartland lists categories of reSponses (l) in the form of the verb "to
be", (2) in the form of the verb ”to have", (3) in the form of action
verbs, and (u) in a form in which the ”self drOps entirely out of
explicit attention." (22, pp. 9-10) Perhaps as a follow-up to this
suggestion, Couch has studied the classes of verb forms used in reSponse
to a modified form of the TST. (9)

Probably the most useful innovation in the categories for TST codes
was one dealing with logical or formal structure, made by Waisanen.

His three major categories are (l) consensual, (2) evaluative-consensual,
and (3) idiosyncratic or non-consensual. Essentially he has divided
Kuhn's ”sub—consensual” category into evaluative-consensual (positional
references which are, in some way, modified) and idiosyncratic (all

remaining sub-consensual references). Then he provides for 5 sub-

18

 

 

 

 

 

 

 

 

 

 

 

 

38w

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

mHm Ems 0:» Rom mamam mwoo mo PamEQOHm>oQ cam AH3OQU

HH «dawn.

 

 

Hmnwsmmnooncoz lawnmoH w.
owvmnocmooon no owpmnochmOﬂoH imam
Hmsowmmm .m - Hmsowmwm .m, macaw
Hmzowmmn .m _m>wpowompm nhm>¢ w
owuwosoo .w .coz u m>wm Hmom mymopop doom
oweocooouowoom .o vuconmnmsoo noomsoo unH Icemnoo
cowumuwcmwno Hmowpwaom .o . Insm loam
comeNwamwno mﬂowwwﬁwn . . .
8m.» .833 .m u (I Flags
. amuaaowmpmo 5 Hmowmmmwmd .i. . -Boaﬁa
b.3333.“ .m _ :83
:Oﬁpmoooo .0 ” nonaam<.
cowumuwcmwno Hmsnom .o .
xnoz .0 Warm ovum
cowpmasooo .n nmowmeHHm mmmaamm .b
hwwcsssoo .m
Emwcooom .m Emimcoomm Hm-
Hmzowmmn .v
mucownm .o
Hmuﬂnma .A
awsmcwx .m sown
mumswnm .N 1 smmswnm .w Imosvm .:
spammn .w macaw
wow .m sawemm .m umoamw
Towpmwnopomnmno Hwowmmnm .o Tmmmao we doom
mmmnoom .o Hchﬁpmmom seam mesonw anomcoo
88: .a ....Jmm .a Hmwoom
mmm .m
Hmcomnwm .H a a m wow .a
.llllllllwmmmmmmmmo (Huqmauuqmw. epownuwnoo unmuaoo
o>wpmsam>m § Hmswammsou gaumsamxrm.‘ Hormoomaoo a 38280 Hmsmnomcou mownowwumo
4 moswm ucommnm - c ocmaphmmuz
mo 52 $8 353m: 332302 ﬁaoo 53 _ 28 use aoﬁae

l9

categories to analyze the ”literal content" of both consensual and
evaluative-consensual categories. The utility of this innovation lies in
its ability to extract content data from a great number of TST reSponses
which previously were classified sub-consensual and, therefore, paid
little heed. His five content categories are; (1) Personal (physical
characteristics), (2) Primary (continuing intimate relationships),
(3) Secondary (associational, segmentalized, Specialized interaction),
(H) Categorical ("man," "citizen,” ”white,” "American," are examples),
and (5) Residual (all responses that do not fit the first a content
categories). This is essentially the basic category system studied
here, with the addition of the 23 sub-Sub-categories already referred
to in the form of Specific roles or physical attributes.*

The present study also involves an innovation in the administration
of the TST. The instructions were read to respondents to

Ask this question of yourself, 'Who am I?‘ Think of as

many answers as you can in answer to the question, 'Who

am I?’ In a moment I would like you to give me the answers

as if you were giving them to yourself, not to me or anyone

else. Take a little time to think about it. (pause) Now,

please make what you consider to be the most important

statement about yourself first. (pause) Now, make what you

consider to be the next most important statement about

yourself. (pause) Now, make what you consider the next

most important statement about yourself. (pause) Are

there any other statements you could make about yourself in
answer to the question, ’Who am I?’ (see Appendix A)

 

 

The interview schedule contained only 10 Spaces for reSponses and
interviewers were instructed to write down only as many as ten reSponses

and to Spend no more than 3 minutes on the SIP. (13, p. 7—8) We see,

*Attributed to Waisanen by Dyer, (10, pp. 39-40).

20

by comparison with prior uses of the TST that, (l) fewer responses were
sought, (2) less time was allowed, and (3) reSpondents were directed into
a format of most important statements first. These innovations were the
result of a pre-test of the interview schedule.

The main aim of this pre-test was to test the schedule on persons
who were functionally illiterate, to find what impairments to communication
this would present to an interviewer. The pre-test was conducted with
persons living in a rural, Michigan, migrant farm-workers enclave, and
with persons living in an urban, Michigan, Negro and Puerto Rican slum
_ ghetto. Such a pre-test was as essential for the TST as fer any other
phase of the schedule, for until now the TST had been employed only on
extremely homogeneous groups, and often on middle-class college-age
persons. Now we were pre-testing for its utility with an extremely
heterogeneous sample, by testing the opposite extreme type of homogeneity.

We found reSpondents on the pre-test made many fewer statements,
many non-directive probes were used, and a great deal of time was Spent
on administration of the TST in relation to other items, although there
was only one hour available for the administration of the entire
schedule. For the sake of expedience, then, the TST was administered
With built-in probes which provided more structure than previously, and
in a foreshortened format. Brookover had previously used a "Ten

Statements Test”, (4, p. 28) and Dyer had questioned the need for obtaining

as many as twenty statements. (10, p. ”1)

21

A further innovation in administration of the TST was developed
and tested by McPhail. After obtaining TST protocols from reSpondents
he presented them with three additional problems:

(1) rank these statements in the order of their importance
to you...

(2) assign a plus (+), minus (-) or zero (0) to each

statement, depending on whether you feel positive,

negative, or neutral toward that statement... and

(3) attempt to estimate (in units of five percentage points)

what percentage of those persons who are in a position to

know, would agree with you on each separate statement you

have made. (24)

This was an attempt to (I) test the correlation between chronological
order of reSponse and salience for the respondent, (2) test for any
correlation between negative fonn of reSponse with negative feeling about
the reSponse. For example, had a response "I am not good-looking" been
given, a coder might assume this to be a negative self-evaluation, but
unless we know whether his looks makes the respondent sad or glad, we
cannot tell if it is a negative or positive evaluation. Finally, (3)
this was an attempt to tap the respondent's perception of consensuality
or sub—consensuality for each Statement he had made. Conceivably a
respondent might estimate that 60% of the persons in a position to know
would agree with his statement "I am stubborn”, thus changing it from an
Iidiosyncratic reSponse to a consensual reSponse. Again, a forerunner to
the McPhail approach to salience was used by Brookover, when he asked
respondents to circle the most important (no specified number) of their
Statements. (4, p. 28) Unfortunately, no report was made of the associa-

tion between this self-coding and the ordering of the statements.

Looking toward the future for the TST SIP, Dyer has suggested that

22

responses to it be submitted to factor analysis, rather than content
analysis or scaling techniques.(10, p. Ml) Mielke has suggested a more
Specific plan for factor analysis of TST reSponses.

A factor analytic method could also be used to devise a value
adherence instrument, using a two Stage pretest. The first
stage would be eXploratory, going to the field with o en-end
questions...Everytime a respondent would mention...something
that sounded like a general value, this would be recorded

for the second pretest stage. The entire pool of general
talues thus recorded would then be administered to a second
pretest sample for "agreement" or "applicability to me” ratings.
These ratings would then be submitted to factor analysis. Here
the factors would represent a parsimonious set of independent
value clusters. By discerning an underlying commonality of
content, each value "cluster" could be called a single general
value. The items most highly loaded on each factor would
become the basis for the value adherence instrument in the

main study. (27)

In summary, Table II represents a history of the growth and

development of code plans for the TST SIP.

Review of Previous Investigations Relevant to this Study:

 

Berelson, (8) in his article on "Content Analysis” in the Handbook

of Social Psychology presents a summary of content analysis reliability

 

studies. He finds that reliability is reported in 15-20 percent of the
content analytic studies, and that "the reports on reliability which do
appear are uniformly high". In a population of ”some 30 studies and
experiments” he found "a range of correlation coefficients between about
.78 and .99 with a concentration at about .90 and a range of percentage
agreements between 66 percent and 96 percent with a concentration over

90 percent." He cautions that "many attempts at content analysis may

have been abandoned because of low reliability in the preliminary results...

published content analysis studies which do not report on reliability

23

may be presumed to have had less satisfactory results...most of the
reported reliability results apply to relatively simple versions of
content analysis.” (3, p. 51H)

He Summarizes several reliability experiments as fellows:
”Reliability is higher under these conditions: the simpler the categories
and the unit, the more eXperienced and better trained the coders, the
more precise and complete the set of coding rules, the fuller the
illustrations." (3, pp. Sin-515)

Schutz (30) compared a code system using "one psychological
operation" on the part of the coder, with a code system using a step-
by-step series of deciébns related to a complex category system and
found the latter system produced a "little less” reliability.

Summary

In this chapter we have attempted to (1) present a. general
statement of the aims of this study; (2) Show the need for a minimally
biased SIP in tests of Meadian theory; (3) point to a number of existing
SIP's; (H) outline the historical deveIOpment of the Twenty Statements
SIP regarding (a) code plans, (b) derived scores, (c) reliability tests,
(d) administration procedures, and (6) future deveIOpment; and (5)
review the findings of other checks on the reliability of content analysis.

The remaining chapters deal with (l) the procedures employed in
this study, including the hypotheses being tested, (2) the results

obtained by testing these hypotheses, and (3) the conclusions arrived at,

 

based on the results of the hypothesis testing procedure.

CHAPTER II

PROCEDURE

The Experimental Context of this Study_

The present reliability test was conducted in conjunction with a
cooperative research project which collected data in five nations:
Mexico, Costa Rica, Japan, Finland and the U.S.A. This very extensive
study was under the direction of Hideya Kumata, Charles Loomis and
Frederick Waisanen in cooperation with Yrjo Littunen and Robert Stewart.

The United States sample was constructed and drawn by the Gallup
Organization, which also supervised and conducted all U. 8. interviews.
It is "a (modified) area probability sample of the United States general
public (outside of institutions), age 21 or older." Size of the sample
was 1,528. The interviews were conducted September 2 to October 6, 1963.
(39, p. l) The data collection instrument was an approximately hour-
long scheduled interview.

Using the 1,528 TST protocols thus obtained, a random sample of 150,
or approximately 10 percent of the population of protocols was drawn
(see Appendix C). This sample was the data assigned to each coder for
coding. The data sampled contained 858 responses. This amounts to an
average number of responses per protocol of 5.72.

It is assumed, by sampling randomly from a sample designed to
represent the U. 8. adult population, age 21 and over, that the range of

coding difficulty found in these protocols will be closely representative

2“

25

of the range of coding difficulty which would be found in protocols

generated by the 0.8. adult population, 21 and over.

The Coders and Data Collection Procedure

 

A coder employed by the Gallup Organization had coded the 150
protocols prior to the eXperiment. Future references shall refer to

the Gallup coder. In addition, five non-Gallup coders Operating

 

independently, coded each of the 150 TST protocols. In Chapter IV I
will deal with the possibility that these five coders were not truly
independent. However to safeguard the independence of the five coding
Operations, each coder did the physical coding of protocols in a separate
booklet of protocols. To further safeguard independence, the protocols
in each booklet were ordered in a random fashion and differently from
each of the four other booklets. We Should note that this procedure may
facilitate the contribution of varying levels of coder fatigue to
differences in coding, thus biasing the results in the direction of
lower reliability.

The booklets were divided into two sections of 75 protocols each.
One and a half hours coding time was estimated for each section. Coders

were instructed to ”code each section on a separate day, and code all

 

sections within a two week period. Please code each section during one

 

 

continuous work period. Try to do this work at a time of day when you
are mentally alert and relatively free from external distraction." Un-
fortunately not all coders were able to follow these instructions. While
all coding was accomplished within a two week period, some coders found
it necessary to use more than two separate days and Shorter than one and

a half hour work sessions. A change in the procedure of this sort,

26

however, would lead to eXpectations of lowered fatigue for the task.

On each page of the booklet in typewritten, dittoed form were the
responses of a TST protocol. Only one protocol appeared on each page,
and no protocol appeared divided between two pages, regardless of the
number of reSponseS it contained. The number of responses per protocol
ranged from one to ten. A reSpondent number in four digits appeared in
association with each protocol. We should note that the Gallup coder
performed the coding task on data handwritten by interviewers under
field conditions, and that this was the original data from which the ex-
perimental booklets were transcribed.

The code categories used were in the form of two digit numbers
representing punches in two columns of an IBM data card. Each coder
was instructed to: ”Code in red pencil....Indicate by red Slash marks
within the statement the division of a reSpondent's statement into more
than one statement for coding purposes. Write your codes in the left
margin in the order the statements appear on each page."

At the beginning of each experimental booklet the following in—
formation was given the coders regarding the process of transcribing
the protocols from the original handwriting:

Following this page you will find a reproduction of the

questionnaire section which asked and recorded the TST.

(see Appendix A) Reference to this format may be helpful

in coding some of the responses. In general, each statement

which was written next to a number l—lO has been reproduced

as a separate response for you to code, by: (a) Triple Spacing

it from the other statements, (b) Capitalizing the first letter,

if it were not capitalized by the interviewer (except in some

cases where capitalization could affect meaning), and (c)

Ending the statement with a period (.). Since in some cases,

it appeared that though a reSponse was written on several

lines, it was really one statement, we have shown this by

ending the line with a dash (--) and beginning the next line

in lower case. Every effort was made to reproduce the state-

ment as recorded by the interviewers. Therefore, grammar and
Spelling errors have been reproduced.

27

It is assumed that the Gallup coder had become highly aware of
the 15 page coding instructions and code for the TST developed by
Stewart and others. (see Appendix B) Four of the five non-Gallup
coders had helped develop the coding instructions and code book, and
were therefore very familiar with the code book. All five coders were
instructed to "Read through the instructions in the code book for coding
the TST befOre you begin this work. Follow the instructions in the code
book during the coding and refer to the code book as frequently as
necessary." All five coders were at the graduate or post-graduate level
in the social sciences.

(See Appendix A for the format of the TST in the interview schedule,
Appendix B for the TST coding instructions and code book, and Appendix C
for the 150 TST protocols.)

The Hypotheses and Scoring Procedures
A basic hypothesis of this study is that there will be a high level

of reliability among the six coders on the coding of the TST protocols.

 

I will define "high level of reliability among six coders” as a high
percentage of protocols with a high mean reliability score. The mean
reliability socre of a protocol is an average across the reliability

scores of all reSponsein the protocol. The reliability score of a reSponse
is based upon a scale of agreement—consistency among the six coders as to
how they coded the reSponse. The scale of agreement-consistency is based
upon two values (1) Agreement, or the number of paired agreements obtained,
considering that a total of 15 paired agreements are possible among six
coders, and (2) Consistency, or the Size of the modal category assigned

to the item by six coders, which mode may range from one to Six. The scores thus

28

obtained from this scale range from 0-10. Each score represents a
discrete structural pattern of category combinations, and,in each case,
knowing nothing but the score we can determine the number of paired
agreements and size of mode which it represents, but not the code

category. Table III will make this relationship more apparent.

Table III

Reliability Score- Based on a Scale of Agreement-Consistency

 

Combination of Categories Number of Size of

Score (a structural pattern) Agreements Mode
10 AAAAAA 15 6
9 AAAAAB 10 5
8 AAAABB 7 4
7 AAAABC 6 H
6 AAABBB 6 3
5 AAABBC H 3
H AAABCD 3 3
3 AABBCC 3 2
2 AABBCD 2 2
l AABCDE l 2
0 ABCDEF 0 l

 

To understand the formal patterns one must realize that the letters
A,B,C,D,E,F represent the order in which the analyzer notes different
code category assignments by different coders. There is no relationship
between these letter notations and the different numbers assigned to
code categories in the code book. For example, if the analyzer notes
that all the coders coded an item with the same code category, that item
would be assigned an AAAAAA structure, whether it was coded a 2l—category
or a 53-category by all six coders. At the other end of the scale, if

the analyzer notes that each coder assigned a different code category

29

to the same item, that item would be assigned an ABCDEF structure,
whether it was coded as 72, 56, 12, 58, 60, 61, or any other combination
of six different categories. Thus, the difference between a score of
seven and eight on this scale is not found-in the mode, for in each
situation there are four A's. The difference between the reliability
scores of seven and eight is found in the number of paired agreements,
for an eight score represents one additional paired agreement between

the two coders who assigned non-modal categories, i.e., "BB” represents

a paired agreement which is missing in "BC”.

For purposes of operationalizing this hypothesis a high reliability
score was defined arbitrarily as any score of 8 or over, thus representing
roughly the top quarter of the scale. "A high percentage of protocols
with a high mean reliability score" was defined as 75 percent or more
of all protocols have a mean reliability score of 8 or better. The per-
centage figure was chosen as an amalgam of (1) previously attained
percentages of inter-coder reliability for the TST, and (2) indications
from related research in content analysis that factors Specific to this
Study indicate a lower percentage of inter—coder reliability can be
expected.

The history of TST reliability checks has been discussed in
Chapter I. We should recall that in their original study, Kuhn and
McPartland reported 97 percent reliability between only two coders,
coding material into only two categories. McPartland reports 97 percent
P61iability between only three coders, coding into only four categories.
Berelson reports percentages of reliability for content analyses range

between 66 percent and 96 percent with a concentration over 90 percent,

30

but he adds that "the reported reliability results apply to relatively
simple versions of content analysis."

Berelson concludes that the simpler the categories the higher the
reliability, and the reliability eXperiment reported by Schutz indicated
that a Simple decision by the coder would produce higher reliability
than a step-by-step series of decisions by the coder. Reference to
Table II and Appendix B would indicate that there are 45 discrete code
categories used in the present study, requiring the coder to make
decisions step-by-step as to (1) whether a reSponse is a single or
multiple Statement, (2) whether a reSponse is or is not modified in some
way, (3) whether reSponse contains material which may be classified by
content (consensual) or does not contain material which may be classified
by content (idiosyncratic) and (u) what content category (from among 23)
Should be assigned to this reSponse.

Furthermore, although the difference between the use of three coders
in McPartland's reliability check and the six coders in the present
reliability check may seem inconsequential, a deeper consideration is
that with three coders there can be only three paired agreements but
with six coders there are 15 paired agreements. Adding only three coders
to the reliability test multiplies the chances for disagreement five times,
and we would expect such a procedure to lower the obtained reliability.

Thus three main factors differentiate this study from other
reliability checks of TST coding; (1) many more categories, (2) many
more decisions to be made by the coder, (3) many more paired agreements
among coders. Add the final consideration that in the present study

the data being coded, obtained, as it was, for the first time from a

31

heterOgeneous sample rather than a homogeneous sample,ndght be eXpected
to extend the range of expression thus making coding more difficult
and therefore less reliable. For these reasons the expected percentage
of reliability was lowered to 75 percent from the 97 percent obtained by
McPartland and Kuhn E. McPartland.

Now our original basic hypothesis, that there will be a high level
of reliability among the six coders on the coding of the TST protocols

has been operationalized. Its meaning has become, we expect to obtain

 

a situation in which 75 percent or more of all protocols have a mean

 

reliability score of eight or better, on the coding of l50 TST protocols

 

byysix coders. An item reliability of eight or better means the item will

 

 

have at least seven of 15 pairs in agreement and a modal category no

 

smaller than four of six codes. The mean reliability score of a protocol

 

is an arithmetic average to the nearest tenth, of the scores obtained

 

on all the items in the protocol.

 

A Somewhat similar basic hypothesis is that there will be a high

 

level of reliability among the six coders on the coding of the TST

 

responses. In this case, "high level of reliability” will be

operationalized as (l) 75 percent of all responses will be scored as eight

 

or better in reliability and (2) the average reliability of all responses

 

 

will be a score of eight or better.

 

The following sub-hypotheses will be operationalized in parallel
fashion to the basic hypothesis:

Hl When the difference between consensual reSponseS and
evaluative-consensual reSponses is overlooked, reliability
is expected to appear as greater than underrealistic
conditions.

H2 When differences between coders in the decision of
whether to divide a statement into more than one
Vstatement for coding purposes are adjusted so that
only the immediate statement is affected rather than
all subsequent statements in the protocol, reliability
is expected to appear as greater than under realistic
conditions.
It can be seen from the discussion of step-by-step decisions by
the coders that each of these hypotheses reflects an attempt to isolate
one of the first two decision steps. The effect of the decision "whether
a reSponse is or is not modified in some way," is removed from consideration
which allows us to hypothesize that this will make the coding appear more
reliable. The accidental effect (not the main effect) of the decision
"whether a reSponse is a single or multiple statement" is removed from
consideration, allowing us to hypothesize that this will make the coding

appear more reliable. Note that neither of these hypotheses expect a

change in reliability. Rather they expect a change in the appearance of

 

reliability under a hypothesized condition which is not a condition of
reality.

The actual data record on data cards or data computer tape, the
realistic condition, is one which records discrete numbers for two
statements which are alike except that one is modified and the other is
not modified. Thus, ”I am a man” would receive the code punch, 16, but
"I am a fairly honest man,” would receive the code punch 17, denoting
What would have been a "16" category, except for the modification,
”fairly honest". For our purposes "modified” is used in both the
strict, grammatical sense, as well as in the less strict sense of
overall meaning. Note also, that the terms "evaluated" and "qualified"

Will be used as if they were equivalent to the term ”modified”, since no

33

practical distinctions between them are necessary in this study.

The actual data record, the realistic condition, will also allow
accidental effect to accumulate for all items following an item which
was or was not Split into two statements in an unreliable manner. This
accumulation relates to the fact that each statement's category code
number is punched in a set of two discrete data card columns.

Thus if the first statement of seven statements is Split into two

statements in an unreliable manner,each of the following six statements

 

will be affected accidentally in the direction of unreliability by
necessarily placing what may be reliable code category decisions in an
unreliable set of columns. Under the condition of reliability the
second statement's code category number would have been in the second
set of two columns, and instead it is in the third set of two columns;
the seventh statement's code category number would have been in the
seventh set of two columns, and instead it is in the eighth set of two
columns. Thus the unreliability of a Split statement decision is com-
pounded accidentally, and eSpecially so where it occurs near the
beginning of a protocol, and the number of statements in the protocol is
relatively large.

Three factors are apparent in the coding situation, which might
be expected to affect reliability: (1) The data being coded, (2) The
code categories being used, and (3) The coders who do the coding. Each
of the following sub-hypotheses relates to one of these factors:

H3 The more statements made on a TST protocol the lower
will be the mean reliability score for that protocol.

Hu As content categories increase in complexity response
reliability will lessen.

34
H5 There will be no relationship in the pattern of coder
pagreements which contribute to unreliabiﬁty.

HSa There will be no relationship in the pattern of paired
agreements which contribute;t>unreliability.

HSb There will be no relationship in the pattern of triple
agreements which contributssto unreliability.

Hypothesis three has three bases. (1) As mentioned above, a
greater number of protocol statements is expected to lower reliability
in the Specific cases where an early protocol statement is Split un-
reliably. (2) A greater number of statements in a protocol increases
the appearance of the protocols' complexity. (3) A greater number of
statements in a protocol increases the number of step-by-step decisions
within the protocol.

Hypothesis four refers to the 23 specific content categories.
"Complexity of reference ” refers to a four place scale of complexity,
in which personal statements are defined as less complex than primary
statements, primary statements are defined as less complex than secondary
statements, secondary statements are defined as less complex than
categorical statements, and idiosyncratic statements (because this
category has a residual dimension) are defined as equal in complexity
with the personal statement, at the low end of the complexity scale.

Only those statements to which four, five or six coders assigned
categories of the same complexity level can be analyzed within this
hypothesis. This does not mean that at least four coders will have
assigned exactly the same numerical category code to these statements.

For example four of six code categories assigned to a statement are #0,
ul, #2, #3. Such a statement is at the secondary level of complexity,

since all four of these categories are secondary consensual statements.

35

If fewer than four coders coded the item at the same level of com-
plexity, it will not be clear what level of complexity we are dealing
with. Therefore, such items will not be analyzed under this hypothesis.

Hypothesis five is the expectation when all coders are independent.
This experiment assumes coder independence. Therefore, this is a
relevant hypothesis. To operationalize this hypothesis a Chi Square
procedure will be utilized. The obtained value in each cell is the number
of unreliable agreements between two coders, while the expected value in
each cell is the number of agreements expected between two coders if
unreliable agreement is equalized across the five other coders. We will
assume the hypothesis is supported if the chi—Square value is not Sig-
nificant. The same analysis will operationalize Hypothesis five-a and
Hypothesis five—b.

Physical Analysis Procedure

 

The first step in analysis was to reorder the pages in each of the
five experimental booklets from their random and different orders to
the same order.

Next the raw data of six codes for each item was associated with
the protocol number and item number in six two-digit columns of type—
writer notation. Typewriter notation proved most efficient because the
consistency of form in typewritten characters led to easier identification
of differences in categories. The visual discrimination was more time-
consuming with hand-written character notation. The column of two digit
codes for the Gallup coder was transcribed from a print-out Sheet of the
Control Data 3600 Computer at Michigan State University. The column of

two-digit codes for each eXperimental coder was transcribed from red

36

pencil notations in the left hand margin of each of the five ex-
perimental booklets. Examples using the typewriter notation method
may be found in Appendix C, Section two, three and four, as well as on
pages “1 and 43 of this chapter.

In association with this raw data for the reliability experiment,
notations were made of (l) the score of each item, (2) the average score
of the protocol, (3) whether qualification~modification contributed to
less than perfect agreement, (u) whether Splitting a statement contributed
to less than perfect agreement, (5) what levels of complexity of reference
were involved in less than perfect agreement, (6) the number of items in
the protocol, and (7) within the raw data columns, all code categories
which were non—modal were encircled, in the manner that scale errors are
encircled in typewriter notation of the sort develOped by Waisanen. The
end product of this procedure was 22 single spaced pages of raw data,
associated with all measures of the variables necessary to operationalize
the hypotheses.

Finally, additional typewriter notations were used to rescore
protocols in which either (or both) qualification and modification
Operated as a factor contributing to unreliability. You may wish to refer

again to the examples cited above.

Data Analysis Procedure

Protocols were rank ordered by mean reliability score. From this
Table IV was constructed, representing the number of protocols receiving
each score (in tenths) the perzent of protocols by scores, and the

cumulative percent of protocols above or at each score. This operation

37

was necessary for testing and illustrating the basic hypothesis and
hypothesis one and two. To further test the general reliability of the
coding procedures under the second basic hypothesis, a determination was
made of (l) the percent of items scoring eight or better, and (2) the
average reliability score figured across all statements. From this data
Table V was constructed.

To test hypothesis one the statements were rescored, deleting
qualification-modification as a source of unreliability and Similar
manipulations of these new scores were made. In this process the following
discrete code categories were lumped together and considered to be
equivalent categories for obtaining the reliability score: (10,11) (1u,15)
(16,17) (20,21) (22,23) (2u,25) (26,27) (30,31) (uo,u1,u2,u3) (uu,u5)
(u6,u7) (ua,s2,53) (50,51) (54,55) (56,57) (58,59) (60,61) (62,63) (70,71)
Please refer to Appendix B for the denotation of these codes.

To make more explicit the procedure followed in rescoring a protocol
for a test of hypothesis one, let us refer to an example from the data;

a protocol which reads as follows:

I'm a human being.

I'm able to work.

I'm able to carry my home.
I am proud to have children.

I am a citizen.

The reliability score for this protocol, under the realistic con-
dition, was 8.6, with five scores of ten, nine, five, nine and ten. Rescored
for testing hypothesis one, it obtained a reliability score of 9.8, with
five scores of ten, ten, nine, ten, ten. To realize how this apparent change
in reliability was obtained, it is necessary to compare the actual code

categories assigned to each reSponse by Six coders with the same data

38

after adjustments have been made to overlook, or remove the effect of;

qualification.

ASSIGNED CATEGORY UNDER REAL CONDITIONS:

ReSponse Coder
Number G A B C D E Score Verbal ReSponse:

60 60 60 6O 6O 60 10 I'm a human being.

L13 us u3® 3 11.3 I'm able to work.
63 63 63 @

I'm able to carry my home.
{ED 21 21 21 21 21 I am proud to have children.
50 50 50 50 50 50

I am a citizen.

U1£‘Q)k)ka

zte
CDCOOkOU'lto

.6 mean score of protocol

ADJUSTED CATEGORY UNDER HYPOTHCSIZED CONDITION:

Response Coder Verbal
Number G A B C D E Score Response;

60 6O 6O 6O 6O 60 10 I'm a human being.
43 43 43 43 43 43 10 I'm able to work.
63 63 63 63 (f2) 63 9 I'm able to carry my home.
21 21 21 21 21 21 10 I am proud to have children.
50 50 50 50 50 50 10 I am a citizen.

35' .
9.8 mean score of protocol

(n-FCOMl-J

Note that only the four encircled "errors” due to qualification
were changed under the hypothesized condition; and that the one encircled
"error" due to some other cause has remained unchanged. Prom inspection
of the three reSponseS involved in rescoring, we can see that the ”errors"
which were changed were evidently due to a differential treatment of the
three qualifying words, ”able", ”able”, and "proud".

To testliypothesis two those protocols affected by splitting were
reordered to delete the accidental affects of disagreements in Splitting,
then the statements were rescored. In this process blank code categories
became meaningful. In the case where a coder had not Split a statement

while others had Split the same statement, that coder was shown as

39

assigning a blank code category to the second row of codes devoted to
the second part of the Split statement.

To clarify this rescoring procedure, we refer to a protocol from
the data, an example which reads as follows:

I'm an American Citizen. 8

I believe in fairness.

I try to deal fair with everybody.

I can get credit anywhere I've been very fair.

I'm just a common-ordinary fellow.

I never had more than a country school education.

The reliability score for this protocol, under the realistic con-
dition, was 6.5, with eight scores of ten, eight, ten, nine, two, three, two
and eight. Rescored for testing hypothesis two, it obtained a reliability
score of 8.8 with eight scores of ten, eight, ten, nine, seven, nine,
seven, and ten. Note that while there are only six verbal responses in the
protocol, the coders have Split some reSponses, thus creating eight reSponse
scores. This fact makes it difficult to determine which scores were
assigned to which verbal reSponse, eSpecially under realistic conditions,
before accidental effects of Splitting are removed or overlooked. Therefore,
in the example below, we must view all six responses as a whole. We may rely only
on the fact that the first category assigned codes the first verbal
response, the last category codes the last verbal response, and all other
categories are in the same order as the verbal responses which they refer

to. The column headed "Response Number" goes unused in this example to

avoid confusion.

ASSIGNED CATEGORY PLACEMENT UNDER REAL CONDITIONS:

ReSponse Coder

Number: 6 A B c D
50 50 50 50 50
50 50 ('2) 50 50
72 72 72 72 72
72 72 72 72
72 GE 61 72 C7;

61 72 47 72 61

u7 é§§3 #7 gig;

'39

E
50

72
72
61
47

Score Verbal Response:

10
8
10
9

_3_
52
6

.5

I'm an American Citizen.

I believe in fairness.

I try to deal fair with everybody.

I can get credit anywhere I've
been very fair.

I'm just a common-ordinary fellow.

I never had more than a country
school education

= mean score of protocol

After accidental effect of Splitting is removed, it is necessary

to assign two lines to each verbal reSponse which was Split by any coder.

In general the categories and scores which are associated with one verbal

reSponse will appear on the same line as the response number and all

following lines, until the next verbal reSponse is begun. Note that some

verbal responses could be Split into more than two codable reSponses. Below

is the example of rescoring which goes with the above example of original

scoring. Response number refers to each verbal reSponse.

ADJUSTED CATEGORY PLACEMENT

Response Coder
Number G A B C D

1 50 so 50 50 50
50 50 (:3 50 50

2 72 72 72 72 72
a 72 72 72 72 61
4 ® a)

72 72 72 (it
5 61 61 ® 61
6 u7 u7 u7 u7 u7

UNDER HYPOTHESIZED CONDITION:

E

50
O
72
72

72
61
47

Score Verbal ReSponse:

10.
70

I'm an American citizen.

I believe in fairness.

I try to deal fair with everybody.

I can get credit anywhere I've
been fair.

I'm just a common-ordinary fellow.

I never had more than a country
school education.

8.8 = mean score of protocol

n+1

Now what was really happening among the coders becomes a bit
clearer. There was a four to two disagreement on the Splitting of
response one and four. Notice that even though removing the accidental
effect of Splitting changed the category content assigned to the second
codable response (second line) it did not change the obtained score of
eight. Under the realistic condition we have tried to circle all "errors”,
while under the hypothesized condition we have circled all "errors”
which remain and may be attributed to some other effect.

Specifically, in verbal reSponse number 1. evidently some coders
Split this into the two reSponses, "I am an American, and I am a citizen."
In verbal response number 4. evidently some coders Split this into the
two reSponses, "I can get credit anywhere, and I've been fair." Finally,
in verbal reSponse number five it appears that only coder A noticed the
word "fellow" and coded this reSponse as a qualified sex reference.

To test hypothesis four it was first necessary to select those
items which could be analyzed under this hypothesis (see discussion of
hypothesis) and then each of these items was assigned a score (1—4) of
"complexity". In the next section the ensuing statistical analysis is
explained.

To test hypothesis five and its sub hypotheses it was necessary to
note all the encircled errors which were in agreement and all the sets
of tripletons in agreement in the even Split situation (Reliability score =
6, structure = AAABBB). Counting all occurrences of 15 sets of agree-

ment pairs resulted in 15 obtained values for the Chi Square cells.

1:2

Statistical Analysis Procedure

 

To test hypothesis three it is necessary to correlate the mean
reliability score of 150 protocols with the number of items in each
protocol. Data analysis has produced a total of three sets of scores
(under three hypothetical conditions). It is necessary to do three
correlations to test this hypothesis, since there was no designation of
which set of scores would be used in this test. A significant negative
correlation will support the hypothesis.

To test hypothesis four it is necessary to correlate the complexity
_soore (1-4) with the reliability score of all statements testable under
the hypothesis. Again three correlations must be done to reflect the
three score sets develOped in data analysis. Again, a significant negative
correlation is needed to support the hypothesis.

To test hypothesis five and its two sub hypotheses it is necessary
to do three Chi Square analyses for each of the three score sets developed.
See page 35 for a description of obtained and eXpected values in these
analyses. The hypothesis is supported if the Chi Square value is not

Significant.

43

. .m.b no m.s mm omnoom momma Ham endow on Haw: m.s
e0 m 00 monoom 00000 map 0H0 sauce 0n Haas 0.0 000000 cameos 0:0 ca .0009

on «5 >kumwooEEH swamp one room an pom omenoooom mpaommn one news swamp no>m com
amnvcwv Cm>m

mpaomon may nonpmwow wcwmsonm >n mpHSmon omen“ ooanmv 0>mn 02

popcoooom momma mo 2 one owmomhocw m>mn awe swoop Hmonmwc new 0% xomn weep

emote: Geoaoo 0:9 CM

mean: .0.0 no
.oamow >vwaenmwaon one cm
0 he pom omensooom
ova; mohoom he you
soon mo mwmoonm one mocﬂme

I'll

 

 

 

 

 

 

 

 

 

 

 

 

 

0.0a 0.0e 0.00- mwmmgsmmme 0.0: 0.00 0.00 0.Sa 0.:H 0.0 0 0>npmaseeo (tn:
S.e e.0 0.e 0.0 0.0 0.0 0.0 0.0 S.m 0.0 0.0 0 .1.
s 0H 0H :0 m we :0 :0 0 0 :H z
oo>06om pﬂamm we poommm Hmpcmoﬂﬂwm
0.0» 0.0e p.00 p.00 p.00 0.00 0.3: 0.00 p.00 0.00 0.00 0 0>npmaseao 7::
e.m S.e 0.: 0.0 0.0 0.00 e.0 s.0 S.e s.0 0.0a S 11
e e 0 0 0 S0 0H 00 e 00 00 z
oo>o€mm pomwmm cowpmoﬂmwawmw
0.HS 0.00 0.S0 0.H0 0.0: 0.0: 0.00 0.00 0.0a 0.:H 0.0 0 0>eemaseso
0.» 0.e 0.0 0.0 S.: s.0 0.0 0.» 0.0 0.0 0.0 0
Ha He 0 0 S 0a 00 0H 0 0 :H z
om>OEem mvommmm mcﬂooo oz
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0a emmnoom 000:

mem>o no prmﬁm mo mohoom emery
mcoﬂvwocoo ownnh pom w one 2 mm
maoeouona ems 00H e6 enoom epaaheeﬂaem 000:

< Hm<m
>H mam<9

 

0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.50 0.00 0.00 0.00 0.H0 0.00 0.H0

 

0.00 5. I I I I I 0.0 0.0 s. 5.0 0.0 0.0 0.0 0.0

 

00H H I I I I I 0 0 H 3 0 0 m 0

 

0.00H 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 H.o0

 

0.00H 5. I 5. I I 0.0 5. 5.0 5.0 0.3 0.0 5. 5.0 0.0

 

00H H I H I I 0 H 3 3 0 0 H 3 0

 

nu

0.00H 0.00 0.00 0.00 0.50 0.00 H.00 0.30 H.30 3.00 H.00 3.00 3.50 5.30 0.00 0.05 0.55 0.35

 

0.00H 5. I 0.H 5. 5. 5. 0.H 5. 5. 0.H 5.0 0.0 5.0 5.0 5.0 0.0 5.0 5.0

 

00H H I 0 H H H 0 H H 0 3 0 3 3 3 0 3 3

 

Hmp05 3.3 0.3 0.3 0.0 0.0 3.0 0.0 0.0 0.0 0.0 3.0 0.0 0.0 0.5 0.5 3.5 0.5 0.5
Aunmwm swab wmoa mo wonoom noozv mSOHpHocoo money
now ucoonom one 2 0n mHooouonm 905 00H pow 09000 thHHnoHHom com:

0 0000
>H 00000

a0>0m mo 0wm0nocH cm 900 pcsooom 03 >0: mwnu SH
000:00009 mo noneac CH 0mmmnocH :0 0909000000: 005 pH .HOUOponm 0 SH HHHmm 0H 00:00009 0:0 nmnp 0QOE nonxe

.GOHpHonoo mwcp n0oo: .z Hopow 02p :0 000:00009

.mpHHmm 000:“ we muo0mm0 Hepc0owoom 03p 0>oE0n ou

 

 

 

 

 

 

 

 

 

 

0.00 5.00 0.00 0.50 0.50 0.00 0.00 0.00 3.05 5.00 3.33 0 0>HpmHneso
0.00 H. 0. 3.H 0. 0.H 0.0 0.3 H.0 5.0 0.00 3.33 0
0000 H 5 0H 0 0H 30 50 mm 30 0H0 300 2
00.0 o0>oa0m “HHmm mo
#60000 Hmpceewoo<
H.00H 0.00 0.00 0.50 3.00 0.00 0.00 0.55 0.05 0.03 0 0>H00Hseoo
H.00H I 0. 0.H 5. 0.H 0.0 0.3 5.5 0.5 0.00 0.03 0
000 I 0 0H 0 0H 00 30 00 m0 000 003 2
00.0 U0>OE0m Po0wmm
GOHHMUHMHHMSO
I 0.00 0.00 0.00 5.30 3.00 H.00 0.00 5.H5 0.00 0.00 0 0>090Hseso
0.00 I 0.H 0.0 0. 0.0 0.5 0.3 0.0 5.0 0.30 0.00 0
000 I 3H 00 m 00 00 00 05 05 0H0 000 z
0H.0 om>os0m #00000
MCHooo oz
x 00060 0 a 0 0 3 0 0 e 0 0 00 00660 00000000000

wmmcomm0m 909 HHm now

moowaonoo 009:9 wow 0 0:0 2 00

000660 50000000000

> 0Hn09

CHAPTER III

RESULTS

General

Using the procedures outlined in Chapter II, it was feund that
72 percent of all protocols were assigned mean reliability scores of
eight or better under the realistic condition. In line with this finding,
72 percent of 858 reSponses obtained reliability scores of eight, nine or
ten, and the mean reliability score across all reSponses was 8.19, while the
modal reliability score was a perfect score of 10. 38.5 percent of all
items were coded with perfect reliability across 15 paired comparisons.

As was noted earlier, comparison of six coders entails comparisons of
15 pairs.

Table IV gives the N and percent of all protocols assigned discrete
reliability scores (to the nearest tenth) under conditions in which no
coding effects have been removed from consideration. Table V gives the
N and percent of all responses assigned discrete reliability scores
(O-lO) under conditions in which no coding effects have been removed from
consideration.

These results relate to the basic hypothesis that there will be a
high level of reliability among the six coders on the coding of the TST

pgptocols and on the coding_of the TST responses. Although the results

 

did not reach the eXpected level of 75 percent either for protocols or
reSponses, the mean reliability score per reSponse did exceed the minimum
expected value of eight. In addition, the large percentage of responses

H6

#7

which obtained a modal reliability score on the extremely high side
of the reliability range indicates the excessively skewed distribution
in the direction of high reliability. This skewed effect is made more

apparent in Table IV and Table V.

When the Effect of Qualifiers is Removed

 

Hypothesis one stated that when the difference between consensual

 

reSponses and evaluative-consensual reSponses is overlooked, reliabilitz

 

is exPected to appear as greater than under realistic conditions. The

 

basic hypothesis was tested under conditions that removed none of the
effects which would Operate in the reality of the coding procedure.
Hypothesis one must be tested under the condition which removes the
effect of qualification from the coding of protocols and responses. For
examples of this procedure see page “1 and Appendix C, Section 11. Under
this condition, 77 percent of all protocols obtained a mean reliability
score of eight or better, while 78 percent of all responses were assigned
scores of eight, nine or ten. The mean reliability score across reSponses
was 8.60, and while the modal reliability score persisted at the value
of 10, there was an increase in reSponses accounted for by this mode

of 8.4 percent (u6.9 percent) These results support hypothesis one
since they represent (1) an apparent increase of five percent in the
number of protocols having a mean reliability of 8 or better, (2) an
apparent increase of seven percent in the number of reSponses scored
reliable at the eight, nine or ten level, and an increase of .u in the
average reliability of all responses. More complete results for the

test of this hypothesis may also be found in Table IV and Table V.

l+8

When the Accidental Effect of Splits is Removed

Hypothesis two stated that when differences between coders, in

 

 

the decision of whether to divide a statement into more than one statement

 

for coding purposes, are adjusted so that only the immediate statement

is affected, rather than all subsequent statements in the protocol,

 

reliability is expected to appear as greater than under realistic

 

conditions. Hypothesis two also must be tested under a hypothetical

 

condition which removes a real effect from consideration. The results
obtained under that condition represent an apparent increase of eight
percent (80 percent) in the number of protocols with mean reliability
of eight or better. An increase of eight percent (79 percent) in the
number of reSponses scored as reliable at the eight, nine or ten level
was found. The mean reliability of all responses increased .MO to 8.59
and the reSponses accounted fOr by the modal score of 10 increased

5.9 percent to uu.u percent. Cleardy the apparent increase of
reliability which was hypothesized has been obtained. These results are

made more explicit in Table IV and Table V.

Factors Present in Low Reliability

 

At this point, to visualize what sorts of responses produce certain
levels of reliability, you may wish to turn to Appendix C, which lists
each item and protocol, in conjunction with its score under three con-
ditions. The most unreliable protocol was #736, which reads:

I am an individual common everyday person.

Try to live up to the laws of America 8 pay my bills.
Try to bring up my family 8 get as meddui (sic) as you I can.

H9

The most unreliable statements when accidental effects of Splits had

been removed were:

1. My family.

2. I stay home on the week ends.

3. I'm a Negro and proud of it because its Gods (sic) intention.
u. I sing in the choir.

5. I am a person who believes a happy marriage is the basis

for success.
. I live alone.
7. My occupation.
8. I am a person who would like to become head of large
business firm.
9. I love my family and want to provide for them any way I can.*

0)

For one of these there were no agreements, and for eight of these
statements there was only one pair of coders in agreement. It is
interesting to notice that the categories involved in these very un-
reliable reSponses were repeaters, as can be seen in Table VI.

The only non—existent category (73) used by any coder was used in

coding the one response which was scored as zero in reliability.

Reliability Related to Complexity Level of Code Category

In order to test hypothesis four as content categories increase in

 

complexity, reliability will lessen, each item was scored on the following

 

scale of complexity, if there were at least four of six coders in agree-

ment on complexity level:

Complexity
Score Name Categories
1 Personal Categories 10-17, 70-71,
2 Primary Categories 20-27
3 Secondary Categories uo-ua, 52-55
a Categorical Categories 30-31, 50—51, 56-59, 62-63
5 Catch-all Categories 60-61, 72

T“*‘-———' '
3These responses are from several protocols, not a Single protocol.

Category
72
Non-split
43

48

51

59
62
63
22
1+1
58
61
21
23
1+2
IILI
as

53

73

50

Table VI

Code Categories by Frequency of Use
For Nine Responses Coded Least Reliably

Description
Other idiosyncratic (residual)
(Not a category)
Qualified general reference to work

General religiosity references, (qualified
and unqualified)

Qualified reference to nation, race, ethnic or
language group

Qualified other categorical referents
Domestic references (unqualified)
Domestic references (qualified)

Marital terms (unqualified)

Qualified occupational title

Other categorical referents, (unqualified)
Qualified other consensual

Qualified kinship, excluding marital terms
Qualified marital terms

General reference to work, (unqualified)
Formal organizations, (unqualified)

Formal organizations, (qualified)

Qualified reference to religion, religious
organizations

(A non—existent code category)

Frequency
8

6

51

It was expected that a complexity score of five would produce about

the same reliability Score as a complexity score of one. The decision
to treat these two scores separately and merge them later was made
because they were unlike groups in many ways. The analysis for this
hypothesis was done under only two conditions (1) the realistic con-
dition in which no coding effects are removed from consideration, and
(2) the hypothetical condition in which the accidental effects of Splits
were removed from consideration. It was inappropriate to test the
hypothesis under the condition in which the effect of qualification was
removed from consideration, since that made up a large part of the effect
this analysis was concerned with. The results are reported here in

tabular form. There seems to be no relationship of the type hypothesized,

Table VII

Modal Complexity Score of ReSponses
By N and X Reliability for two Conditions

 

 

Conplexity Score 1 2 3 u. . 5 Total
No Coding Effects
Removed

N 83 209 166 102 231 *786
Beliability Score 9.1 9.2 8.9 8.5 9.0

 

Accidental Effect
of Split Removed

N 39 197 158 83 217 #694

iReliability Score 9.2 8.8 8.6 9.2 8.9

 

*Only responses judged by a majority of all coders to be on a single
complexity level were analyzed, thus accounting for a decrease of 112
resPonses from the total N of reSponses.

#Removal of the accidental effect of Split statements created enough
redistribution of categories to lower the N of responses analyzed.by an
additional 59, while the total N increased by seven, thus the net loss
of 52.

52

at least not when complexity is defined as it was here. It did not seem
appropriate to run a statistical correlation of complexity score with

reliability score as originally planned.

Reliability Related to Number of Statements in Protocol

 

Hypothesis three, the more statements made on a TST protocol, the

 

lower will be the mean reliability score for that protocol, was tested

 

by running a correlation between number of reSponses in the protocol
and mean reliability score of the protocol. Since no Specific condition
was specified in the hypothesis, all three conditions were tested. The

results of the three correlations are:

No Coding Effects Removed - r -.2177, (p<:.01)
Qualification Effects

Removed - r -.09117 (NS)

Accidental Effect of

Splits Removed - r +.000590 (NS)

The hypothesis is supported under the realistic condition. However, since
the correlation is very nearly zero when the accidental effect of Splits
is removed, prdbably the negative correlation can be accounted for
entirely by that accidental effect, rather than by any effect of "more
responses appearing more complex to the coder," on which this hypothesis

was also based.

Results of Testing for Independence of Coders

 

Since the reliability results reported.here depend to a great
extent on the assumption that the coders were independent, it seemed wise

to test this assumption, in the form of a hypothesis. This test may be

53

thought of as a validity check of the reliability test which is being
reported. To put it another way, if the level of reliability which we
have measured and reported is actually a valid measure it must be
measuring no contaminating variables along with "true" reliability. If
we find that the coders are related in their agreements when they are
contributing to unreliability, we could no longer assume that they were
not related, by some contaminating variable, in their agreements which
contribute to reliability. In such a situation, the level of measured
reliability would be higher than the level of true reliability by some
unknown amount of contribution from the contaminating variable.
Therefore, we translated our assumption of coder independence into

the hypothesis that there will be no relationship in the pattern of

 

coder agreements which contribute to unreliability, This hypothesis was

 

tested by a Chi Square analysis of the N of paired agreements for 15
pairs of coders. It was tested under only two conditions and not

tested under the condition in which qualification effect is removed.
Three separate tests were run for paired agreements, triple agreements
and a combination of these when they contribute to unreliability.

"No relationship” was operationalized by using as the expected value for
the Chi Square analysis an amount of agreements equal for each of the

15 pairs of coders, which is equivalent to the average obtained N of

agreements across all 15 pairs.

59

Table VIII

2
X Values for Coder Pair Agreements
Under Two Conditions

 

 

 

 

 

 

(df = 19)
N Expected . X2 Significance
Value
Nb‘deing’Eff5cts
Removed
Paired Agreement 168 11 15.9 as
Triple Agreement 239 16 6.1 NS
Paired + Triple 407 27 10.8 NS
Accidental Effect
of Split Removed
Paired Agreement 150 10 33.7 p<:305
Triple Agreement 210 19 3.0 NS
Paired + Triple 360 24 15.8 NS

 

It appears that there may be a contaminating variable present, which

only becomes apparent after removal of the accidental effect of Splits
from consideration. In order to make more explicit the relationships
between coders which may, unwittingly, be contributing to measured re-
liability, the relational model in Figure l. was constructed. Here

each solid line represents a positive relation and each broken line
:represents a negative relation between two coders, A,B,C,D,E, and G
(Gallup). The values associated with each line indicate the amount and
direction of difference between the obtained number of agreements and the

eXpected (average) number of agreements.

55

Note that the shortest line represents the most positive relationship
(+12), while the longest line represents the most negative relationship
(-6). It was not possible to reflect the value of each relationship in

the length of each line. To achieve this effect, it would be necessary

to go to three, rather than two dimensions for the model. The most notable

change would be the positioning of G at a point above this page.

*1
‘\\
/ \ <3, -6 /
-5/ \ \ /
./*~./'
// 1:“ ,0>:\(’

 

/

\6 $47175 /
1"

\O-‘ £4-11“;

Figure 1. Model of Coder Relationships

This figure is based only upon the one Chi Square analysis which
Proved significant. In explanation of the strong positive relationship

between C and B, and between E and D, I offer that C and B were two male

56

‘ graduate students who had shared office Space for several years, and that
E and D are a married couple. Probably the contaminating variable un—
earthed here, is that these two pairs of the Six coders tend to ”think
alike" due to habitual association and interaction. To this extent,
then, the coders can not be thought of as independent. The somewhat
strong positive relation between G and both E and D is without rationale,
since these two pairs had no symbolic contact of any kind, while a number
of written communiques regarding the TST coding procedures had passed

between A and G.

CHAPTER IV

CONCLUSIONS

The Basic Hypothesis of Reliability

 

.The basic hypothesis of high reliability, under the realistic
condition in which no coding effects wre removed from consideration was
not supported at the level arbitrarily selected, i.e., 75 percent or
more of all protocols, received a score of eight or better. The obtained
results it will be recalled were that 72 percent of the protocols received
a reliability score of eight or better.

Let's return for a second look at the basis for this hypothesis.
In the first place, the decision to set the score of eight as an acceptance
level, rather than a score of seven, while reasoned out in some detail,
makes the assumption that a difference of one in paired agreements
(between the score of eight and the score of seven) is somehow more
important than a difference of one in the Size of the mode (between the
score of seven and the score of Six). Therefore, we could also
rationalize, after the fact, that the acceptance level could have been
set at a score of seven, as easily as at a score of eight, and reference
to Part B of Table IV shows that such an alteration would have produced
high reliability in 85 percent of all protocols, well within the
acceptance level of 75 percent or over.

In the second place, the hypothesis was based upon (1) greater
complexity of code categories than had been used in the past to code the

TST, plus (2) possibly a greater range in complexity of the TST reSponse

57

58

protocols, due to a unique use of the TST on a heterogeneous sample rather
than a homogeneous group, plus (3) the fact that testing reliability across
six coders and 15 pairs of coders, rather than two or three coders as in

past studies, was a more rigorous test of reliability, and finally

(9) upon the obtained reliability results (97 percent) of Kuhn and McPartland.

From this line of reasoning it is possible to decide, as we did, to
lower the expectation of reliability from the 97 percent level obtained
in the past. However, there were no guidelines as to how much lower was
low enough to account for the three differences in treatment. Therefore,
we feel the obtained reliability figure supports our contention that the
reliability figure should be lower than 97 percent. Beyond that one
could rationalize that either (1) three percent is a very small difference
from accurate prediction, or (2) we were mistaken in our arbitrary choice
of a 75 percent acceptance level. We will not indulge in either line of
rationalization, however, and the fact remains that reliability did not
reach the level we thought it would.

The problem remains for other behavioral researchers to assess the
processes which operate in arbitrary decisions regarding numerical levels.
While there may be individual differences between researchers, I suSpect
that there are also cultural factors involved in such matters as (1) odd
number versus even number, (2) how much is big, (3) how much is
majority, (u) how much is little, (5) how much or how little is
significant, etc. Obviously the most important cultural variables are
the numerical base used and the habitual numerical operations reflected

in monetary matters and pricing practices. I suSpect in this instance

59

that a numberical base of ten influenced the arbitrary decision to set
the acceptance level at 75 percent. Furthermore, had the researcher
been raised in a culture in which the monetary system used pennies (.01),
nickels (.05), dimes (.10) and tomens (.30), the arbitrary choice may
well have been set at 70 percent. An experimental study of the
psychological and social processes involved in setting numberical
acceptance levels would be of importance to the study of philosophy of

science.

Knowledge of Grammar related to Coder Reliability

 

Both of the hypothetical conditions studied are related to the coders

knowledge of grammar. The hypothesis that Removing the effect of quali-

 

fication will operate to raise the appearance of reliability can be

 

translated back to the belief that, at least for the coders involved in
this reliability test, the coder competence was not at the highest
possible level regarding ability to (1) identify adjectives, or (2) decide
what sorts of clauses, phrases or allusions serve to modify the main

clause. The hypothesis that removing the accidental effect, caused by

 

non agreement between coders in the decision to Split a response into

 

two reSponseS for coding purposes, will operate to raise the appearance

 

of reliability_was based upon the belief that, at least for the coders
involved in this study, coder competence was not at the highest possible
level regarding (l) ability to identify a compound sentence, and (2)
ability to identify separate clauses associated with compound objects.

AS pointed out before, the removal of these two effects was only

hypothetical, and in each case the hypothesis was supported by the results.

60

From this we can conclude that if coders were employed whose levels of
competence, at these four grammatical problems, were higher, then
reliability would increase in a real, rather than an apparent fashion.
While it is probably impossible to remove these effects entirely, due to
the difficulties of the language structure, it should be possible to
lessen these effects.

It seems an appropriate suggestion, that a replication of this
study, using the same 150 protocols, but employing six high school English
teachers, or six professional editors as coders would produce an even
higher level of reliability than that obtained here, assuming adequate

training in the code's content categories.

Independence Related to Coder Reliability

 

The hypothesis that there will be no relationship between coders in

their agreements which contribute to unreliability, was based upon the

 

assumption that the coders in the present study were independent.

Recall that, at least after the accidental effect of Splits were removed,
this hypothesis was not supported in the test of paired agreements which
contributed to unreliability.

A legitimate conclusion is that a replication of this study using
coders who are selected in some manner so as to make them more independent
would measure reliability at a lower level than the level measured in
this study. For example, using the same TST protocols one might ask
Six graduate students in social psychology to act as coders, with the
limitation that each graduate student be studying at a discrete

university, and never have met any of the other coders.

6l

In line with the suggested replication using coders of increased
competence in grammar, one would eXpect reliability to increase if the
six high School English teachers all knew each other and taught at the
same high school, while one would eXpect little change in reliability
if the six high school English teachers taught in separate cities or had
never met; one would expect increased reliability if the six editors all
worked in the same publishers office, while one would expect little change
in reliability if the six editors worked for discrete publishing companies

and had never met. Because an increase in the grammatical competence of the

coder is expected to increase real reliability, while an increase in coder

independence is expected to decrease measured reliability, it is probable

that the two factors would cancel each other out. Therefore, an increase

in coder grammatical competence coupled with an increase in coder in-

dependence might well create little effect on the level of measured
reliability.

Factors in the Present Code which Limit Reliability

The casual finding that certain code categories contribute to ex—
treme low reliability in a disprOportionately greater amount than other
code categories, while still other code categories do not contribute at
all to extreme low reliability suggests that there are real differences

between code categories as to the reliability one can impute to their

assignment to a reSponse by an individual coder. Originally it was our

intent to additionally study these differences in the several code
categories, since, on the face of it, some code categories seem less

clear-cut than others. However, our own bias places higher value on a

62

study which tests hypotheses than on a study which does eXploratory

counting, and there seemed no basis upon which to hypothesize this

variable, the definitiveness of the code category. An attempt was made

by testing the hypothesis that as complexity level of the category
increased, reliability would decrease, but this hypothesis was not born

out. Since the scale used to Operationalize ”complexity level" does not

appear to be measuring what I referred to above as definitiveness of the

code category, it was not really surprising that the hypothesis was not

supported.

' If some scale of definitiveness can be developed, then a hypothesis

and corresponding test of the correlation of definitiveness level with

reliability level could be run in a future reliability study. To point

up levels of definitiveness, let us examine the actual code, as it

appears in Appendix B. If as Berelson generalizes, the "more precise

and complete the set of coding rules, the fuller the illustrations,"
(3, p. 514) the more reliable will be the coding; then this code is
lacking only precision. Certainly the code approaches completeness and

fuller illustrations would be beyond comprehension. Actually grave

attempts at precision were made in the writing of this code. However, its

very length, completeness, and fullness of illustration operate to

nullify the attempt at precision.

Following are some of the inconsistencies in the code which lead

to imprecision. Reference will be made by page number and row on the page,

numbering rows from top to bottom.

lO.

63

On page 73 #9, row 3, compared with row 6 -- ”I'd rather not answer
that question." is labeled a refusal (99), while "I would rather keep
that to myself" is labeled as idiosyncratic (72).

On page 73 #9, row 6, compared with 83 #61, last row-—"I would rather
keep that to myself," is labeled as idiosyncratic (72), while "I would
rather keep this to myself." is labeled as qualified other consensual

(61).

On page 75 #ll, row 3, compared.with 83 #61, row 9--"I am a young
single girl" of which the portion "I am young" is labeled as qualified
age (11), while "I am my plain old self" is labeled qualified other

consensual (61).

Page'ﬂi #19, row 7-— ”I am very tanned” is inaccurately labeled other

physical characteristic (unqualified) (In)

Page'"5 #17, row 9, compared with 78 #90, row 8, and compared with
page #02, row 6--"I'm a chore woman" is labeled a qualified sex
reference (17), while "I’m a charwomad'is labeled occupational title
(unqualified) (HO), and "I am a working man" is labeled general

reference to work (unqualified) (#2).

Page '"5 #20, row 9, 10 and 12, compared with #21, row 3-—"I have eleven
grandchildren" and "I am a mother of four daughters" and "I have

four children -- three in school and one babyf, are labeled kinship
(unqualified) excluding marital terms (20), while "I raised a family

of three” is labeled qualified kinship, excluding marital terms. (21).

 

Page 80 #148, row 6 and 7, compared to Page 81 , #53, row 7——"I am a
person who believes in God" and "I am a religious person," are

labeled Religiosity, (qualified or unqualified) (48), while ”I live by
my religious beliefs” is labeled Qualified reference to religion and

religious organizations (53).

 

Page 77 #25, row 3, compared to page 83 #61, row 19 and compared
'to 39' row 6--”I have lots of friends" is labeled Qualified reference

to a friend (25), while ”I am a friendly person," is labeled qualified
other consensual (61) and ”I am fr1endly“ is labeled idiosyncratic (72).

Page 83 #61, row 2 and row 6, compared with page 85 row 16 -- "I'm not
much of a talker" and "I keep my mouth shut” are labeled as qualified
other consensual (61), while ”I do not like to gossip," is labeled

idiosyncratic (72).

Page 33 #61, row 5, compared with page 89 , row 9 and 15-- "I am a
happy person” is labeled qualified other consensual (61), while

”I'm happy" and "I have always been pretty happy" are labeled
idiosyncratic, (72).

64

11. Page 82 # 59, row 2 compared with Page 85 row 18 -- "I am interested
in Sports" is labeled qualified other categorical referents (59),
while "I am interested in recreation" is labeled idiosyncratic (72).

12. Page 82 # 59, row 2, compared with page 84 row 22-- "I like music"
is labeled qualified other categorical (59), while "I am creative in

art" is labeled isiosyncratic (72).

These illustrations of some of the Specific difficulties in the present

code may indicate some of the confusion caused in the individual coder's
mind by the code. One would expect that such confusions would give rise
to unreliability in coding. More careful attention to the code could
operate to reduce unreliability.

Length of the TST as a Factor in Reliability_

The hypothesis that the more resppnses there were in a TST protocol
the lower the reliability of the protocol would be was upheld by a
negative correlation at a significant level under the realistic condition.
This evidence points to a value in using only ten statements, rather than
20, as has been done in the present study and in Brookover's study (4).

Such foreshortening of the TST should act as an aid to reliability.

Future Utility of the Twenty Statements Test

The ease of administration of the TST makes it a natural choice for
assessing self data in the field situation, and it has particular value
in any type of cross cultural study, where the question can be rather
easily translated into equivalent forms in several languages.

Perhaps the greatest utility for the TST in the future is to

compile a body of statements about the self which can in turn be scaled

by a large sample of persons. Finally by submitting these scale items

to factor analysis, it should be possible to construct a standardized

65

scale of self-attitude with factor loadings for each item. Such a

standardized scale would have the advantage of the TST in being self-
defined by a large, representative sample of selves, plus the added
advantage of structured and pre-coded reSponses, which would make un-
necessary any future studies of inter-coder reliability.

Another future product arising from the TST, as a generator of
items, would be a Q-sort of items designed to measure self-attitudes. Such
a standardized instrument would introduce the additional advantage of
allowing the individual respondent to add a good deal of self definition
to the measurement in the style in which he rank orders the items, while
reliability remains enhanced by holding the possible reSponses constant.

Another innovation which should enhance the reliability of the TST

is the self-coding form under development by McPhail (24). By placing the

coding under the direction of the respondent, coder reliability becomes
a meaningless concept.

In conclusion, let us look at some important ways this study differs
from the original reliability check by Kuhn and McPartland: (l) the data
being coded in the present study represented a wider range of expression,
generated as it was by a heterOgeneous rather than homogeneous group of
rspondents, (2) code categories being used represented 22 times as many
possibilities for unreliability as the original Kuhn and McPartland code
(45 categories XE: two categories), (3) six coders represent 16 times as
many possibilities for disagreement as the two coders used in the Kuhn
and McPartland reliability check, and (4) the present study set a standard

for acceptance of reliability (75 percent of all protocols and reSponses

should have a reliability score of eight or better) rather than accepting

65a

whatever reliability level was obtained.

All of these differences make a high level of reliability more
difficult to attain. The 72 percent level obtained XE: the 97 percent
level obtained by Kuhn and McPartland appears encouragingly good. In
Spite of the fact that the results did not quite reach up to the p_priori
acceptance level, we feel the TST can be counted upon for very good inter-

coder reliability even when heterogeneous populations are sampled, a

complex code category system is used and a relatively rigorous reliability

test is applied.

1.

10.

ll.

12.

13.

14.

15.

REFERENCES

Allport, G.W., Becoming: Basic Considerations for a Psychology of
Personality, Yale University Press, New Haven, 1955.

 

American College Dictionary, Random House, New York, 1947.

Berelson, Bernard, "Content Analysis,” in Gardner Lindzey (Ed.),
Handbook of Social Psychology, V. I., Addison-Wesley Publishing Co.,

Reading, Massachusetts, 1954, 488—522.

Brookover, Wilbur 8., Ann Paterson 5 Shailer Thomas, Self Concept of

Ability and School Achievement: The Relationship of Self Images .
to Achievement in Junior High School Subjects, Office of Research 8
Publications, Michigan State University, East Lansing, Michigan, 1962.

Bugental, J.F.T. and S. L. Zelen, ”Investigations into the Self—Concept.
I. The W-A-Y Technique," Journal of Personality, V. 18, 1950, 483-498.

Cooley, Charles Horton, Human Nature and the Social Order, Charles
Scribner's Sons, New York, 1902.

Couch, Carl J., "Family Role Specialization and Self—Attitudes in
Children”, Sociological Quarterly, April, 1962, 115-121.

, "Self-Attitudes and Degree of Agreement with Immediate
Others,” American Journal of Sociology, 1958, 491-96.

 

' , Unpublished manuscript dealing with classes of verb
forms used in response to a modified form of the Twenty Statements

Test.

 

Dyer, Del A., Self Concept, Role—Internalization and SalienCy in
Relation to 4-H Leader Effectiveness, Unpublished Ph.D. Thesis,
Michigan State University, East Lansing, Michigan, 1962.

Hartley, E. L., and R. E. Hartley, Fundamentals of Social Psychology,
Alfred Kn0pf, New York, 1952.

Hickman, C. Addison S Manford H. Kuhn, Individuals, Gropps and
Economic Behavior, Dryden Press, New York, 1956.

Interviewers' Bulletin, Gallup Organization, publication #GO/6368 and

 

#606369, September 3, 1963.

Kuhn, Manford H., "Self," in Julius Gould and William L. Kolb (Eds.),
A Dictionary of the Social Sciences, Free Press, Glencoe, Illinois,

1964, 628-30.

, "Self-Attitudes by Age, Sex and Professional Training,"
THE‘Sbciological Quarterly, V. I, #1, January, 1960, 39-55.

66

l6.

17.

18.

19.

20.

21.

22.

23.

24.

25.

26.

27.

28.

29.

67

. and Thomas S. McPartland, "An Empirical Investigation
of’SeIf—Attitudes,” American Sociological Revue, 1954, V. 19, 68-76.

Lindesmith, A. R. and A. L. Strauss, Social Psychology, Dryden Press,
New York, 1949.

Lipkin, S. "Clients' Feelings and Attitudes in Relation to the Outcome
of Client-Centered Therapy," Psychological Monogrpphs, 1954, V. 68,

1-30 0

McPartland, Thomas, S. D., The Self and Social Structure, Unpublished
Ph.D. Thesis, University of Iowa, Iowa City, Iowa, 1953.

McPartland, , John H. Cumming, Wynona S. Garretson, "Self-Con—
caption and Ward Behavior in Two Psychiatric HOSpitals,”

Sociometry, June, 1961, 111—124.

 

McPartland, and John H. Cumming, "Self—Conception, Social Class,
and Mental Health," Human Organization, Vol. 17, #3, 1958, 24-29.

 

Manual for the Twenty Statements Problem, Department of Research,
The Greater Kansas City Mental Health Foundation, Kansas City,

Missouri, 1959.

McPhail, S. Clark, Self-Identification Within a Specific Context of
Experience and Behavior, Unpublished Ph.D. Thesis, Michigan State

University, East Lansing, Michigan, 1965.

McPhail, Clark, Unpublished manuscript on develOpment of a self-
coding form of the Twenty Statements Test.

Mead, George Herbert, Mind, Self 8 Society, University of Chicago
Press, Chicago, 1943.

Meltzer, Bernard N., The Social Psychology of George Herbert Mead,
Division of Field Services, Western Michigan University,
Kalamazoo, Michigan, 1959.

Mielke, Keith W. Evaluation of Television as a Function of Self—
Beliefs, Unpublished Ph.D. Thesis, Michigan State University,
East Lansing, Michigan, 1965.

 

Raimy, V. C. "Self-Reference in Counseling Interviews,” Journal of
Consulting Psychology, 1948, V. 12, 153-163.

Scheerer, E. T., "An Analysis of the Relationship Between Acceptance
of and Respect for Self and Acceptance of and Respect for Others
in Ten Counseling Cases," Journal of Consulting Psychology, 1949,

V0 13, 169-1750

30.

31.

32.

33.

34.

35.

36.

37.

38.

39.

68

Schutz, H., Theory_and Methodology of Content Analysis, Unpublished
Ph.D. thesis, University of California at Los Angeles, 1950.

Stewart, Robert L., An Empirical Test of the Reference Group
Hypothesis, Unpublished M.A. thesis, University of Iowa, Iowa City,

Iowa, 1952.

 

Tucker, Charles W., Occupational Evaluation 8 Self—Identification,
Unpublished Ph.D. Thesis, Michigan State University, East Lansing,

Michigan, 1966.

Vargas, M. J., "Changes in Self-Awareness During Client-Centered
Therapy," in C. R. ROgers 6 Rosalind F. Dymond (Eds.), Psychotherapy
and Personality Change, University of Chicago, Press, Chicago,

1954, 145-166.

 

Webster's Elementary Dictionary, American Book Company, New York, 1941.

Wylie, Ruth C., The Self Concept, A Critical Survey of Pertinent
Research Literature, University of Nebraska Press, Lincoln, Nebraska,

1961.

Bugental, J. F. T., ”A Method of Assessing Self and Not-Self Attitudes
During the Therapeutic Series,” Journal of Consulting Psychologists,

1952, v. 16, 435-439.

Stock, Dorothy, ”An Investigation into the Intercorrelations Between
the Self-Concept and Feelings Directed Toward Other Persons and Groups.”

Journal of Consulting_Psyphologists, 1949, V. 13, 176-180.

Raskin, N. J., "An Objective Study of the Locus-of-Evaluation Factor
in Psychotherapy," in W. W. Wolff 8 J. A. Precker (Eds.), Success
in Psychotherapy, 1952, New York, Grune 8 Stratton, 143-162.

 

A Manual for the Analyzed Data from United States and Mexico,
Charles W. Tucker, principal editor, Michigan State University,

Sociology Department, East Lansing, Michigan, September 18, 1964,
mimeo.

APPENDIX A

Form of the TST in the Interview Schedule

70

APPENDIX A

Form of the TST in the Interview Schedule

Do not spend more than three minutes on the next item.

INTERVIEWER:
Probe unti1_you get at least five statements from the
When you must_probe, indicate with a

respondents.
horizontal line below the number of his last statement.

Although probably new

Now, we have quite a different thing for you to do.
to you, it is easy and I think you will find it quite enjoyable. Everyone
Now, let me tell

we have asked to do this has found it to be interesting.

you what we have in mind.
Think of as many answers as you

Ask this question of yourself, "Who am I?"
can in answer to the question, "Who am I?"

In a moment I would like you to give me the answers as if you were giving
them to yourself, not to me or anyone else. Take a little time to think
about it. (PAUSE HERE MOMENTARILY.)

3. Now, please make what you consider to be the most important statement

 

about yourself first.

1.
b. Now, make what you consider to be the next most important statement

about yourself.

2.
Now, make what you consider the next most important statement about

yourself. -

3.
(PROBE) Are there any other statements you could make about yourself

in answer to the question, "Who am I?"

 

 

 

 

 

APPENDIX B

TST Code Book

72

READ CAREFULLY - GENERAL CODING INSTRUCTIONS FOR ITEM 17 - Who am I?
A STATE-

1. The object is to code all statements as separate items.
MENT CAN BE A SINGLE WORD, A PHRASE, OR A COMPLETE SENTENCE.

Thus, you may find more than one statement on a particular line in
Each such statement will be coded separately.

the questionnaire .
Examples: a. carpenter; a happy fellow; I am a good father
(This is three separate statements.)
(This is two statements.)

b. I am a father and husband.
I am a very hardworking, unhappy father. (This is one

statement.)
I am handsome but weak. (This is two statements.)

. I am a young single girl. (This is three statements.)
I am a American Negro citizen. (This is three statements.)

"hmg,

GENERAL RULE: Any element which can be construed as a single thought
A noun and its accompanying

about oneself will be coded as a statement.
modifiers is usually a single thought. Adjectives modifying nouns are
to be included with the noun (as in example c. above). Adjectives which
stand alone as self descriptions (as in example d. and e. and f. above)

will be counted as statements.
Usually only one state-

2. Go throngh question 17 and mark off statements.
ment will be fbund on a line. In those cases where more than one state-
ment is found on a line, code each of the statements in order, beginning
with the first mentioned statement.

3. Each statement will be coded into a two column code described below.
In case there are less

When you get to ten statements, ignore the rest.
than l0 statements, put code 88 into the unfilled columns.

u. Coding of this item can best be done by following two procedures:

a. Matching statements on the protocols with statements provided as

examples for each category (see following pages.)
b. Following these General instructions with Special attention to the

Arbitrary Rules (see following pages.)

5. The titles of the categories are only suggestive of the range of state-
ments to be included in each category. Basically, the category serves
However the logic

to collect statements which have some common referent.
of placing certain statements in the same category may not always be
apparent. Attention to such questions as "What did the reSpondent really
mean to say?", or "How can this category placement make sense in the
context of what the respondent said in another statement?", will lead to
random coding, and low inter—coder reliability. Therefore, the best
check on where a statement is to be coded is to try to find an example
like the statement in the list of examples, or one clearly close to it.

A final check should be made to make certain that the category assignment

does not violate any of the arbitrary rules and exceptions.

73

GENERAL CODING INSTRUCTION - Who am I? (Continued)
6. Definition of a qualified and unqualified statement:

a. A QUALIFIED statement is any statement which is modified by the
inclusion of an adjective or adverb, along with the noun and verb.
Sometimes a verb will modify a statement, as in the statement
"I hate work." -

b. AIUNQLALIFIED statement is any statement which is not modified by
the inclusion of an adjective or adverb. This would be only the
noun or verb, in cases where the verb has no qualifying effect.

Examples: I am a poor student. (This is a qualified statement.)
I am a father. (This is an unqualified statement.)

7. Each statement should be coded disregarding (a) negation or affirmation
and (b) verb tense.

Examples: Treat the statement "I am not a mother." as in the same
class as "I am a mother."
Treat the statement "I was a foreman." as in the same class

as "I am a foreman."

8. The distinction between "consensual" and "idiosyncratic" statements can
best be understood by reading‘through the list of examples provided.
All of the references in the list of idiosyncratic statements are to
events, states of mind, or processes which are private, lacking in
Specificity, or lacking in general agreement as to meaning. On the
other hand, all consensual statements have at least one word which
points to something most people would recognize, and have some agreement

for the meaning of.

9. Use a 99 to code all columns in the case of a clear refusal.
We define a refusal as one of the following:

a. I'd rather not answer that question.
b. I don't want to answer that question.
c. Would you please go to the next question.

"I don't know." and "I would rather keep that to myself" are not defined
as refusals to the question but as reSponses to the question which should

be coded as Idiosyncratic (72)

ARBITRARY RULES

A. In cases of statements which fit into two or more categories, aSSign
the statement to the category closest to the top of the list numerically,
with the following exceptions:
1. Any reference to health, whether consensual or not, should be coded
as a 70 or 71, depending on the matter of qualification.
2. Any reference to an organized church or religion should be coded as
53 or 52 regardless of what other codable items are mentioned in the

statement.

7“

GENERAL CODING INSTRUCTIONS - Who am I? (Continued)

3. Any reference to race, irrespective of other references in the
statement which are codable, should be coded 50 or 51.

A. General religiosity references frequently include words that refer
to kinship, sex, work, etc. However, all religiosity references
should be coded as N8. (see examples)

Code the statement as it is recorded on the schedule. Do not add or
subtract words or impute meanings. The only exception is described
in rule C below.

In compound sentences with a single stem, treat the clauses as if each
were separate statements with the stem attached to each clause.

Examples 1. "I am interested in my house, in recreation, and in education."

should be treated as the following three statements:
a. I am interested in my house.
b. I am interested in recreation.
c. I am interested in education.

2. "I am an American Negro citizen." should be treated as the
following three statements:
a. I am an American.
b. I am a Negro.
c. I am a citizen.

3. "I am a young single girl." should be treated as the
following three statements:
a. I am young.
b. I am single.
c. I am a girl.

In cases where titles are mentioned which include several words, treat
the entire title as the object of the sentence.

Examples: 1. I am a U.S. citizen (Do not treat U.S. as qualifying citizen.)
2. I am an executive secretary (Do not treat executive as
qualifying secretary)
3. I am a retired boilermaker (Do not treat retired as
qualifying boilermaker.)
u. I am a successful medical doctor. (Treat successful as

qualifying medical doctor)

When coding initials (e.g., B.P.O.E., G.O.P., I.0.0.F., etc.) which
have not been defined by the interviewer on the schedule, all initials,
whether they are recognized or not recognized by the coder, should be
coded as 58 or 59 depending upon qualification.

Example: "I am a B.P.O.E." (This is an unqualified statement - 58)
"I am a good B.P.O.E.” (This is a qualified statement - 59)

75

TST CODE

10 - Age (unqualified) Only include Specific age such as "I am 25 years old.”

11 -

12-

13 -

l4 -

15 -

16 -

Statements referring to "old" or "young" are NOT
to be included in this category but are coded as 11,

I am 25 years old
Qualified age

I am almost 35.

I am (a) young (single) (girl) *Note: This is the first example of
a sentence in which there is more
than 1 codable referent. In this
case ”I am young" is coded as ll;
"I am single" is coded as 22;
and, "I am a girl" is coded—as 2&3

Name This would be the first, last or both names with or without the

prefix of Mr., Mrs., or Miss.

I am Ardith Weaver
I am Willie Ella Barnell
My name is Joseph Smith

Address This would include only street address and not the name of

the community or nation.

I live at 712 Creek Road

Other Physical Characteristics (Unqualified) -This could include
references to height, weight, skin color, eye color, etc.
This does not include "colored", "negro", etc., which is

coded under 50, 51.

I am bald-headed
I am 5' 2" tall.
I am very tanned.

Qualified physical characteristics

I am nearly 5'2" tall.
My hair is turning gray.

Sex (unqualified) This would include the words male or female. It

 

would also include the words; man, woman, boy,
girl, lady, and gentlemen.

I am a man
I am a female
I am a woman

l7 -

18 -

19 -

20 -

21 -

76

Qualified Sex Reference

 

I am a contented woman
I am a nice gentleman
I'm a chore woman

*I am a man who is loved by his friends *In the fourth statement there

I am a good man are two possibilities of

I am a self-made man ‘ coding: friends or sex ref.
I am a fortunate man categories. Following Rule 1
I am a man able to help my children the statement is pushed up _'
I am a free man. to the first of the two

I'm the man of the house. possible categories.

I am a man who does a lot of public service work free.
You are a nice gentle woman that paid me a visit.

OPEN

OPEN

Kinship (unqualified) Excluding marital terms. Including such terms as
family, child, parent, relative, daughter, son, father, mother,
sister, brother, uncle, aunt, grandfather, grandmother, great
aunt, great uncle, godmother, godfather and all in-law referents.
Do not count number of children or relatives as a qualification.

I am a grandmother

have children

am a mother

have eleven grandchildren

am a mother of four daughters

am a father to my children

have four children -- three in school and one baby.

am making a living for my family.

HHHHHHH

Qualified Kinship. Excluding marital terms.

I'm a good relative.

I raised a family of three.

I come from a good family.

I have a very happy family.

My family is all gone.

can educate my children.

am a referee for my daughter.

am proud of my kids.

am interested in my children' 3 education.

hope to have made my folks happy in my life.

am a farmer' 3 daughter

am not the same as I would be if my mother was living.
enjoy being close to my family.

am loyal to my family.

The funre lies in the hands of mothers

1, as a mother, am reSponsible for the future generation.

My family is the most to me.

HHHHHHHHH

22

23 -

21+-

77

I am a family man.

I am a devoted family man.

As a mother, I can't be done without.

And my children whom I couldn't do without.

I am a person whose parents are both working.

I am not the person I would be if I didn't have a family.

Marital terms (unqualified) This would include the terms husband and
wife, married and single, and those referring to marital
status including divorced and separated.

 

am a wife.

am single.

am divorced.

am married.

am my husband's wife.

HHHHH

Qualified Marital Terms

 

I'm a wife of a progressive businessman.

I lost my wife.

I am proud of my husband.

I'm going to be married.

I have a nice wife.

I am not the person I would be if I wasn't married.
I married a farmer.

I'm important to my husband.

Financially, I like to feel I am helping my husband.
Ask my wife, she would say I am a gun nut.

I help mate financially.

Reference to Friend (unqulaiified) This would include those statements
with the word "friend" in them. A statement referring
to the reSpondent's own friendly quality would not be
included, e.g., "I am friendly" would be coded as 22,

I am a friend.
I have friends.

25 - Qualified Reference to a Friend

26

I have many friends.

I have lots of friends.

I am people's friend.

I am a friend to other men.

I'd like to make more friends than I have.

Other (Residual) Primary Terms (unqualified) This would include
neighbor, have neighbors, peers, bridge clubs, buddy,

pal, colleague, associate, companion.

I am a neighbor.
I am a companion.

27

28

29

30

31

32

33
39

35

36

37

38

39

no

__q——..~ .
m*.~. .-.-\._.._ ,

78

Qualified Other (Residual) Primary Terms

I hope that I am a good neighbor

I am a good neighbor

I have never had much trouble with neighbors.
OPEN

OPEN

Reference to Community (Unqualified) This would include any reference
to the respondent's town, city, community, or state.

I live in New York.
I am a Californian.
I am a Michigander.

Qualified reference to community

I am from a poor community. I am a community service worker.
I am a citizen of a nice community. I am a community worker.

I like to mix with community groups. I am a good community supporter.
I am an example in the community. I am a civic minded person.

I am loyal to my community.
I am well-liked in the community.

OPEN

OPEN
OPEN

OPEN
OPEN
OPEN
OPEN

OPEN

Qgpupational Title (Unqualified). This would include all names of
specific occupations including the term "laborer",

housewife.

 

I'm a dish-washei I am a teacher
*I'm a retired housewife I am a chauffer
I am a seamstress I'm a painter
I am a punch press operator I am a pastor
I am a beautician I am a housewife
I am a musician I'm a char-woman
a“Retired" in conjunction with an occupational title is not, for our purposes,

considered as a qualifier of occupation and should be coded as 393

79

I am a post-master I'm a lawn mower
I am a banker I am a gardner
I'm a filling station operator I'm a knitter
I'm a carpenter I am a civil worker
*I am a retired truck driver I am a cook
I am a farmer I am a Spiritual advisor
I am a nurse I am a nursemaid
I'm a counselor I am a political worker
I'm an entertainer I am a foreman on my job
I am an educator I am a religious instructor

#1 - Qualified Occupational Title

 

am an enthusiastic advertising man.

am a happy housewife

am a very average businessman

am a good housekeeper

hope to become a good author.

ran a foster home for five years in New York state.

I'm a housewife, all inclusive.

I'm a farmer, still able to do a day's work.

I don't like housework. I love housework.

I'm tops at housework. I just finished some housework.

HHHHHH

H2 - General Reference to Work (Unqualified). Includes reference to work,
worker, retired, unemployed.

I work.

I am a laborer.

I work 10 hours a day.
I am a working man.
I'm retired.

I am employed.

I am making a living.

43 - Qualified General Reference to Work

I am a worker by trade. I am good at work.
I am a good worker.
I am a poor working man.
I'm dedicated to my work.
When I work and am hired to work, I
do the best of my ability.
I don't Shirk my job.
I work everyday.
I'm a hard working man.
I'm a good employer.
I like my work.
I am fair and average in my work.
I like to work.
I am strong in buying and selling property.
I enjoy making a living

 

*"Retired" in conjunction with an occupational title is not, for our purposes,
considered as a qualifier of occupation and should be coded as NO.

un-

us-

HB -

1+7-

1+8-

ug-

80

I live by the sweat of my brow.

My job comes first.

I have pride in my work.

You have to (work) if you want to keep even.

Formal Organizations (Unqualified) This would include all educational,
professional, civic, social, farm and labor
organizations. THIS WOULD NOT INCLUDE ANY RELIGIOUS
OR POLITICAL ORGANIZATIONS. All initials should be
coded as 58 or 59 depending on the matter of
qualification. -—-

I'm a Rotarian. I am a member of the Eastern Star.
I'm a Mason. I am a member of organizations.
I'm a class Sponsor.

I am a Boy Scout Committeeman.

goalified Formal Organizations. Excluding Religious and Political Orgs.
I am a loyal unionist.

Reference to Education (Unqualified). Excluding occupational titles
such as teacher, instructor, sponsor, advisor, etc.

I am a student.
I would say I am a student.

anlified Reference to Education
I enjoy studying.
I am a tired Student.

I am going to get my BA degree next winter.

Religiosity (Qualified or Unqualified) Excluding references to church,
—~A formal religion or religious organizations.

am one of God's workers put here to make this world better than when
I came here.

H

I try to serve the Lord in every way possible.

I am a person who believes in God. .

I am important to the Lord as an indiv1dual

I was blessed by the Lord in everything.
*I am a Christian lady. *Where kinship referents are used here,
*I am a child of God. (child, children), and sex referents,
*I am one of God's children. (lady) disregard usual coding by kin-

I lead a Christian life. ship and sex and place all such

I am a creature created by God. referents to religiosity in ﬁg,

OPEN

81

50 - Reference to Nation, Race, Ethnic or Language Group (Unqualified).

51 -

52 -

53 -

*I enjoy my church work.

This would include reference to nationality and
citizenship. American, English, Finnish, Negro,
White, Spanish (Speaking) etc. Any referrent to
being Jewish should be coded as either 52 or 53
depending upon qualifications. '_— -__

I'm English.
I am a Negro.
I am a white woman

*I am a American (Negro) (citizen) *Sentence containing more than 1
*I am a (American) Negro (citizen) codable referent. In each case
*I am a (American) (Negro) citizen. here, the code is 50.

I am a (middle-class) citizen. .__

Qualified Reference to Nation, Race, Ethnic or Language Grogp

I am a good U.S. Citizen

I am a common everyday American

I am a privileged American

I'm glad that I am an American by choice

I am a person with opportunities unequaled in U.S.

I am as deep as anyone in the progress of our country
I am glad to be in America

I am a visitor to this country

Reference to Religion and Religious Organizations (unqualified) This
would include all church or church-related groups,
membership and participation in same. Jewish would
be included in this categOry. Initials that can
not be recognized by the coder should be coded
either 60 or 61 depending upon qualification.

I am a Catholic

I'm a church member
I am a church goer
I go to church.

I am Jewish

Qualified Reference to Religion and Religious Organizations

I am a good church member
I am important to my church

I am loyal to my church .
*In the fourth item "work" is con-

sidered here as a modifier of church.
Church is categorized as referring
to Religion Organization and it is
thus coded as 52 or 53 depending on
qualification. It should not be

pushed up to #2 or 43.

I am free to choose my religion
I live by my religious beliefs.
I go to church every Sunday.

I go to church regularly.

82

SN - Reference to Politics and Political Organizations (Unqualified) This

55 -

56 -

57 -

58 -

59 -

6O -

would include reference to political, politics, all
political parties, political action groups and
affiliated clubs and associations. Initials that
can not be recognized by the coder as falling in
any category should be coded either as 60 or 61
depending upon qualification.

I am a voter.
I am a Democrat

Qualified Reference to Politics and Political Organizations

I'm a poor politician

I am an unhappy Democrat

I'm interested in national and international concerns

I feel myself as much concerned about public life as any other human
I am :ble to vote

Reference to Socio-Economic Status (Unqualified) This would include
working class, lower class, middle class, upper
class, rich, poor, wealthy or well-to-do. An
evaluation by the respondent of working ability
will not be included.

I am middle—class
I am in the upper-class
I am (a) middle-class (citizen)

Qualified Reference to Socio-Economic Status.

I am making an average income
I don't have much money

Other Categorical Referents (Unqualified) These would include such
referents as sports fan, athelete, lover, member,

hobbies.

I belong to the AFL-CIO
I am a server.
I am a stamp collector

Qualified Other Categorical Referents

I am interested in Sports I like music.
My hobbies are gardening I love dogs.
My hobbies are bowling I love German shepards.

I do serve a lot I'm interested in sports.

I'm an outdoor person

Other Consensual (unqualified) These would include such referents as:

person, individual, human being, myself, me, human,
homo sapien.

61 -

62 -

63 -

83

I am an individual
I am me

Qualified Other Consensual

 

I'm not much of a talker

I am law abiding

I try to better myself

There is no one like me

I keep my mouth shut

I'm a very conscientious person

am a mere individual

am my plain old self

am an average person

am a person brought into the world
to do something

am a person who lives in the future
am a friendly person

am a very nice person

I'm a very unimportant person

I'm an individual with my own beliefs
I am a busy person

To me I'm just myself

I am a person who forgives easily

I am a good person

I would rather keep this to myself

HHHH

P£P1P4

I'm a person who enjoys living

in the present

I am not the person I'd like to be

I am a happy person

I am a common person

I'm a good natured person

I am an individual who thinks for
himself.

I am a good guy

Domestic References (Unqualifiedl_ including such statements as I cook
good, I can food, I wash the car, etc.

am a provider

am a homeowner

am a breadwinner
own my own property
don't owe any debts
am a home body

have no car

pay my bills

am a tax payer
work in the yard

hihihihihihihihihihi

Qualified Domestic References

I don't like to cook

I keep my place up so I can be proud of it

My home is my haven

I love my home

I love canning

I love food

I am interested in my home
I like to sew

Around the house, I am actually lord and master.

6H

65

66

67

68

69

7O

71

84

OPEN

OPEN

OPEN

OPEN

OPEN

OPEN

References to Health (Unqualified)

 

I am healthy

Qualified References to Health

 

I am in good health
I am not in good health

*I am a sick person

It is important to help ill and
handicapped people

72 - Idiosyncratic

 

I help those in need
I'm not a hypocrite

J.

I
I

I am given the opportunity to choose

I'm sociable

I'm friendly

I am one who leads a life morally
and financially successful

I'm happy

am easily satisfied

have responsibilities

am not enthusiastic

am fair

have always been pretty happy

am different

am dissatisfied

am contented

am not too smart

I'm not too dumb

I am congenial

I am concerned about my position

I enjoy making other peOple
lead a fuller life

I like peOple

HHHHHHHHHH

have been beat out of some money

I

I
I

HHHHHHHHHHHHHHH

"Any time there is a reference to

health it is coded as 70 or 71
regardless of referents such as
person, begin, individual, man,
woman, etc.

have everything else

was lucky in every way

I'm not too much really

like to be friendly with every-
body

try to be nice to other people
try to treat everyone as I like
to be treated

like to see everyone happy

am concerned for my fellow man
have room to improve

abide by peace

like my comfort

am a small amoeba

am uncertain

am confused

am not very confident

am honest

am upright

am square

am creative in art

have an optimistic attitude

am urged to do things I can't and
I become frustrated

85

am honest

don't steal

am agreeable

can carry on

have things done on time

To be honest is always the first
thing that comes to mind

I have always been helpful

I go out of my way to help others

I believe that I cast an
influence for good

I am a very small ingredient of

a vast mixture

like to do things

cooperate with everyone

think

am a nervous wreck

am the same as everybody else

am interested in recreation

am good to animals

think good thoughts

have a clean mind

I'm morally clean

I think on clean and noble things

I do not live in the past

I am important to life

I am glad to be able to do what I
am doing

I'm glad tole able to be in this

apartment

HHHHH

HHHHHHHHH

get emotional about little things

have a temper

am stubborn

am not much of anybody right now

don't pattern after anyone else

I appreciate it

You are bringing me something

I feel humble and honored

Actually we are so insignificant
today

I think I have a place no one else

can get to which gives me strength

do not like to gossip

am being of service to others

like to be around other people

can carry on

have my own individuality

love helping other children

Iam a nobody

I am a free soul

I am trying to get along with
everybody

I love to take care of people

HHHHH

HHHHHH

 

86

Column Number: 13, 1“
Page Number: a
Question Number: 17

Item Description: Now, we have quite a different thing for you to do.
Although probably new to you, it is easy and I think
you will find it quite enjoyable. Every one we have
asked to do this has found it to be interesting. Now
let me tell you what we have in mind.

Ask this question oijourself. "Who am_I?” Think

of as many answers as you can in answer to the question,
"Who am I?"

In a moment I would like you to give me the answers
as if you were giving them to yourself, not to me or
anyone else. Take a little time to think about it.
Code for first statement (unit of thought).

Coder note: This item will hereafter be referred to as "Who am I?”.
There will be two columns for each unit of thought (see General Instructions).

 

 

 

 

 

 

 

 

 

 

Codes:
1 31.
10 - Age (unqualified)
ll - Qualified age
12 - Name
“~_ 13 - Address
__g, 14 - Other physical characteristics (unqualified)
___ _—I 15 - Qualified other physical characteristics
16 - Sex (unqualified)
I 17 - Qualified sex
18 -
19 -
20 - Kinship (unqualified and excluding marital terms)

 

21 - Qualified kinship (excluding marital terms)
22 - Marital terms (unqualified)

23 - Qualified marital terms

2“ - Reference to friends (unqualified)

25 - Qualified reference to friends

26 - Other (residual) primary terms (unqualified)
27 Qualified other (residual) primary terms

28

29
3O - Reference to community (unqualified)
31 Qualified reference to community

32 -

33
3M

35 -

 

 

 

 

 

 

 

 

Column Number:

Page Number:

Question Number:

Item Description:

 

 

 

 

 

 

 

 

 

 

 

 

l J

 

 

 

 

 

 

H!

 

 

 

 

 

 

 

 

1H1!!! 'Ill’

 

u

36
37
38
39
1+0
111
112
1+3
1111
1+5
116
1+7
1+8
119
50

51

52
53
54
55
56
57
58
59
60
61
62
63
7O
71
72
88
99

 

 

87

13, 111

17

"Who am I?"

Code for first statement (unit of thought)

Occupational title (unqualified)

Qualified occupational title

General references to work (unqualified)

Qualified general references to work

Formal organizations (unqualified)

Qualified formal organizations

References to education (unqualified)

Qualified references'to education

General religiosity references (qualified and unqualified)

Reference to nation, race, ethnic or language group
(unqualified)

Qualified reference to nation, race, ethnic or language
group

Reference to religion, religious organizations (unqualified)

Qualified reference to religion, religious organizations

Reference to politics, political organizations (unqualified)

Qualified reference to politics, political organizations

Reference to socio-economic status (unqualified)

Qualified reference to socio-economic status

Other categorical referents (unqualified)

Qualified other categorical referents

Other consensual (unqualified)

Qualified other consensual

Domestic references (unqualified)

Qualified domestic reference

References to health (unqualified)

Qualified references to health

Other Idiocyncratic (residual)

No statement to be coded in these columns

Refusal

APPENDIX C

150 TST Protocols

 

89

Section I

Protocols Scoring the Same Under All Three Conditions
N=1111

 

*Where no score is noted for a response it is understood to have
a perfectly reliable score (10)

9.6 Mean Score

 

 

I am a mother.
I am a wife.
I am a woman.
9 I am an artist.
9 I am a friendly woman.
10.0
I am a religious man.
I do the best I can.
10.0
I am a beautician.
I am a housewife.
I am a mother.
9.2

I'm a worker.

I'm a good mother.

I'm a faithful mother.

I am the breadwinner here.
6 I am a clean housekeeper.

Disgruntled man.
Lost confidence in life.
Ambitious for son.

5 Bitter at stupidity of gov't.

is:
I'm myself.
A housewife.
A mother.
I like fun.
9 I do the best I can, in all I do.
A cook.
10.0

——_

I am a person.
I am a mother.
I am a wife.
Teacher.
Companion.
Friend.

9O

 

9.3 Mean Score
I am a wife.
I am a mother.
7 I am a companion.
I am a teacher.
I am a nurse.
9 That's about all I can think of.
8.0
8—— I am a good family man.
5 I am law abiding.
I am very healthy.
9 No, I have scraped the bottom of the barrel now.
9.
8—_ I'm a business man.
I'm a public accountant.
8 I love tax work (income tax).
9 I'm a school board member.
9 I'm a school board member.
9 I love Sports.

I think I'm a good father -- (3 children).
Like to play golf.

8.5
9__ An honest man.
A religious man.
9 Member of Fraternity.
6 Family man.
10.0
_—h—I am me.

CD
0
U"

Iii.”

I53

LOCO

I am nothing.
I speak what I think.

Child of God.
Good husband.
Good father.
Good name.
Good friend.
Religious.

I

HHHH

HHHHH

am a man.
am honest.
work hard.
don't drink.
do unto others like I would have them do unto me.

am a citizen.
am an everyday person. .
am trying to make a decent home for my family.

am a father.
am a taxpayer.

91

10.0 Mean Score
Housewife.
Mother.

 

I am a wife.
A mother.
8 A home-maker.
It feels good to be important to those I have chosen to make a part of my
life.

I'm a mother.
I'm a wife.

7 I'm a clean person.
8 I'm a nutritious cook.
My church is important.
6 It's important to me to have recreation outside the home.
9 It is necessary for me to have my friends in at least once a week.
9 I help my children with homework.
Another important thing is to have a reliable baby sitter.
9.2
15- I'm a housewife 8 mother.
10
9 I'm a cook.
9 I'm a seamstress.
8 I enjoy reading.
9
9. I'm a small gear in a big machine.
8 I'm wrangler - (my nickname).
I'm free as heck.
8.8
___'A mother.
A wife.
8 Home maker.
Waitress.

6 Gemocratic. (sic)

I am an American.
I am a Negro.
I am healthy.

9 I am honest.

I guess the most important my honor.
Concern for others.
Well I guess answered anotherin general form.
Difficult to think of another one.
9 I'm someone who tries to set an example for the youth.

92

8.8 Mean Score

am an American.
am a father.

am a husband.

am a leader.

am well adjusted.
am a thinker.

 

 

LO
HHHHHH

9

10.0
Human being.
Negro.
Husband.
American.

8.2

(.0

am an individual.

an talented person.
am good housewife. ‘
am good dresser.

am good cook.

(.0
HHHHH

(DOW

am a mother.

am a wife.

am a grandmother.

am trying to help people.
am making best of my life and ability.

hihikihdki

am a human king.

am an individual.

am a creature of God.
am a husband.

am a father.

HHHHH

1010
Mother.
Individual.
Citizen.

*

I am a mother.
D.K.

1.11.2
Wife.
Mother.
Home.

10.0
I'm me.
A child of God.

9.5

93

Mean Score

 

 

LO
0
U1

10.

I am a housewife.
A homemaker.

A mother.

I am a Baptist.

I am a father.
A Husband.
Teacher.

A Friend.

A Scholar.

A Citizen.

A human Being.
I'm a man.

A son.
A brother.
A writer.
A camera bug.
am a Father.

am a Husband.

try to be a good Church worker.
think I am a good provider.

try to be a good citizen.

am a Grand Father.

Piklhihiklhi

_

LO

.6

An average human being.

2S

CCU

.K.
K

Just
I am
I am

I am
I do

9
9

a small cog in a big wheel.
not too important.

not ambitious.

a man.

my best.

10

.O

Steady worker.

8.2

Dependable.
Reliable.
Young.
Fortunate.

I am a salesman.

I am in the middle class bracket.

I am a resident of a very nice neighborhood.
Own my home.

I am satisfied with what money I'm making.

99

Mean Score

 

Getting along very well with majority of people I know.
If I do a job I do it to the best of my ability.

to

(.0
U1

LDLO

'0 CD
0
(A)

£0

03me

No.
No.

I am mother.

Wife.

Mother.
Competent.

Loved.

Country I love.
Wonderful future.

I am a man.

I do my best at my job.
I try to help people.
None.

A mother.

An American.

I'm a Presbeterian. (sic)
I am easy going.

I am friendly.

I like kids.

I like to be by myself.
I'm a thinker, not a doer.

95

Section II

Protocols Scoring Differently
When Qualification Effect* is Removed
N = 53

 

*Qualification effect is the effect caused by disagreement between coders as
to the interpretation of a reSponse in regard to qualification, modification
or evaluation y§_non-qualification, non-modification or non-evaluation.
Below is one example of what is actually happening when qualification effect
is removed:

ASSIGNED CATEGORIES UNDER REALISTIC CONDITIONS:

Response Coder Verbal
Number: G A B C D E Score ReSponse:

1. 1&8 N8 118 118 ® ® 8 Christian.
2. 22 22 22 22 22 22 10 Wife.

3. 2O 20 20 2o 20 2O 10 Mother.

1+. 72 58 72 @ 58 @ Leader.

5. 6O 60 Q9 60 60 60 9 Person.

6 . 60 60 at 60 60 60 9 Individual.
7. 1+0 1+0 140 1+0 110 ® 9 Artist.

8. no no 62 62 no 62 (E) Homemaker.
9

l2 l2 12 12 12 12 19_ Mrs. Edward E. Rockelt.
73
8.1 = mean score of protocol

ADJUSTED CATEGORIES UNDER HYPOTHESIZED CONDITIONS:

ReSponse Coder Verbal
Number G A B C D E Score ReSponse:

#8 #8 #8 ”8 98 48 10 Christian.
22 22 22 22 22 22 10 Wife.

20 2O 2O 2O 2O 2O 10 Mother.

72 58 72 ® 58 (9 Leader.

60 60 60 60 60 60 10 Person.

60 60 6O 60 60 60 10 Individual.

110 110 110 1+0 110 ® 9 Artist.

no no 62 62 1+0 62 @ Homemaker.

l2 12 12 l2 l2 12 19_ Mrs. Edward E. Rockelt.
77
8.6 = mean score of protocol

coco-q Oiuus'o>k>pu
0

Note that in reSponse 5. and 6. the qualification effect may be attributed
to one coder mistakenly identifying the reSponse as qualified or modified.
In reSponse l. the effect is due to a disagreement between coders as to
Whether the response is a "general religiosity reference" (category #8)

or a "reference to religion, religious organizations" (category 52). .The
disagreement probably hinges upon the capitalization of the word Christian.

See page #1 for a second example.

l!

96

The following discrete code categories were lumped together and con-
sidered to be equivalent categories for rescoring under the hypothetical
condition of overlooking qualification or evaluation of the response. In
all cases qualification and nonqualification categories have been lumped
together. In addition, general and Specific categories have also been lumped
together for the subjects of "occupation-work" (categories u0,ui,u2,u3)
and "religion-religious" (categories #8,52,53).

(10,11) (14,15) (16,17) (20,21) (22,23) (24,25) (26,27) (30,31) (no,ui,u2,
us) (uu,u5) (N6,u7) (M8,52,53) (50,51) (54,55) (56,57) (58,59) (60,61)
(62,63) (70,71)

#NR“ No Coding Effects Removed, QR = Qualification Effect Removed.

+No score notation, in QR denotes no change in score under this condition.
#NR #QR

.M 9.2 Mean Score

+ I'm dependable.
10 My work is of excellent quality.

a)

 

 

0|

6 10 I have a good business.
7 I make a good living for my family.
My family is happy.

8.7 9.0
17— — Mother.
10 Housewife.

8 Constant driver.

7 8 Help husband in newspaper work.
9 10 Good Christian, I hope, I try.
8

9

8.7 9.0
15——- —_—- Trucking contractor.
10 Interest's in family.

8 10 Father of married children.

8 Trying to do and make a success.

9 Not fulfilling ambition as liked.

7 Too much gone from income before personal use.

 

6.9 7.0

7 '__. I am supporter of my family.
10 A good father.
10 A good husband.

2 5 Give them an education.

3 A good helper.

8.0 10.0

9 10 I am a religious leader.

9 10 Being a parent.

6 10 Maintaining a home.

re

0
0
CD

I

p...
O

(I)
o
m

l

00
O
0')

°’ l

£0
01

8.8
10
10

8.6

10
IO

 

97

Mean Score

I am a usher of the church.
I am a grandmother.

I am a wife.

I am a day hand.

 

I'm a mother.

Good person.

Good neighbor.
Interior decorator.
Beautifier.

I am a husband and a father.
I am a provider of a family.
A citizen of a free country.
No.

I'm a father.

I'm a husband.

I'm a religious man.

I'm an office worker.

I'm a helper to other people - I try to get them out of
trouble.

I'm employed and I hate to have to work.

I'm a good church member.
I sing in the choir.
I get along fine with everyone.

Christian.

Wife.

Mother.

Leader.

Person.

Individual.

Artist.

Homemaker.

Mrs. Edward E. Rockelt.

I am a healthy person.
I am a Negro.
I am an honest person.

98

 

 

#913.
10.0 Mean Score
10 I am a Christian.
10 I am an honest Person.
I am a church goer.
I am a Negro.
£5.
I am a husband.
10 An officer in the Navy.
That would about do it.
No.
2e.
I'm Mrs. David Lambiotte.
10 I am Marcias Mother.
I am a woman.
I am an American.
I guess that is about all of me there is.
9.7
'__- I'm just an ordinary guy.
10 I'm an American, a Hoosier.
10
10 I'm a good barber 8 I'm not conceited.
10
10 I'm a Quaker, belong to Friends Church.
10
8 I'm a family man 8 a child of God.
10
21.6.
That I am a good mother.
10 That I keep my home up.
I have raised 5 fine children.
I am a happy person.
My health is good.
10 I am a grandmother.
8 I am a person that has the love of her family.
9.3
IO_' I am just a retired person.
Ive lived my life.
That's all.
9.8
_——' I am an individual.
I am a housewife.
I am very optimistic
I am a pretty happy-go-lucky person.
9 I am a foreigner.
I am not too hard to get along with.

(.0
O
O

OGDLOO

C) CD CD
0 [—1 |~--l o o
LDLDMQ (Doom-F; \ILDmmO

\)
O
0.)

H

F‘
U'lt—IOLDCDO

'IO'
10
10
10

99

Mean Score

 

I'm an average male.

Content being the head of the family.
I'm satisfied in type work I'm doing.
I'm a good husband.

I'm a good father.

I am a housewife.
I am a mother of five children.
I am a Christian.

I am free to do whatever I want.
I am happy.

We are financially solvent.

I am engaged to be married.

Breadwinner for 3 people.

I am a middle income bracket person.
That's all.

None whatsoever.

I am a retired professional man.

I was successful.

I am an individual.

I am a citizen of the United States.

I am a deep thinker.

My name, of course.
My health.
Love.

Tried to live right.
Nice.

Lived a Christian Life.
I am a mother.

I live alone.

Do my own housework.

I have been a good mother.
I have worked hard.

I try to help others.

Try to Family (loyal).

To my Church loyal.

100

Mean Score

 

10.0
10

10

00

 

I am a Church Deacon.

I raised 5 boys and u girls.

I can still do a good days work.
I goes to Church every Sunday.

H

am a father of 6 children.
am a husband.
I am a plumber.

H

A child of God.

A citizen of U.S.

A person with opportunities unequaled
An average American.

An average job.

An average family.

I live by the sweat of my brow.

I enjoy making a living.

I work for a living.

I get along with others.

I am pretty well satisfied.

I am old.

I've raised 2 boys.

I'm raising another.

Use to live on a farm.

Got my education the hard way.

Fathers.
Dishwasher.
Lawnmower.
Grandfather.
mason.

Good neighbor.

am a mother.

am a housewife.

have a lot of reSponsibility.
don't care for Sports.

work too hard in house.

don't work in yard.

I'm not lazy.

I love to sleep.

Pihihihihihi

except in U.S.

\1
O
O

101

 

I'm trying to live a Christian life.

I work outside of home.

I
I
I am a church worker.
I am active in community.
P

I'm a human being.

I'm able to carry my home.
I am proudin have children.

I am an individual.
An average person.
Try to be a Good Neighbor.

C00perative Person.

I am a helper of people.
I am head of a family.
I am a church member.

My name is Viola Gustafson.
I am raising 2 children.

I must make sure the children have enough

clothes on their back.

QB.
9.5 Mean Score
10
I'm honest.
I'm a mother.
Housewife.
I'm not lazy.
9.0
am a mother.
am a wife.
6 OTOA.
9.8
10 I'm able to work.
9
10
I am a citizen.
10.0
10
Cheerful person.
10 Good wife.
8.5
10
None.
8.2
9
7
8.7
8

Im an uncertain person.
Im a good mother.
But a very poor housekeeper.

to eat and

102

 

93
9.5 Mean Score
I guess I just live a Clean life.
10 I like my neighbors.
10 And my family.
I like my car.
9.8
'_—’ I am a good husband.
A good worker.
Agreeable.
Not quarrelsome.
Not in good health.
10 Hope a good neighbor.
9.6
_—_- I am a very contented person.
I am fortunate to have such a wonderful family.
10 I am not very tall.
I enjoy being myself.
I wouldn't want to change.
10.0
Glad to be a citizen of the U.S.
I'm a carpenter.
10 Happy to have opportunity to have a home 8 work.
8.8
_—— I'm a woman.
I'm a housewife.
I'm a mother.
I'm an individual.
I am part of a group.
10 I am a sister.
10 I am a daughter.
I am a consumer.
I am a homemaker.
I am a cook.
9.1
10_ I've kept my home together.
I've kept my job.
10 I've kept my children together.
I've kept my family as a family.
I've always kept my temper.
I was never frivolous with a dollar.
I gave my boy the best education you could get.
I have a good wife.

103

 

 

NR QR
§;0_ 8.8 Mean Score

10 Good mother.

10 Good wife.

5 9 Overseeing the home.
5 Good living women - all

.7 9.2
To“ "- A wife.

10 A mother.

10 A teacher.

9 10 Community leader.

7 9 Sportswoman.

6 Poor housekeeper.
9.2 9.8

9“" —- Idea of God.

9 10 Wife.

10 Painter.

9 10 Lover of Animals.

9 10 Lover of flowers.
9.0 10.0

10 Mother.

10 Worker.

10 Chinese.

6 10 Going to be a Grandmother.
8.8 9.0

ID_- -—_’ I'm a worker.

10 I'm an American.

9 10 I'm a music lover.
10 I'm a husband.

5 No.

9.u 9.8
I0_' -—_' A human bang.

10 I try to be a nice person.
10 Generous to a fault.
10 Helpful.

7 9

me and my husband.

6.7 7.0

com-cl

01!

There's nothing more important to me than whats

I live all alone.
I am an old man.

my life I've tried to do what is right.

between

109

Section III

Protocols Scoring Differently
When Accidental Effect* of Splitting is Removed
N = 17

*Accidental effect of Splitting Should not be confused with the main effect
of Splitting. If we were treating the main effect here, we would
arbitrarily remove or add code categories to the real judgements of non-
conforming coders. Thus, if four of six coders had Split a verbal response
into two or more codable reSponses, we would arbitrarily add some sort of
code categories to the judgements of the two nonconforming coders. Like-
wise, if four of six coders had not Split a verbal reSponse into two or
more codable responses, we would arbitrarily subtract code categories from
the judgements of the two non-conforming coders. Such a procedure was

not feasible, fer at least three reasons; (1) often the disagreement on
splitting is not modal, but rather an even Split, artifact of an even
number of coders; (2) There is little basis for deciding which of two code
categories should be removed from a coders judgement, unless it is done

on the basis of conforming to a modal category; and (3) There is even less
basis for deciding which of many code categories should be added to a
coders judgement, unless it is done on the basis of conforming to the single
category already assigned to the verbal reSponse, as when a single code of
"12" is changed to two codes of the category "12" by the same coder.

Therefore, removal of the accidental effect of Splitting does not
necessitate adding or subtracting or changing the categories of different
codes. The latter method was used when the qualification effect was
removed in Appendix C, SeCtion II. What is involved here is a rearrangement
of the existing categories in the code matrix, so that reliability may be
gauged upon a comparison of the categories assigned by all coders to the
same reSponse. We will, in effect, be Straightening out the reSponse rows
across the code matrix by leaving some blank gaps within the code matrix
where no Split was made by five or fewer coders, and, therefore, no category
was assigned by five or fewer coders. Following is an example of what
happens. (see page ”3 for another example)

ASSIGNED CATEGORY PLACEMENT UNDER REAL CONDITIONS:
(ReSponse Number not used to avoid confusion)

ReSponse Coder Verbal
Number. A B C D E Score Response:

0'! 17 17 17 m 17 8 Iamanoldwoman.
(E 25 25 25 £13 25 8 I've got lots of friends.

12;? 72 72 72 013 5 I like people.
72 72 72 72 72 72 10 D.K. I'm awfully dumb.

72 O72 72 720 33?"
7.8 = mean score of protocol

 

105

ADJUSTED CATEGORY PLACEMENT UNDER HYPOTHETICAL CONDITIONS:
(Response number refers to verbal response)

Coder Verbal

ReSponse
C D E Score Response:

Number G A B

1 CD % 8 _I am an old woman.
(I) .17 17 l7 l7 8
2 5 25 25 25 25 25 10 I've got lots of friends.
3 72 72 72 72 72 9 I like people.
72 72 é é 72 72 10 D.K. I'm awfully dumb.
8
_818 = mean score of protocol

This example is particularly helpful in showing how the rescoring
procedure to test this hypothesis can create a larger number of scores for
the protocol. Here we see that the N of scores increases from five to six.
The reason for this is that adjustment within the code matrix to straighten
response-rows necessitated eXpansion of the matrix. To be exact, two
coders, (A and E) did not Split any responses, two coders (G and D) split
only the first reSponse, and the remaining two coders (B and C) Split the
last reSponse only. If any of the six coders had, under the realistic con-
dition, Split both the first and last reSponse, there would have been six
codes for the realistic condition, and therefore, no difference in the N

of scores under the two differing conditions.
+NR = No coding effect removed, SR = Accidental effect of Splitting removed,

R# = reSponse number of verbal reSponse. For the NR scores it is not

useful to match verbal reSponse to scores, by number.
% = No score notation denotes no change in score.

5%: 113* 5.1:"

 

 

8.3 9.0 Mean Score

1 10 8 I'm a good mother.

2 3 8 That I am able to hold a position at my age.
9 5

3 9 10 I am an efficient worker.

a 9 10 I am a grandmother.

5 9 10 I am a religious person.

6 9 10 I am a good citizen.
7.2 8.6

1 —9_- -—_- Let's pass that up. I'm not very cooperative.
7 8

2 7 8 Just a teacher.

3 5 9 The community thinks we're just a servant.

u 8 9 So you can write your own criticism.

 

 

03M

0":

£114:wa

4:0)er O)

can)».

0‘4:

If:

\J
o
m

|

(bl—'Uimm
O

rateco
OO.
O

0.
L0

CD
0
w
(I)
0
U1

wonoococnmgppmp
00 0|

‘0 OWPLOI-Jtoo
- o o l.
a: o)

¢>F4e1~1a
<3c>c>c>Eq

CO
e
w

I

‘9CD1D1mld-oto
o1

10

106

Mean Score
I am an old woman.

I've got lots of friends.

I like people.
D.K. I'm awfully dumb.

A good mother.
Important to my husband.
A good housekeeper, I hope.

A good Church member.
Important to my community.

I am a good daughter.

I am a reasonably successful secretary.
I am a good and loyal friend.

I could be more active in community life,

but I have a busy schedule.
I am a little cog in the wheel.

I'm very happy person.
I'm healthy.

I'm young.

I have high hopes.

Hope to have a healthy 8 happy family.

Very lucky to have a wonderful husband.
Wonderful mother 8 father in law.

Wonderful mother and step-father.

am a widow.

have a great burden in life.

have always sacrificed my life for my family.
have raised my own children and now must raise

grandchildren.
I have lived a clean life above reproach.

HHHH

I'm a woman.
I'm a mother.

I hope I'm useful. I think I am useful.

I'm needed by others.
I help - or try to help others.

#0501:

tutor—a 01 MI—J 4': MP to

0301

HO)
O.
U"

0

\I (DNOJMLDl-‘m

O

C» mmmztomqqmtol

...a
O 0
CD

(I) @010)

l-‘i-J
000
\1

L0 coucoHHi—‘cn £00100
' COO-
00 co

(OI-'l-‘(D
00

L080

10
10

107

Mean Score

 

I'm an American citizen.

I believe in fairness.

I try to deal fair with everybody.

I can get credit anywhere I've been very fair.
I'm just a common-ordinary fellow.

I never had more than a country school education.
I like football, basketball.

I like to drink, smoke.

am a father 8 mother.

H

like women.

like to go to parties.

don't like onions.

don't like the Chicago Bears 8 L.A. Dodgers.

HHHH

I am an American citizen.

am a Husband.
am unemployed.

HH

Father.
Citizen, a damn good one too.

Good Blacksmith.
Good husband.
Good gardner.

A mother.
I am a middle class citizen.

I'm a Negro.
I'm a nurse.
I'm concerned about my position.

Very small ingredient of a vast mixture.

That Speck ingredient is as small as a grain of salt

Of itself in itself its insignificient. (sic)

But in the whole matter it can influence the taste
of the finished product.

Important to my family.

Important to my country.

dams-mm...

1‘: M

O)

m

108

 

 

 

NR §B_

L1 L7 MmSwm

9 I'm a good human being.

10 I enjoy life.

6 5 I appreciate many things.

9 Art,

5 7 Music,

6 8 People and,

5 6 Nature.

8.0 8.0

9——' -_—' I'm not what I'd like to be.

10 I'm not as intelligent as I'd like to be.

7 I don't persue educating myself to the degree I
feel I should.

5 I neglect myself physically - in appearance.

6 I think of myself as a person with tremendous
am't of love.

9 Emotional.

9 Concern for others 8 the world esp. children.

9

9

5 9 Someone who can't say no if it hurts others.

9 5 Sometimes I think I have a lot of knowledge

9 8 8 sometimes I think I'm a phoney.

8.6 8.u

10 I'm a relatively unimportant person who likes

9 7 to do things for others.

10 I think it's important to watch out for others
as well as ourselves.

7 10 I have a good sence (sic) of humor and that's

7 5 important.

109

Section IV

Protocols Scoring Differently When
Accidental Effect* of Splitting is Removed
And When Qualification Effect* is Removed
N = 37

 

*See Appendix C, Section III for an explanation of the rescoring process
in removing accidental effect of Splitting. See Appendix C, Section II
for an explanation of the rescoring process in removing qualification
effect. Following are examples which Should clarify the rescoring done
in this section:

ASSIGNED CATEGORY UNDER REAL CONDITIONS:
(Response number is meaningless)

Response Coder Verbal ,,
Number G A B C D E Score ReSponse:
@ 72 72 72 72 7 I am a guy with a lot of problems.
M3 #3 43 43 M3 9 I am concerned about finding a good job.

H

#7 72 M7 72 #3 #3 have reasonable experience but do not

#1 #6 91 46 have a college degree.

22 #0 22 #0 am ex claim insurance adjuster.
MO 22
6')

22 22 23 22 am single 8 I wish I was married.

23 62) 23 23 like people.
72 72 72 6)) 72 Enjoy working with people.
113 ® 03 113 (3) 1+3

I like work that is worth while.
72 72 72 #3 u #3 I like to do something that is
72 G) 72
ReSponse Coder Verbal

beneficial to other people.
Number G A B C D E Score Response:

HHH

m
010)“) \Iximq
O
01

= mean score of protocol.

am an American citizen.

am honest.

am good.

work every day.

stay home on the week ends.
go to church on Sunday.

H
O

50 50 50 50 50 50
50 50 72 50 72 72
72 72 72 72 72 72

H
lemme)
HHHHHH

.1:-
air»

.9 = mean score of protocol.

110

ADJUSTED CATEGORY UNDER HYPOTHESIZED CONDITION:
REMOVAL OF QUALIFICATION EFFECT
(ReSponse number meaningless)

Response Coder Verbal
Number G A B C E Score Response:

D
(ED 72 72 72 Q 72 7 I am a guy with a lot of problems.
43 43 43 43 75) 43 9 I am concerned about finding a good job.
47 72 47 72 43 43 @ I have reasonable experience but do not
46 46 46 5 have a college degree.
8 40 g 40 g 40 5 I am ex claim insurance adjuster.
22 22 22 22 22 9 I am Single 8 I wish I was married.
8 23 ® 23 23 23 8 I like people.
72 72 72 Q 72 7 ‘Enjoy working with people.
113 ® 113 113 as 113 7 I like work that is worth while.
72 72 72 43 43 43 I like to do something that is
72 43 (f§> 72 Egg beneficial to other people
9
76'
6.5 = mean score of protocol
Response Coder Verbal

Number G A B C D E Score ReSponse:

50 50 50 50 50 50 10 I am an American citizen.
50 50 72 50 72 72 G I am honest.
72 72 72 72 72 72 10 I am good.
72 72 113 72 113 113 @ I work every day.
43 43 ® 43 43 7 I stay home on the week ends.
.@ 53 ® 53 53 4 I go to church on Sunday.
53 53 53 Q
49
7.0 = mean score of protocol

When qualification effect is removed, there seems to be disagreement
between coders on the first protocol whether (1) "do not" in reSponse .
three is a qualifier, (2) "ex" in reSponse four is a qualifier, (3) "wish"
in response five is a qualifier, (4) ”worth while" in response eight 13 a
qualifier. In the second protocol there seems to be disagreement between
coders on whether (1) "every day" in reSponse four is a qualifier, and
(2) "on Sunday" in response six is a qualifier.

 

 

111

ADJUSTED CATEGORY PLACEMENT UNDER HYPOTHESIZED CONDITION:
REMOVAL OF ACCIDENTAL EFFECT OF SPLITTING
(Response number refers to verbal reSponse)

Response Coder Verbal
Number G A B C D E Response:

U)
0
O
3

 

l. (:3) 9 I am a guy with a lot of problems.
@ 72 72 72 72 72 9
2 43 43 43 43 43 43 10 I am concerned about finding a good job.
3. 72 72 43 43 ‘8 I have reasonable experience but do not
47 46 47 46 46 47 ‘3 Have college degree.
4 41 4O 41 4O 40 41 ‘3 I am ex claim insurance adjuster.
5 22 22 22 22 22 22 10 I am single 8 I wish I was married.
® 23 23 23 23. 8
6. 72 72 72 72 a) 72 9 I like people.
7 113 ® 113 113 113 8 Enjoy working with people.
8 43 43 43 4 43 9 I like work that is worth while.
9. 72 72 72 ® (1% 72 _7_ I like to do something that is
94 beneficial to other people
7.8 = mean score of protocol
Response Coder Verbal

Number' G A B C D E Score ReSponse:

l. 50 50 50 50 50 50 10 I am an American citizen.

50 50 50 ©

72 72 72 72 72 72 10
72 72 72 72 72 10
3

72
@11 11311311
999999

I am honest.

I am good.

I work every day.

I stay home on the week ends.
I go to church on Sunday.

070')th

.4 = mean score of protocol

When accidental effect of Splitting is removed, we can see that there
is, in the first protocol, disagreement between coders on whether to Split
(1) "I am a guy with a lot of problems" into the two reSponseS, "I am a guy,
and I have a lot of problems", and (2) "I have reasonable experience but
do not have college degree" into two clauses; but there is no disagreement
that "I am Single 8 I wish I was married." should be Split into two clauses.
In the second protocol there is only disagreement on whether :0 Split the
response "I am an American citizen." into the two responses, I am an
American, and I am a citizen." However, this disagreement is so near the
beginning of the protocol that it creates a good deal of acc1dental effect.

In the fellowing material, NR = no coding effects remoyed, QR =
qualification effect removed, SR = accidental effect of Splitting removed.
Notice that none of these results amounts to the removal of both qualifi-
cation effect and accidental effect of Splitting in the same operation.

R# = reSponse number, which refers to the verbal response and can be a1
meaningfully linked to the score only in the condition in which accident

effect of Splitting is removed (SR).

112

+When there is no score notation in column QR or SR, it denotes that no
score occurred due to the rescoring.

 

 

I am dependable and steady.

I try to do the best for my community and country.

I do my job as well as I can.
I want my wife and children to respect-—

one so I do try to have high standards of behavior.

I have a lot of duties to perform.
I like social life - lodge work.

I am a guy with a lot of problems.

I am concerned about finding a good job.

I have reasonable experience but do not have
college degree.

I am ex claim insurance adjuster.

I am single 8 I wish I was married.

Enjoy working with people.
I like work that is worth while.
I like to do something that is beneficial to other

I am an American citizen.

work every day.
stay home on the week ends.
go to church on Sunday.

I am most important to my family.
I hold a place in my church, I feel important.

I love my neighbors.

change in
59.189.13.93.
6.3 7.2 7.7 Mean Score
l 10 + +
9
2 5 9 8 I'm a family man.
3 7 9
5 4
4 7 10
5 2 5 5
4 5 5
6 8 9
4.8 6.2 5.2
l 5
2 5
l 4
3 5 9 My family.
4 8 9 Nothing else.
5.5 6.5 7.9
1—7—"T
9
2 3 10
3 2 5 3
2 5 6
4 7 9 6
5 5 8 10
7 8
6 7 9 I like people.
7 6 8
8 2 3 9
9 9 8
people.
5.9 7. 7.4
170——
6
2 10 I am honest.
3 5 6 10 I am good.
4 4 7 8 I
5 l 4 O I
6 5 6 8 I
5.0 6. 6.6
1—9—‘_
2 8 10
4 5 5
3 2 7 6
4 2 5 I'm too much fat.

[—4 \lmUl-FCO M

030'!sz

c'oahaha

 

 

 

 

113
1.1.13. 9.13. SR
6;9_ Z;Z_ 7.1 Mean Score
8 Outside of just an ordinary home maker.
10 Mother.
9 And citizen.
2 5 I try to work in school affairs.
6 10 And a few fraternal organizations.
9 Well, when I was working in the biz (sic) world
I felt--
5 6 I should try to be sincere 8 make myself valuable——
4 9
9 5 8 not loaf on the job.
7.7 8.6 8.3 g
—9_- -__' -——- I am a chauffer, bookkeeper (sic), laundress, E‘-
9 cook, housekeeper. 1
9 1;
9 ...
5 9 6 I am just a wife and mother. i i
5 9 6
7 9 Don't know any more answers.
7 9 Just don't know.
6.8 7.5 8.5
9 10 A human being as everyone.
6 I am an honest man who never went to jail.
7
6 10 I know how to work.
2 7 8 I am not lazy.
8 I like my neighbors.
8 10 I think I am a friend to everyone.
8 10 I like to work.
8.3 9.0 8.7
_5_- _IT_ -___ I am a good provider for my family.
10 9
7 9 10 I am a father.
9 8 I am a christian (sic).
9 I like people.
9 10 I am a Negro.
9 10 I am a man.
7.5 7.8 7.0
—5_- —9- —__' I'm just me.
8 5 Absolutely.
8 9 Refused - even—-
9 after probing.

 

N)

014-?

 

114

Mean Score

 

 

 

 

 

15989.1:
6.3 6.8 6.8
10
10

7 9

7 9
1 4 10
7 1
l 10
2 4
9 2
9 5
4.5 5.2 8.2
6 10

6 10
2 9
2 4
5 10
6 10
8.7 9.0 9.3
8 10 9
9

9 10
8 9
9 9
9 10
6.5 7.0 8.3
9

9

2 5 10
5

5 8
9
5.6 6.2 7.5
8

8

4 7 9
l 4 9
4 9
1 4
‘4 10
8 4
9 8
9 6

Your attitude.
The way I carry myself thru life, my job, etc.

I love my family 8 want to provide for them
any way I can.

I'm a Negro and proud of it because its Gods (sic)
intention.

Refusal.

I am an individual common everyday person.

Try to live up to the laws of America 8 pay
my bills.

Try to bring up my family 8 get as meddui (sic)
as you I can.

I am a mother 8 happy to be one.

I like people.
D.K. -- I am a wife.

I can't think of anything more.

I'm a helper to my family.

I like people.
I'm a working mother.

I'm really not much.

I'm just a poor working gal.

I'm just an ordinary person.
I go to work, come home attend church.

I like to travel.
I enjoy reading when I have the time.

(JON

0.)

\ICDUI-FODN

M

U‘l-F’

Z
70

01 01
° (DUIUTOOMG) 0
O) M

IQCVQ-Sl’l-‘QQI

\l
(I)

GO

«compact—11a

\1
M

..l

Q-PCDIDMOI—‘F'

(D
O

\JHQQQOO
O

(D
O
(.0

I8

(OLDKDCD

115

Mean Score

 

First of all a mother, and still needed by
family 8 husband.

I'm a companion to my husband.
I try to be a good homemaker.
Nothing else. '

If I was well I would like to work.

I'm retired was 2 weeks in hOSpital have
ulserated (sic) foot.

If nobody bothers me I'm O.K.
I don't like to nag or fight.

I'm a farmer.
A father.
I'm a boater - sailor when I have time.

I'm a director of many organizations.
I'm a bridge player.
But I work late most nights.

I have 29 grandchildren so a Grandmother.

And mother.
A wife.
And the official telephone answerer here.

I'm quite active in church.
And I love music.
And sing in the choir.

I'm very congenial I think.
I'm understanding.

I'm very unselfish I think family comes before me.

I always try to look out for my fellow man.
I like to apply the Golden Rule.
I try to do things other people ask me to do.

01-5300

m>ka

4:0.)

 

EB. Q_R if:
§_:_l_ 6.3 6.11
l T '9—
1

7 8
7 1
9 8
9

9

g_:_l_ 9.3 9.3
9 '15— ‘—
9

9

10 9
9

9 10
9 10
6.8 7.2 7.5
"'5" T —
5 6 8
9 10
8 7
7.2 7.7 8.5
"10 —‘ ‘—
5 9

8

5 9
7 10
8 9
4.8 5.6 7.7
T _‘ ‘5—
6 8
6 10
5 10
l 11 5
2 9
11 8
8 5
- 7
- 10
9.0 9.3 9.5
10

8 10

9

9 10
9 10
9 10

 

116

Mean Score

I am a person who would like to become head
of a large business firm.

I am a person who believes a happy marriage
is the basis for success.

I am a person who believes one Should have a
Supreme Being in which to believe.

I like to enjoy life.

 

I am a citizen of the United States.
I am a good understanding honest woman.

I am religious.
I am respectable.

 

An old woman.

I live alone.
I like to be alone.

I'm a believer of God.
A wife that loves her family.

I give untireing (sic) of my service to others.

Good American.
Don't know.

I am responsible for my family's health 8 well-
being.

I am a mother.

I am a wife.

I am able to sew and knit for my family.

I am said to be a good shopper as I know prices.

I participate in Sports with other women.

I am a man.
A Christian man.

A husband.
A farmer.
Father.

 

81;. 1':
8.2

1 “1‘0—
2 10
3 10
11 10
5 10
7

6 7
7 2
8 8
7

9

9 9
7.7

l "9
9

2 10
3 7
2

11 9
8.0

l "‘9’
2 9
3 9
11 9
5 7
7

6 9
7 9
8 7
9 5
lo -
8.5

1 —8-
2 10
10

3 6
8.5

l .10
5

U'I-F'OJM
}_1
O

10
10

10

 

117

Mean Score

 

am a happy mother.

am a good wife.

am helpful to other people.

am an eager student.

am an eager outdoor Sports woman.

Pihikihiki

H

am a good houseekeeper.

enjoy gardening.

I am a good needleworker, sewing and knitting and
crocheting.

H

I enjoy the arts, (painting in oils).

Old every day worker.

I have a family.
Father and Husband.

No.

am nobody.

intend to be somebody.

would like to make other people happy.

am a person of reSponsibility.

would like to have a large family about
12 kids.

am sure of myself.

am confident with respect to work.

have reSpect from my friends.

feel a little unsettled as far as my family
is concerned.

I have a friendly relationship with neighbors.

HHHHH

Pihihihi

am another individual.
I have certain rights and opportunities.

H

I have been fortunate in life.

I'm just me - a person.

I'm satisfied with my way of life.
I get along ok.

D.K.

D.K.

 

 

«Cle-J

C501

wlvra 01 ; wlo1a

Q0301;

R51.

014:

 

118

Mean Score

 

 

 

111191398
7.1 7.7 7.7
7‘6““—
10
5 6
7 9
9
5 8
7 8
6.4 7.7 7.2
‘IO_—-
10
8 10
2 9
7
1 9
7 10
- 2
7.9 8.4 8.6
10
8 10
9 10
4 5 9
5 7
9 8
9
9
6.7 6.8 7.5
10
9
7 8
5 8
5 9
4 5 1
8.6 9.9 8.7
"1‘0""—
10
9
7
5 8
10
9 10

I am the man of the house.

I made a change for better instead of worse.
I try to provide for my family.

I like to help my neighbors 8 friends.

H

like to help myself last.
am a Christian.

H

am a housewife.

am a mother.

am a part-time worker.

am a baby sitter sometimes for friends or
relatives.

HHHH

I am married.
N. F. I.

I'm Bernard Perry.
I'm Father of 5 children.
I'm a married man.

I'm average fellow.

Right now I've got a h--- of a cold.

Had Pneumonia last year 8--

I'm afraid of this cough getting to be
Pneumonia again.

Honesty, Truthful, Sincere.

Love for home 8 country.

My occupation.

I try to be a good mother.

I feel I am a good wife.

Fianancially1(sic), I like to feel I help my
husband.

I love helping other children. We ran a--
foster home for 5 years in New York State.

I believe when I work and am hired to work I--
do the best of my ability. In other words, I--

don't Shirk my job.

Illll1111111111111111111111111111Ill

 

31293101714