A RELIABILITY TEST. OF THE TWENTY STATEMENTS TEST". Thesis for the Doom of M. A. MICHIGAN STATE UNIVERSITY Carole Overmier Befiinghaus I966 ABSTRACT A RELIABILITY TEST OF THE TWENTY STATEMENTS TEST by Carole Overmier Bettinghaus The Twenty Statements Test (TST) is an instrument designed to access self-identification in a manner consistent with the "self" theories of George Herbert Mead. As such, its utility is limited to research which approaches the study of social psychology from the vantage point of sociology. To administer the TST, the researcher asks a respondent to: Write twenty answers to the simple question "Who am I?" in the (twenty numbered) blanks. Just give twenty different answers to this question. Answer as if you were giving the answers to yourself, not to somebody else. Write the answers in the order that they come to you. Don't worry about logic or "importance". Go along fairly fast, for time is limited. This open-ended question acts as a stimulus for a set of reSponses from each reSpondent. Each set of responses is called a protocol. Every protocol is content analyzed on the basis of a coded set of discrete cate- gories. The present study attempts to measure the reliability among six coders in the task of coding 150 such protocols. The 150 protocols were sampled randomly from a population of 1,528 protocols. The 1,528 protocols were obtained from reSpondents in a modified area probability sample of the United States general public (outside of institutions), age 21 or older. Carole Overmier Bettinghaus The code applied to the protocols contained MB discrete categories, and the coding task was done under independent conditions. Scores of reliability (range: from 0 to 10) were obtained for each reSponse and each protocol. These reliability scores were based upon a combined scale of consistency (range: from 1 to 6), and agreements (range: from O to 15). The basic hypothesis was: ”there will be a high level of reliability among the six coders in the coding of the TST protocols, and responses." ”High level of reliability" was defined as (l) 75 percent of all protocols will have a mean score of eight or better, and (2) 75 percent of all responses will have a score of eight or better, or the mean score of all responses will be eight or better. Results obtained were: (1) 72 percent of all protocols had a mean score of eight or better, (2) 72 percent of 858 responses had a score of eight or better, and the mean score of all reSponses was 8.19. A set of sub-hypotheses was also tested. Sub-hypothesis one was: "when the difference between consensual reSponses and evaluative-con- sensual responses is overlooked, reliability is eXpected to appear as , greater than under realistic conditions.” Sub-hypothesis two was: "when differences between coders in the decision of whether to divide a statement into more than one statement for coding purposes are adjusted so that only the immediate statement is affected rather than all subsequent statements in the protocol, reliability is expected to appear as greater than under realistic conditions." Both of these hypotheses were supported. The suggestion is made that use of coders with training in grammar would contribute to an increase in inter-coder reliability. Carole Overmier Bettinghaus Sub—hypothesis three was: "the more statements made on a TST protocol the lower will be the mean reliability score for that protocol." This hypothesis was supported. The conclusion was that reliability will be enhanced when fewer than 20 reSponses are sought. Sub—hypothesis four was: "as content categories increase in com- plexity, reliability will lessen." This hypothesis was not SUpported. Subrhypothesis five was: "there will be no relationship in the pattern of coder agreements which contribute to unreliability." This hypothesis was used to operationalize the assumption made in the study that the coders were independent. The test of this hypothesis may be thought of as a validity check on the reliability study. Under one condition the hypothesis was rejected. This suggested that enhanced independence of coders in future studies may be eXpected to lower measured reliability. Finally, some potentially useful forms of the TST are suggested: (1) structured and standardized with factor loadings for individual items, (2) a Q-sort form for self-styling by the reSpondent, and (3) unchanged, but with provision for self-coding by the respondent. A RELIABILITY TEST OF'THE TWENTY STATEMENTS TEST By Carole Overmier Bettinghaus A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree of MASTER OF ARTS Department of Sociology 1966 ACKNOWLEDGMENTS I wish to express my appreciation to my advisor, Dr. Hideya Kumata, who guided me through the report writing stage of this research; and to the other members of my committee, Dr. Frederick Waisanen, and Dr. Mason Miller. During Dr. Kumata's absence from the country, Dr. Robert Stewart served as my advisor, and it is he to whom I am grateful for aiding me in the formative stages of the research. The data used to check inter-coder reliability in this study, as well as the coding of the Gallup coder, were graciously made available to me by Dr. Charles Loomis, Dr. Kumata, Dr. Stewart, and Dr. Waisanen. I wish to thank them for this consideration. The four individuals, other than myself, who recoded the data were Clark McPhail, Dr. Stewart, Charles Tucker and Dr. Erwin Bettinghaus. I thank them for conscientious attention to detail in a time-consuming, somewhat non-stimulating task. Any aSpersions cast on their grammatical competence in Chapter IV, Conclusions, should be discounted. Grammar is not their recognized area of competence or training; and one would not expect those in the role of social scientist to exhibit the level of grammatical soPhistication needed here. My sincere thanks go to Mrs. Shirley Sherman, who typed the final manuscript for this thesis, and was a great help with many formal details. Finally, I wish to thank all the individuals, within my acquaintance, whose consistent expectations have been that this research and report either should be completed, or had already been completed. This "generalized other" has had great influence on the personal perseverence which I brought to the task. CHAPTER II III IV REFEREN TABLE OF CONTENTS BACKGROUND 0 O O O O O O O O O O O O O O 0 Introduction 0 o o o o o o o o o o o o o 0 Relation of a Self Identification Instrument to Meadian Theory . . . . . . . . . . . . . Review of Types of Self Identification Instruments Historical Development of the Twenty Statements Test . . Review of Previous Investigations Relevant S ummary O O O O O O C O O O O O O O 0 O O PROCEDURE 0 O O O O O O O O 0 O O O O O O The Experimental Context of this Study . . The Coders and Data Collection Procedure . The Hypotheses and Scoring Procedure . . . Physical Analysis Procedure . . . . . . . Data Analysis Procedure . . . . . . . . . Statistical Analysis Procedure . . . . . . RESULrl‘S O O O O O O O O O O O O O O O O 0 General . . . . . . . . . . . . . . . . . When the Effect of Qualifiers is Removed . When the Accidental Effect of Splits is Removed Factors Present in Low Reliability . . . . to this Study 0 O O O O O 0 Reliability Related to Complexity Level of Code Category Reliability Related to Number of Statements in Protocol Results of Testing for Independence of Coders COIJCLUSIOI‘IS o o o o o o o o o o o o o o The Basic Hypothesis of Reliability . . . Knowledge of Grammar Related to Coder Reliability Independence Related to Coder Reliability Factors in the Present Code which Limit Reliability Length of the TST as a Factor in Reliability . . Future Utility of the Twenty Statements Test . . . . . . CBS 0 O O O O O O O O O O O O O O O O O 0 iii Page 2” 25 27 35 36 H2 H3 H3 H7 48 H8 ”9 52 52 57 57 59 60 61 6M 6M 66 TABLE II III IV IVA IVB VI VII VIII LIST OF TABLES Distinctions Between Categories for McPartland's Plan . . . Growth and DeveIOpment of Code Plans for the TST SIP . Reliability Score - Based on a Scale of Agreement-Consistency Mean Reliability Score of 150 TST Protocols by N and % for Three Conditions Mean Scores of Eight or Over . . . . . . . . . . Mean Scores of Less than Eight . . . . . . . . . Reliability Scores for all TST ReSponses by N and % for Three Conditions . Code Categories by Frequency of Use for Nine Coded Least Reliably Modal Complexity Score of ReSponses by N and 0 Reliability for Two Conditions . . . . . . . X2 Values for Coder Pair Agreements Under Two Conditions iv Responses Page 15 18 28 H3 ”3 #4 HS 50 51 54 LIST OF FIGURES Page Figure 1 Model of Coder Relationships . . . . . . . 55 APPENDIX A B C Section Section II LIST OF APPENDICES Page Form of the TST in the Interview Schedule . . . 69 T8 T COde Bock I O 0 C O O O O O I O O O O O O O 7 l 150 TST Protocols . . . . . . . . . . . . . . . 88 Protocols Scoring the Same Under All Three Conditions, N = nu . . . . . . . . . . . . 89 Protocols Scoring Differently When Qualification Effect islkmoved, N = 53 . . . 95 Section III Protocols Scoring Differently When Accidental Section IV Effect of Splitting is Removed, N = 17 . . . ion Protocols Scoring Differently When Accidental Effect of Splitting is Removed, and When Qualification Effect is Removed, N = 37 . . 109 vi CHAPTER I BACKGROUND Introduction Any open-ended item in an interview schedule depends upon its code, rather than its form, for reliability. Therefore, this study is designed to test a comparatively complicated, formal code system for the content analysis of responses to a fairly simple, informal question, "Who am I?", asked as if the respondent were soliloquizing (16, p. 69). The test is one of reliability. Reliability is operationalized as very good agreement between six individual coders in the task of coding identical sets of re8ponses to the above question. Following eXplanations shall refer to each set of re5ponses as a protocol, and to the question as one form of a self identification problem or §I3_(23, p. 7a). This Specific SIP is known as the Twenty Statements Test, or TST, Thus, a TST-SIP protocol is under- stood to be a set of reSponses made by an individual resPondent to the question "Who am I?" At times the six coders may be referred to in pragmatically equivalent terms as eXperimental subjects. A discussion of the high level of reliability in Chapter II will deal with the concise operational definition of the above phrase "very good agreement". In this chapter, I will (1) discuss the need for a SIP which creates minimum bias in operationalizing Meadian self-theory within any experimental design context, (2) review the characteristics of several types of SIP, (3) discuss the history of the TST-SIP which asks a direct question, while creating a non-directive bias, and (u) review previous investigations into the reliability of content analyses. Relation of Self Identification Instrument to Meadian Theory From the standpoint of a researcher in behavioral science, any verbalization designed to elicit communication from a reSpondent is a tool of the trade, or an instrument. From the vantage point of an individual respondent such a verbalization creates or increases the imbalance between work required and work completed. That is, for the respondent, it creates a task or problem to deal with. Since it is the same verbalization from different standpoints, we will adept the practice of using the term "self identification instrument" as interchangeable with SIP. A SIP is, then, an instrument for eliciting communication from a respondent about the concept of self; Specifically, what objects he identifies or associates self with_as well as what objects he identifies or defines self as: The concept of self is central to the self—role theories of George Herbert Mead. He held that the self mediated between society and the individual and allowed the development of the mind. The self was composed of two phases, the "I", and the "Me". The "I" is the impulsive tendency of the individual which begins social acts and gives prepulsion to them. The ”Me" comprises the incorporation of other individuals within each individual. It gives direction to each social act, and is the final state of a social act. The "Me" or object phase of the self is formed through the process of role-taking (26, p. 15). Kuhn tells us that Mead saw the self as carrying on an internal conversation between the "I" and the "Me” (14, p. 629). Meltzer interprets Mead's belief that ”the ability (of the "I" phase of self) to act toward oneself (in the ”Me" phase) makes possible an inner experience which need not reach overt expression ... which constitutes mipd," (26, p. 18) Others, especially Charles Horton Cooley, a contemporary of Mead, and Gordon Allport, a later reviver of Meadian theory, have added their contributions1o self-theory (in, p. 629). Cooley characterizes self as feeling or primitive emotion and claims that the feeling of self "is, perhaps, to be thought of as a more general instinct, of which anger, (fear, grief, etc.) etc. are differentiated forms...” (6, p. 171). Allport, Speaking of the study of personality says, ...all psychological functions commonly ascribed to a self or ego, must be admitted as data in the scientific study of personality. These functions are not, however, co-extensive with personality as a whole. They are rather the Special aSpects of personality that have to do with warmth, with unity, with a sense of personal importance...I have called them "propriate" functions. If the reader prefers, he may call them self-functions, and in this sense self may be said to be a neCessary psychological concept (1, p. 55). Of social psychology Allport says, Scholars interested in culture and personality deal primarily with the function of ego-extension, for their task is to account for the process of socialization (l, p. 57). Of the "prepriate" function of ego-extension he has this to say, Soon the process of learning brings with it a high regard for possessions, for loved objects and later for ideal causes and loyalties. We are Speaking here of whatever objects a person calls "mine." They must at the same time be objects of importance, for sometimes our sense of "having" has no affective tone and hence no place in the prOprium. As we grow older we identify with groups, neighborhood, and nation as well as with possessions, clothes, home (1, pp. #1 ff.). However, Allport evinces a view contrasting to Mead's, that the self directs and controls an individual's behavior, gives rise to the mind, and is necessarily covert (26, pp. 11 8 18). Allport contends, What is unnecessary and inadmissable [as data in the scientific study of personality] is a self (or soul) that is said to perform acts, to solve problems, to steer conduct, in a trans-psychological manner, inaccessible to psychological analysis (1, p. 55). In addition to the views of self put forth by Allport and Cooley, several changes have crept into Meadian theory through interpretation. The self has come to be thought of not as the locus for an internal conversation, nor as an entity carrying on a conversation with itself and within itself, but, rather, as embodied in that conversation; or carried to an extreme form, the self is thought of merely as a re- flected image of significant or generalized others (18, p. 629). Kuhn, in pointing to this trend on the part of others (1n, pp. 156 S 198) (11, p. 316), makes no mention of his own like tendency in the development of the TST, the instrument dealt with in this study. The TST, is purported to elicit communication from an individual about his self. In operation it uses a cue verbalization of a form approximating what one might imagine the ”I" asking of "Me." (Who am I? - as if I were asking and answering myself, with no other involved.) This elicits communication of a "stream—of—consciousness" nature, directed toward the concept "I". Now Kuhn 8 McPartland found that reSponses to this question could be divided into two gross categories which seem to fit the needs of Meadian theory (1) consensual responses which ”refer to groups and classes whose limits and conditions of membership are matters of common knowledge, (16, p. 69) " and (2) subconsensual reSponses which "refer to groups, classes, attributes, traits or any other matters which would require interpretation by the reSpondent to be precise or to place him relative to other people." (16, p. 69) It seems to me that the former category might be characterized as statements one could expect to be made by the "Me," or internalized other phase of the self, while the latter category might be characterized as statements one could expect to be made by the "I," in disagreeing retort to the "Me," or in Simple interjection. Indeed the finding that ”respondents tended to exhaust all of the consensual references they would make before they made any subconsensual ones," (16, p. 70) points to an order in which the ”Me" reSponds to the one question with consensual responses, followed by a rejoinder from the "I" with subconsensual statements. Perhaps Waisanen also viewed subconsensual reSponses as related to the "I," for having accounted for some sub-consensual reSponses as evaluative consensual (reSponses of a consensual nature, but which included modifiers) he termed all remaining sub-consensual responses ”idiosyncratic."* Dictionary definitions of idiosyncracy characterize it as a queer, peculiar, unusual or individual way of acting or expressing oneself (34, p. 310) (2, p. 599), while Mead characterizes the "I" as saying "something that was novel to himself." (25, p. 197) Now it is apparent that the data elicited by the TST is data from the internal conversation which Mead refers to, rather than data §This category system is attributed to Waisanen by Dyer (10, p. 39) from the self. Furthermore, because the subsequent analysis of data obtained using the TST has largely neglected the sub—consensual responses, the "I" portion of the data has lain dormant. In essence then, Kuhn has, with good intention, created an instrument which, effectively, treats the self only as a reflected image of Significant or generalized others. It is indeed very likely, that what Allport saw as a self inadmissable to scientific study was as much a matter of convenience as anything. I see no useful way of getting at data which is internalized telow the level of language. However, more rigorous treatment of the TST'S subconsensual data may bring greater insight into the nature of the "I" phase of the self. As we shall see shortly, there are a great many instruments designed as SIP'S. Most of them have been used to measure the self in the manner of the psychologically oriented social psychologist, as a concept deduced from eXperimental data, rather than a self in the manner of the Sociologically oriented social psychologist, as a concept induced from Meadian theory. (1Q, p. 629) The very fact that so many self identification instruments are extant, documents a great need for an adequate SIP. The TST with its direct approach to the internal conversation of gestures comes closer than perhaps any other current instrument to allowing operationism of Meadian theory. Furthermore, the direct approach to data overcomes the bias of inferential research which attempts to deduce the nature of the self from non-verbal behavior or from tangential verbal behavior. (16, p. 68) Kuhn also strove to minimize the effect of directive or structural bias. (12, p. 283) There is evidence that even when the TST is administered in a structured context respondents are not led to make reSponses reflecting that context. (10, p. 22) However, the TST entails other biases; (l) a self-selection bias on the part of those reSpondents who choose to (a) respond volubly or (b) reSpond minimally, and (2) a post factum structural bias, dependent upon what form of content—analytic code the data is forced into. The former bias produces non-equivalent numbers of reSponses between protocols, making any rigorous statistical treatment impossible. However, if the TST incorporated any manner of intensity measurement, which it does not, this lack of equivalence would be lessserious. The TST is an SIP designed to overcome experimental bias, and as such it has sacrificed a great deal of experimental utility. Review of types of self-identification instruments Wylie (35) has reviewed self-identification instruments at great length, including measures of both phenomenological and non-phenomenological self. The latter type of measurement includes the TAT and Rorschach. Here we are mainly concerned with reviewing these instruments which measure phenomenological self, however. Wylie divides these instruments into three general types, (1) Q-Sorts, (2) rating scales, questionnaires, adjective check lists, and (3) coding plans for interview materials. Although she does not honor the TST with inclusion in her list of measurements, it could.be placed legically in the third class. This omission is undoubtedly due to Wylie's psychological rather than sociological orientation. Her list (35, pp. 61-6u) includes 23 different Q-sorts, of which she deals most extensively with the Q-sorts of (a) Butler and Haigh and (b) Hilden. Wylie, writing in 1961 found that the second major category of instrument accounted for the most use. However, it was this author's experience, in surveying dissertations (l954-present) which dealt with "self" in a sociological orientation to social psychology, that Q-sort technique was used in a majority of cases. Wylie lists (35, pp. 87-97) 83 instruments of the type gathered together under the heading "rating scales, questionnaires, and adjective check lists." For only one-third of these is reliability information available and it is usually of the Split-half coefficient type. In addition, Wylie lists seven coding plans for interview materials. (35, p. 107-110) Although the TST is essentially a coding plan type, it differs from all those listed by Wylie in that it is used to code data from a pen and pencil or oral interview which could be administered in either an experimental or field setting. The seven code plans listed by Wylie are for coding material sampled from a pepulation of clinical clients. Reliability between judges is reported in six of the seven code plans. Raimy (28) reported, "three out of four judges coded 356 client responses with percentages of agreement ranging from 50.5 percent for 'ambivalent' to 81 percent for informational questions. Bugental (36) reported five raters rated five protocols with reliabilities for "unit determination, 87.u percent; for categorization, 59.1 percent; and for evaluation, 75.0 percent." Vargas (33) reported a 96 percent agreement between two judges on which statements constituted self-description, in general. For Stock (37) two judges categorized each statement from three interviews. The per- centages of exact agreement for each interview ranged from 68.5 percent to 82.2 percent. Scheerer's (29) scheme used a 5 point scale for each variable. "To explore reliability, three judges rated all units in six interviews. At least two out of the three judges agreed 93.8 percent of the time. On only four out of 178 Self units did any one judge's rating deviate by two scale points from both other judges ratings." Raskin (38) using a similar technique with a four-point scale reports an "interjudge reliability on 59 items" of .91. For Lipkin (18), "interjudge agreements were obtained ranging between 81.9 percent and 100 percent" using a "complicated reliability formula." It is interesting to note that Wylie includes Bugenthal and Zelen's W-A-Y Test or "Who are you" as a questionnaire (5), even though it has great similarity to the "Who am I" of the TST, and employs a 1? category code plan for the coding of three responses. Historical Development of the Twenty Statements Test The TST was developed by Kuhn and McPartland and first reported.by them in 1958. The instructions to reSpondents which they used in their initial applications were: There are twenty numbered blanks on the page below. Please write twenty answers to the simple question, 'Who am I?’ in the blanks. Just give twenty different answers to this question. Answer as if you were giving the answers to yourself, not to somebody else. Write the answers in the order that they occur to you. Don't worry about legic or 'importance.' Go along fairly fast, for time is limited." (16, p. 69) 10 These instructions were given in writing to some groups, and orally to other groups in a college class setting. The time was limited to twelve minutes. The number of reSponses obtained in this way varied from 20 to one or two, with a median of 17. Each reSponse either began with "I am ...” or omitted that phrase and consisted of single words. The responses were content analyzed into two gross categories, either consensual or subconsensual. A consensual Statement refers to "groups and classes whose limits and conditions of membership are matters of common knowledge. Those which refer to "groups classes, attributes, traits or any other matters which would require interpretation by the respondent to be precise or to place him relative to other people" were considered subconsensual. Examples of the consensual variety are 'student,’ 'girl,‘ 'husband,‘ ‘Baptist,' 'from Chicago,‘ 'pre-med,‘ 'daughter,‘ 'oldest child,’ 'studying engineering,’ that is, statements referring to consensually defined statuses and classes. Examples of the subconsensual category are 'happyf 'bored,' 'pretty good student,' 'too heavy,’ 'good wife,’ 'interesting'; that is, statements without positional reference, or with reference to consensual classes obscured by ambiguous modifiers. (16, pp. 69-70) An intercoder reliability check between two coders was reported as resulting in less than three differences per 100 responses. The major score developed from the reSponses was a locus score, or the number of consensual statements made before any but erroneous (Guttmann scale "error") subconsensual statements are made in a TST SIP protocol. This locus score was developed after submission of the data to Guttmann scalogram analysis techniques indicated that subjects tended largely to exhaust all their consensual reSponses before beginning to make any subconsensual reSponses. Kuhn and McPartland equated the size of this ll locus or consensuality with the respondent's amount of anchorage or self-identification. It is not clear what utilitarian purpose was served by submitting the data to Guttmann analysis. A locus score could be obtained simply by counting the number of consensual items made as responses. Only as evidence that consensual reSponses in general are more salient is such a method useful. The salience score was based upon the assumption that earlier statements were more salient for reSpondents than later statements. This assumption was made in Spite of the Specific instruction to the respondent to the contrary. Therefore, this writer assumes that the TST'S developers had some type of subconscious salience in mind when they made this assumption. The salience score of any statement is obtained from its rank position in the protocol, beginning with #20 for the first statement. This assumption that sequence is equivalent to salience is based on a statement by Newcomb, who says that salience "refers to a person's readiness to reSpond h a certain way. The more salient a person's attitude the more readily will it be eXpressed with a minimum of outer stimulation." (16, p. 72) They also advance an alternative explanation that consensual statements are made first because individuals are habituated to such responses in our culture by remembered external stimuli, such as applications, questionnaires and census takers. They refute this alternate eXplanation on the evidence that three and four- year-olds answer the question, ”Who are you?” with the consensual statements of name, sex and age, though they have not yet any memory 12 of such habituating stimuli. The research reported in connection with this introduction of the TST SIP was a correlational study between religious denomination of the respondent and amount of anchorage of the reSpondent. Wylie says of this research, "Their results are not meaningful psychologically for several reasons. First, as we have said above, there is no clear re- lationship between religious affiliation and psychologically relevant variables. In addition, there was no control over reSponse total in obtaining scores, and there was no attempt to match groups of varying religious affiliations with respect to variables relevant to the ”Who am I" responses." (35, p. 182) Following this original report of the TST, several dissertations were done under Kuhn's direction at the University of Iowa employing the TST to gather data. Notable among these is that of McPartland (19) which was completed a year prior to the Kuhn E McPartland article, and the master's thesis of Robert Stewart (31). Other of Kuhn's students who have employed the TST in their post-doctoral research are Fred Waisanen and Carl Couch, along with McPartland. Under the influence of the presence of Waisanen, Couch and Stewart at Michigan State University the TST was employed by Wilbur Brockover, in a study of self-concept of Junior High (u) students. The TST was also used by Charles Tucker (32), Clark McPhail (23), and Delwyn Dyer (10) in their dissertations, and by this author in the present research. 13 Writing in 1956 Kuhn alluded to the TST as non-specific, non- suggestive and non-structured. He adds that locus scores increase with age during the first 20 years, and that there are significant variations in locus by sex and occupation. He tells Of a different category system used on the responses of "more than 200 respondents”; (1) references to statuses in social categories and social groups; (2) references to ideolOgical beliefs —- moral, philosophical, ethical; (3) statements of personal aSpir- ation and achievement; (4) identification in terms of interests and aversions, including positively and negatively held social objects; and (5) self evaluations. (12, p. 2H5) Unfortunately, he reports no inter-coder reliability for this coding plan. In another source he elaborates on this coding plan: These five categories (sufficient to order all_the reSponses made) are the following: (1) social groups and classifications (such as age, sex, educational level, occupation, marital status, kin relations, socially defined physical characteristics, race, national origin, religious membership, political affiliation, formal and informal group memberships); (2) Ideo- logical beliefs (including statements of a religious, philosophical or moral nature); (3) interests (including statements relating objects to the self with either positive or negative affect); (u) ambitions (and all anticipated success themata); (5) self-evaluations (such as evaluations of mental and physical and other abilities, physique and appearance, relatedness to others, aspirations, persistence, industriousness, emotional balance, material resources, past achievements, habits of neatness, orderliness, and the like, and more comprehensive self—typing in clinical or quasi- clinical terms).(15, pp. uo-ui) In viewing these two explanations of the same code plan by the same author, there is one glaring contradiction which might lead to coder error: In the first explanation ”aSpirations" is included in the type three, aChievement, ambition, aspiration category. In the second explanation "aSpirations” are included in the examples of type five, self-identification category. Furthermore, the type three and Type 14 four categories interchange content in the two eXplanations which could lead to further confusion. There are other sources of possible confusion which can be stated in more operational terms. These all pertain to a difference between one of the last four categories, and the first category. That is, they are differences between the consensual category and the four new categories into which Kuhn has Split the subconsensual category. (1) While physical description is category one, physical evaluation is category five; (2) while membership is category one, aSpiration to membership is category three or four, interest or affect toward a _group is category three or four, and beliefs associated with group membership (such as religious or political belief) is category two. McPartland, meanwhile, over the years 1958 to 1961, set out another method by which the TST might be coded, while suggesting still other methods. In his Manual for the Twenty Statements Problem, (22, p. 2) he suggests category systems based on "literal content, referen- tial frame or logical form". Kuhn's category system discussed above, as well as the code plan developed by McPartland are of the referential frame type. Drawing upon three separate explanations of the McPartland category System Table I was constructed to aid the reader in grasping all the distinctions drawn by MoPartland. McPartland reports a time limit of 15 minutes on his reSpondents, all of whom were patients in one or another facility for treatment of mental illness. His instructions were printed for reading and read aloud for listening for each eXperimental group, as follows: "There are 20 numbered Spaces on this sheet. Just write 20 different things 15 CH now>mnmm onmz who coaummocooumaom: .comumnnmw .m mcoamz pom nmcHEEDU .m crow .saaaaaa .aa .Hema .mase .snboeohoom .mampnmmoz canvaseosmm oze . . m mmeofi .eqmflnmmoza .Apomapnmmoz on ow csocx nocpsmv .mmma .znmscmw .ommfl>mm .EoHQOpm mpcoEmwmumtmpnoze one now Hmoemz .cosmowom mo wcoEmeomQ .coflpmocsom guano: Hopes: huwo mowemx geomonw ones .mm-:a .aa .mmma .ms .sa .Ho> .eoflbmaflammgo cmeom :.£pamw: Hmvcmz pom fimwmao Hmwoom .cowpaoocooumaow: .wwcweeoo .m crow pom .m mmeonh .ocmaphmmoze mmwsommpmo Hmeamhpo wamomcomcoooom wamomoomoooosm wamowcomcoo wamomcomcoo EMMmeHMMW Z .- wpommoo Hmfloom < chflpmoflHoEH epommhsm Hmfloom m wuoomosm Hmfloom m myoomoo.HMfloom m wopoppmo wcnoppmm =mnmewo= mo whospo Hmcowuouflpmcw so me poo ameOMpoqumcfl cm cw ampcmocoomcmne .rouo pmnwamsocow < nmcuo powwamnmcmw < wmcoz soaprflHQEH :onmcs no cogs: poo poo . :sog: mo coHuowpwnm woowuowomnm mvfiEpom meow>ma muwenmm .oahum poopmww .now>maoo mo soap nofi>mnon mo zoo Hoonm mcowumvooa 1:00 new: poo mcowpm: Iawpommna ow%womem cowpowompm pom axe cascades oz :uwm oownm> ow omwnm> pom mane: mowamEH % mcoz mcompmowaoEH woaomoflcoeeoo .wcwmmos wcowpmowcoeeoo mo GOEEoo m wcfi>mn ewcoEmnowmmE mahmwmwnm> poo .oopmowam> .mnmcpo he ooowmnmon: empwwoom he oopmowam> so monooms I a a ho mHHwomcmmcoo poz poo .maAMflwwwm> poo maamowcomooo ow coconmmwn so coflpmomam> wpcmEpwEEoo «cowpomnmpcw no mocmowmmcoo «onouosnpm amfioom wcowuomnmuow smoonnp amwoow chanson ou uumwoom o>wyomwmwnw oz cw pawso>ao>=fl omooq mpowoom ma oomwwom mponocoo 009 I maoz o» oowpmHmm emcowpmopwm «.cowpmenom woosuflwpm emaome: o» mousnvnppm Hmfioom omnsposnpm new mo smash memo no .uom .pxmucoo Homecommwn may gown: >Hywowaexw cw cofluwofimwpcmow: pom Pompcoo amasowpnma oz now>mzon mo woahpm pco5m>ao>aw mmHHQEH mopsnflnuum Hoowmmcm «masbm empoapumcoo «m>wwcmcmnmeoo mommacowpmouwm «Hmcowpwmom «pom muonocoo omamamq swam mooo m.oamayhmmoz H magma. pom weapowmpmo cooxvom mcowwocwuwwm 16 about yourself in the spaces. Don't worry about how important they are or the order you put them in. Just write the first 20 answers you think of to the question: 'WHO AM I?'" McPartland reports that for a sample of 60 reSpondents, three in- dependent coders agreed on more than 97 percent of reSponses. He also developed another type of score for the TST, the modal reSponse. Bach respondent's protocol was characterized as type A, B, C, or D, dependent upon which category of response was modal for the reSponses of the protocol. Of the mode he says, "about nine patients in ten write responses which show a clear mode in some one category, that is, they made at least one more statement in some one category than in any other. The remaining 10 percent either tied in two categories or gave so few responses that the mode was obviously unreliable. These few respondents, nevertheless, were categorized by mode when there was one, or in the more ”distinctive category involved in a tie; in 'A' or 'D' if one of these was tied with'B' or 'C' and in 'D' if that category was tied with 'A!‘ The rationale is that more unusual reSponses should receive greater weight in analysis." Further, when a respondent made no responses he was classified as Type A on the assumption that he was reSponding in a "constricted manner." (22, p. 9) The three coders reached 100 percent agreement on the modal type of 60 respondents. (20, p. 118) Couch seems to be the first investigator to use a category system stressing the ”literal content” referred to by McFartland. Using the same instructions as the original Kuhn a McPartland study, with an eight minute time limit, he content analyzed for "references to a role in, or an attachment to, a major institution of our society...responses that had 17 a reference to religion, the family or education." (8, pp. u92—u93) In a later study his code plan "noted whether or not reSpondents had identified themselves on the TST as male or female, boy or girl, or man or woman." (7, p. 118) The code plan under consideration in this study (see Appendix B) is perhaps the most elaborate in terms of "literal content" analysis, separating Kuhn's original ”consensual" category into 23 separate content areas (see Table II). Each of these categories was required as a relevant variable in at least one of the research interests bound together by cooperative data collection. Because "wastebasket" categories are included, these categories comprise an exhaustive code plan for the consensual responses. In a suggestion for a ccde plan based on ”logical structure," McPartland lists categories of reSponses (l) in the form of the verb "to be", (2) in the form of the verb ”to have", (3) in the form of action verbs, and (u) in a form in which the ”self drOps entirely out of explicit attention." (22, pp. 9-10) Perhaps as a follow-up to this suggestion, Couch has studied the classes of verb forms used in reSponse to a modified form of the TST. (9) Probably the most useful innovation in the categories for TST codes was one dealing with logical or formal structure, made by Waisanen. His three major categories are (l) consensual, (2) evaluative-consensual, and (3) idiosyncratic or non-consensual. Essentially he has divided Kuhn's ”sub—consensual” category into evaluative-consensual (positional references which are, in some way, modified) and idiosyncratic (all remaining sub-consensual references). Then he provides for 5 sub- 18 38w mHm Ems 0:» Rom mamam mwoo mo PamEQOHm>oQ cam AH3OQU HH «dawn. Hmnwsmmnooncoz lawnmoH w. owvmnocmooon no owpmnochmOfloH imam Hmsowmmm .m - Hmsowmwm .m, macaw Hmzowmmn .m _m>wpowompm nhm>¢ w owuwosoo .w .coz u m>wm Hmom mymopop doom oweocooouowoom .o vuconmnmsoo noomsoo unH Icemnoo cowumuwcmwno Hmowpwaom .o . Insm loam comeNwamwno mflowwwfiwn . . . 8m.» .833 .m u (I Flags . amuaaowmpmo 5 Hmowmmmwmd .i. . -Boafia b.3333.“ .m _ :83 :Ofipmoooo .0 ” nonaam<. cowumuwcmwno Hmsnom .o . xnoz .0 Warm ovum cowpmasooo .n nmowmeHHm mmmaamm .b hwwcsssoo .m Emwcooom .m Emimcoomm Hm- Hmzowmmn .v mucownm .o Hmuflnma .A awsmcwx .m sown mumswnm .N 1 smmswnm .w Imosvm .: spammn .w macaw wow .m sawemm .m umoamw Towpmwnopomnmno Hwowmmnm .o Tmmmao we doom mmmnoom .o Hchfipmmom seam mesonw anomcoo 88: .a ....Jmm .a Hmwoom mmm .m Hmcomnwm .H a a m wow .a .llllllllwmmmmmmmmo (Huqmauuqmw. epownuwnoo unmuaoo o>wpmsam>m § Hmswammsou gaumsamxrm.‘ Hormoomaoo a 38280 Hmsmnomcou mownowwumo 4 moswm ucommnm - c ocmaphmmuz mo 52 $8 353m: 332302 fiaoo 53 _ 28 use aofiae l9 categories to analyze the ”literal content" of both consensual and evaluative-consensual categories. The utility of this innovation lies in its ability to extract content data from a great number of TST reSponses which previously were classified sub-consensual and, therefore, paid little heed. His five content categories are; (1) Personal (physical characteristics), (2) Primary (continuing intimate relationships), (3) Secondary (associational, segmentalized, Specialized interaction), (H) Categorical ("man," "citizen,” ”white,” "American," are examples), and (5) Residual (all responses that do not fit the first a content categories). This is essentially the basic category system studied here, with the addition of the 23 sub-Sub-categories already referred to in the form of Specific roles or physical attributes.* The present study also involves an innovation in the administration of the TST. The instructions were read to respondents to Ask this question of yourself, 'Who am I?‘ Think of as many answers as you can in answer to the question, 'Who am I?’ In a moment I would like you to give me the answers as if you were giving them to yourself, not to me or anyone else. Take a little time to think about it. (pause) Now, please make what you consider to be the most important statement about yourself first. (pause) Now, make what you consider to be the next most important statement about yourself. (pause) Now, make what you consider the next most important statement about yourself. (pause) Are there any other statements you could make about yourself in answer to the question, ’Who am I?’ (see Appendix A) The interview schedule contained only 10 Spaces for reSponses and interviewers were instructed to write down only as many as ten reSponses and to Spend no more than 3 minutes on the SIP. (13, p. 7—8) We see, *Attributed to Waisanen by Dyer, (10, pp. 39-40). 20 by comparison with prior uses of the TST that, (l) fewer responses were sought, (2) less time was allowed, and (3) reSpondents were directed into a format of most important statements first. These innovations were the result of a pre-test of the interview schedule. The main aim of this pre-test was to test the schedule on persons who were functionally illiterate, to find what impairments to communication this would present to an interviewer. The pre-test was conducted with persons living in a rural, Michigan, migrant farm-workers enclave, and with persons living in an urban, Michigan, Negro and Puerto Rican slum _ ghetto. Such a pre-test was as essential for the TST as fer any other phase of the schedule, for until now the TST had been employed only on extremely homogeneous groups, and often on middle-class college-age persons. Now we were pre-testing for its utility with an extremely heterogeneous sample, by testing the opposite extreme type of homogeneity. We found reSpondents on the pre-test made many fewer statements, many non-directive probes were used, and a great deal of time was Spent on administration of the TST in relation to other items, although there was only one hour available for the administration of the entire schedule. For the sake of expedience, then, the TST was administered With built-in probes which provided more structure than previously, and in a foreshortened format. Brookover had previously used a "Ten Statements Test”, (4, p. 28) and Dyer had questioned the need for obtaining as many as twenty statements. (10, p. ”1) 21 A further innovation in administration of the TST was developed and tested by McPhail. After obtaining TST protocols from reSpondents he presented them with three additional problems: (1) rank these statements in the order of their importance to you... (2) assign a plus (+), minus (-) or zero (0) to each statement, depending on whether you feel positive, negative, or neutral toward that statement... and (3) attempt to estimate (in units of five percentage points) what percentage of those persons who are in a position to know, would agree with you on each separate statement you have made. (24) This was an attempt to (I) test the correlation between chronological order of reSponse and salience for the respondent, (2) test for any correlation between negative fonn of reSponse with negative feeling about the reSponse. For example, had a response "I am not good-looking" been given, a coder might assume this to be a negative self-evaluation, but unless we know whether his looks makes the respondent sad or glad, we cannot tell if it is a negative or positive evaluation. Finally, (3) this was an attempt to tap the respondent's perception of consensuality or sub—consensuality for each Statement he had made. Conceivably a respondent might estimate that 60% of the persons in a position to know would agree with his statement "I am stubborn”, thus changing it from an Iidiosyncratic reSponse to a consensual reSponse. Again, a forerunner to the McPhail approach to salience was used by Brookover, when he asked respondents to circle the most important (no specified number) of their Statements. (4, p. 28) Unfortunately, no report was made of the associa- tion between this self-coding and the ordering of the statements. Looking toward the future for the TST SIP, Dyer has suggested that 22 responses to it be submitted to factor analysis, rather than content analysis or scaling techniques.(10, p. Ml) Mielke has suggested a more Specific plan for factor analysis of TST reSponses. A factor analytic method could also be used to devise a value adherence instrument, using a two Stage pretest. The first stage would be eXploratory, going to the field with o en-end questions...Everytime a respondent would mention...something that sounded like a general value, this would be recorded for the second pretest stage. The entire pool of general talues thus recorded would then be administered to a second pretest sample for "agreement" or "applicability to me” ratings. These ratings would then be submitted to factor analysis. Here the factors would represent a parsimonious set of independent value clusters. By discerning an underlying commonality of content, each value "cluster" could be called a single general value. The items most highly loaded on each factor would become the basis for the value adherence instrument in the main study. (27) In summary, Table II represents a history of the growth and development of code plans for the TST SIP. Review of Previous Investigations Relevant to this Study: Berelson, (8) in his article on "Content Analysis” in the Handbook of Social Psychology presents a summary of content analysis reliability studies. He finds that reliability is reported in 15-20 percent of the content analytic studies, and that "the reports on reliability which do appear are uniformly high". In a population of ”some 30 studies and experiments” he found "a range of correlation coefficients between about .78 and .99 with a concentration at about .90 and a range of percentage agreements between 66 percent and 96 percent with a concentration over 90 percent." He cautions that "many attempts at content analysis may have been abandoned because of low reliability in the preliminary results... published content analysis studies which do not report on reliability 23 may be presumed to have had less satisfactory results...most of the reported reliability results apply to relatively simple versions of content analysis.” (3, p. 51H) He Summarizes several reliability experiments as fellows: ”Reliability is higher under these conditions: the simpler the categories and the unit, the more eXperienced and better trained the coders, the more precise and complete the set of coding rules, the fuller the illustrations." (3, pp. Sin-515) Schutz (30) compared a code system using "one psychological operation" on the part of the coder, with a code system using a step- by-step series of deciébns related to a complex category system and found the latter system produced a "little less” reliability. Summary In this chapter we have attempted to (1) present a. general statement of the aims of this study; (2) Show the need for a minimally biased SIP in tests of Meadian theory; (3) point to a number of existing SIP's; (H) outline the historical deveIOpment of the Twenty Statements SIP regarding (a) code plans, (b) derived scores, (c) reliability tests, (d) administration procedures, and (6) future deveIOpment; and (5) review the findings of other checks on the reliability of content analysis. The remaining chapters deal with (l) the procedures employed in this study, including the hypotheses being tested, (2) the results obtained by testing these hypotheses, and (3) the conclusions arrived at, based on the results of the hypothesis testing procedure. CHAPTER II PROCEDURE The Experimental Context of this Study_ The present reliability test was conducted in conjunction with a cooperative research project which collected data in five nations: Mexico, Costa Rica, Japan, Finland and the U.S.A. This very extensive study was under the direction of Hideya Kumata, Charles Loomis and Frederick Waisanen in cooperation with Yrjo Littunen and Robert Stewart. The United States sample was constructed and drawn by the Gallup Organization, which also supervised and conducted all U. 8. interviews. It is "a (modified) area probability sample of the United States general public (outside of institutions), age 21 or older." Size of the sample was 1,528. The interviews were conducted September 2 to October 6, 1963. (39, p. l) The data collection instrument was an approximately hour- long scheduled interview. Using the 1,528 TST protocols thus obtained, a random sample of 150, or approximately 10 percent of the population of protocols was drawn (see Appendix C). This sample was the data assigned to each coder for coding. The data sampled contained 858 responses. This amounts to an average number of responses per protocol of 5.72. It is assumed, by sampling randomly from a sample designed to represent the U. 8. adult population, age 21 and over, that the range of coding difficulty found in these protocols will be closely representative 2“ 25 of the range of coding difficulty which would be found in protocols generated by the 0.8. adult population, 21 and over. The Coders and Data Collection Procedure A coder employed by the Gallup Organization had coded the 150 protocols prior to the eXperiment. Future references shall refer to the Gallup coder. In addition, five non-Gallup coders Operating independently, coded each of the 150 TST protocols. In Chapter IV I will deal with the possibility that these five coders were not truly independent. However to safeguard the independence of the five coding Operations, each coder did the physical coding of protocols in a separate booklet of protocols. To further safeguard independence, the protocols in each booklet were ordered in a random fashion and differently from each of the four other booklets. We Should note that this procedure may facilitate the contribution of varying levels of coder fatigue to differences in coding, thus biasing the results in the direction of lower reliability. The booklets were divided into two sections of 75 protocols each. One and a half hours coding time was estimated for each section. Coders were instructed to ”code each section on a separate day, and code all sections within a two week period. Please code each section during one continuous work period. Try to do this work at a time of day when you are mentally alert and relatively free from external distraction." Un- fortunately not all coders were able to follow these instructions. While all coding was accomplished within a two week period, some coders found it necessary to use more than two separate days and Shorter than one and a half hour work sessions. A change in the procedure of this sort, 26 however, would lead to eXpectations of lowered fatigue for the task. On each page of the booklet in typewritten, dittoed form were the responses of a TST protocol. Only one protocol appeared on each page, and no protocol appeared divided between two pages, regardless of the number of reSponseS it contained. The number of responses per protocol ranged from one to ten. A reSpondent number in four digits appeared in association with each protocol. We should note that the Gallup coder performed the coding task on data handwritten by interviewers under field conditions, and that this was the original data from which the ex- perimental booklets were transcribed. The code categories used were in the form of two digit numbers representing punches in two columns of an IBM data card. Each coder was instructed to: ”Code in red pencil....Indicate by red Slash marks within the statement the division of a reSpondent's statement into more than one statement for coding purposes. Write your codes in the left margin in the order the statements appear on each page." At the beginning of each experimental booklet the following in— formation was given the coders regarding the process of transcribing the protocols from the original handwriting: Following this page you will find a reproduction of the questionnaire section which asked and recorded the TST. (see Appendix A) Reference to this format may be helpful in coding some of the responses. In general, each statement which was written next to a number l—lO has been reproduced as a separate response for you to code, by: (a) Triple Spacing it from the other statements, (b) Capitalizing the first letter, if it were not capitalized by the interviewer (except in some cases where capitalization could affect meaning), and (c) Ending the statement with a period (.). Since in some cases, it appeared that though a reSponse was written on several lines, it was really one statement, we have shown this by ending the line with a dash (--) and beginning the next line in lower case. Every effort was made to reproduce the state- ment as recorded by the interviewers. Therefore, grammar and Spelling errors have been reproduced. 27 It is assumed that the Gallup coder had become highly aware of the 15 page coding instructions and code for the TST developed by Stewart and others. (see Appendix B) Four of the five non-Gallup coders had helped develop the coding instructions and code book, and were therefore very familiar with the code book. All five coders were instructed to "Read through the instructions in the code book for coding the TST befOre you begin this work. Follow the instructions in the code book during the coding and refer to the code book as frequently as necessary." All five coders were at the graduate or post-graduate level in the social sciences. (See Appendix A for the format of the TST in the interview schedule, Appendix B for the TST coding instructions and code book, and Appendix C for the 150 TST protocols.) The Hypotheses and Scoring Procedures A basic hypothesis of this study is that there will be a high level of reliability among the six coders on the coding of the TST protocols. I will define "high level of reliability among six coders” as a high percentage of protocols with a high mean reliability score. The mean reliability socre of a protocol is an average across the reliability scores of all reSponsein the protocol. The reliability score of a reSponse is based upon a scale of agreement—consistency among the six coders as to how they coded the reSponse. The scale of agreement-consistency is based upon two values (1) Agreement, or the number of paired agreements obtained, considering that a total of 15 paired agreements are possible among six coders, and (2) Consistency, or the Size of the modal category assigned to the item by six coders, which mode may range from one to Six. The scores thus 28 obtained from this scale range from 0-10. Each score represents a discrete structural pattern of category combinations, and,in each case, knowing nothing but the score we can determine the number of paired agreements and size of mode which it represents, but not the code category. Table III will make this relationship more apparent. Table III Reliability Score- Based on a Scale of Agreement-Consistency Combination of Categories Number of Size of Score (a structural pattern) Agreements Mode 10 AAAAAA 15 6 9 AAAAAB 10 5 8 AAAABB 7 4 7 AAAABC 6 H 6 AAABBB 6 3 5 AAABBC H 3 H AAABCD 3 3 3 AABBCC 3 2 2 AABBCD 2 2 l AABCDE l 2 0 ABCDEF 0 l To understand the formal patterns one must realize that the letters A,B,C,D,E,F represent the order in which the analyzer notes different code category assignments by different coders. There is no relationship between these letter notations and the different numbers assigned to code categories in the code book. For example, if the analyzer notes that all the coders coded an item with the same code category, that item would be assigned an AAAAAA structure, whether it was coded a 2l—category or a 53-category by all six coders. At the other end of the scale, if the analyzer notes that each coder assigned a different code category 29 to the same item, that item would be assigned an ABCDEF structure, whether it was coded as 72, 56, 12, 58, 60, 61, or any other combination of six different categories. Thus, the difference between a score of seven and eight on this scale is not found-in the mode, for in each situation there are four A's. The difference between the reliability scores of seven and eight is found in the number of paired agreements, for an eight score represents one additional paired agreement between the two coders who assigned non-modal categories, i.e., "BB” represents a paired agreement which is missing in "BC”. For purposes of operationalizing this hypothesis a high reliability score was defined arbitrarily as any score of 8 or over, thus representing roughly the top quarter of the scale. "A high percentage of protocols with a high mean reliability score" was defined as 75 percent or more of all protocols have a mean reliability score of 8 or better. The per- centage figure was chosen as an amalgam of (1) previously attained percentages of inter-coder reliability for the TST, and (2) indications from related research in content analysis that factors Specific to this Study indicate a lower percentage of inter—coder reliability can be expected. The history of TST reliability checks has been discussed in Chapter I. We should recall that in their original study, Kuhn and McPartland reported 97 percent reliability between only two coders, coding material into only two categories. McPartland reports 97 percent P61iability between only three coders, coding into only four categories. Berelson reports percentages of reliability for content analyses range between 66 percent and 96 percent with a concentration over 90 percent, 30 but he adds that "the reported reliability results apply to relatively simple versions of content analysis." Berelson concludes that the simpler the categories the higher the reliability, and the reliability eXperiment reported by Schutz indicated that a Simple decision by the coder would produce higher reliability than a step-by-step series of decisions by the coder. Reference to Table II and Appendix B would indicate that there are 45 discrete code categories used in the present study, requiring the coder to make decisions step-by-step as to (1) whether a reSponse is a single or multiple Statement, (2) whether a reSponse is or is not modified in some way, (3) whether reSponse contains material which may be classified by content (consensual) or does not contain material which may be classified by content (idiosyncratic) and (u) what content category (from among 23) Should be assigned to this reSponse. Furthermore, although the difference between the use of three coders in McPartland's reliability check and the six coders in the present reliability check may seem inconsequential, a deeper consideration is that with three coders there can be only three paired agreements but with six coders there are 15 paired agreements. Adding only three coders to the reliability test multiplies the chances for disagreement five times, and we would expect such a procedure to lower the obtained reliability. Thus three main factors differentiate this study from other reliability checks of TST coding; (1) many more categories, (2) many more decisions to be made by the coder, (3) many more paired agreements among coders. Add the final consideration that in the present study the data being coded, obtained, as it was, for the first time from a 31 heterOgeneous sample rather than a homogeneous sample,ndght be eXpected to extend the range of expression thus making coding more difficult and therefore less reliable. For these reasons the expected percentage of reliability was lowered to 75 percent from the 97 percent obtained by McPartland and Kuhn E. McPartland. Now our original basic hypothesis, that there will be a high level of reliability among the six coders on the coding of the TST protocols has been operationalized. Its meaning has become, we expect to obtain a situation in which 75 percent or more of all protocols have a mean reliability score of eight or better, on the coding of l50 TST protocols byysix coders. An item reliability of eight or better means the item will have at least seven of 15 pairs in agreement and a modal category no smaller than four of six codes. The mean reliability score of a protocol is an arithmetic average to the nearest tenth, of the scores obtained on all the items in the protocol. A Somewhat similar basic hypothesis is that there will be a high level of reliability among the six coders on the coding of the TST responses. In this case, "high level of reliability” will be operationalized as (l) 75 percent of all responses will be scored as eight or better in reliability and (2) the average reliability of all responses will be a score of eight or better. The following sub-hypotheses will be operationalized in parallel fashion to the basic hypothesis: Hl When the difference between consensual reSponseS and evaluative-consensual reSponses is overlooked, reliability is expected to appear as greater than underrealistic conditions. H2 When differences between coders in the decision of whether to divide a statement into more than one Vstatement for coding purposes are adjusted so that only the immediate statement is affected rather than all subsequent statements in the protocol, reliability is expected to appear as greater than under realistic conditions. It can be seen from the discussion of step-by-step decisions by the coders that each of these hypotheses reflects an attempt to isolate one of the first two decision steps. The effect of the decision "whether a reSponse is or is not modified in some way," is removed from consideration which allows us to hypothesize that this will make the coding appear more reliable. The accidental effect (not the main effect) of the decision "whether a reSponse is a single or multiple statement" is removed from consideration, allowing us to hypothesize that this will make the coding appear more reliable. Note that neither of these hypotheses expect a change in reliability. Rather they expect a change in the appearance of reliability under a hypothesized condition which is not a condition of reality. The actual data record on data cards or data computer tape, the realistic condition, is one which records discrete numbers for two statements which are alike except that one is modified and the other is not modified. Thus, ”I am a man” would receive the code punch, 16, but "I am a fairly honest man,” would receive the code punch 17, denoting What would have been a "16" category, except for the modification, ”fairly honest". For our purposes "modified” is used in both the strict, grammatical sense, as well as in the less strict sense of overall meaning. Note also, that the terms "evaluated" and "qualified" Will be used as if they were equivalent to the term ”modified”, since no 33 practical distinctions between them are necessary in this study. The actual data record, the realistic condition, will also allow accidental effect to accumulate for all items following an item which was or was not Split into two statements in an unreliable manner. This accumulation relates to the fact that each statement's category code number is punched in a set of two discrete data card columns. Thus if the first statement of seven statements is Split into two statements in an unreliable manner,each of the following six statements will be affected accidentally in the direction of unreliability by necessarily placing what may be reliable code category decisions in an unreliable set of columns. Under the condition of reliability the second statement's code category number would have been in the second set of two columns, and instead it is in the third set of two columns; the seventh statement's code category number would have been in the seventh set of two columns, and instead it is in the eighth set of two columns. Thus the unreliability of a Split statement decision is com- pounded accidentally, and eSpecially so where it occurs near the beginning of a protocol, and the number of statements in the protocol is relatively large. Three factors are apparent in the coding situation, which might be expected to affect reliability: (1) The data being coded, (2) The code categories being used, and (3) The coders who do the coding. Each of the following sub-hypotheses relates to one of these factors: H3 The more statements made on a TST protocol the lower will be the mean reliability score for that protocol. Hu As content categories increase in complexity response reliability will lessen. 34 H5 There will be no relationship in the pattern of coder pagreements which contribute to unreliabifity. HSa There will be no relationship in the pattern of paired agreements which contribute;t>unreliability. HSb There will be no relationship in the pattern of triple agreements which contributssto unreliability. Hypothesis three has three bases. (1) As mentioned above, a greater number of protocol statements is expected to lower reliability in the Specific cases where an early protocol statement is Split un- reliably. (2) A greater number of statements in a protocol increases the appearance of the protocols' complexity. (3) A greater number of statements in a protocol increases the number of step-by-step decisions within the protocol. Hypothesis four refers to the 23 specific content categories. "Complexity of reference ” refers to a four place scale of complexity, in which personal statements are defined as less complex than primary statements, primary statements are defined as less complex than secondary statements, secondary statements are defined as less complex than categorical statements, and idiosyncratic statements (because this category has a residual dimension) are defined as equal in complexity with the personal statement, at the low end of the complexity scale. Only those statements to which four, five or six coders assigned categories of the same complexity level can be analyzed within this hypothesis. This does not mean that at least four coders will have assigned exactly the same numerical category code to these statements. For example four of six code categories assigned to a statement are #0, ul, #2, #3. Such a statement is at the secondary level of complexity, since all four of these categories are secondary consensual statements. 35 If fewer than four coders coded the item at the same level of com- plexity, it will not be clear what level of complexity we are dealing with. Therefore, such items will not be analyzed under this hypothesis. Hypothesis five is the expectation when all coders are independent. This experiment assumes coder independence. Therefore, this is a relevant hypothesis. To operationalize this hypothesis a Chi Square procedure will be utilized. The obtained value in each cell is the number of unreliable agreements between two coders, while the expected value in each cell is the number of agreements expected between two coders if unreliable agreement is equalized across the five other coders. We will assume the hypothesis is supported if the chi—Square value is not Sig- nificant. The same analysis will operationalize Hypothesis five-a and Hypothesis five—b. Physical Analysis Procedure The first step in analysis was to reorder the pages in each of the five experimental booklets from their random and different orders to the same order. Next the raw data of six codes for each item was associated with the protocol number and item number in six two-digit columns of type— writer notation. Typewriter notation proved most efficient because the consistency of form in typewritten characters led to easier identification of differences in categories. The visual discrimination was more time- consuming with hand-written character notation. The column of two digit codes for the Gallup coder was transcribed from a print-out Sheet of the Control Data 3600 Computer at Michigan State University. The column of two-digit codes for each eXperimental coder was transcribed from red 36 pencil notations in the left hand margin of each of the five ex- perimental booklets. Examples using the typewriter notation method may be found in Appendix C, Section two, three and four, as well as on pages “1 and 43 of this chapter. In association with this raw data for the reliability experiment, notations were made of (l) the score of each item, (2) the average score of the protocol, (3) whether qualification~modification contributed to less than perfect agreement, (u) whether Splitting a statement contributed to less than perfect agreement, (5) what levels of complexity of reference were involved in less than perfect agreement, (6) the number of items in the protocol, and (7) within the raw data columns, all code categories which were non—modal were encircled, in the manner that scale errors are encircled in typewriter notation of the sort develOped by Waisanen. The end product of this procedure was 22 single spaced pages of raw data, associated with all measures of the variables necessary to operationalize the hypotheses. Finally, additional typewriter notations were used to rescore protocols in which either (or both) qualification and modification Operated as a factor contributing to unreliability. You may wish to refer again to the examples cited above. Data Analysis Procedure Protocols were rank ordered by mean reliability score. From this Table IV was constructed, representing the number of protocols receiving each score (in tenths) the perzent of protocols by scores, and the cumulative percent of protocols above or at each score. This operation 37 was necessary for testing and illustrating the basic hypothesis and hypothesis one and two. To further test the general reliability of the coding procedures under the second basic hypothesis, a determination was made of (l) the percent of items scoring eight or better, and (2) the average reliability score figured across all statements. From this data Table V was constructed. To test hypothesis one the statements were rescored, deleting qualification-modification as a source of unreliability and Similar manipulations of these new scores were made. In this process the following discrete code categories were lumped together and considered to be equivalent categories for obtaining the reliability score: (10,11) (1u,15) (16,17) (20,21) (22,23) (2u,25) (26,27) (30,31) (uo,u1,u2,u3) (uu,u5) (u6,u7) (ua,s2,53) (50,51) (54,55) (56,57) (58,59) (60,61) (62,63) (70,71) Please refer to Appendix B for the denotation of these codes. To make more explicit the procedure followed in rescoring a protocol for a test of hypothesis one, let us refer to an example from the data; a protocol which reads as follows: I'm a human being. I'm able to work. I'm able to carry my home. I am proud to have children. I am a citizen. The reliability score for this protocol, under the realistic con- dition, was 8.6, with five scores of ten, nine, five, nine and ten. Rescored for testing hypothesis one, it obtained a reliability score of 9.8, with five scores of ten, ten, nine, ten, ten. To realize how this apparent change in reliability was obtained, it is necessary to compare the actual code categories assigned to each reSponse by Six coders with the same data 38 after adjustments have been made to overlook, or remove the effect of; qualification. ASSIGNED CATEGORY UNDER REAL CONDITIONS: ReSponse Coder Number G A B C D E Score Verbal ReSponse: 60 60 60 6O 6O 60 10 I'm a human being. L13 us u3® 3 11.3 I'm able to work. 63 63 63 @ I'm able to carry my home. {ED 21 21 21 21 21 I am proud to have children. 50 50 50 50 50 50 I am a citizen. U1£‘Q)k)ka zte CDCOOkOU'lto .6 mean score of protocol ADJUSTED CATEGORY UNDER HYPOTHCSIZED CONDITION: Response Coder Verbal Number G A B C D E Score Response; 60 6O 6O 6O 6O 60 10 I'm a human being. 43 43 43 43 43 43 10 I'm able to work. 63 63 63 63 (f2) 63 9 I'm able to carry my home. 21 21 21 21 21 21 10 I am proud to have children. 50 50 50 50 50 50 10 I am a citizen. 35' . 9.8 mean score of protocol (n-FCOMl-J Note that only the four encircled "errors” due to qualification were changed under the hypothesized condition; and that the one encircled "error" due to some other cause has remained unchanged. Prom inspection of the three reSponseS involved in rescoring, we can see that the ”errors" which were changed were evidently due to a differential treatment of the three qualifying words, ”able", ”able”, and "proud". To testliypothesis two those protocols affected by splitting were reordered to delete the accidental affects of disagreements in Splitting, then the statements were rescored. In this process blank code categories became meaningful. In the case where a coder had not Split a statement while others had Split the same statement, that coder was shown as 39 assigning a blank code category to the second row of codes devoted to the second part of the Split statement. To clarify this rescoring procedure, we refer to a protocol from the data, an example which reads as follows: I'm an American Citizen. 8 I believe in fairness. I try to deal fair with everybody. I can get credit anywhere I've been very fair. I'm just a common-ordinary fellow. I never had more than a country school education. The reliability score for this protocol, under the realistic con- dition, was 6.5, with eight scores of ten, eight, ten, nine, two, three, two and eight. Rescored for testing hypothesis two, it obtained a reliability score of 8.8 with eight scores of ten, eight, ten, nine, seven, nine, seven, and ten. Note that while there are only six verbal responses in the protocol, the coders have Split some reSponses, thus creating eight reSponse scores. This fact makes it difficult to determine which scores were assigned to which verbal reSponse, eSpecially under realistic conditions, before accidental effects of Splitting are removed or overlooked. Therefore, in the example below, we must view all six responses as a whole. We may rely only on the fact that the first category assigned codes the first verbal response, the last category codes the last verbal response, and all other categories are in the same order as the verbal responses which they refer to. The column headed "Response Number" goes unused in this example to avoid confusion. ASSIGNED CATEGORY PLACEMENT UNDER REAL CONDITIONS: ReSponse Coder Number: 6 A B c D 50 50 50 50 50 50 50 ('2) 50 50 72 72 72 72 72 72 72 72 72 72 GE 61 72 C7; 61 72 47 72 61 u7 é§§3 #7 gig; '39 E 50 72 72 61 47 Score Verbal Response: 10 8 10 9 _3_ 52 6 .5 I'm an American Citizen. I believe in fairness. I try to deal fair with everybody. I can get credit anywhere I've been very fair. I'm just a common-ordinary fellow. I never had more than a country school education = mean score of protocol After accidental effect of Splitting is removed, it is necessary to assign two lines to each verbal reSponse which was Split by any coder. In general the categories and scores which are associated with one verbal reSponse will appear on the same line as the response number and all following lines, until the next verbal reSponse is begun. Note that some verbal responses could be Split into more than two codable reSponses. Below is the example of rescoring which goes with the above example of original scoring. Response number refers to each verbal reSponse. ADJUSTED CATEGORY PLACEMENT Response Coder Number G A B C D 1 50 so 50 50 50 50 50 (:3 50 50 2 72 72 72 72 72 a 72 72 72 72 61 4 ® a) 72 72 72 (it 5 61 61 ® 61 6 u7 u7 u7 u7 u7 UNDER HYPOTHESIZED CONDITION: E 50 O 72 72 72 61 47 Score Verbal ReSponse: 10. 70 I'm an American citizen. I believe in fairness. I try to deal fair with everybody. I can get credit anywhere I've been fair. I'm just a common-ordinary fellow. I never had more than a country school education. 8.8 = mean score of protocol n+1 Now what was really happening among the coders becomes a bit clearer. There was a four to two disagreement on the Splitting of response one and four. Notice that even though removing the accidental effect of Splitting changed the category content assigned to the second codable response (second line) it did not change the obtained score of eight. Under the realistic condition we have tried to circle all "errors”, while under the hypothesized condition we have circled all "errors” which remain and may be attributed to some other effect. Specifically, in verbal reSponse number 1. evidently some coders Split this into the two reSponses, "I am an American, and I am a citizen." In verbal response number 4. evidently some coders Split this into the two reSponses, "I can get credit anywhere, and I've been fair." Finally, in verbal reSponse number five it appears that only coder A noticed the word "fellow" and coded this reSponse as a qualified sex reference. To test hypothesis four it was first necessary to select those items which could be analyzed under this hypothesis (see discussion of hypothesis) and then each of these items was assigned a score (1—4) of "complexity". In the next section the ensuing statistical analysis is explained. To test hypothesis five and its sub hypotheses it was necessary to note all the encircled errors which were in agreement and all the sets of tripletons in agreement in the even Split situation (Reliability score = 6, structure = AAABBB). Counting all occurrences of 15 sets of agree- ment pairs resulted in 15 obtained values for the Chi Square cells. 1:2 Statistical Analysis Procedure To test hypothesis three it is necessary to correlate the mean reliability score of 150 protocols with the number of items in each protocol. Data analysis has produced a total of three sets of scores (under three hypothetical conditions). It is necessary to do three correlations to test this hypothesis, since there was no designation of which set of scores would be used in this test. A significant negative correlation will support the hypothesis. To test hypothesis four it is necessary to correlate the complexity _soore (1-4) with the reliability score of all statements testable under the hypothesis. Again three correlations must be done to reflect the three score sets develOped in data analysis. Again, a significant negative correlation is needed to support the hypothesis. To test hypothesis five and its two sub hypotheses it is necessary to do three Chi Square analyses for each of the three score sets developed. See page 35 for a description of obtained and eXpected values in these analyses. The hypothesis is supported if the Chi Square value is not Significant. 43 . .m.b no m.s mm omnoom momma Ham endow on Haw: m.s e0 m 00 monoom 00000 map 0H0 sauce 0n Haas 0.0 000000 cameos 0:0 ca .0009 on «5 >kumwooEEH swamp one room an pom omenoooom mpaommn one news swamp no>m com amnvcwv Cm>m mpaomon may nonpmwow wcwmsonm >n mpHSmon omen“ ooanmv 0>mn 02 popcoooom momma mo 2 one owmomhocw m>mn awe swoop Hmonmwc new 0% xomn weep emote: Geoaoo 0:9 CM mean: .0.0 no .oamow >vwaenmwaon one cm 0 he pom omensooom ova; mohoom he you soon mo mwmoonm one mocflme I'll 0.0a 0.0e 0.00- mwmmgsmmme 0.0: 0.00 0.00 0.Sa 0.:H 0.0 0 0>npmaseeo (tn: S.e e.0 0.e 0.0 0.0 0.0 0.0 0.0 S.m 0.0 0.0 0 .1. s 0H 0H :0 m we :0 :0 0 0 :H z oo>06om pflamm we poommm Hmpcmoflflwm 0.0» 0.0e p.00 p.00 p.00 0.00 0.3: 0.00 p.00 0.00 0.00 0 0>npmaseao 7:: e.m S.e 0.: 0.0 0.0 0.00 e.0 s.0 S.e s.0 0.0a S 11 e e 0 0 0 S0 0H 00 e 00 00 z oo>o€mm pomwmm cowpmoflmwawmw 0.HS 0.00 0.S0 0.H0 0.0: 0.0: 0.00 0.00 0.0a 0.:H 0.0 0 0>eemaseso 0.» 0.e 0.0 0.0 S.: s.0 0.0 0.» 0.0 0.0 0.0 0 Ha He 0 0 S 0a 00 0H 0 0 :H z om>OEem mvommmm mcflooo oz 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0a emmnoom 000: mem>o no prmfim mo mohoom emery mcoflvwocoo ownnh pom w one 2 mm maoeouona ems 00H e6 enoom epaaheeflaem 000: < HmH mam<9 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.50 0.00 0.00 0.00 0.H0 0.00 0.H0 0.00 5. I I I I I 0.0 0.0 s. 5.0 0.0 0.0 0.0 0.0 00H H I I I I I 0 0 H 3 0 0 m 0 0.00H 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 H.o0 0.00H 5. I 5. I I 0.0 5. 5.0 5.0 0.3 0.0 5. 5.0 0.0 00H H I H I I 0 H 3 3 0 0 H 3 0 nu 0.00H 0.00 0.00 0.00 0.50 0.00 H.00 0.30 H.30 3.00 H.00 3.00 3.50 5.30 0.00 0.05 0.55 0.35 0.00H 5. I 0.H 5. 5. 5. 0.H 5. 5. 0.H 5.0 0.0 5.0 5.0 5.0 0.0 5.0 5.0 00H H I 0 H H H 0 H H 0 3 0 3 3 3 0 3 3 Hmp05 3.3 0.3 0.3 0.0 0.0 3.0 0.0 0.0 0.0 0.0 3.0 0.0 0.0 0.5 0.5 3.5 0.5 0.5 Aunmwm swab wmoa mo wonoom noozv mSOHpHocoo money now ucoonom one 2 0n mHooouonm 905 00H pow 09000 thHHnoHHom com: 0 0000 >H 00000 a0>0m mo 0wm0nocH cm 900 pcsooom 03 >0: mwnu SH 000:00009 mo noneac CH 0mmmnocH :0 0909000000: 005 pH .HOUOponm 0 SH HHHmm 0H 00:00009 0:0 nmnp 0QOE nonxe .GOHpHonoo mwcp n0oo: .z Hopow 02p :0 000:00009 .mpHHmm 000:“ we muo0mm0 Hepc0owoom 03p 0>oE0n ou 0.00 5.00 0.00 0.50 0.50 0.00 0.00 0.00 3.05 5.00 3.33 0 0>HpmHneso 0.00 H. 0. 3.H 0. 0.H 0.0 0.3 H.0 5.0 0.00 3.33 0 0000 H 5 0H 0 0H 30 50 mm 30 0H0 300 2 00.0 o0>oa0m “HHmm mo #60000 Hmpceewoo< H.00H 0.00 0.00 0.50 3.00 0.00 0.00 0.55 0.05 0.03 0 0>H00Hseoo H.00H I 0. 0.H 5. 0.H 0.0 0.3 5.5 0.5 0.00 0.03 0 000 I 0 0H 0 0H 00 30 00 m0 000 003 2 00.0 U0>OE0m Po0wmm GOHHMUHMHHMSO I 0.00 0.00 0.00 5.30 3.00 H.00 0.00 5.H5 0.00 0.00 0 0>090Hseso 0.00 I 0.H 0.0 0. 0.0 0.5 0.3 0.0 5.0 0.30 0.00 0 000 I 3H 00 m 00 00 00 05 05 0H0 000 z 0H.0 om>os0m #00000 MCHooo oz x 00060 0 a 0 0 3 0 0 e 0 0 00 00660 00000000000 wmmcomm0m 909 HHm now moowaonoo 009:9 wow 0 0:0 2 00 000660 50000000000 > 0Hn09 CHAPTER III RESULTS General Using the procedures outlined in Chapter II, it was feund that 72 percent of all protocols were assigned mean reliability scores of eight or better under the realistic condition. In line with this finding, 72 percent of 858 reSponses obtained reliability scores of eight, nine or ten, and the mean reliability score across all reSponses was 8.19, while the modal reliability score was a perfect score of 10. 38.5 percent of all items were coded with perfect reliability across 15 paired comparisons. As was noted earlier, comparison of six coders entails comparisons of 15 pairs. Table IV gives the N and percent of all protocols assigned discrete reliability scores (to the nearest tenth) under conditions in which no coding effects have been removed from consideration. Table V gives the N and percent of all responses assigned discrete reliability scores (O-lO) under conditions in which no coding effects have been removed from consideration. These results relate to the basic hypothesis that there will be a high level of reliability among the six coders on the coding of the TST pgptocols and on the coding_of the TST responses. Although the results did not reach the eXpected level of 75 percent either for protocols or reSponses, the mean reliability score per reSponse did exceed the minimum expected value of eight. In addition, the large percentage of responses H6 #7 which obtained a modal reliability score on the extremely high side of the reliability range indicates the excessively skewed distribution in the direction of high reliability. This skewed effect is made more apparent in Table IV and Table V. When the Effect of Qualifiers is Removed Hypothesis one stated that when the difference between consensual reSponses and evaluative-consensual reSponses is overlooked, reliabilitz is exPected to appear as greater than under realistic conditions. The basic hypothesis was tested under conditions that removed none of the effects which would Operate in the reality of the coding procedure. Hypothesis one must be tested under the condition which removes the effect of qualification from the coding of protocols and responses. For examples of this procedure see page “1 and Appendix C, Section 11. Under this condition, 77 percent of all protocols obtained a mean reliability score of eight or better, while 78 percent of all responses were assigned scores of eight, nine or ten. The mean reliability score across reSponses was 8.60, and while the modal reliability score persisted at the value of 10, there was an increase in reSponses accounted for by this mode of 8.4 percent (u6.9 percent) These results support hypothesis one since they represent (1) an apparent increase of five percent in the number of protocols having a mean reliability of 8 or better, (2) an apparent increase of seven percent in the number of reSponses scored reliable at the eight, nine or ten level, and an increase of .u in the average reliability of all responses. More complete results for the test of this hypothesis may also be found in Table IV and Table V. l+8 When the Accidental Effect of Splits is Removed Hypothesis two stated that when differences between coders, in the decision of whether to divide a statement into more than one statement for coding purposes, are adjusted so that only the immediate statement is affected, rather than all subsequent statements in the protocol, reliability is expected to appear as greater than under realistic conditions. Hypothesis two also must be tested under a hypothetical condition which removes a real effect from consideration. The results obtained under that condition represent an apparent increase of eight percent (80 percent) in the number of protocols with mean reliability of eight or better. An increase of eight percent (79 percent) in the number of reSponses scored as reliable at the eight, nine or ten level was found. The mean reliability of all responses increased .MO to 8.59 and the reSponses accounted fOr by the modal score of 10 increased 5.9 percent to uu.u percent. Cleardy the apparent increase of reliability which was hypothesized has been obtained. These results are made more explicit in Table IV and Table V. Factors Present in Low Reliability At this point, to visualize what sorts of responses produce certain levels of reliability, you may wish to turn to Appendix C, which lists each item and protocol, in conjunction with its score under three con- ditions. The most unreliable protocol was #736, which reads: I am an individual common everyday person. Try to live up to the laws of America 8 pay my bills. Try to bring up my family 8 get as meddui (sic) as you I can. H9 The most unreliable statements when accidental effects of Splits had been removed were: 1. My family. 2. I stay home on the week ends. 3. I'm a Negro and proud of it because its Gods (sic) intention. u. I sing in the choir. 5. I am a person who believes a happy marriage is the basis for success. . I live alone. 7. My occupation. 8. I am a person who would like to become head of large business firm. 9. I love my family and want to provide for them any way I can.* 0) For one of these there were no agreements, and for eight of these statements there was only one pair of coders in agreement. It is interesting to notice that the categories involved in these very un- reliable reSponses were repeaters, as can be seen in Table VI. The only non—existent category (73) used by any coder was used in coding the one response which was scored as zero in reliability. Reliability Related to Complexity Level of Code Category In order to test hypothesis four as content categories increase in complexity, reliability will lessen, each item was scored on the following scale of complexity, if there were at least four of six coders in agree- ment on complexity level: Complexity Score Name Categories 1 Personal Categories 10-17, 70-71, 2 Primary Categories 20-27 3 Secondary Categories uo-ua, 52-55 a Categorical Categories 30-31, 50—51, 56-59, 62-63 5 Catch-all Categories 60-61, 72 T“*‘-———' ' 3These responses are from several protocols, not a Single protocol. Category 72 Non-split 43 48 51 59 62 63 22 1+1 58 61 21 23 1+2 IILI as 53 73 50 Table VI Code Categories by Frequency of Use For Nine Responses Coded Least Reliably Description Other idiosyncratic (residual) (Not a category) Qualified general reference to work General religiosity references, (qualified and unqualified) Qualified reference to nation, race, ethnic or language group Qualified other categorical referents Domestic references (unqualified) Domestic references (qualified) Marital terms (unqualified) Qualified occupational title Other categorical referents, (unqualified) Qualified other consensual Qualified kinship, excluding marital terms Qualified marital terms General reference to work, (unqualified) Formal organizations, (unqualified) Formal organizations, (qualified) Qualified reference to religion, religious organizations (A non—existent code category) Frequency 8 6 51 It was expected that a complexity score of five would produce about the same reliability Score as a complexity score of one. The decision to treat these two scores separately and merge them later was made because they were unlike groups in many ways. The analysis for this hypothesis was done under only two conditions (1) the realistic con- dition in which no coding effects are removed from consideration, and (2) the hypothetical condition in which the accidental effects of Splits were removed from consideration. It was inappropriate to test the hypothesis under the condition in which the effect of qualification was removed from consideration, since that made up a large part of the effect this analysis was concerned with. The results are reported here in tabular form. There seems to be no relationship of the type hypothesized, Table VII Modal Complexity Score of ReSponses By N and X Reliability for two Conditions Conplexity Score 1 2 3 u. . 5 Total No Coding Effects Removed N 83 209 166 102 231 *786 Beliability Score 9.1 9.2 8.9 8.5 9.0 Accidental Effect of Split Removed N 39 197 158 83 217 #694 iReliability Score 9.2 8.8 8.6 9.2 8.9 *Only responses judged by a majority of all coders to be on a single complexity level were analyzed, thus accounting for a decrease of 112 resPonses from the total N of reSponses. #Removal of the accidental effect of Split statements created enough redistribution of categories to lower the N of responses analyzed.by an additional 59, while the total N increased by seven, thus the net loss of 52. 52 at least not when complexity is defined as it was here. It did not seem appropriate to run a statistical correlation of complexity score with reliability score as originally planned. Reliability Related to Number of Statements in Protocol Hypothesis three, the more statements made on a TST protocol, the lower will be the mean reliability score for that protocol, was tested by running a correlation between number of reSponses in the protocol and mean reliability score of the protocol. Since no Specific condition was specified in the hypothesis, all three conditions were tested. The results of the three correlations are: No Coding Effects Removed - r -.2177, (p<:.01) Qualification Effects Removed - r -.09117 (NS) Accidental Effect of Splits Removed - r +.000590 (NS) The hypothesis is supported under the realistic condition. However, since the correlation is very nearly zero when the accidental effect of Splits is removed, prdbably the negative correlation can be accounted for entirely by that accidental effect, rather than by any effect of "more responses appearing more complex to the coder," on which this hypothesis was also based. Results of Testing for Independence of Coders Since the reliability results reported.here depend to a great extent on the assumption that the coders were independent, it seemed wise to test this assumption, in the form of a hypothesis. This test may be 53 thought of as a validity check of the reliability test which is being reported. To put it another way, if the level of reliability which we have measured and reported is actually a valid measure it must be measuring no contaminating variables along with "true" reliability. If we find that the coders are related in their agreements when they are contributing to unreliability, we could no longer assume that they were not related, by some contaminating variable, in their agreements which contribute to reliability. In such a situation, the level of measured reliability would be higher than the level of true reliability by some unknown amount of contribution from the contaminating variable. Therefore, we translated our assumption of coder independence into the hypothesis that there will be no relationship in the pattern of coder agreements which contribute to unreliability, This hypothesis was tested by a Chi Square analysis of the N of paired agreements for 15 pairs of coders. It was tested under only two conditions and not tested under the condition in which qualification effect is removed. Three separate tests were run for paired agreements, triple agreements and a combination of these when they contribute to unreliability. "No relationship” was operationalized by using as the expected value for the Chi Square analysis an amount of agreements equal for each of the 15 pairs of coders, which is equivalent to the average obtained N of agreements across all 15 pairs. 59 Table VIII 2 X Values for Coder Pair Agreements Under Two Conditions (df = 19) N Expected . X2 Significance Value Nb‘deing’Eff5cts Removed Paired Agreement 168 11 15.9 as Triple Agreement 239 16 6.1 NS Paired + Triple 407 27 10.8 NS Accidental Effect of Split Removed Paired Agreement 150 10 33.7 p<:305 Triple Agreement 210 19 3.0 NS Paired + Triple 360 24 15.8 NS It appears that there may be a contaminating variable present, which only becomes apparent after removal of the accidental effect of Splits from consideration. In order to make more explicit the relationships between coders which may, unwittingly, be contributing to measured re- liability, the relational model in Figure l. was constructed. Here each solid line represents a positive relation and each broken line :represents a negative relation between two coders, A,B,C,D,E, and G (Gallup). The values associated with each line indicate the amount and direction of difference between the obtained number of agreements and the eXpected (average) number of agreements. 55 Note that the shortest line represents the most positive relationship (+12), while the longest line represents the most negative relationship (-6). It was not possible to reflect the value of each relationship in the length of each line. To achieve this effect, it would be necessary to go to three, rather than two dimensions for the model. The most notable change would be the positioning of G at a point above this page. *1 ‘\\ / \ <3, -6 / -5/ \ \ / ./*~./' // 1:“ ,0>:\(’ / \6 $47175 / 1" \O-‘ £4-11“; Figure 1. Model of Coder Relationships This figure is based only upon the one Chi Square analysis which Proved significant. In explanation of the strong positive relationship between C and B, and between E and D, I offer that C and B were two male 56 ‘ graduate students who had shared office Space for several years, and that E and D are a married couple. Probably the contaminating variable un— earthed here, is that these two pairs of the Six coders tend to ”think alike" due to habitual association and interaction. To this extent, then, the coders can not be thought of as independent. The somewhat strong positive relation between G and both E and D is without rationale, since these two pairs had no symbolic contact of any kind, while a number of written communiques regarding the TST coding procedures had passed between A and G. CHAPTER IV CONCLUSIONS The Basic Hypothesis of Reliability .The basic hypothesis of high reliability, under the realistic condition in which no coding effects wre removed from consideration was not supported at the level arbitrarily selected, i.e., 75 percent or more of all protocols, received a score of eight or better. The obtained results it will be recalled were that 72 percent of the protocols received a reliability score of eight or better. Let's return for a second look at the basis for this hypothesis. In the first place, the decision to set the score of eight as an acceptance level, rather than a score of seven, while reasoned out in some detail, makes the assumption that a difference of one in paired agreements (between the score of eight and the score of seven) is somehow more important than a difference of one in the Size of the mode (between the score of seven and the score of Six). Therefore, we could also rationalize, after the fact, that the acceptance level could have been set at a score of seven, as easily as at a score of eight, and reference to Part B of Table IV shows that such an alteration would have produced high reliability in 85 percent of all protocols, well within the acceptance level of 75 percent or over. In the second place, the hypothesis was based upon (1) greater complexity of code categories than had been used in the past to code the TST, plus (2) possibly a greater range in complexity of the TST reSponse 57 58 protocols, due to a unique use of the TST on a heterogeneous sample rather than a homogeneous group, plus (3) the fact that testing reliability across six coders and 15 pairs of coders, rather than two or three coders as in past studies, was a more rigorous test of reliability, and finally (9) upon the obtained reliability results (97 percent) of Kuhn and McPartland. From this line of reasoning it is possible to decide, as we did, to lower the expectation of reliability from the 97 percent level obtained in the past. However, there were no guidelines as to how much lower was low enough to account for the three differences in treatment. Therefore, we feel the obtained reliability figure supports our contention that the reliability figure should be lower than 97 percent. Beyond that one could rationalize that either (1) three percent is a very small difference from accurate prediction, or (2) we were mistaken in our arbitrary choice of a 75 percent acceptance level. We will not indulge in either line of rationalization, however, and the fact remains that reliability did not reach the level we thought it would. The problem remains for other behavioral researchers to assess the processes which operate in arbitrary decisions regarding numerical levels. While there may be individual differences between researchers, I suSpect that there are also cultural factors involved in such matters as (1) odd number versus even number, (2) how much is big, (3) how much is majority, (u) how much is little, (5) how much or how little is significant, etc. Obviously the most important cultural variables are the numerical base used and the habitual numerical operations reflected in monetary matters and pricing practices. I suSpect in this instance 59 that a numberical base of ten influenced the arbitrary decision to set the acceptance level at 75 percent. Furthermore, had the researcher been raised in a culture in which the monetary system used pennies (.01), nickels (.05), dimes (.10) and tomens (.30), the arbitrary choice may well have been set at 70 percent. An experimental study of the psychological and social processes involved in setting numberical acceptance levels would be of importance to the study of philosophy of science. Knowledge of Grammar related to Coder Reliability Both of the hypothetical conditions studied are related to the coders knowledge of grammar. The hypothesis that Removing the effect of quali- fication will operate to raise the appearance of reliability can be translated back to the belief that, at least for the coders involved in this reliability test, the coder competence was not at the highest possible level regarding ability to (1) identify adjectives, or (2) decide what sorts of clauses, phrases or allusions serve to modify the main clause. The hypothesis that removing the accidental effect, caused by non agreement between coders in the decision to Split a response into two reSponseS for coding purposes, will operate to raise the appearance of reliability_was based upon the belief that, at least for the coders involved in this study, coder competence was not at the highest possible level regarding (l) ability to identify a compound sentence, and (2) ability to identify separate clauses associated with compound objects. AS pointed out before, the removal of these two effects was only hypothetical, and in each case the hypothesis was supported by the results. 60 From this we can conclude that if coders were employed whose levels of competence, at these four grammatical problems, were higher, then reliability would increase in a real, rather than an apparent fashion. While it is probably impossible to remove these effects entirely, due to the difficulties of the language structure, it should be possible to lessen these effects. It seems an appropriate suggestion, that a replication of this study, using the same 150 protocols, but employing six high school English teachers, or six professional editors as coders would produce an even higher level of reliability than that obtained here, assuming adequate training in the code's content categories. Independence Related to Coder Reliability The hypothesis that there will be no relationship between coders in their agreements which contribute to unreliability, was based upon the assumption that the coders in the present study were independent. Recall that, at least after the accidental effect of Splits were removed, this hypothesis was not supported in the test of paired agreements which contributed to unreliability. A legitimate conclusion is that a replication of this study using coders who are selected in some manner so as to make them more independent would measure reliability at a lower level than the level measured in this study. For example, using the same TST protocols one might ask Six graduate students in social psychology to act as coders, with the limitation that each graduate student be studying at a discrete university, and never have met any of the other coders. 6l In line with the suggested replication using coders of increased competence in grammar, one would eXpect reliability to increase if the six high School English teachers all knew each other and taught at the same high school, while one would eXpect little change in reliability if the six high school English teachers taught in separate cities or had never met; one would expect increased reliability if the six editors all worked in the same publishers office, while one would expect little change in reliability if the six editors worked for discrete publishing companies and had never met. Because an increase in the grammatical competence of the coder is expected to increase real reliability, while an increase in coder independence is expected to decrease measured reliability, it is probable that the two factors would cancel each other out. Therefore, an increase in coder grammatical competence coupled with an increase in coder in- dependence might well create little effect on the level of measured reliability. Factors in the Present Code which Limit Reliability The casual finding that certain code categories contribute to ex— treme low reliability in a disprOportionately greater amount than other code categories, while still other code categories do not contribute at all to extreme low reliability suggests that there are real differences between code categories as to the reliability one can impute to their assignment to a reSponse by an individual coder. Originally it was our intent to additionally study these differences in the several code categories, since, on the face of it, some code categories seem less clear-cut than others. However, our own bias places higher value on a 62 study which tests hypotheses than on a study which does eXploratory counting, and there seemed no basis upon which to hypothesize this variable, the definitiveness of the code category. An attempt was made by testing the hypothesis that as complexity level of the category increased, reliability would decrease, but this hypothesis was not born out. Since the scale used to Operationalize ”complexity level" does not appear to be measuring what I referred to above as definitiveness of the code category, it was not really surprising that the hypothesis was not supported. ' If some scale of definitiveness can be developed, then a hypothesis and corresponding test of the correlation of definitiveness level with reliability level could be run in a future reliability study. To point up levels of definitiveness, let us examine the actual code, as it appears in Appendix B. If as Berelson generalizes, the "more precise and complete the set of coding rules, the fuller the illustrations," (3, p. 514) the more reliable will be the coding; then this code is lacking only precision. Certainly the code approaches completeness and fuller illustrations would be beyond comprehension. Actually grave attempts at precision were made in the writing of this code. However, its very length, completeness, and fullness of illustration operate to nullify the attempt at precision. Following are some of the inconsistencies in the code which lead to imprecision. Reference will be made by page number and row on the page, numbering rows from top to bottom. lO. 63 On page 73 #9, row 3, compared with row 6 -- ”I'd rather not answer that question." is labeled a refusal (99), while "I would rather keep that to myself" is labeled as idiosyncratic (72). On page 73 #9, row 6, compared with 83 #61, last row-—"I would rather keep that to myself," is labeled as idiosyncratic (72), while "I would rather keep this to myself." is labeled as qualified other consensual (61). On page 75 #ll, row 3, compared.with 83 #61, row 9--"I am a young single girl" of which the portion "I am young" is labeled as qualified age (11), while "I am my plain old self" is labeled qualified other consensual (61). Page'fli #19, row 7-— ”I am very tanned” is inaccurately labeled other physical characteristic (unqualified) (In) Page'"5 #17, row 9, compared with 78 #90, row 8, and compared with page #02, row 6--"I'm a chore woman" is labeled a qualified sex reference (17), while "I’m a charwomad'is labeled occupational title (unqualified) (HO), and "I am a working man" is labeled general reference to work (unqualified) (#2). Page '"5 #20, row 9, 10 and 12, compared with #21, row 3-—"I have eleven grandchildren" and "I am a mother of four daughters" and "I have four children -- three in school and one babyf, are labeled kinship (unqualified) excluding marital terms (20), while "I raised a family of three” is labeled qualified kinship, excluding marital terms. (21). Page 80 #148, row 6 and 7, compared to Page 81 , #53, row 7——"I am a person who believes in God" and "I am a religious person," are labeled Religiosity, (qualified or unqualified) (48), while ”I live by my religious beliefs” is labeled Qualified reference to religion and religious organizations (53). Page 77 #25, row 3, compared to page 83 #61, row 19 and compared 'to 39' row 6--”I have lots of friends" is labeled Qualified reference to a friend (25), while ”I am a friendly person," is labeled qualified other consensual (61) and ”I am fr1endly“ is labeled idiosyncratic (72). Page 83 #61, row 2 and row 6, compared with page 85 row 16 -- "I'm not much of a talker" and "I keep my mouth shut” are labeled as qualified other consensual (61), while ”I do not like to gossip," is labeled idiosyncratic (72). Page 33 #61, row 5, compared with page 89 , row 9 and 15-- "I am a happy person” is labeled qualified other consensual (61), while ”I'm happy" and "I have always been pretty happy" are labeled idiosyncratic, (72). 64 11. Page 82 # 59, row 2 compared with Page 85 row 18 -- "I am interested in Sports" is labeled qualified other categorical referents (59), while "I am interested in recreation" is labeled idiosyncratic (72). 12. Page 82 # 59, row 2, compared with page 84 row 22-- "I like music" is labeled qualified other categorical (59), while "I am creative in art" is labeled isiosyncratic (72). These illustrations of some of the Specific difficulties in the present code may indicate some of the confusion caused in the individual coder's mind by the code. One would expect that such confusions would give rise to unreliability in coding. More careful attention to the code could operate to reduce unreliability. Length of the TST as a Factor in Reliability_ The hypothesis that the more resppnses there were in a TST protocol the lower the reliability of the protocol would be was upheld by a negative correlation at a significant level under the realistic condition. This evidence points to a value in using only ten statements, rather than 20, as has been done in the present study and in Brookover's study (4). Such foreshortening of the TST should act as an aid to reliability. Future Utility of the Twenty Statements Test The ease of administration of the TST makes it a natural choice for assessing self data in the field situation, and it has particular value in any type of cross cultural study, where the question can be rather easily translated into equivalent forms in several languages. Perhaps the greatest utility for the TST in the future is to compile a body of statements about the self which can in turn be scaled by a large sample of persons. Finally by submitting these scale items to factor analysis, it should be possible to construct a standardized 65 scale of self-attitude with factor loadings for each item. Such a standardized scale would have the advantage of the TST in being self- defined by a large, representative sample of selves, plus the added advantage of structured and pre-coded reSponses, which would make un- necessary any future studies of inter-coder reliability. Another future product arising from the TST, as a generator of items, would be a Q-sort of items designed to measure self-attitudes. Such a standardized instrument would introduce the additional advantage of allowing the individual respondent to add a good deal of self definition to the measurement in the style in which he rank orders the items, while reliability remains enhanced by holding the possible reSponses constant. Another innovation which should enhance the reliability of the TST is the self-coding form under development by McPhail (24). By placing the coding under the direction of the respondent, coder reliability becomes a meaningless concept. In conclusion, let us look at some important ways this study differs from the original reliability check by Kuhn and McPartland: (l) the data being coded in the present study represented a wider range of expression, generated as it was by a heterOgeneous rather than homogeneous group of rspondents, (2) code categories being used represented 22 times as many possibilities for unreliability as the original Kuhn and McPartland code (45 categories XE: two categories), (3) six coders represent 16 times as many possibilities for disagreement as the two coders used in the Kuhn and McPartland reliability check, and (4) the present study set a standard for acceptance of reliability (75 percent of all protocols and reSponses should have a reliability score of eight or better) rather than accepting 65a whatever reliability level was obtained. All of these differences make a high level of reliability more difficult to attain. The 72 percent level obtained XE: the 97 percent level obtained by Kuhn and McPartland appears encouragingly good. In Spite of the fact that the results did not quite reach up to the p_priori acceptance level, we feel the TST can be counted upon for very good inter- coder reliability even when heterogeneous populations are sampled, a complex code category system is used and a relatively rigorous reliability test is applied. 1. 10. ll. 12. 13. 14. 15. REFERENCES Allport, G.W., Becoming: Basic Considerations for a Psychology of Personality, Yale University Press, New Haven, 1955. American College Dictionary, Random House, New York, 1947. Berelson, Bernard, "Content Analysis,” in Gardner Lindzey (Ed.), Handbook of Social Psychology, V. I., Addison-Wesley Publishing Co., Reading, Massachusetts, 1954, 488—522. Brookover, Wilbur 8., Ann Paterson 5 Shailer Thomas, Self Concept of Ability and School Achievement: The Relationship of Self Images . to Achievement in Junior High School Subjects, Office of Research 8 Publications, Michigan State University, East Lansing, Michigan, 1962. Bugental, J.F.T. and S. L. Zelen, ”Investigations into the Self—Concept. I. The W-A-Y Technique," Journal of Personality, V. 18, 1950, 483-498. Cooley, Charles Horton, Human Nature and the Social Order, Charles Scribner's Sons, New York, 1902. Couch, Carl J., "Family Role Specialization and Self—Attitudes in Children”, Sociological Quarterly, April, 1962, 115-121. , "Self-Attitudes and Degree of Agreement with Immediate Others,” American Journal of Sociology, 1958, 491-96. ' , Unpublished manuscript dealing with classes of verb forms used in response to a modified form of the Twenty Statements Test. Dyer, Del A., Self Concept, Role—Internalization and SalienCy in Relation to 4-H Leader Effectiveness, Unpublished Ph.D. Thesis, Michigan State University, East Lansing, Michigan, 1962. Hartley, E. L., and R. E. Hartley, Fundamentals of Social Psychology, Alfred Kn0pf, New York, 1952. Hickman, C. Addison S Manford H. Kuhn, Individuals, Gropps and Economic Behavior, Dryden Press, New York, 1956. Interviewers' Bulletin, Gallup Organization, publication #GO/6368 and #606369, September 3, 1963. Kuhn, Manford H., "Self," in Julius Gould and William L. Kolb (Eds.), A Dictionary of the Social Sciences, Free Press, Glencoe, Illinois, 1964, 628-30. , "Self-Attitudes by Age, Sex and Professional Training," THE‘Sbciological Quarterly, V. I, #1, January, 1960, 39-55. 66 l6. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 67 . and Thomas S. McPartland, "An Empirical Investigation of’SeIf—Attitudes,” American Sociological Revue, 1954, V. 19, 68-76. Lindesmith, A. R. and A. L. Strauss, Social Psychology, Dryden Press, New York, 1949. Lipkin, S. "Clients' Feelings and Attitudes in Relation to the Outcome of Client-Centered Therapy," Psychological Monogrpphs, 1954, V. 68, 1-30 0 McPartland, Thomas, S. D., The Self and Social Structure, Unpublished Ph.D. Thesis, University of Iowa, Iowa City, Iowa, 1953. McPartland, , John H. Cumming, Wynona S. Garretson, "Self-Con— caption and Ward Behavior in Two Psychiatric HOSpitals,” Sociometry, June, 1961, 111—124. McPartland, and John H. Cumming, "Self—Conception, Social Class, and Mental Health," Human Organization, Vol. 17, #3, 1958, 24-29. Manual for the Twenty Statements Problem, Department of Research, The Greater Kansas City Mental Health Foundation, Kansas City, Missouri, 1959. McPhail, S. Clark, Self-Identification Within a Specific Context of Experience and Behavior, Unpublished Ph.D. Thesis, Michigan State University, East Lansing, Michigan, 1965. McPhail, Clark, Unpublished manuscript on develOpment of a self- coding form of the Twenty Statements Test. Mead, George Herbert, Mind, Self 8 Society, University of Chicago Press, Chicago, 1943. Meltzer, Bernard N., The Social Psychology of George Herbert Mead, Division of Field Services, Western Michigan University, Kalamazoo, Michigan, 1959. Mielke, Keith W. Evaluation of Television as a Function of Self— Beliefs, Unpublished Ph.D. Thesis, Michigan State University, East Lansing, Michigan, 1965. Raimy, V. C. "Self-Reference in Counseling Interviews,” Journal of Consulting Psychology, 1948, V. 12, 153-163. Scheerer, E. T., "An Analysis of the Relationship Between Acceptance of and Respect for Self and Acceptance of and Respect for Others in Ten Counseling Cases," Journal of Consulting Psychology, 1949, V0 13, 169-1750 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 68 Schutz, H., Theory_and Methodology of Content Analysis, Unpublished Ph.D. thesis, University of California at Los Angeles, 1950. Stewart, Robert L., An Empirical Test of the Reference Group Hypothesis, Unpublished M.A. thesis, University of Iowa, Iowa City, Iowa, 1952. Tucker, Charles W., Occupational Evaluation 8 Self—Identification, Unpublished Ph.D. Thesis, Michigan State University, East Lansing, Michigan, 1966. Vargas, M. J., "Changes in Self-Awareness During Client-Centered Therapy," in C. R. ROgers 6 Rosalind F. Dymond (Eds.), Psychotherapy and Personality Change, University of Chicago, Press, Chicago, 1954, 145-166. Webster's Elementary Dictionary, American Book Company, New York, 1941. Wylie, Ruth C., The Self Concept, A Critical Survey of Pertinent Research Literature, University of Nebraska Press, Lincoln, Nebraska, 1961. Bugental, J. F. T., ”A Method of Assessing Self and Not-Self Attitudes During the Therapeutic Series,” Journal of Consulting Psychologists, 1952, v. 16, 435-439. Stock, Dorothy, ”An Investigation into the Intercorrelations Between the Self-Concept and Feelings Directed Toward Other Persons and Groups.” Journal of Consulting_Psyphologists, 1949, V. 13, 176-180. Raskin, N. J., "An Objective Study of the Locus-of-Evaluation Factor in Psychotherapy," in W. W. Wolff 8 J. A. Precker (Eds.), Success in Psychotherapy, 1952, New York, Grune 8 Stratton, 143-162. A Manual for the Analyzed Data from United States and Mexico, Charles W. Tucker, principal editor, Michigan State University, Sociology Department, East Lansing, Michigan, September 18, 1964, mimeo. APPENDIX A Form of the TST in the Interview Schedule 70 APPENDIX A Form of the TST in the Interview Schedule Do not spend more than three minutes on the next item. INTERVIEWER: Probe unti1_you get at least five statements from the When you must_probe, indicate with a respondents. horizontal line below the number of his last statement. Although probably new Now, we have quite a different thing for you to do. to you, it is easy and I think you will find it quite enjoyable. Everyone Now, let me tell we have asked to do this has found it to be interesting. you what we have in mind. Think of as many answers as you Ask this question of yourself, "Who am I?" can in answer to the question, "Who am I?" In a moment I would like you to give me the answers as if you were giving them to yourself, not to me or anyone else. Take a little time to think about it. (PAUSE HERE MOMENTARILY.) 3. Now, please make what you consider to be the most important statement about yourself first. 1. b. Now, make what you consider to be the next most important statement about yourself. 2. Now, make what you consider the next most important statement about yourself. - 3. (PROBE) Are there any other statements you could make about yourself in answer to the question, "Who am I?" APPENDIX B TST Code Book 72 READ CAREFULLY - GENERAL CODING INSTRUCTIONS FOR ITEM 17 - Who am I? A STATE- 1. The object is to code all statements as separate items. MENT CAN BE A SINGLE WORD, A PHRASE, OR A COMPLETE SENTENCE. Thus, you may find more than one statement on a particular line in Each such statement will be coded separately. the questionnaire . Examples: a. carpenter; a happy fellow; I am a good father (This is three separate statements.) (This is two statements.) b. I am a father and husband. I am a very hardworking, unhappy father. (This is one statement.) I am handsome but weak. (This is two statements.) . I am a young single girl. (This is three statements.) I am a American Negro citizen. (This is three statements.) "hmg, GENERAL RULE: Any element which can be construed as a single thought A noun and its accompanying about oneself will be coded as a statement. modifiers is usually a single thought. Adjectives modifying nouns are to be included with the noun (as in example c. above). Adjectives which stand alone as self descriptions (as in example d. and e. and f. above) will be counted as statements. Usually only one state- 2. Go throngh question 17 and mark off statements. ment will be fbund on a line. In those cases where more than one state- ment is found on a line, code each of the statements in order, beginning with the first mentioned statement. 3. Each statement will be coded into a two column code described below. In case there are less When you get to ten statements, ignore the rest. than l0 statements, put code 88 into the unfilled columns. u. Coding of this item can best be done by following two procedures: a. Matching statements on the protocols with statements provided as examples for each category (see following pages.) b. Following these General instructions with Special attention to the Arbitrary Rules (see following pages.) 5. The titles of the categories are only suggestive of the range of state- ments to be included in each category. Basically, the category serves However the logic to collect statements which have some common referent. of placing certain statements in the same category may not always be apparent. Attention to such questions as "What did the reSpondent really mean to say?", or "How can this category placement make sense in the context of what the respondent said in another statement?", will lead to random coding, and low inter—coder reliability. Therefore, the best check on where a statement is to be coded is to try to find an example like the statement in the list of examples, or one clearly close to it. A final check should be made to make certain that the category assignment does not violate any of the arbitrary rules and exceptions. 73 GENERAL CODING INSTRUCTION - Who am I? (Continued) 6. Definition of a qualified and unqualified statement: a. A QUALIFIED statement is any statement which is modified by the inclusion of an adjective or adverb, along with the noun and verb. Sometimes a verb will modify a statement, as in the statement "I hate work." - b. AIUNQLALIFIED statement is any statement which is not modified by the inclusion of an adjective or adverb. This would be only the noun or verb, in cases where the verb has no qualifying effect. Examples: I am a poor student. (This is a qualified statement.) I am a father. (This is an unqualified statement.) 7. Each statement should be coded disregarding (a) negation or affirmation and (b) verb tense. Examples: Treat the statement "I am not a mother." as in the same class as "I am a mother." Treat the statement "I was a foreman." as in the same class as "I am a foreman." 8. The distinction between "consensual" and "idiosyncratic" statements can best be understood by reading‘through the list of examples provided. All of the references in the list of idiosyncratic statements are to events, states of mind, or processes which are private, lacking in Specificity, or lacking in general agreement as to meaning. On the other hand, all consensual statements have at least one word which points to something most people would recognize, and have some agreement for the meaning of. 9. Use a 99 to code all columns in the case of a clear refusal. We define a refusal as one of the following: a. I'd rather not answer that question. b. I don't want to answer that question. c. Would you please go to the next question. "I don't know." and "I would rather keep that to myself" are not defined as refusals to the question but as reSponses to the question which should be coded as Idiosyncratic (72) ARBITRARY RULES A. In cases of statements which fit into two or more categories, aSSign the statement to the category closest to the top of the list numerically, with the following exceptions: 1. Any reference to health, whether consensual or not, should be coded as a 70 or 71, depending on the matter of qualification. 2. Any reference to an organized church or religion should be coded as 53 or 52 regardless of what other codable items are mentioned in the statement. 7“ GENERAL CODING INSTRUCTIONS - Who am I? (Continued) 3. Any reference to race, irrespective of other references in the statement which are codable, should be coded 50 or 51. A. General religiosity references frequently include words that refer to kinship, sex, work, etc. However, all religiosity references should be coded as N8. (see examples) Code the statement as it is recorded on the schedule. Do not add or subtract words or impute meanings. The only exception is described in rule C below. In compound sentences with a single stem, treat the clauses as if each were separate statements with the stem attached to each clause. Examples 1. "I am interested in my house, in recreation, and in education." should be treated as the following three statements: a. I am interested in my house. b. I am interested in recreation. c. I am interested in education. 2. "I am an American Negro citizen." should be treated as the following three statements: a. I am an American. b. I am a Negro. c. I am a citizen. 3. "I am a young single girl." should be treated as the following three statements: a. I am young. b. I am single. c. I am a girl. In cases where titles are mentioned which include several words, treat the entire title as the object of the sentence. Examples: 1. I am a U.S. citizen (Do not treat U.S. as qualifying citizen.) 2. I am an executive secretary (Do not treat executive as qualifying secretary) 3. I am a retired boilermaker (Do not treat retired as qualifying boilermaker.) u. I am a successful medical doctor. (Treat successful as qualifying medical doctor) When coding initials (e.g., B.P.O.E., G.O.P., I.0.0.F., etc.) which have not been defined by the interviewer on the schedule, all initials, whether they are recognized or not recognized by the coder, should be coded as 58 or 59 depending upon qualification. Example: "I am a B.P.O.E." (This is an unqualified statement - 58) "I am a good B.P.O.E.” (This is a qualified statement - 59) 75 TST CODE 10 - Age (unqualified) Only include Specific age such as "I am 25 years old.” 11 - 12- 13 - l4 - 15 - 16 - Statements referring to "old" or "young" are NOT to be included in this category but are coded as 11, I am 25 years old Qualified age I am almost 35. I am (a) young (single) (girl) *Note: This is the first example of a sentence in which there is more than 1 codable referent. In this case ”I am young" is coded as ll; "I am single" is coded as 22; and, "I am a girl" is coded—as 2&3 Name This would be the first, last or both names with or without the prefix of Mr., Mrs., or Miss. I am Ardith Weaver I am Willie Ella Barnell My name is Joseph Smith Address This would include only street address and not the name of the community or nation. I live at 712 Creek Road Other Physical Characteristics (Unqualified) -This could include references to height, weight, skin color, eye color, etc. This does not include "colored", "negro", etc., which is coded under 50, 51. I am bald-headed I am 5' 2" tall. I am very tanned. Qualified physical characteristics I am nearly 5'2" tall. My hair is turning gray. Sex (unqualified) This would include the words male or female. It would also include the words; man, woman, boy, girl, lady, and gentlemen. I am a man I am a female I am a woman l7 - 18 - 19 - 20 - 21 - 76 Qualified Sex Reference I am a contented woman I am a nice gentleman I'm a chore woman *I am a man who is loved by his friends *In the fourth statement there I am a good man are two possibilities of I am a self-made man ‘ coding: friends or sex ref. I am a fortunate man categories. Following Rule 1 I am a man able to help my children the statement is pushed up _' I am a free man. to the first of the two I'm the man of the house. possible categories. I am a man who does a lot of public service work free. You are a nice gentle woman that paid me a visit. OPEN OPEN Kinship (unqualified) Excluding marital terms. Including such terms as family, child, parent, relative, daughter, son, father, mother, sister, brother, uncle, aunt, grandfather, grandmother, great aunt, great uncle, godmother, godfather and all in-law referents. Do not count number of children or relatives as a qualification. I am a grandmother have children am a mother have eleven grandchildren am a mother of four daughters am a father to my children have four children -- three in school and one baby. am making a living for my family. HHHHHHH Qualified Kinship. Excluding marital terms. I'm a good relative. I raised a family of three. I come from a good family. I have a very happy family. My family is all gone. can educate my children. am a referee for my daughter. am proud of my kids. am interested in my children' 3 education. hope to have made my folks happy in my life. am a farmer' 3 daughter am not the same as I would be if my mother was living. enjoy being close to my family. am loyal to my family. The funre lies in the hands of mothers 1, as a mother, am reSponsible for the future generation. My family is the most to me. HHHHHHHHH 22 23 - 21+- 77 I am a family man. I am a devoted family man. As a mother, I can't be done without. And my children whom I couldn't do without. I am a person whose parents are both working. I am not the person I would be if I didn't have a family. Marital terms (unqualified) This would include the terms husband and wife, married and single, and those referring to marital status including divorced and separated. am a wife. am single. am divorced. am married. am my husband's wife. HHHHH Qualified Marital Terms I'm a wife of a progressive businessman. I lost my wife. I am proud of my husband. I'm going to be married. I have a nice wife. I am not the person I would be if I wasn't married. I married a farmer. I'm important to my husband. Financially, I like to feel I am helping my husband. Ask my wife, she would say I am a gun nut. I help mate financially. Reference to Friend (unqulaiified) This would include those statements with the word "friend" in them. A statement referring to the reSpondent's own friendly quality would not be included, e.g., "I am friendly" would be coded as 22, I am a friend. I have friends. 25 - Qualified Reference to a Friend 26 I have many friends. I have lots of friends. I am people's friend. I am a friend to other men. I'd like to make more friends than I have. Other (Residual) Primary Terms (unqualified) This would include neighbor, have neighbors, peers, bridge clubs, buddy, pal, colleague, associate, companion. I am a neighbor. I am a companion. 27 28 29 30 31 32 33 39 35 36 37 38 39 no __q——..~ . m*.~. .-.-\._.._ , 78 Qualified Other (Residual) Primary Terms I hope that I am a good neighbor I am a good neighbor I have never had much trouble with neighbors. OPEN OPEN Reference to Community (Unqualified) This would include any reference to the respondent's town, city, community, or state. I live in New York. I am a Californian. I am a Michigander. Qualified reference to community I am from a poor community. I am a community service worker. I am a citizen of a nice community. I am a community worker. I like to mix with community groups. I am a good community supporter. I am an example in the community. I am a civic minded person. I am loyal to my community. I am well-liked in the community. OPEN OPEN OPEN OPEN OPEN OPEN OPEN OPEN Qgpupational Title (Unqualified). This would include all names of specific occupations including the term "laborer", housewife. I'm a dish-washei I am a teacher *I'm a retired housewife I am a chauffer I am a seamstress I'm a painter I am a punch press operator I am a pastor I am a beautician I am a housewife I am a musician I'm a char-woman a“Retired" in conjunction with an occupational title is not, for our purposes, considered as a qualifier of occupation and should be coded as 393 79 I am a post-master I'm a lawn mower I am a banker I am a gardner I'm a filling station operator I'm a knitter I'm a carpenter I am a civil worker *I am a retired truck driver I am a cook I am a farmer I am a Spiritual advisor I am a nurse I am a nursemaid I'm a counselor I am a political worker I'm an entertainer I am a foreman on my job I am an educator I am a religious instructor #1 - Qualified Occupational Title am an enthusiastic advertising man. am a happy housewife am a very average businessman am a good housekeeper hope to become a good author. ran a foster home for five years in New York state. I'm a housewife, all inclusive. I'm a farmer, still able to do a day's work. I don't like housework. I love housework. I'm tops at housework. I just finished some housework. HHHHHH H2 - General Reference to Work (Unqualified). Includes reference to work, worker, retired, unemployed. I work. I am a laborer. I work 10 hours a day. I am a working man. I'm retired. I am employed. I am making a living. 43 - Qualified General Reference to Work I am a worker by trade. I am good at work. I am a good worker. I am a poor working man. I'm dedicated to my work. When I work and am hired to work, I do the best of my ability. I don't Shirk my job. I work everyday. I'm a hard working man. I'm a good employer. I like my work. I am fair and average in my work. I like to work. I am strong in buying and selling property. I enjoy making a living *"Retired" in conjunction with an occupational title is not, for our purposes, considered as a qualifier of occupation and should be coded as NO. un- us- HB - 1+7- 1+8- ug- 80 I live by the sweat of my brow. My job comes first. I have pride in my work. You have to (work) if you want to keep even. Formal Organizations (Unqualified) This would include all educational, professional, civic, social, farm and labor organizations. THIS WOULD NOT INCLUDE ANY RELIGIOUS OR POLITICAL ORGANIZATIONS. All initials should be coded as 58 or 59 depending on the matter of qualification. -—- I'm a Rotarian. I am a member of the Eastern Star. I'm a Mason. I am a member of organizations. I'm a class Sponsor. I am a Boy Scout Committeeman. goalified Formal Organizations. Excluding Religious and Political Orgs. I am a loyal unionist. Reference to Education (Unqualified). Excluding occupational titles such as teacher, instructor, sponsor, advisor, etc. I am a student. I would say I am a student. anlified Reference to Education I enjoy studying. I am a tired Student. I am going to get my BA degree next winter. Religiosity (Qualified or Unqualified) Excluding references to church, —~A formal religion or religious organizations. am one of God's workers put here to make this world better than when I came here. H I try to serve the Lord in every way possible. I am a person who believes in God. . I am important to the Lord as an indiv1dual I was blessed by the Lord in everything. *I am a Christian lady. *Where kinship referents are used here, *I am a child of God. (child, children), and sex referents, *I am one of God's children. (lady) disregard usual coding by kin- I lead a Christian life. ship and sex and place all such I am a creature created by God. referents to religiosity in fig, OPEN 81 50 - Reference to Nation, Race, Ethnic or Language Group (Unqualified). 51 - 52 - 53 - *I enjoy my church work. This would include reference to nationality and citizenship. American, English, Finnish, Negro, White, Spanish (Speaking) etc. Any referrent to being Jewish should be coded as either 52 or 53 depending upon qualifications. '_— -__ I'm English. I am a Negro. I am a white woman *I am a American (Negro) (citizen) *Sentence containing more than 1 *I am a (American) Negro (citizen) codable referent. In each case *I am a (American) (Negro) citizen. here, the code is 50. I am a (middle-class) citizen. .__ Qualified Reference to Nation, Race, Ethnic or Language Grogp I am a good U.S. Citizen I am a common everyday American I am a privileged American I'm glad that I am an American by choice I am a person with opportunities unequaled in U.S. I am as deep as anyone in the progress of our country I am glad to be in America I am a visitor to this country Reference to Religion and Religious Organizations (unqualified) This would include all church or church-related groups, membership and participation in same. Jewish would be included in this categOry. Initials that can not be recognized by the coder should be coded either 60 or 61 depending upon qualification. I am a Catholic I'm a church member I am a church goer I go to church. I am Jewish Qualified Reference to Religion and Religious Organizations I am a good church member I am important to my church I am loyal to my church . *In the fourth item "work" is con- sidered here as a modifier of church. Church is categorized as referring to Religion Organization and it is thus coded as 52 or 53 depending on qualification. It should not be pushed up to #2 or 43. I am free to choose my religion I live by my religious beliefs. I go to church every Sunday. I go to church regularly. 82 SN - Reference to Politics and Political Organizations (Unqualified) This 55 - 56 - 57 - 58 - 59 - 6O - would include reference to political, politics, all political parties, political action groups and affiliated clubs and associations. Initials that can not be recognized by the coder as falling in any category should be coded either as 60 or 61 depending upon qualification. I am a voter. I am a Democrat Qualified Reference to Politics and Political Organizations I'm a poor politician I am an unhappy Democrat I'm interested in national and international concerns I feel myself as much concerned about public life as any other human I am :ble to vote Reference to Socio-Economic Status (Unqualified) This would include working class, lower class, middle class, upper class, rich, poor, wealthy or well-to-do. An evaluation by the respondent of working ability will not be included. I am middle—class I am in the upper-class I am (a) middle-class (citizen) Qualified Reference to Socio-Economic Status. I am making an average income I don't have much money Other Categorical Referents (Unqualified) These would include such referents as sports fan, athelete, lover, member, hobbies. I belong to the AFL-CIO I am a server. I am a stamp collector Qualified Other Categorical Referents I am interested in Sports I like music. My hobbies are gardening I love dogs. My hobbies are bowling I love German shepards. I do serve a lot I'm interested in sports. I'm an outdoor person Other Consensual (unqualified) These would include such referents as: person, individual, human being, myself, me, human, homo sapien. 61 - 62 - 63 - 83 I am an individual I am me Qualified Other Consensual I'm not much of a talker I am law abiding I try to better myself There is no one like me I keep my mouth shut I'm a very conscientious person am a mere individual am my plain old self am an average person am a person brought into the world to do something am a person who lives in the future am a friendly person am a very nice person I'm a very unimportant person I'm an individual with my own beliefs I am a busy person To me I'm just myself I am a person who forgives easily I am a good person I would rather keep this to myself HHHH P£P1P4 I'm a person who enjoys living in the present I am not the person I'd like to be I am a happy person I am a common person I'm a good natured person I am an individual who thinks for himself. I am a good guy Domestic References (Unqualifiedl_ including such statements as I cook good, I can food, I wash the car, etc. am a provider am a homeowner am a breadwinner own my own property don't owe any debts am a home body have no car pay my bills am a tax payer work in the yard hihihihihihihihihihi Qualified Domestic References I don't like to cook I keep my place up so I can be proud of it My home is my haven I love my home I love canning I love food I am interested in my home I like to sew Around the house, I am actually lord and master. 6H 65 66 67 68 69 7O 71 84 OPEN OPEN OPEN OPEN OPEN OPEN References to Health (Unqualified) I am healthy Qualified References to Health I am in good health I am not in good health *I am a sick person It is important to help ill and handicapped people 72 - Idiosyncratic I help those in need I'm not a hypocrite J. I I I am given the opportunity to choose I'm sociable I'm friendly I am one who leads a life morally and financially successful I'm happy am easily satisfied have responsibilities am not enthusiastic am fair have always been pretty happy am different am dissatisfied am contented am not too smart I'm not too dumb I am congenial I am concerned about my position I enjoy making other peOple lead a fuller life I like peOple HHHHHHHHHH have been beat out of some money I I I HHHHHHHHHHHHHHH "Any time there is a reference to health it is coded as 70 or 71 regardless of referents such as person, begin, individual, man, woman, etc. have everything else was lucky in every way I'm not too much really like to be friendly with every- body try to be nice to other people try to treat everyone as I like to be treated like to see everyone happy am concerned for my fellow man have room to improve abide by peace like my comfort am a small amoeba am uncertain am confused am not very confident am honest am upright am square am creative in art have an optimistic attitude am urged to do things I can't and I become frustrated 85 am honest don't steal am agreeable can carry on have things done on time To be honest is always the first thing that comes to mind I have always been helpful I go out of my way to help others I believe that I cast an influence for good I am a very small ingredient of a vast mixture like to do things cooperate with everyone think am a nervous wreck am the same as everybody else am interested in recreation am good to animals think good thoughts have a clean mind I'm morally clean I think on clean and noble things I do not live in the past I am important to life I am glad to be able to do what I am doing I'm glad tole able to be in this apartment HHHHH HHHHHHHHH get emotional about little things have a temper am stubborn am not much of anybody right now don't pattern after anyone else I appreciate it You are bringing me something I feel humble and honored Actually we are so insignificant today I think I have a place no one else can get to which gives me strength do not like to gossip am being of service to others like to be around other people can carry on have my own individuality love helping other children Iam a nobody I am a free soul I am trying to get along with everybody I love to take care of people HHHHH HHHHHH 86 Column Number: 13, 1“ Page Number: a Question Number: 17 Item Description: Now, we have quite a different thing for you to do. Although probably new to you, it is easy and I think you will find it quite enjoyable. Every one we have asked to do this has found it to be interesting. Now let me tell you what we have in mind. Ask this question oijourself. "Who am_I?” Think of as many answers as you can in answer to the question, "Who am I?" In a moment I would like you to give me the answers as if you were giving them to yourself, not to me or anyone else. Take a little time to think about it. Code for first statement (unit of thought). Coder note: This item will hereafter be referred to as "Who am I?”. There will be two columns for each unit of thought (see General Instructions). Codes: 1 31. 10 - Age (unqualified) ll - Qualified age 12 - Name “~_ 13 - Address __g, 14 - Other physical characteristics (unqualified) ___ _—I 15 - Qualified other physical characteristics 16 - Sex (unqualified) I 17 - Qualified sex 18 - 19 - 20 - Kinship (unqualified and excluding marital terms) 21 - Qualified kinship (excluding marital terms) 22 - Marital terms (unqualified) 23 - Qualified marital terms 2“ - Reference to friends (unqualified) 25 - Qualified reference to friends 26 - Other (residual) primary terms (unqualified) 27 Qualified other (residual) primary terms 28 29 3O - Reference to community (unqualified) 31 Qualified reference to community 32 - 33 3M 35 - Column Number: Page Number: Question Number: Item Description: l J H! 1H1!!! 'Ill’ u 36 37 38 39 1+0 111 112 1+3 1111 1+5 116 1+7 1+8 119 50 51 52 53 54 55 56 57 58 59 60 61 62 63 7O 71 72 88 99 87 13, 111 17 "Who am I?" Code for first statement (unit of thought) Occupational title (unqualified) Qualified occupational title General references to work (unqualified) Qualified general references to work Formal organizations (unqualified) Qualified formal organizations References to education (unqualified) Qualified references'to education General religiosity references (qualified and unqualified) Reference to nation, race, ethnic or language group (unqualified) Qualified reference to nation, race, ethnic or language group Reference to religion, religious organizations (unqualified) Qualified reference to religion, religious organizations Reference to politics, political organizations (unqualified) Qualified reference to politics, political organizations Reference to socio-economic status (unqualified) Qualified reference to socio-economic status Other categorical referents (unqualified) Qualified other categorical referents Other consensual (unqualified) Qualified other consensual Domestic references (unqualified) Qualified domestic reference References to health (unqualified) Qualified references to health Other Idiocyncratic (residual) No statement to be coded in these columns Refusal APPENDIX C 150 TST Protocols 89 Section I Protocols Scoring the Same Under All Three Conditions N=1111 *Where no score is noted for a response it is understood to have a perfectly reliable score (10) 9.6 Mean Score I am a mother. I am a wife. I am a woman. 9 I am an artist. 9 I am a friendly woman. 10.0 I am a religious man. I do the best I can. 10.0 I am a beautician. I am a housewife. I am a mother. 9.2 I'm a worker. I'm a good mother. I'm a faithful mother. I am the breadwinner here. 6 I am a clean housekeeper. Disgruntled man. Lost confidence in life. Ambitious for son. 5 Bitter at stupidity of gov't. is: I'm myself. A housewife. A mother. I like fun. 9 I do the best I can, in all I do. A cook. 10.0 ——_ I am a person. I am a mother. I am a wife. Teacher. Companion. Friend. 9O 9.3 Mean Score I am a wife. I am a mother. 7 I am a companion. I am a teacher. I am a nurse. 9 That's about all I can think of. 8.0 8—— I am a good family man. 5 I am law abiding. I am very healthy. 9 No, I have scraped the bottom of the barrel now. 9. 8—_ I'm a business man. I'm a public accountant. 8 I love tax work (income tax). 9 I'm a school board member. 9 I'm a school board member. 9 I love Sports. I think I'm a good father -- (3 children). Like to play golf. 8.5 9__ An honest man. A religious man. 9 Member of Fraternity. 6 Family man. 10.0 _—h—I am me. CD 0 U" Iii.” I53 LOCO I am nothing. I speak what I think. Child of God. Good husband. Good father. Good name. Good friend. Religious. I HHHH HHHHH am a man. am honest. work hard. don't drink. do unto others like I would have them do unto me. am a citizen. am an everyday person. . am trying to make a decent home for my family. am a father. am a taxpayer. 91 10.0 Mean Score Housewife. Mother. I am a wife. A mother. 8 A home-maker. It feels good to be important to those I have chosen to make a part of my life. I'm a mother. I'm a wife. 7 I'm a clean person. 8 I'm a nutritious cook. My church is important. 6 It's important to me to have recreation outside the home. 9 It is necessary for me to have my friends in at least once a week. 9 I help my children with homework. Another important thing is to have a reliable baby sitter. 9.2 15- I'm a housewife 8 mother. 10 9 I'm a cook. 9 I'm a seamstress. 8 I enjoy reading. 9 9. I'm a small gear in a big machine. 8 I'm wrangler - (my nickname). I'm free as heck. 8.8 ___'A mother. A wife. 8 Home maker. Waitress. 6 Gemocratic. (sic) I am an American. I am a Negro. I am healthy. 9 I am honest. I guess the most important my honor. Concern for others. Well I guess answered anotherin general form. Difficult to think of another one. 9 I'm someone who tries to set an example for the youth. 92 8.8 Mean Score am an American. am a father. am a husband. am a leader. am well adjusted. am a thinker. LO HHHHHH 9 10.0 Human being. Negro. Husband. American. 8.2 (.0 am an individual. an talented person. am good housewife. ‘ am good dresser. am good cook. (.0 HHHHH (DOW am a mother. am a wife. am a grandmother. am trying to help people. am making best of my life and ability. hihikihdki am a human king. am an individual. am a creature of God. am a husband. am a father. HHHHH 1010 Mother. Individual. Citizen. * I am a mother. D.K. 1.11.2 Wife. Mother. Home. 10.0 I'm me. A child of God. 9.5 93 Mean Score LO 0 U1 10. I am a housewife. A homemaker. A mother. I am a Baptist. I am a father. A Husband. Teacher. A Friend. A Scholar. A Citizen. A human Being. I'm a man. A son. A brother. A writer. A camera bug. am a Father. am a Husband. try to be a good Church worker. think I am a good provider. try to be a good citizen. am a Grand Father. Piklhihiklhi _ LO .6 An average human being. 2S CCU .K. K Just I am I am I am I do 9 9 a small cog in a big wheel. not too important. not ambitious. a man. my best. 10 .O Steady worker. 8.2 Dependable. Reliable. Young. Fortunate. I am a salesman. I am in the middle class bracket. I am a resident of a very nice neighborhood. Own my home. I am satisfied with what money I'm making. 99 Mean Score Getting along very well with majority of people I know. If I do a job I do it to the best of my ability. to (.0 U1 LDLO '0 CD 0 (A) £0 03me No. No. I am mother. Wife. Mother. Competent. Loved. Country I love. Wonderful future. I am a man. I do my best at my job. I try to help people. None. A mother. An American. I'm a Presbeterian. (sic) I am easy going. I am friendly. I like kids. I like to be by myself. I'm a thinker, not a doer. 95 Section II Protocols Scoring Differently When Qualification Effect* is Removed N = 53 *Qualification effect is the effect caused by disagreement between coders as to the interpretation of a reSponse in regard to qualification, modification or evaluation y§_non-qualification, non-modification or non-evaluation. Below is one example of what is actually happening when qualification effect is removed: ASSIGNED CATEGORIES UNDER REALISTIC CONDITIONS: Response Coder Verbal Number: G A B C D E Score ReSponse: 1. 1&8 N8 118 118 ® ® 8 Christian. 2. 22 22 22 22 22 22 10 Wife. 3. 2O 20 20 2o 20 2O 10 Mother. 1+. 72 58 72 @ 58 @ Leader. 5. 6O 60 Q9 60 60 60 9 Person. 6 . 60 60 at 60 60 60 9 Individual. 7. 1+0 1+0 140 1+0 110 ® 9 Artist. 8. no no 62 62 no 62 (E) Homemaker. 9 l2 l2 12 12 12 12 19_ Mrs. Edward E. Rockelt. 73 8.1 = mean score of protocol ADJUSTED CATEGORIES UNDER HYPOTHESIZED CONDITIONS: ReSponse Coder Verbal Number G A B C D E Score ReSponse: #8 #8 #8 ”8 98 48 10 Christian. 22 22 22 22 22 22 10 Wife. 20 2O 2O 2O 2O 2O 10 Mother. 72 58 72 ® 58 (9 Leader. 60 60 60 60 60 60 10 Person. 60 60 6O 60 60 60 10 Individual. 110 110 110 1+0 110 ® 9 Artist. no no 62 62 1+0 62 @ Homemaker. l2 12 12 l2 l2 12 19_ Mrs. Edward E. Rockelt. 77 8.6 = mean score of protocol coco-q Oiuus'o>k>pu 0 Note that in reSponse 5. and 6. the qualification effect may be attributed to one coder mistakenly identifying the reSponse as qualified or modified. In reSponse l. the effect is due to a disagreement between coders as to Whether the response is a "general religiosity reference" (category #8) or a "reference to religion, religious organizations" (category 52). .The disagreement probably hinges upon the capitalization of the word Christian. See page #1 for a second example. l! 96 The following discrete code categories were lumped together and con- sidered to be equivalent categories for rescoring under the hypothetical condition of overlooking qualification or evaluation of the response. In all cases qualification and nonqualification categories have been lumped together. In addition, general and Specific categories have also been lumped together for the subjects of "occupation-work" (categories u0,ui,u2,u3) and "religion-religious" (categories #8,52,53). (10,11) (14,15) (16,17) (20,21) (22,23) (24,25) (26,27) (30,31) (no,ui,u2, us) (uu,u5) (N6,u7) (M8,52,53) (50,51) (54,55) (56,57) (58,59) (60,61) (62,63) (70,71) #NR“ No Coding Effects Removed, QR = Qualification Effect Removed. +No score notation, in QR denotes no change in score under this condition. #NR #QR .M 9.2 Mean Score + I'm dependable. 10 My work is of excellent quality. a) 0| 6 10 I have a good business. 7 I make a good living for my family. My family is happy. 8.7 9.0 17— — Mother. 10 Housewife. 8 Constant driver. 7 8 Help husband in newspaper work. 9 10 Good Christian, I hope, I try. 8 9 8.7 9.0 15——- —_—- Trucking contractor. 10 Interest's in family. 8 10 Father of married children. 8 Trying to do and make a success. 9 Not fulfilling ambition as liked. 7 Too much gone from income before personal use. 6.9 7.0 7 '__. I am supporter of my family. 10 A good father. 10 A good husband. 2 5 Give them an education. 3 A good helper. 8.0 10.0 9 10 I am a religious leader. 9 10 Being a parent. 6 10 Maintaining a home. re 0 0 CD I p... O (I) o m l 00 O 0') °’ l £0 01 8.8 10 10 8.6 10 IO 97 Mean Score I am a usher of the church. I am a grandmother. I am a wife. I am a day hand. I'm a mother. Good person. Good neighbor. Interior decorator. Beautifier. I am a husband and a father. I am a provider of a family. A citizen of a free country. No. I'm a father. I'm a husband. I'm a religious man. I'm an office worker. I'm a helper to other people - I try to get them out of trouble. I'm employed and I hate to have to work. I'm a good church member. I sing in the choir. I get along fine with everyone. Christian. Wife. Mother. Leader. Person. Individual. Artist. Homemaker. Mrs. Edward E. Rockelt. I am a healthy person. I am a Negro. I am an honest person. 98 #913. 10.0 Mean Score 10 I am a Christian. 10 I am an honest Person. I am a church goer. I am a Negro. £5. I am a husband. 10 An officer in the Navy. That would about do it. No. 2e. I'm Mrs. David Lambiotte. 10 I am Marcias Mother. I am a woman. I am an American. I guess that is about all of me there is. 9.7 '__- I'm just an ordinary guy. 10 I'm an American, a Hoosier. 10 10 I'm a good barber 8 I'm not conceited. 10 10 I'm a Quaker, belong to Friends Church. 10 8 I'm a family man 8 a child of God. 10 21.6. That I am a good mother. 10 That I keep my home up. I have raised 5 fine children. I am a happy person. My health is good. 10 I am a grandmother. 8 I am a person that has the love of her family. 9.3 IO_' I am just a retired person. Ive lived my life. That's all. 9.8 _——' I am an individual. I am a housewife. I am very optimistic I am a pretty happy-go-lucky person. 9 I am a foreigner. I am not too hard to get along with. (.0 O O OGDLOO C) CD CD 0 [—1 |~--l o o LDLDMQ (Doom-F; \ILDmmO \) O 0.) H F‘ U'lt—IOLDCDO 'IO' 10 10 10 99 Mean Score I'm an average male. Content being the head of the family. I'm satisfied in type work I'm doing. I'm a good husband. I'm a good father. I am a housewife. I am a mother of five children. I am a Christian. I am free to do whatever I want. I am happy. We are financially solvent. I am engaged to be married. Breadwinner for 3 people. I am a middle income bracket person. That's all. None whatsoever. I am a retired professional man. I was successful. I am an individual. I am a citizen of the United States. I am a deep thinker. My name, of course. My health. Love. Tried to live right. Nice. Lived a Christian Life. I am a mother. I live alone. Do my own housework. I have been a good mother. I have worked hard. I try to help others. Try to Family (loyal). To my Church loyal. 100 Mean Score 10.0 10 10 00 I am a Church Deacon. I raised 5 boys and u girls. I can still do a good days work. I goes to Church every Sunday. H am a father of 6 children. am a husband. I am a plumber. H A child of God. A citizen of U.S. A person with opportunities unequaled An average American. An average job. An average family. I live by the sweat of my brow. I enjoy making a living. I work for a living. I get along with others. I am pretty well satisfied. I am old. I've raised 2 boys. I'm raising another. Use to live on a farm. Got my education the hard way. Fathers. Dishwasher. Lawnmower. Grandfather. mason. Good neighbor. am a mother. am a housewife. have a lot of reSponsibility. don't care for Sports. work too hard in house. don't work in yard. I'm not lazy. I love to sleep. Pihihihihihi except in U.S. \1 O O 101 I'm trying to live a Christian life. I work outside of home. I I I am a church worker. I am active in community. P I'm a human being. I'm able to carry my home. I am proudin have children. I am an individual. An average person. Try to be a Good Neighbor. C00perative Person. I am a helper of people. I am head of a family. I am a church member. My name is Viola Gustafson. I am raising 2 children. I must make sure the children have enough clothes on their back. QB. 9.5 Mean Score 10 I'm honest. I'm a mother. Housewife. I'm not lazy. 9.0 am a mother. am a wife. 6 OTOA. 9.8 10 I'm able to work. 9 10 I am a citizen. 10.0 10 Cheerful person. 10 Good wife. 8.5 10 None. 8.2 9 7 8.7 8 Im an uncertain person. Im a good mother. But a very poor housekeeper. to eat and 102 93 9.5 Mean Score I guess I just live a Clean life. 10 I like my neighbors. 10 And my family. I like my car. 9.8 '_—’ I am a good husband. A good worker. Agreeable. Not quarrelsome. Not in good health. 10 Hope a good neighbor. 9.6 _—_- I am a very contented person. I am fortunate to have such a wonderful family. 10 I am not very tall. I enjoy being myself. I wouldn't want to change. 10.0 Glad to be a citizen of the U.S. I'm a carpenter. 10 Happy to have opportunity to have a home 8 work. 8.8 _—— I'm a woman. I'm a housewife. I'm a mother. I'm an individual. I am part of a group. 10 I am a sister. 10 I am a daughter. I am a consumer. I am a homemaker. I am a cook. 9.1 10_ I've kept my home together. I've kept my job. 10 I've kept my children together. I've kept my family as a family. I've always kept my temper. I was never frivolous with a dollar. I gave my boy the best education you could get. I have a good wife. 103 NR QR §;0_ 8.8 Mean Score 10 Good mother. 10 Good wife. 5 9 Overseeing the home. 5 Good living women - all .7 9.2 To“ "- A wife. 10 A mother. 10 A teacher. 9 10 Community leader. 7 9 Sportswoman. 6 Poor housekeeper. 9.2 9.8 9“" —- Idea of God. 9 10 Wife. 10 Painter. 9 10 Lover of Animals. 9 10 Lover of flowers. 9.0 10.0 10 Mother. 10 Worker. 10 Chinese. 6 10 Going to be a Grandmother. 8.8 9.0 ID_- -—_’ I'm a worker. 10 I'm an American. 9 10 I'm a music lover. 10 I'm a husband. 5 No. 9.u 9.8 I0_' -—_' A human bang. 10 I try to be a nice person. 10 Generous to a fault. 10 Helpful. 7 9 me and my husband. 6.7 7.0 com-cl 01! There's nothing more important to me than whats I live all alone. I am an old man. my life I've tried to do what is right. between 109 Section III Protocols Scoring Differently When Accidental Effect* of Splitting is Removed N = 17 *Accidental effect of Splitting Should not be confused with the main effect of Splitting. If we were treating the main effect here, we would arbitrarily remove or add code categories to the real judgements of non- conforming coders. Thus, if four of six coders had Split a verbal response into two or more codable reSponses, we would arbitrarily add some sort of code categories to the judgements of the two nonconforming coders. Like- wise, if four of six coders had not Split a verbal reSponse into two or more codable responses, we would arbitrarily subtract code categories from the judgements of the two non-conforming coders. Such a procedure was not feasible, fer at least three reasons; (1) often the disagreement on splitting is not modal, but rather an even Split, artifact of an even number of coders; (2) There is little basis for deciding which of two code categories should be removed from a coders judgement, unless it is done on the basis of conforming to a modal category; and (3) There is even less basis for deciding which of many code categories should be added to a coders judgement, unless it is done on the basis of conforming to the single category already assigned to the verbal reSponse, as when a single code of "12" is changed to two codes of the category "12" by the same coder. Therefore, removal of the accidental effect of Splitting does not necessitate adding or subtracting or changing the categories of different codes. The latter method was used when the qualification effect was removed in Appendix C, SeCtion II. What is involved here is a rearrangement of the existing categories in the code matrix, so that reliability may be gauged upon a comparison of the categories assigned by all coders to the same reSponse. We will, in effect, be Straightening out the reSponse rows across the code matrix by leaving some blank gaps within the code matrix where no Split was made by five or fewer coders, and, therefore, no category was assigned by five or fewer coders. Following is an example of what happens. (see page ”3 for another example) ASSIGNED CATEGORY PLACEMENT UNDER REAL CONDITIONS: (ReSponse Number not used to avoid confusion) ReSponse Coder Verbal Number. A B C D E Score Response: 0'! 17 17 17 m 17 8 Iamanoldwoman. (E 25 25 25 £13 25 8 I've got lots of friends. 12;? 72 72 72 013 5 I like people. 72 72 72 72 72 72 10 D.K. I'm awfully dumb. 72 O72 72 720 33?" 7.8 = mean score of protocol 105 ADJUSTED CATEGORY PLACEMENT UNDER HYPOTHETICAL CONDITIONS: (Response number refers to verbal response) Coder Verbal ReSponse C D E Score Response: Number G A B 1 CD % 8 _I am an old woman. (I) .17 17 l7 l7 8 2 5 25 25 25 25 25 10 I've got lots of friends. 3 72 72 72 72 72 9 I like people. 72 72 é é 72 72 10 D.K. I'm awfully dumb. 8 _818 = mean score of protocol This example is particularly helpful in showing how the rescoring procedure to test this hypothesis can create a larger number of scores for the protocol. Here we see that the N of scores increases from five to six. The reason for this is that adjustment within the code matrix to straighten response-rows necessitated eXpansion of the matrix. To be exact, two coders, (A and E) did not Split any responses, two coders (G and D) split only the first reSponse, and the remaining two coders (B and C) Split the last reSponse only. If any of the six coders had, under the realistic con- dition, Split both the first and last reSponse, there would have been six codes for the realistic condition, and therefore, no difference in the N of scores under the two differing conditions. +NR = No coding effect removed, SR = Accidental effect of Splitting removed, R# = reSponse number of verbal reSponse. For the NR scores it is not useful to match verbal reSponse to scores, by number. % = No score notation denotes no change in score. 5%: 113* 5.1:" 8.3 9.0 Mean Score 1 10 8 I'm a good mother. 2 3 8 That I am able to hold a position at my age. 9 5 3 9 10 I am an efficient worker. a 9 10 I am a grandmother. 5 9 10 I am a religious person. 6 9 10 I am a good citizen. 7.2 8.6 1 —9_- -—_- Let's pass that up. I'm not very cooperative. 7 8 2 7 8 Just a teacher. 3 5 9 The community thinks we're just a servant. u 8 9 So you can write your own criticism. 03M 0": £114:wa 4:0)er O) can)». 0‘4: If: \J o m | (bl—'Uimm O rateco OO. O 0. L0 CD 0 w (I) 0 U1 wonoococnmgppmp 00 0| ‘0 OWPLOI-Jtoo - o o l. a: o) ¢>F4e1~1a <3c>c>c>Eq CO e w I ‘9CD1D1mld-oto o1 10 106 Mean Score I am an old woman. I've got lots of friends. I like people. D.K. I'm awfully dumb. A good mother. Important to my husband. A good housekeeper, I hope. A good Church member. Important to my community. I am a good daughter. I am a reasonably successful secretary. I am a good and loyal friend. I could be more active in community life, but I have a busy schedule. I am a little cog in the wheel. I'm very happy person. I'm healthy. I'm young. I have high hopes. Hope to have a healthy 8 happy family. Very lucky to have a wonderful husband. Wonderful mother 8 father in law. Wonderful mother and step-father. am a widow. have a great burden in life. have always sacrificed my life for my family. have raised my own children and now must raise grandchildren. I have lived a clean life above reproach. HHHH I'm a woman. I'm a mother. I hope I'm useful. I think I am useful. I'm needed by others. I help - or try to help others. #0501: tutor—a 01 MI—J 4': MP to 0301 HO) O. U" 0 \I (DNOJMLDl-‘m O C» mmmztomqqmtol ...a O 0 CD (I) @010) l-‘i-J 000 \1 L0 coucoHHi—‘cn £00100 ' COO- 00 co (OI-'l-‘(D 00 L080 10 10 107 Mean Score I'm an American citizen. I believe in fairness. I try to deal fair with everybody. I can get credit anywhere I've been very fair. I'm just a common-ordinary fellow. I never had more than a country school education. I like football, basketball. I like to drink, smoke. am a father 8 mother. H like women. like to go to parties. don't like onions. don't like the Chicago Bears 8 L.A. Dodgers. HHHH I am an American citizen. am a Husband. am unemployed. HH Father. Citizen, a damn good one too. Good Blacksmith. Good husband. Good gardner. A mother. I am a middle class citizen. I'm a Negro. I'm a nurse. I'm concerned about my position. Very small ingredient of a vast mixture. That Speck ingredient is as small as a grain of salt Of itself in itself its insignificient. (sic) But in the whole matter it can influence the taste of the finished product. Important to my family. Important to my country. dams-mm... 1‘: M O) m 108 NR §B_ L1 L7 MmSwm 9 I'm a good human being. 10 I enjoy life. 6 5 I appreciate many things. 9 Art, 5 7 Music, 6 8 People and, 5 6 Nature. 8.0 8.0 9——' -_—' I'm not what I'd like to be. 10 I'm not as intelligent as I'd like to be. 7 I don't persue educating myself to the degree I feel I should. 5 I neglect myself physically - in appearance. 6 I think of myself as a person with tremendous am't of love. 9 Emotional. 9 Concern for others 8 the world esp. children. 9 9 5 9 Someone who can't say no if it hurts others. 9 5 Sometimes I think I have a lot of knowledge 9 8 8 sometimes I think I'm a phoney. 8.6 8.u 10 I'm a relatively unimportant person who likes 9 7 to do things for others. 10 I think it's important to watch out for others as well as ourselves. 7 10 I have a good sence (sic) of humor and that's 7 5 important. 109 Section IV Protocols Scoring Differently When Accidental Effect* of Splitting is Removed And When Qualification Effect* is Removed N = 37 *See Appendix C, Section III for an explanation of the rescoring process in removing accidental effect of Splitting. See Appendix C, Section II for an explanation of the rescoring process in removing qualification effect. Following are examples which Should clarify the rescoring done in this section: ASSIGNED CATEGORY UNDER REAL CONDITIONS: (Response number is meaningless) Response Coder Verbal ,, Number G A B C D E Score ReSponse: @ 72 72 72 72 7 I am a guy with a lot of problems. M3 #3 43 43 M3 9 I am concerned about finding a good job. H #7 72 M7 72 #3 #3 have reasonable experience but do not #1 #6 91 46 have a college degree. 22 #0 22 #0 am ex claim insurance adjuster. MO 22 6') 22 22 23 22 am single 8 I wish I was married. 23 62) 23 23 like people. 72 72 72 6)) 72 Enjoy working with people. 113 ® 03 113 (3) 1+3 I like work that is worth while. 72 72 72 #3 u #3 I like to do something that is 72 G) 72 ReSponse Coder Verbal beneficial to other people. Number G A B C D E Score Response: HHH m 010)“) \Iximq O 01 = mean score of protocol. am an American citizen. am honest. am good. work every day. stay home on the week ends. go to church on Sunday. H O 50 50 50 50 50 50 50 50 72 50 72 72 72 72 72 72 72 72 H lemme) HHHHHH .1:- air» .9 = mean score of protocol. 110 ADJUSTED CATEGORY UNDER HYPOTHESIZED CONDITION: REMOVAL OF QUALIFICATION EFFECT (ReSponse number meaningless) Response Coder Verbal Number G A B C E Score Response: D (ED 72 72 72 Q 72 7 I am a guy with a lot of problems. 43 43 43 43 75) 43 9 I am concerned about finding a good job. 47 72 47 72 43 43 @ I have reasonable experience but do not 46 46 46 5 have a college degree. 8 40 g 40 g 40 5 I am ex claim insurance adjuster. 22 22 22 22 22 9 I am Single 8 I wish I was married. 8 23 ® 23 23 23 8 I like people. 72 72 72 Q 72 7 ‘Enjoy working with people. 113 ® 113 113 as 113 7 I like work that is worth while. 72 72 72 43 43 43 I like to do something that is 72 43 (f§> 72 Egg beneficial to other people 9 76' 6.5 = mean score of protocol Response Coder Verbal Number G A B C D E Score ReSponse: 50 50 50 50 50 50 10 I am an American citizen. 50 50 72 50 72 72 G I am honest. 72 72 72 72 72 72 10 I am good. 72 72 113 72 113 113 @ I work every day. 43 43 ® 43 43 7 I stay home on the week ends. .@ 53 ® 53 53 4 I go to church on Sunday. 53 53 53 Q 49 7.0 = mean score of protocol When qualification effect is removed, there seems to be disagreement between coders on the first protocol whether (1) "do not" in reSponse . three is a qualifier, (2) "ex" in reSponse four is a qualifier, (3) "wish" in response five is a qualifier, (4) ”worth while" in response eight 13 a qualifier. In the second protocol there seems to be disagreement between coders on whether (1) "every day" in reSponse four is a qualifier, and (2) "on Sunday" in response six is a qualifier. 111 ADJUSTED CATEGORY PLACEMENT UNDER HYPOTHESIZED CONDITION: REMOVAL OF ACCIDENTAL EFFECT OF SPLITTING (Response number refers to verbal reSponse) Response Coder Verbal Number G A B C D E Response: U) 0 O 3 l. (:3) 9 I am a guy with a lot of problems. @ 72 72 72 72 72 9 2 43 43 43 43 43 43 10 I am concerned about finding a good job. 3. 72 72 43 43 ‘8 I have reasonable experience but do not 47 46 47 46 46 47 ‘3 Have college degree. 4 41 4O 41 4O 40 41 ‘3 I am ex claim insurance adjuster. 5 22 22 22 22 22 22 10 I am single 8 I wish I was married. ® 23 23 23 23. 8 6. 72 72 72 72 a) 72 9 I like people. 7 113 ® 113 113 113 8 Enjoy working with people. 8 43 43 43 4 43 9 I like work that is worth while. 9. 72 72 72 ® (1% 72 _7_ I like to do something that is 94 beneficial to other people 7.8 = mean score of protocol Response Coder Verbal Number' G A B C D E Score ReSponse: l. 50 50 50 50 50 50 10 I am an American citizen. 50 50 50 © 72 72 72 72 72 72 10 72 72 72 72 72 10 3 72 @11 11311311 999999 I am honest. I am good. I work every day. I stay home on the week ends. I go to church on Sunday. 070')th .4 = mean score of protocol When accidental effect of Splitting is removed, we can see that there is, in the first protocol, disagreement between coders on whether to Split (1) "I am a guy with a lot of problems" into the two reSponseS, "I am a guy, and I have a lot of problems", and (2) "I have reasonable experience but do not have college degree" into two clauses; but there is no disagreement that "I am Single 8 I wish I was married." should be Split into two clauses. In the second protocol there is only disagreement on whether :0 Split the response "I am an American citizen." into the two responses, I am an American, and I am a citizen." However, this disagreement is so near the beginning of the protocol that it creates a good deal of acc1dental effect. In the fellowing material, NR = no coding effects remoyed, QR = qualification effect removed, SR = accidental effect of Splitting removed. Notice that none of these results amounts to the removal of both qualifi- cation effect and accidental effect of Splitting in the same operation. R# = reSponse number, which refers to the verbal response and can be a1 meaningfully linked to the score only in the condition in which accident effect of Splitting is removed (SR). 112 +When there is no score notation in column QR or SR, it denotes that no score occurred due to the rescoring. I am dependable and steady. I try to do the best for my community and country. I do my job as well as I can. I want my wife and children to respect-— one so I do try to have high standards of behavior. I have a lot of duties to perform. I like social life - lodge work. I am a guy with a lot of problems. I am concerned about finding a good job. I have reasonable experience but do not have college degree. I am ex claim insurance adjuster. I am single 8 I wish I was married. Enjoy working with people. I like work that is worth while. I like to do something that is beneficial to other I am an American citizen. work every day. stay home on the week ends. go to church on Sunday. I am most important to my family. I hold a place in my church, I feel important. I love my neighbors. change in 59.189.13.93. 6.3 7.2 7.7 Mean Score l 10 + + 9 2 5 9 8 I'm a family man. 3 7 9 5 4 4 7 10 5 2 5 5 4 5 5 6 8 9 4.8 6.2 5.2 l 5 2 5 l 4 3 5 9 My family. 4 8 9 Nothing else. 5.5 6.5 7.9 1—7—"T 9 2 3 10 3 2 5 3 2 5 6 4 7 9 6 5 5 8 10 7 8 6 7 9 I like people. 7 6 8 8 2 3 9 9 9 8 people. 5.9 7. 7.4 170—— 6 2 10 I am honest. 3 5 6 10 I am good. 4 4 7 8 I 5 l 4 O I 6 5 6 8 I 5.0 6. 6.6 1—9—‘_ 2 8 10 4 5 5 3 2 7 6 4 2 5 I'm too much fat. [—4 \lmUl-FCO M 030'!sz c'oahaha 113 1.1.13. 9.13. SR 6;9_ Z;Z_ 7.1 Mean Score 8 Outside of just an ordinary home maker. 10 Mother. 9 And citizen. 2 5 I try to work in school affairs. 6 10 And a few fraternal organizations. 9 Well, when I was working in the biz (sic) world I felt-- 5 6 I should try to be sincere 8 make myself valuable—— 4 9 9 5 8 not loaf on the job. 7.7 8.6 8.3 g —9_- -__' -——- I am a chauffer, bookkeeper (sic), laundress, E‘- 9 cook, housekeeper. 1 9 1; 9 ... 5 9 6 I am just a wife and mother. i i 5 9 6 7 9 Don't know any more answers. 7 9 Just don't know. 6.8 7.5 8.5 9 10 A human being as everyone. 6 I am an honest man who never went to jail. 7 6 10 I know how to work. 2 7 8 I am not lazy. 8 I like my neighbors. 8 10 I think I am a friend to everyone. 8 10 I like to work. 8.3 9.0 8.7 _5_- _IT_ -___ I am a good provider for my family. 10 9 7 9 10 I am a father. 9 8 I am a christian (sic). 9 I like people. 9 10 I am a Negro. 9 10 I am a man. 7.5 7.8 7.0 —5_- —9- —__' I'm just me. 8 5 Absolutely. 8 9 Refused - even—- 9 after probing. N) 014-? 114 Mean Score 15989.1: 6.3 6.8 6.8 10 10 7 9 7 9 1 4 10 7 1 l 10 2 4 9 2 9 5 4.5 5.2 8.2 6 10 6 10 2 9 2 4 5 10 6 10 8.7 9.0 9.3 8 10 9 9 9 10 8 9 9 9 9 10 6.5 7.0 8.3 9 9 2 5 10 5 5 8 9 5.6 6.2 7.5 8 8 4 7 9 l 4 9 4 9 1 4 ‘4 10 8 4 9 8 9 6 Your attitude. The way I carry myself thru life, my job, etc. I love my family 8 want to provide for them any way I can. I'm a Negro and proud of it because its Gods (sic) intention. Refusal. I am an individual common everyday person. Try to live up to the laws of America 8 pay my bills. Try to bring up my family 8 get as meddui (sic) as you I can. I am a mother 8 happy to be one. I like people. D.K. -- I am a wife. I can't think of anything more. I'm a helper to my family. I like people. I'm a working mother. I'm really not much. I'm just a poor working gal. I'm just an ordinary person. I go to work, come home attend church. I like to travel. I enjoy reading when I have the time. (JON 0.) \ICDUI-FODN M U‘l-F’ Z 70 01 01 ° (DUIUTOOMG) 0 O) M IQCVQ-Sl’l-‘QQI \l (I) GO «compact—11a \1 M ..l Q-PCDIDMOI—‘F' (D O \JHQQQOO O (D O (.0 I8 (OLDKDCD 115 Mean Score First of all a mother, and still needed by family 8 husband. I'm a companion to my husband. I try to be a good homemaker. Nothing else. ' If I was well I would like to work. I'm retired was 2 weeks in hOSpital have ulserated (sic) foot. If nobody bothers me I'm O.K. I don't like to nag or fight. I'm a farmer. A father. I'm a boater - sailor when I have time. I'm a director of many organizations. I'm a bridge player. But I work late most nights. I have 29 grandchildren so a Grandmother. And mother. A wife. And the official telephone answerer here. I'm quite active in church. And I love music. And sing in the choir. I'm very congenial I think. I'm understanding. I'm very unselfish I think family comes before me. I always try to look out for my fellow man. I like to apply the Golden Rule. I try to do things other people ask me to do. 01-5300 m>ka 4:0.) EB. Q_R if: §_:_l_ 6.3 6.11 l T '9— 1 7 8 7 1 9 8 9 9 g_:_l_ 9.3 9.3 9 '15— ‘— 9 9 10 9 9 9 10 9 10 6.8 7.2 7.5 "'5" T — 5 6 8 9 10 8 7 7.2 7.7 8.5 "10 —‘ ‘— 5 9 8 5 9 7 10 8 9 4.8 5.6 7.7 T _‘ ‘5— 6 8 6 10 5 10 l 11 5 2 9 11 8 8 5 - 7 - 10 9.0 9.3 9.5 10 8 10 9 9 10 9 10 9 10 116 Mean Score I am a person who would like to become head of a large business firm. I am a person who believes a happy marriage is the basis for success. I am a person who believes one Should have a Supreme Being in which to believe. I like to enjoy life. I am a citizen of the United States. I am a good understanding honest woman. I am religious. I am respectable. An old woman. I live alone. I like to be alone. I'm a believer of God. A wife that loves her family. I give untireing (sic) of my service to others. Good American. Don't know. I am responsible for my family's health 8 well- being. I am a mother. I am a wife. I am able to sew and knit for my family. I am said to be a good shopper as I know prices. I participate in Sports with other women. I am a man. A Christian man. A husband. A farmer. Father. 81;. 1': 8.2 1 “1‘0— 2 10 3 10 11 10 5 10 7 6 7 7 2 8 8 7 9 9 9 7.7 l "9 9 2 10 3 7 2 11 9 8.0 l "‘9’ 2 9 3 9 11 9 5 7 7 6 9 7 9 8 7 9 5 lo - 8.5 1 —8- 2 10 10 3 6 8.5 l .10 5 U'I-F'OJM }_1 O 10 10 10 117 Mean Score am a happy mother. am a good wife. am helpful to other people. am an eager student. am an eager outdoor Sports woman. Pihikihiki H am a good houseekeeper. enjoy gardening. I am a good needleworker, sewing and knitting and crocheting. H I enjoy the arts, (painting in oils). Old every day worker. I have a family. Father and Husband. No. am nobody. intend to be somebody. would like to make other people happy. am a person of reSponsibility. would like to have a large family about 12 kids. am sure of myself. am confident with respect to work. have reSpect from my friends. feel a little unsettled as far as my family is concerned. I have a friendly relationship with neighbors. HHHHH Pihihihi am another individual. I have certain rights and opportunities. H I have been fortunate in life. I'm just me - a person. I'm satisfied with my way of life. I get along ok. D.K. D.K. «Cle-J C501 wlvra 01 ; wlo1a Q0301; R51. 014: 118 Mean Score 111191398 7.1 7.7 7.7 7‘6““— 10 5 6 7 9 9 5 8 7 8 6.4 7.7 7.2 ‘IO_—- 10 8 10 2 9 7 1 9 7 10 - 2 7.9 8.4 8.6 10 8 10 9 10 4 5 9 5 7 9 8 9 9 6.7 6.8 7.5 10 9 7 8 5 8 5 9 4 5 1 8.6 9.9 8.7 "1‘0""— 10 9 7 5 8 10 9 10 I am the man of the house. I made a change for better instead of worse. I try to provide for my family. I like to help my neighbors 8 friends. H like to help myself last. am a Christian. H am a housewife. am a mother. am a part-time worker. am a baby sitter sometimes for friends or relatives. HHHH I am married. N. F. I. I'm Bernard Perry. I'm Father of 5 children. I'm a married man. I'm average fellow. Right now I've got a h--- of a cold. Had Pneumonia last year 8-- I'm afraid of this cough getting to be Pneumonia again. Honesty, Truthful, Sincere. Love for home 8 country. My occupation. I try to be a good mother. I feel I am a good wife. Fianancially1(sic), I like to feel I help my husband. I love helping other children. We ran a-- foster home for 5 years in New York State. I believe when I work and am hired to work I-- do the best of my ability. In other words, I-- don't Shirk my job. Illll1111111111111111111111111111Ill 31293101714