LIBRA R Y Michigan S are University Mail OVERDUE FINES: . ‘ 25¢ per do per item .1 4‘3“}; 5 momma LIBRARY MATERIALS: ,. ~ ‘3. , ,y ’ Place in book return to remove . I ‘3',” . charge from circulation records leafiot’s WOO THE EFFECTS OF FIXED AND ASCENDING CRITERIA ON ACHIEVEMENT, ATTITUDE AND STUDY EFFICIENCY IN MASTERY LEARNING By James Anthony D'Albro A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Administration and Higher Education 1980 ABSTRACT THE EFFECTS OF FIXED AND ASCENDING CRITERIA ON ACHIEVEMENT, ATTITUDE AND STUDY EFFICIENCY IN MASTERY LEARNING By James Anthony D'Albro Selection of the most appropriate criterion scores for criterion-referenced testing under mastery learning is uncertain because there are no procedures for sorting through the many approaches for setting such scores. Implementation strategies for nearly all the approaches are also lacking. This research was directed toward solving the problem of setting the criterion in the mastery learning strategy. Thus. the overall purpose of this research was to identify a criterion which yielded the best achievement throughout a quarter course in greenhouse management while requiring the least amount of study time and maintaining the best student attitudes. A different criterion level was set in each of three 50-minute mastery classes. The criteria were used in James Anthony D'Albro conjunction with a mastery learning strategy. The criteria were 80% fixed, 90% fixed, and ascending (80% for the first unit and increasing 5% each unit test until 90% is reached, additional units were graded at 90% of total points). A fixed criterion was one which had the same standard applied to each of the five unit tests in the quarter. The textual material of the mastery strategy in greenhouse management was divided into sections containing: instruction for completion, objectives for each unit, a set of review questions. and the lectures given by the instructor. Mastery in this research was defined by three elements: instruction, grades, and testing. In order to reach mastery of a unit of study, the students had to attain the minimum criterion set for each instructional unit in a given treatment group. Whenever the criterion was not met on the first attempt of any test, the student was provided with additional instructional assistance and permitted to attempt mastery a second time. There was a total of five unit tests, each test being given at the end of a two week unit of instruction. The control group received the same statement of objectives as the mastery groups. They were lectured on each unit of study, and were given the same test questions as the mastery groups. James A. D'Albro The effect of the treatments was measured by a multiple choice achievement test, an attitude scale, and the total time spent on study as reported by students. The groups taught under the mastery strategy attained a significantly higher level of achievement than the control. The study also showed that setting a higher criterion or gradually raising the criterion did not yield higher achievement than a lower criterion. The (90%) fixed criterion produced less efficient study scheduling than other criteria without any gain in achievement over a lower criterion. The ascending criterion did not produce the increase in achievement over the other criterion groups that might have been expected. Student's attitudes toward the course were also less positive when they were pushed to meet higher levels of criterion. The findings suggested that the 80% fixed criterion could produce the best learning while maintaining the most productive student attitudes. Moreover, this criterion yielded the best student learning in the least amount of total study time over the quarter. DEDICATION To My Wife D'Anne For her patience, understanding, and constant support ACKNOWLEDGEMENTS The author wishes to acknowledge the assistance and advice given to this research by the following committee members: Dr. Van C. Johnson Dr. Max R. Raines Dr. J. Lee Taylor Dr. Stephen L. Yelon The author wishes to express his sincere thanks to Dr. Stephen Yelon who spent a great deal of time editing and suggesting areas of improvement. A special recognition is extended to Dr. Robert Smidt for his assistance on the use of statistics and computers for this research. Also, Mr. Charles Strong is recognized for his assistance on the technical writing of this dissertation. iii TABLE OF CONTENTS Page LIST OF TABLES . . . . . . . . . . . . . . . . . . . vii LIST OF FIGURES. . . . . . . . . . . . . . . . . . . ix Chapter I I INTRODUCTION. I I I I I I I I I I I I I I I Statement of the Problem. . . . . The Instructional Philosophy of Mastery Learning . . . . . . . . . Purpose of the Research . . . . Importance of the Research. . . . . . . 9 Research Questions. . . . . . . . . . . 11 Research Hypotheses . . . . . . . . . . 11 Hypotheses Regarding Achievement. . . . 12 Hypotheses Regarding Attitude . . . . . 15 Hypotheses Regarding Time Spent on Instruction. I I I I I I I I I I I I 18 Overview of Literature Survey - . . . . - 22 II. LITERATURE SURVEY . . . . . . . . . . . . . 23 Introduction. . . . . . . . . . . . . . . 23 Research Regarding Mastery Learning . . 25 Strategies Used to Set Criteria . . . . 27 Factors to Consider for Setting Criteria . . . . . . . . . . . . 33 Setting the Level of the Criterion. . . . 34 Summary of Research Regarding Mastery Learning . . . . . . . . . . . . . . 37 Study Time Needed to Attain Criterion . . 41 Summary of Study Time Needed to Attain Criterion. . . . . . . . . . . . . 44 Student Attitudes and Learning for Mastery. . . . . . . . . . . . . . . . 44 Summary of Literature on Student Attitudes I I I I I I I I I I I I I I “8 Summary of Literature Survey and Relation to Research Questions. . . . . . . . . 48 iv Chapter IIII PILOT STUDYI I I I I I I I I I I I I I I I IntrOduCtion I I I I I I I I I I I I I I Population and Sample. . . . . . . . . . Course Material and Instruction. . . . . Course Evaluation. . . . . . . Other Types of Study Aids Used in the Course. . . . . . . Validity of the Achievement Test . . Item Analysis of Unit Tests. . . . . Reliability of the Achievement Test. . Summary and Conclusion of the Validity and Reliability of the Achievement TeStI I I I I I I I I I I I I I I I Development and Assessment of the Attitude MeaSLlre I I I I I I I I I I I I I I I IV. RESEARCH DESIGN AND PROCEDURES . . . . . . Introduction . . . . . . . Experimental Design. . . . . . . . . . . Independent Variables. . . . . . . . . Control of Internal Validity of Treatments. . . . . . . . . . . . . Dependent Variables. . . . . . . . . . Procedures . . . . . . . . . . . . . . . Population and Sample. . . . . . . . . Treatments . . . . . . . . . . . . . . Instrumentation and Data Collection. . . Data Analysis. . . . . . . . . . . . . . V. ANALYSIS OF THE RESULTS. . . . . . . . . . Introduction . . . . . . . . . . . . . . Analysis of Covariates . . . . . . . . . Grade Point Average. . . . . . . . . . Statistical Analysis of the Chi Square Test of the Elective-Required Covariate . . . . . . . . Regression Analysis for the Age Covariate . . . . . . . . . . . . . Summary of Analysis of Covariates. . . . Interaction of Criterion by Repeated Measures on Mean Achievement . . . . . Interaction of Criterion by Repeated Measures on Mean Attitude. . . . . . . Interaction of Criterion by Repeated Measures on Mean Study Time. . . . . . Summary. . . . . . . Limitations of the Results on Attitude and Study Time . . . . . . . . . . . . 93 95 96 96 102 106 110 114 Page Chapter VI. DISCUSSION, CONCLUSIONS AND RECOMMENDATIONS. 115 Introduction- - - - - - - - - . . . - - - 115 Experimental Design . . . . . . . . . . 118 The Sample of the Research. . . . . . . 119 Method of Data Collection . . . . . 122 The Importance of the Covariables . 123 Discussion of the Analyses of the Results 124 Discussion on the Results of Achievement 124 Discussion on the Results of Attitude . . 127 Discussion on the Results of Total Study Time I I I I I I I I I I I I I I I I I 13]- Conclusions . . . . . . . . . . . . . 137 Summary of Conclusions. . . . . . . . . 139 Recommendations and Further Questions . . I40 APPENDICES I I I I I I I I I I I I I I I I I I I I I 143 Appendix A. Course Objectives . . . . . . . . . 143 Appendix B. Edward's Criteria for Selecting Attitude Statements . . . . . . . . 152 Appendix C. Attitude Survey A . . . . . . . . . 154 Attitude Survey B . . . . . . . . . 156 BIBLI OGRAPHY I I I I I I I I I I I I I I I I I I I I 158 vi LIST OF TABLES Page Table 1. Table of specifications for direct assessment of objectives covered on five unit achievement tests in the pilot study. See Appendix A for number corresponding to objective. . . . . . . . . . . . . 58 Table 2. Item analysis in % correct for each question for all unit tests in Greenhouse Management. *Items of less than 50% correct response were revised . . . . . . . . . . . . . . . 62 Table 3. Reliability coefficients calculated by the Livingston formula for criterion-referenced tests. Values are shown for each unit test taken by students during the pilot study . . . . . . . . . . 66 Table 4. The variable matrix is shown. The multiple dependent measures are shown for each time for each experimental variable . . . . . . . . . . . . . . . 76 Table 5. Prerequisite profile of students subjected to the criterion treatments . . . . . . . . . . . . 82 Table 6. Additional course work taken by the students in the stated criterion treatments. . . . . . . . . 82 Table 7. Analysis of variance for grade point average 0 o o o o o o o o o o e o o o e e o o e o e 92 Table 8. Means of the four experimental groups on grade point average . . . . . . . . . . . . . . . . 93 Table 9. Summary of data from the four experimental groups based on a 7:3 ratio (e1ective:required). Ratio was obtained when the pilot study was made. . 94 Table 10. Summary of data from the four experimental groups based on 41:15 ratio of the observed totals. 94 Table 11. Statistics for regression analysis for the age covariate . . . . . . . . . . . . . . . . . . . 95 Table 12. Univariate results for the criterion by repeated measures interaction on achievement. . . . 98 Table 13. Mean achievement scores for each unit test for the groups under study. . . . . . . . . . . . . 99 vii Page Table 14. Univariate ANOVA for the comparison of the mean achievement of the criterion groups to the contrOlI I I I I I I I I I I I I I I I I I I I I loo Table 15. Univariate ANOVA for the comparison of the mean achievement of the ascending criteria to the fixed criteria . . . . . . . . . . . . . . . . . . . 101 Table 16. Mean attitude scores for each unit test for the groups under study . . . . . . . . . . . . . 105 Table 17. Mean study time in hours for each unit for the groups under study . . . . . . . . . . . . . . . 110 viii LIST OF FIGURES Page Figure 1. Mean attitude score plotted over time for each group under study. . . . . . . . . . . . 103 Figure 2. Mean study time plotted over time for . . 109 each group under study. . . . . . . . . . . . ix CHAPTER I INTRODUCTION Statement of the Problem Educators are faced with the same methods of teaching and evaluating students that they used in the past. Typically a lecture hall is used to assemble a class. The instructor faces the class and begins an hour of talking about a subject. The instructor then designs a test to determine what has been remembered and/or understood. The test is graded with the intention that most students get a 'C' grade. The assignment of grades is based on the following assumptions: if students are normally distributed with respect to aptitude for some subject and all students are given exactly the same instruction, then achievement measured at the completion of the subject will be normally distributed. Aleamoni (March 1979) states that a distribution of student grades follows a normal curve so that there are 3 percent A's, 13 percent B's, 68 percent 0'3, 13 percent D's and 3 percent F's. But Block (1971) suggests that American education must turn away from this traditional method of teaching and evaluating. Schools must provide successful and rewarding learning experiences for 2 most students, not just a few. He suggests that criterion- referenced testing under the mastery strategy offers the greatest potential for students. Mastery learning (Block, 1971) offers a powerful new approach to teaching which can provide almost all students with the successful and rewarding learning experiences now allowed to only a few. Block (1971) suggests that 75 to 90 percent of the students can reach the same high level of achievement as the top 25 percent do under traditional group-based instructional methods. Group-based instruction is teaching of a group of students at a set hour in a set room. Further, the mastery approach includes procedures that are primarily designed for use in the group—based instructional situation, where the time allowed for learning is relatively fixed (Bloom,l968). Bloom's mastery strategy minimizes the time a student needs to learn. Therefore, most students can master the material within the calendar instructional time available. The next section explains the essential philosophy of the mastery strategy. It also discusses some of the important features of this innovative method. The Instructional Philosophy of Mastery Learning Mastery learning is an instructional philosophy and an associated set of ideas about instruction. This philosophy asserts that under appropriate instructional conditions most students can learn what they are taught (Block, 1974). There are several procedures in the mastery learning strategy which have made the above mentioned philosophy a reasonable instructional approach (Bloom, 1974). First, the idea of mastery does not require a normal distribution of grades from A to F. Instead, it suggests that each student should be given enough time to master the subject matter being taught. Thus, most students can get an 'A' grade. Second, and more central to the mastery learning strategy is the use of feedback and corrective procedures. Bloom (1974) has stated that there are a variety of procedures to provide practice and feedback. They are tests, homework and workbooks. Brief diagnostic progress tests has proved to be the most useful. The test shows what the student has learned from a chapter, a unit or some other learning sequence. It also is valuable feedback to the student and the instructor on what aspects of the learning unit are weak, needing correction and further study. Third, the diagnostic test must have some value which defines competence or mastery. The instructor can select a certain number of points correct out of a total or a percentage grade to define the needed level of competence. The benefits of declaring points or a percent is that it clearly relates achievement to the degree of mastery of what is set out to be learned. It provides a standard 4 measure of achievement. Consequently, students are not competing against one another. The manner by which levels of mastery have been determined is arbitrary. Herein lies a problem. The philosophy has been converted into procedures for grading and collecting of data upon which to set a grade. As yet there is no sound basis for deciding whether or not the student can be considered a 'master'. A master is a student who has met or exceeded the criterion score set by the instructor and therefore has learned to a sufficient degree. The evidence for a decision of mastery must be made on the basis of grades. Therefore, the problem is the setting of a certain criterion to determine grades. The criterion represents the absolute performance standard against which the sufficiency of each student's learning can be evaluated and graded (Block, 1971). This standard should indicate the specific amount of skills a student must show before he or she can be judged to have mastered the skills taught. The standard also indicates how well the student has learned. In that respect, the instructor knows exactly how much each student has learned. This is unlike a relative standard of grading which judges students in relation to others and not in relation to the course content. An example of an absolute standard is given by Bormuth (1970) and Glaser and Nitko (1970). They state that criterion referenced tests are absolute and that these 5 standards define what proportion of a well defined body of content and behavior the student is expected to learn. Thus, a test with a standard of 85% suggests that a student must show competency of the content to that level. According to Block, (1970) evaluators have ignored the problem of defining the criterion in an objective way. They have developed increasingly sophisticated data gathering instruments and procedures. But not one valid technique has been stated for defining the criterion in an objective manner. Hence, the degree of mastery of many students is being misjudged. For example, suppose a student's learning is misjudged due to a poor criterion. The student may have to review material already learned and in so doing waste valuable study time. This time could have been spent studying material of a more advanced stage. A continuation of this defect in evaluation may eventually lead the student to a poor attitude toward a subject, a major and even to school itself. As Block (1970) states, this is most unfortunate because accurate indications of the sufficiency of a student's learning are crucial to his/her cognitive and attitudinal outcomes. Hambleton et a1. (1978) in their review of criterion levels state that the matter of the determination of criterion scores seems unclear. Further, they state that there are no procedures for sorting through the numerous approaches for determining the criterion scores in order to 6 select the most appropriate one for a given situation. Implementation strategies for nearly all of the approaches are also lacking. Hence, if we are to solve the problem of setting the criterion in the mastery learning strategy, a full scale directed effort must be undertaken to research the problem further. This research attempts to partially solve that problem. In conclusion, the best available information on setting a criterion suggests that criteria are set in an arbitrary manner. It is up to an instructor to determine the level of the criteria. Therefore, this research addresses itself to the problem of the determination of a criterion score used in the mastery strategy. The research effort seeks to find an empirical basis for the criterion. The score can then be implemented when the mastery strategy is used. With this intention, the following purposes are stated. Purpose of the Researgh Instructors have selected their criteria for mastery without an explicit theory or any evidence suggesting that those chosen over others are superior in fostering student development (Block, 1970). Block (1970) goes on to say, therefore, it is entirely possible that criteria selected may not represent the best learning and attitudes as other criteria. The research proposed has been developed to correct the major problem of setting a criterion within the mastery 7 strategy. Based on this research some criteria may be decided as a result of evidence. Four steps were taken within this research to remedy the problem of setting a criterion. First, students' achievement results from criterion referenced tests were logged. These tests were diagnostic tests taken during the course. Second, the setting of the criterion was based on a test performance which produced the highest achievement. Therefore, the instructor can evaluate what has been learned and how well it has been learned. Third, the time to attain the criterion was within a period allocated for the course. In addition, the time to learn the content was efficient for the student. Fourth, the attainment of high levels of achievement on criterion tests did not sacrifice the attitude of the student toward the course or major. In an effort to correct the problem of setting a criterion, the following purposes of this research are stated: 1. It is expected that having a criterion as a goal will be of a benefit to students to achieve each unit of study in the most efficient manner. To accomplish this one may use fixed or ascending criteria to evaluate the performance of students on various units of study. Fixed criteria are absolute standards set prior to testing. They do not change from one test to the next. For example, an 8 80% criterion is the standard which all students are judged on each unit test. In the case of an ascending criterion, a first test may be assigned an absolute standard of 80%. Each successive test will have a new and higher percent standard by which achievement is judged. It was hoped that by setting criteria on each test we would be able to identify levels which, when maintained throughout the learning, encourage students to learn adequately and score well on a criterion measure (Bormuth, 1969 and Block, 1970). Thus, the attainment of the criterion will indicate to the instructor that most course objectives were learned by the students. In addition, the instructor will be assured that the student has acquired a sufficient amount of course information. Therefore, the student will be judged competent in the subject. 2. The positive attitude of students may increase or decrease in response to the difficulty of a criterion. For example, a very high criterion of 90% may force students to reach that level. But, it may also cause the attitude of the student to decrease significantly. Since attitude may change, it will be the purpose of the research to identify a criterion which when used throughout the course, will produce the best student attitude toward the course. 3. Since the efficiency of learning can be interpreted as the total time it takes to learn a skill or series of skills in a unit of study, it was also the purpose of this research to manipulate the criterion or standard 9 used to judge students to see if students can be made to reduce their time to learn over the duration of the course. In summary, the intention of the research was to establish a criterion which can produce the greatest efficiency and achievement without a sacrifice of the student's attitude toward the course. In addition, the aim of high achievement will also foster greater efficiency of study as manifested by time invested in the learning of content. If these objectives are attained, then a partial answer will be given as to a basis for setting the criterion under the mastery strategy. Finally, Block (1970) suggests that if instructors can choose adequate performance levels, then educational 'programs might become more effective. Sound criteria make this possible. So that the research is not understated, the following statements are made to emphasize its importance. Importance of the Study This study was important for the following three reasons. First, the study provided a basis of setting criterion levels for a mastery program. Thus, one can base a criterion on experimental evidence. Through the implementation of an improved method of criterion selection, the student may be able to attain high levels of achievement. Second, the proper selection of criterion is keyed closely to the attitudes of the student. There must be knowledge of a criterion which can produce a high level of 10 achievement without sacrificing the attitude of the student. Block (1972) states that "the attitudinal changes which do occur raises the important question of whether in pushing some students to attain very high levels of performance throughout their learning we are not, in fact, promoting their intellectual development at the expense of their feelings toward the material learned." Lastly, the criterion selected must be attainable within the time allowed for the course. In order to meet the objective of an attainable criterion, the student must be molded into more efficient behavior. It is suggested that there may be particular criterion levels whose attainment early in the sequence will progressively increase the amount of material achieved per time later_in the sequence. Students who learn under the mastery strategy, therefore, may eventually be able to achieve their required criterion level in the same amount of instructional time that should ordinarily be expected of students who learn under non-mastery conditions. Non-mastery refers to group- based instruction whereby a student is instructed and evaluated on his/her performance relative to others in the class, and where curves are established to define a spread or range of scores from'A'to'F'. 11 Research Questions The need for this research has been stated in the above sections. In general, the research is addressed to the kinds of variations of presenting the criterion to students in order to yield the greatest achievement. In addition, in what ways can criteria be presented to students so that they maintain a high achievement and a positive attitude toward a subject? Lastly, the cognitive learning of the course content should be done with a minimum amount of time. In an effort to provide a comprehensive answer to a selection of a criterion, the research seeks to investigate the following questions: 1. Does one criterion produce more achievement than another? 2. Does one criterion produce better student attitude than another? 3. Does one criterion produce more efficient study scheduling than another? Research Hypotheses From the research questions for this study, the following hypotheses were drawn. In each case, the selection of criterion was tested for its effect on achievement, attitude and total instructional time needed to learn each unit of study. 12 Hypotheses RegardingiAchievement The overall hypothesis of the interaction of the treatments and time for the achievement dependent variable was: There will be an interaction between treatments and time for mean achievement. The direction and the magnitude of the interaction for each treatment group is stated below, 1. The ascending criterion group will have a progressively higher score on achievement for each unit test over the period of the quarter. It is expected that early success on unit tests should motivate students to succeed later in the quarter when the criterion is at its highest level of 90%. 2. The 90% fixed criterion group will have a progressively lower score on achievement for each unit test over the period of the quarter. This group is required to reach a high level of achievement from the beginning of the quarter. We should expect early frustration in an attempt to attain this high level. There may be a loss of motivation to succeed to a high level later in the quarter if early failures are encountered. 3. The 80% fixed criterion group will have the next lowest but a moderately stable score on mean achievement for each unit test over the period of the quarter. The relative ease of reaching a low level of criterion should produce little change in achievement score from unit to unit. 4. The control group will have the lowest mean achievement score of any group for each unit test over the 13 period of the quarter. The students of this group are graded on a straight percent, 90%, 80%, 70%, 60% and 50% of the total score. With only one testing at each unit test, we should expect a distribution of scores of'A'to'FY. Furthermore, the distribution should produce an average grade of 'C'. This outcome is unlike criterion-referenced testing. Testing is done to bring most students to the highest level of achievement. Alternative learning aids are used to assist learning when mastery is not reached. Retesting is used to re-evaluate student learning. Students of the control group are not given a chance at remediation and retesting. 5. The ascending criterion referenced group, the 80% and 90% fixed criterion-referenced group will receive a higher score on a measure of achievement than students in the control class. This difference will exist because a greater number of students will attain mastery of each of the tests taken. This is so because students under the mastery teaching method are required to attain mastery or reach the prescribed criterion for each unit of study before advancing to the next unit. Therefore, more students should reach an 'A' under the criterion based grading. 6. The ascending criterion group will receive a higher mean achievement score than a fixed criterion group of 80% or 90%. The ascending criterion starts at 80% and increased by 5% in each successive unit until 90% is reached. It then remains stable at 90%. The fixed criterion (80% or 14 90%) remains the same for each test. The ascending criterion is predicted to be better than the 90% level because students will find it easier to achieve mastery early in the quarter. It is assumed that the early success creates a positive attitude toward the subject under this condition. To know that one can pass the tests early should motivate the students to work hard to pass tests with higher achievement levels later. Also, the maintenance of a particular high performance level later in the subject is less threatening and approached with an expectation of success. The 80% fixed criterion group should be equal to the ascending group early in the sequence. As the ascending group finds it more difficult later in the sequence, the mean difference on the achievement will become more apparent. The ascending group will score significantly higher than the 80% group since the level of achievement for the 80% group is lower. 7. The 90% fixed criterion group will get a significantly lower score on achievement than the ascending group. From the beginning, the level of achievement is set very high. Students will probably feel that this is an unreasonable expectation to meet. They will probably have much frustration in an attempt to score to a high level set for the course. Under these conditions, we can expect students to become discouraged early in the quarter. The early disappointment over failure to score properly will probably discourage students to try to score higher 15 in the quarter. 8. The 80% fixed criterion group will get a significantly lower achievement score than the ascending group. Since the achievement level of the 80% group is set so low for the whole term the students need not score as high as the ascending group to obtain mastery or an 'A' grade. Finally, the 80% fixed criterion group will receive a lower mean achievement score than the 90% fixed criterion group. A 80% criterion from start to finish is set so low that students of this criterion do not have to score as high as the 90% criterion group. Therefore, the average achievement score should be much different. Hypotheses Regarding Attitude The overall hypothesis of the interaction of the treatments and time for the attitude dependent variable was: There will be interaction between treatments and time for mean attitude. The direction and the magnitude of the interaction for each treatment group is stated below. 1. The ascending criterion group will have the most positive mean attitude over time. This group is expected to have a progressively more positive attitude because they will attain the stated criterion for the earlier unit tests without much difficulty. The early success should motivate students to succeed later when the unit tests have become more difficult to attain. The continued success should 16 produce the higher positive attitude toward the course later in the quarter. 2. The 80% fixed criterion group will have the next most positive attitude over time. The relative ease of attaining criterion for each unit should produce a high positive attitude. The attitude change is expected to be moderate over time since the level of criterion can be attained without much effort. 3. The 90% fixed criterion group will have a progressively more negative attitude over the period of the quarter. While the attitude of the 90% group may start as positive as the other groups, the attitude is expected to become more negative as the criterion level continues to be difficult to attain. The author is not suggesting that it is impossible to attain very high levels of achievement but that in pushing students to do so may produce a negative response in attitude toward the course over a period of time. 4. The control group will have the most negative attitude toward the course over the period of the quarter. This group should have the most difficult time of any group in trying to succeed in a course which has a straight percentage method of grading. The students have no opportunity for remediation and retesting. We should expect the greatest frustration over difficulties in attaining a desirable score on each unit test. 5. The mastery students under criterion referenced testing will have a higher mean score on a measure of 1? attitude than students in the control class. One would expect a more positive attitude under mastery since those students would be given more opportunity to attain a high level of achievement. One should also expect students to be threatened less by a course which does not seek to promote a standard distribution of grades from 'A' to 'F'. 6. The ascending criterion group will have a significantly higher mean score on a measure of attitude than the 80% or 90% fixed criterion groups. Early success on the unit criterion-referenced tests of the ascending group will build confidence. In addition, it is predicted that students achieving early in the quarter are likely to report that they are learning well. Thus, it is reasonable to suggest that a positive attitude toward learning will stimulate students to master the next higher criterion level. Students will not turn away from the subject matter as they might if success on tests is hard or impossible to attain from the beginning. 7. Students in the 90% fixed criterion group will have a significantly lower mean score on a measure of attitude than the ascending group. The score which this group must attain from the beginning is set very high. There will probably be a great deal of frustration in trying to reach such a high level of achievement on unit tests. The frustration will probably be increased by the restudy and retesting that must be done in an attempt to succeed on the achievement tests. Because of this, we can expect students 18 to become much more negative in attitude earlier in the quarter. Since the high standard is fixed until the end of term, the attitude will remain negative. 8. Students in the 80% fixed criterion group will have a significantly lower score on the measure of attitude than the ascending group. The ease at which the 80% group can achieve each unit test will probably cause no significant change in attitude toward the course. The ascending group will also have success on the same unit test as the 80% group when the criterion is low. But, the ascending group is faced with more difficult levels of criterion late in the quarter. Therefore, the early success should motivate students to try harder to succeed later. As the ascending group continues to achieve on its tests, a more positive attitude should be noticed. Lastly, the 80% fixed criterion group will have a significantly higher mean score on the measure of attitude than the 90% fixed criterion group. While one might be able to push students to attain a very high level of performance throughout their learning, students of the 90% group may develop a negative attitude toward the material learned. This is why we should expect significantly higher positive attitude at the lower criterion. Hypotheses Regarding Time Spent On Instruction The overall hypothesis of the interaction of the treatments and the time for the time spent on studies dependent variable was: There will be an interaction between 19 treatments and time for the mean time spent on studies. The direction and magnitude of the interaction for each treatment group is stated below. 1. The ascending criterion group will spend progressively less time per unit on studies over the period of the quarter. In the beginning of the course, students of the ascending group will have an opportunity to adjust to the course when the criterion is low. Once they have established themselves under the lower criteria, the study time of students will be less and less for each successive unit. It is suggested that students who are successful early in the quarter will not find it necessary to over- study to ensure that learning is complete. 2. The 80% fixed criterion group will spend an equal amount of time per unit on studies over the period of the quarter. A criterion of 80% is easy to reach. Once students realize that it takes little time and effort to obtain the desired score for each unit, they will spend an equal amount of time on their study of each unit to assure success. 3. The 90% fixed criterion group will spend progressively more time per unit on studies over the period of the quarter. A Criterion of 90% for each unit test is a very difficult standard to reach. If students are to assure themselves of reaching the desired criterion, a greater amount of time must be spent on studies. Any failure to reach the stated criterion level for a unit test will 20 indicate to the student that one must study more for the next unit test to achieve a criterion of 90%. The students will tend to add much more study time in an attempt to make sure that they have learned. Therefore, this pattern of study habit is very inefficient. 4. The control group will spend the most time per unit on studies over the period of the quarter. The opportunity to do very well in a course will depend upon performance on each test. The understanding of the content must be complete on the first attempt of each test. There is no chance for remediation and retesting. In an effort to be as complete as possible on the understanding of the course content, the students of the control group will spend a great deal of time in learning. This situation will produce inefficient study scheduling by students. 5. The criterion-referenced groups will spend significantly less mean time on study than the control group. The time spent on study can be considered a measure of study efficiency. When there is less study time spent to master a particular unit, the time spent on study is considered more efficient. There may be a particular criterion level whose attainment early in the sequence of study will progressively increase the amount of material achieved per unit time later in the course. Thus, students learning under criterion-referenced testing may be spending less time in study for tests. The students of the control group should be expected to spend a constant amount of time 21 through their learning. Consequently, the time spent on study will be greater for the control group than the criterion-referenced groups. 6. The ascending criterion group will spend significantly less mean time on study than the 80% or 90% fixed criterion groups. Students of the ascending group will have an opportunity to adjust to the course early in the quarter. Once they have established themselves under the lower criterion, their study time will be less and less for each successive unit. In further support of the hypothesis, it is suggested that students who find success early in the course will not find it necessary to over-study to make sure that they have learned. The overall effect will be to shape the student into an efficient pattern of studying. Therefore, it is predicted that the ascending criterion offers the greatest opportunity to provide students with greater efficiency of study. This is accomplished by a gradual incline to a more difficult criterion. 7. The 90% fixed criterion group will spend significantly more mean time on study than the ascending group. Students in the group with 90% fixed criterion will have a very high level of achievement. Therefore, it can be expected that there will be more time spent on initial learning. Additional time will also be needed for re—study and re-take tests. Overall, the time to learn to an adequate level will be greater than the ascending group. 22 8. The 80% fixed criterion group will spend significantly more mean time on instruction than the ascending group. Since the level of achievement for the 80% group is fixed at 80% of the total points, we can expect students to spend the same amount of study time for each unit during the quarter. The ascending group is expected to decrease in study time over the same quarter of instruction as the 80% fixed group. Therefore, the mean score on study time will be greater for the 80% fixed group. Overview of Literatgre Survey The next chapter will review the literature of mastery as it specifically relates to research on mastery learning and to the criterion setting procedures. Since this chapter has involved attitudes and the study time of the student, it is relevant to explore the literature of mastery learning with regard to these subjects. This will be done to develop background of data for specific criterion setting research procedures which produce best learning in a relatively short amount of time without sacrificing the attitude of the student. CHAPTER II LITERATURE SURVEY Introduction There are many innovative possibilities to foster the learning process. Mastery learning is a specific method of the general mastery strategy which can be used to implement this process. The general mastery strategy is defined by two essential features. One, the course content is segmented into a number of relatively short, self contained units. Students are tested on each unit. Second, students are expected and required to meet a predetermined criterion or level of mastery before progressing to the next unit and its test. The basic assumption of the mastery approach is that almost all students can and will learn. To meet the assumption, a set of procedures have been established. The first is that mastery entails the formulation of a set of instructional objectives that all students are expected to achieve to a particular mastery performance standard. The second procedure is the breakdown of a course into a sequence of smaller learning units where each unit typically covers several course objectives. 23 24 The third procedure is the construction of brief progress tests called formative evaluation instruments for all learning units. These tests are typically ungraded, but in some cases they may be used as the basis for the final grade. The resultant grade indicates whether the student has or has not achieved the course objectives to the appropriate level. The final procedure is the preparation of alternative learning materials for students who have not attained mastery of the objectives of the unit. These alternatives teach the objectives in a way different than the teacher's lecture presentation. The procedures used in this thesis were those reviewed by Block (1971). Briefly, Block reports that a subject is chosen and broken down in a specified number of units. Preferably, the subject is one requiring convergent thinking; that is, it has a definite body of knowledge upon which a group of experts can agree. Objectives are specified in a behavioral sense so students know what is expected. Ideally, the units of study build on one another. In some cases, courses may not be in a hierarchical order but are broken into units by subtopic. The students are asked to master each unit of study at a specific criterion level. The grading is, therefore, absolute in that it depends upon a level of attainment of criterion and not the class average or a curve generated from relative groups of students. When students do not reach mastery, a wide range 25 of procedures is initiated to help students study the same unit material and correct deficiencies. Students are then allowed to retake a unit test for mastery. The various unit tests represent formative tests; that is, tests which are not used for grade but tests which are used to inform the student of deficient areas. The summative test is used at the end of the course to put together all that has been mastered. This is the test for a grade. Bloom (1971) states that this method of learning for mastery has allowed up to 90% of the students in a particular class to achieve an 'A' grade. Since this thesis centered around mastery learning and the use of criteria, the concept of mastery learning was reviewed in some detail. Research Regarding Mastery Leagnigg Block (1971) reviewed the results from approximately 40 major studies on mastery learning. All these studies have been done under actual school conditions. They have involved all levels of education and in subjects ranging from arithmetic to philosophy to physics. Block states that these major studies have shown that 90 percent of the mastery learning students have achieved as well as 20 percent of the non-mastery learning students. Several other studies not reviewed by Block (1971) are reported below in this review. In 1968, Amthor compared two classes of a course in descriptive geometry at the college level. Both classes 26 were presented with identical instruction but differed in the type of evaluation or learning strategy used. One class was taught the content of the course in lecture. They were tested on the content one time and awarded letter grades 'A' through 'F'. The students of the other class were taught under a mastery learning strategy. This strategy was explained earlier in this chapter. The results of Amthor's study were reported in terms of the number of students who received a grade of 'A' in each of the classes. The results show that 23 of the 29 students (about 80%) received a grade of 'A' for the mastery learning treatment while only 11 of the 63 (17.46%) received an 'A' in the 'A' through 'F' non- mastery graded system. Foth (1973) reports that an improved version of his mastery learning program in soil science at Michigan State University produced a grade of 'B' or better for 90% of the students; 70% achieved a grade of 'A'. In general, research by Foth found that between 70% and 80% of students received an 'A' instead of the 95% proposed by Bloom and Block. In further support of mastery learning, Wentling (1973) finds that high school students enrolled in General Automobile Mechanics obtain significantly higher mean achievement scores for both immediate achievement (test a day later) and retention (same test given three weeks later). A study conducted by Johnson, Gnagey and Chesbro (1970) contradicts the research of Foth, Amthor and Wentling. They used the mastery method whereby students were tested over the materials covered in lectures, texts 27 and outside readings. One group had to make a score of 80% on weekly quizzes for mastery. They were required to retest if unsuccessful until they passed. A second group was given the same four 60-item unit exams and a comprehensive final examination and assigned letter grades on the first try. A third group received no weekly test but spent the time discussing the material. None of the groups showed any increase in learning as reflected by examinations covering the material. The students were alike in their learning. The research of Johnson et al. was the only research which contradicted the positive results of mastery learning. Nevertheless, the evidence is over- whelmingly weighted in a positive direction for improved achievement under the mastery learning procedure when all of its aspects are used to teach a course. Strategies Used To Set Criteria The literature of the past 20 years has not reported much research on the basis for setting of criterion levels. Instead, it has produced a controversy on the validity of setting criteria. The controversy has centered around Ebel's (1971) objection on the general meaningfulness of criteria of achievement. He states that criteria must not represent the interests, values and standards of just one teacher, but they usually do. This is true because teachers have not taken the time to come to a consensus about criteria. Therefore, according to Ebel, they lack validity and useful 28 meaning. Block's (1971) rebuttal to Ebel is not strong and direct. Instead, he contends that the setting of an absolute level insures that each student completes his/her learning before advancing to new information. How high a level of achievement or what knowledge is to be acquired is not answered. With the exception of experimental papers by Block (1970) and Carlson and Minke (1975), much of the rationale used to set criterion levels for mastery learning has been subjective. In this regard Bloom (1971) states that a necessary condition for mastery is the setting of absolute performance standards. Block (1971, 1974) remarks that there are no hard and fast objective rules for setting criteria. But criteria must be set to use as the basis for grades in order to reflect attainment of those standards. One broad suggestion regarding a strategy used to set criteria (Bloom, 1971, Block, 1971 and Millman, 1973) is to set realistic performance standards for each school or group in cooperation with teachers and administrators. The teachers and administrators would inspect test items to determine the minimum number of items that students must answer correctly in order to be considered in a "mastery state." A variation of this suggestion is proposed by Millman (1973). Test items are sorted into meaningful clusters. The clusters may correspond to the objectives of the course. Experts in the field determine the criterion score for each cluster of items. "Mastery status" could 29 be assumed for students whose test performance on test items in each cluster met or exceeded the corresponding criterion score. Another educational approach for setting the criterion is Millman's (1973) approach. Millman (1973) suggests two procedures. One deals with setting the criterion so that a predetermined percentage of a group of students pass. This procedure is inconsistent with the philosophy of mastery. The philosophy asserts that students should be encouraged to achieve optimum learning of the stated course objectives. A second procedure is to administer a test to students who have already mastered the material. The criterion is chosen as the raw score corresponding to a chosen percentile score. Hambleton et al. (1978) state that this procedure has its limitation but they do not state why it is limited. A third approach for setting criteria is that grades for the following year might be based on grading standards arrived at the previous year if parallel examinations are used. Specifically, Block (1971) states that scores which earned students learning under non-mastery condition 'A's' and 'B's' might be useful mastery grading standards. Based on Block's suggestion, Hapkiewicz and Foth (1973) have reported that scores which earn students 'A's' or 'B's' in a previous term when grades were assigned on a curve were specified as the standard for students in mastery learning courses. A scale was developed from previous course grades 3c in Soil Science 210 at Michigan State University (Hapkiewicz and Foth, 1973). The scale was more rigorous than most previous scales used since no one received a grade point average of 4.0 with less than 88%: whereas some students received a 4.0 with only 84% when grades were based on a curve under the non-mastery system. A fourth approach is suggested by Hambleton et a1. (1978). They claim that, in general, criterion scores probably should be based on psychological and educational considerations, but in some instances statistical considerations can be brought to bear on the problem of setting a criterion score. Several statistical procedures, which they reviewed, are stated in the following paragraphs. Huynh and Perney (in press-see Hambleton et a1. 1978) suggest a method of estimating criterion scores. Test performance data for a group of students on a series of unit tests plus test scores from a "referral task" are needed to start their algorithm for criterion score and domain score estimation. 0n the basis of an initial classification of students into mastery states, determined by data obtained from the referral task, a score and domain scores for the last unit in the sequence can be obtained. The criterion score and mastery determination from the last unit will then serve as the "referral task" data for the second to last unit. The process is continued until scores and domain score estimates are available for all students on each of the unit tests. 31 According to Hambleton et a1. (1978) the practical value of Huynh and Perneys' method of estimating criterion scores is unknown. The method of Huynh and Perney appears to have several problems. It assumes all items in a unit test to have equal difficulty. It requires the existence of an independent measure of performance, to which they referred in their work as a "referral task." There must be the proper sequencing of units. Also, there is a subjective assignment of students into mastery states based on the referral task. Berk (1976) proposes a relatively simple procedure for selecting a criterion score. The method requires the selection of instructed and uninstructed groups of students. Instructed students are those who have received "effective" instruction on an objective to be assessed. Effective instruction involves a qualitative judgement about the mastery of an objective by students. Uninstructed students are those who have not received instruction on an objective. They are also tested to see if they have mastered the objective. Generally, the distribution of instructed and uninstructed student scores, ranging from zero to i, where i is the number of items on the test, can be divided by a series of criterion scores into two general categories: masters and non-masters. According to Berk, a criterion which produces the greatest frequency count of students at or above the criterion identifies the groups generally 32 considered masters. Those below the criterion are considered non-masters. Since it is assumed that the students of the instructed group are 'true masters“, these students are, specifically put into two classes: True masters (TM), and false non-masters (FN). Similarly, the students in the uninstructed group are classified as false masters (FM) and true non-masters (TN). The classification just mentioned is expressed in a box form below. Criterion Classification Instructed (I) Uninstructed (U) Predicted :1; Masters Type II Error .23 (PM=TM + FM) True Masters False Masters H+>0 (TM) (FM) ocsua 13.3 no -H#4§ 3E3: Type I Error 3:;“5 Predicted False Nonmasters True Nonmasters 90 >30 :wxa >=o >90 me me hfla me me nap-«g HHE HHEIJ HHE HID-lg 326-! 3:9 LEE-IE :29 LEE-I 06H QBH mea 09H UBH