A comparison between the vertical scaling of tests sensitive to multiple dimensions using common-item and common-group designs
Three methods of item response theory (IRT) linking--common-item, common-group and a combination of common-item and common-group (referred to as common-common) linking designs were compared using real testing data from an English as second language (ESL) exam program. The methods were considered as "vertical scaling" instead of "equating" because, first, the test was designed to examine three different traits of English ability; multidimensional IRT and factor analysis on testing data confirms that the test was multidimensional. Second, the two test forms are not at the same difficulty level, the averaged difficulty parameters were different by about 0.5, 1.0 or 1.5 standard units, thus the linking was considered vertical. The effects of test length and averaged difficulty level differences were also analyzed. For practical reasons, the anchor test used in the common-item linking design could not represent all the dimensions of the test forms.
Read
- In Collections
-
Electronic Theses & Dissertations
- Copyright Status
- In Copyright
- Material Type
-
Theses
- Authors
-
Yu, Jing
- Date Published
-
2007
- Subjects
-
Educational tests and measurements
English language--Study and teaching--Foreign speakers
Item response theory
- Program of Study
-
Counseling, Educational Psychology, and Special Education
- Degree Level
-
Doctoral
- Language
-
English
- Pages
- ix, 115 pages
- Permalink
- https://doi.org/doi:10.25335/5ra9-5723