Technical Manual - Renaissance Learning

More documents

Recommendations

Info

$Accelerated Math for Interventionâ¢ - Renaissance Learning$

Psychometric Research Supporting STAR Early Literacy EnterpriseItem CalibrationThe objectives of the Calibration Study were to:Collect sufficient response data to allow IRT item parameters to be estimatedfor all 2,929 STAR Early Literacy items.Conduct preliminary research into the psychometric reliability of STAR EarlyLiteracy tests, using a test-retest design.Assess the degree of relationship between STAR Early Literacy scores and astandardized reading achievement test.In support of the first objective, provisions were made during forms design tofacilitate expressing all IRT item parameters on a common scale. To that end,some of the test items were used as “anchor items”—items common to two ormore forms that are used to facilitate linking all items to the common scale. Twokinds of anchoring were used: 1) horizontal (form-to-form) anchoring, and 2)vertical (level-to-level) anchoring.Horizontal anchoring: The purpose of horizontal anchoring is to place all items ata given level on the same scale, regardless of differences among the forms at thatlevel. To accomplish that, several items appeared in all forms at a given level.These horizontal anchor items were chosen to be representative of the sevencontent domains and to be appropriate for the grade level.Vertical anchoring: The purpose of vertical anchoring is to place items atadjacent levels on the same scale. To accomplish that, a number of items wereadministered at each of two adjacent levels: A and B, or B and C. As much aspossible, the vertical anchor items were chosen to be appropriate at both thelower and higher levels at which they were used.Table 12 depicts the distribution of the three types of items within STAR EarlyLiteracy calibration test forms. The distribution differs from one level to another.The three item types are horizontal anchor items, vertical anchor items, andunique (non-anchor) items.Table 12:Number of Anchor Items and Unique Items in Each 40-Item Test Form,by LevelItem TypeLevel APre-K & KLevel BGrade 1Level CGrades 2 & 3Horizontal anchor items 5 7 5Vertical anchor items 5 11 6Unique items 30 22 29Total 40 40 40STAR Early LiteracyTechnical Manual40
Psychometric Research Supporting STAR Early Literacy EnterpriseStatistical Analysis: Fitting the Rasch IRT Model to the Calibration DataFor reliable IRT scale linking, it is important for anchor items to be representativeof the content of the tests they are used to anchor. To that end, the distribution ofanchor items was approximately proportional to the distribution of items amongthe domains and skills summarized in “Content and Item Development” onpage 15.To accomplish the second objective of the Calibration Study, many of theparticipating students were asked to take two STAR Early Literacy tests so that thecorrelation of their scores on two occasions could be used to evaluate the retestreliability of STAR Early Literacy tests over a short time interval. Topics related toreliability are described in “Reliability and Measurement Precision” on page 44.To accomplish the third objective, a subsample of the grade 1, 2 and 3 studentsalso took a computer-adaptive STAR Reading 2.x assessment to provide a basis forevaluating the degree of correlation between STAR Early Literacy and readingability. Statistical results are presented in “Validity” on page 55.Statistical Analysis: Fitting the Rasch IRT Model to the Calibration DataWith the response data from the Calibration Study in hand, the first order ofbusiness was to calibrate the items and score the students’ tests. This was doneusing the “Rasch model,” an IRT model that expresses the probability of a correctanswer as a function of the difference between the locations of the item and thestudent on a common scale. Rasch model analysis was used to determine thevalue of a “difficulty parameter” for every item, and to assign a score to everystudent. In the analysis, a number of statistical measures of item quality andmodel fit were calculated for each item.Item parameter estimation and IRT scoring were accomplished using WINSTEPS, acommercially available Rasch model analysis software package. WINSTEPS iscapable of Rasch analysis of multiple test forms simultaneously. Using thiscapability, three item parameter estimation analyses were conducted. All Level Btest forms were analyzed first, and the resulting scale was used as the referencescale for the other forms. Following that, separate analyses were conducted of theLevel A and Level C forms. In each of the last two analyses, the parameters ofanchor items common to Level B were held fixed at the values obtained from theLevel B analysis. This had the effect of placing all Level A and Level C itemparameters on the Level B scale. 11. All 246 test forms contained a number of anchor items. At each of the three levels, a small set ofitems specific to that level was common to all of the forms; these “horizontal anchors” served to linkall forms at a given level to a common scale. Additionally, every form contained some items incommon with forms from adjacent levels; these “vertical anchors” served to link the scales of LevelsA and C to the reference scale based on Level B.STAR Early LiteracyTechnical Manual41
Page 1 and 2: STAR Early LiteracyTechnical Manual
Page 3: ContentsIntroduction . . . . . . .
Page 8 and 9: STAR Early LiteracyTechnical Manual
Page 10 and 11: IntroductionOverviewEnterprise inco
Page 12 and 13: IntroductionDesign of STAR Early Li
Page 14 and 15: IntroductionDesign of STAR Early Li
Page 16 and 17: IntroductionTest SecurityRepeating
Page 18 and 19: IntroductionPsychometric Characteri
Page 20 and 21: IntroductionPsychometric Characteri
Page 22 and 23: IntroductionSTAR Early Literacy Ent
Page 24 and 25: Content and Item DevelopmentThe STA
Page 32 and 33: Content and Item DevelopmentItem De
Page 44 and 45: Core Progress Learning Progression
Page 46 and 47: Psychometric Research Supporting ST
Page 50 and 51: Psychometric Research Supporting ST
Page 52 and 53: Reliability and Measurement Precisi
Page 64 and 65: ValidityRelationship of STAR Early
Page 74 and 75: ValidityPost-Publication Study Data
Page 86 and 87: ValidityConcurrent Validity of Esti
Page 88 and 89: ValiditySummary of STAR Early Liter
Page 90 and 91: ValidityValidation Research Study P
Page 92 and 93: ValidityValidation Research Study P
Page 94 and 95: ValiditySTAR Early Literacy Enterpr
Page 96 and 97: ValiditySTAR Early Literacy Enterpr
Page 98 and 99:
ValiditySTAR Early Literacy Enterpr
Page 100 and 101:
ValidityThe Validity of Early Numer
Page 102 and 103:
ValidityThe Validity of Early Numer
Page 104 and 105:
ValidityRelationship of STAR Early
Page 106 and 107:
ValidityRelationship of STAR Early
Page 108 and 109:
NormingSTAR Early Literacy Enterpri
Page 110 and 111:
NormingDevelopment of Norms for STA
Page 112 and 113:
NormingData AnalysisTable 49:Gender
Page 114 and 115:
NormingGrowth Normswere smoothed us
Page 116 and 117:
Score DistributionsScaled Scores: S
Page 118 and 119:
Score DefinitionsFor its internal c
Page 120 and 121:
Score DefinitionsEstimated Oral Rea
Page 122 and 123:
STAR Early Literacy Enterprise in t
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
AppendixTable 61:STAR Early Literac
Page 162 and 163:
AppendixTable 61:STAR Early Literac
Page 164 and 165:
ReferencesAdams, M. J. (1990). Begi
Page 166 and 167:
IndexAAbsolute growth, 134Access le
Page 168 and 169:
IndexScaled Score norms, 100Scaled
Page 170:
IndexTest item design guidelinesans
show all

Technical Manual - Renaissance Learning

Create successful ePaper yourself

Delete template?

Save as template?