MAKING VALID AND RELIABLE DECISIONS IN ... - CCSSO projects

More documents

Recommendations

Info

Reliability of Assessment Results: The degree to which the scores of every individual are consistent overrepeated applications of a measurement procedure and, hence, are dependable and repeatable; the degreeto which scores are free of errors of measurement.Sample: A sample is a selection of a specified number of entities called sampling units (test takers, items,etc.) from a larger specified set of possible entities, called the population.Sampling: The selection of a sample.School Report Cards: Reports that provide information about schools, as a whole, rather than aboutindividual students. For example, they may include information about the number of students who scoreat the proficient level on State tests, information about the number of teachers teaching in their areas ofprimary training, as well as information about attendance, retention, and discipline referrals. In somecases, the data on school report cards are used to make programmatic decisions about schools or todetermine whether they meet accreditation criteria, for example.Secure Forms of Assessments: Refers to the need to keep high-stakes tests safeguarded so that all studentshave equal exposure to the test materials and equal opportunities for success. If test security is violated,then some students can be placed at an unfair advantage or disadvantage. When this happens, thevalidity of high-stakes tests is violated.Stakeholders: Persons holding a vested interest in the outcomes of the assessment program. These likelyinclude parents, students, educators, and taxpayers.Standard Assessment: Refers to the administration of an assessment in the prescribed, standard way,without the use of accommodations or modifications.Standard Error of Measurement: The average amount that scores in a distribution differ from thecorresponding true scores for a specified group of test takers.Standards-Based Tests: A type of criterion-referenced test. They consist of items that reflect a preestablishedset of content standards. Results are then interpreted against a set of criteria orperformance standards.Technically Sound Accountability Systems: Systems that are defensible, reliable, and valid for thepurposes for which they are used, fair, and unbiased.Test Forms: Parallel or alternate versions of a test that are considered interchangeable, in that they measurethe same constructs, are intended for the same purposes, and are administered using the same directions.True Scores: In classical test theory, the average of the scores that would be earned by an individual on anunlimited number of perfectly parallel forms of the same test. In item response theory, the error-freevalue of test taker proficiency.Valid: Refers to the degree to which a test measures what it purports to measure. See Validity.Validity of a Test:(1) An overall evaluation of the degree to which accumulated evidence and theory support specificinterpretations of test scores(2) The extent to which a test measures what its authors or users claim it measures(3) The appropriateness of the inferences that can be made on the basis of test resultsValidity of the Accountability System: An accountability system can be said to have validity when theevidence is judged to be strong enough to support the inferences that:• The components of the system are aligned to the purposes, and are working in harmony tohelp the system accomplish those purposes; and• The system is accomplishing what was intended (and did not accomplish what was not intended.)104 Making Valid and Reliable Decisions in Determining AYP
AccountabilitySystems &ReportingCASCOMPREHENSIVEASSESSMENTSYSTEMSforTITLE IState Collaborative on Assessmentand Student Standards
Page 3 and 4:
CHIEFSTATESCHOOLofCOUNCILOFFICERSMA
Page 6 and 7:
INTER-COMPONENT PROBLEMS: MAL-ALIGN
Page 8 and 9:
percentage increases over time. Man
Page 10 and 11:
The information provided in this pa
Page 12 and 13:
AYP MeasuresRewards and SanctionsEn
Page 14 and 15:
making AYP will be included in the
Page 16 and 17:
performance of subgroups of student
Page 18 and 19:
An aligned system of academic conte
Page 20 and 21:
How to include small schools and sm
Page 22 and 23:
Further, the law, in Section 1116,
Page 24 and 25:
The extent to which LEAs and SEAs w
Page 26 and 27:
20 Making Valid and Reliable Decisi
Page 28 and 29:
the NCLB Act. This is not because t
Page 30 and 31:
whether of a student or of a school
Page 32 and 33:
The solution is not simple. It is a
Page 34 and 35:
How does the measurement correspond
Page 36 and 37:
The information that an accountabil
Page 38 and 39:
Overall GoalsAccountability systems
Page 40 and 41:
Ineffective schools should be ident
Page 42 and 43:
FIGURE 5. A SIMPLIFIED THEORY OF AC
Page 44 and 45:
that definitions of variables and p
Page 46 and 47:
no satisfactory way to compute a co
Page 48 and 49:
classification rules—across schoo
Page 50 and 51:
Occasionally, however, the errors c
Page 52 and 53:
more rudimentary skills. (Or, is th
Page 54 and 55:
Intent of the accountability modela
Page 56 and 57:
ReferencesAmerican Educational Rese
Page 58 and 59:
Hill, R. (2001). Issues related to
Page 60 and 61: 54 Making Valid and Reliable Decisi
Page 62 and 63: Finally, AYP has proven difficult s
Page 64 and 65: accountability model proposed. To t
Page 66 and 67: However, defining a school is not a
Page 68 and 69: FIGURE 9. FACTORS TO CONSIDER AS ST
Page 70 and 71: school population (Crane, 2002). As
Page 72 and 73: If one wants to establish a measure
Page 74 and 75: Using this approach, a State could
Page 76 and 77: strategy on reliability grounds 23
Page 78 and 79: SummaryA school’s results for the
Page 80 and 81: 567). Los Angeles, CA: Center for t
Page 82 and 83: Is the accountability system leadin
Page 84 and 85: not the very next year, with no cha
Page 86 and 87: Minimum “n”Minimum “n” is i
Page 88 and 89: District and State officials may fi
Page 90 and 91: 2. Rolling averages Decision: How c
Page 92 and 93: approach illuminates these issues,
Page 94 and 95: What are the unintended consequence
Page 96 and 97: Further guidance is provided in reg
Page 98 and 99: Accountability Requirement for Engl
Page 100 and 101: “Safe Harbor”Safe harbor” pro
Page 102 and 103: This review process provides “one
Page 104 and 105: will likely have significant impact
Page 106 and 107: Simulation studies to evaluate pote
Page 108 and 109: Assessment: Any systematic method o
show all

MAKING VALID AND RELIABLE DECISIONS IN ... - CCSSO projects

Create successful ePaper yourself

Delete template?

Save as template?