Technical Report - International Military Testing Association
Technical Report - International Military Testing Association
Technical Report - International Military Testing Association
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
-<br />
Preoentation to MTA Panel ConaLdering<br />
Non-EmpF;ical Velidatfan of Teet Items<br />
The fact that validity ia many-faced and difficult to determine<br />
does not lessen the importance of the problem. The test produced by<br />
all services here are of tremendous faport. (The results of theoe tears<br />
determine which enlisted personne? get additional pay for isuparior performence,<br />
and help determine which get promoted to the newt higher pay<br />
grade.) Thus, tha ccoaomic and leadership status and quality of the<br />
enlisted personnel of our Armed Forcer are effected etrongly by 0;1r te8tm.<br />
As e result, the morals and quality of our fighting forces ere directly<br />
Affected by the military testing program--our teeto.<br />
It ie our duty to aek ourselves, “How valid are our tertr? Do our<br />
teat identify the best informed and mst nkilled personnel in the euma<br />
compet~t~va occupa~el area? Do thcytruly FdentFfy the least lnfowed<br />
end lee8 rkilled?” Becauee these evaiuatfon testa do affect tha letdership<br />
and morale of the enlleted structure of our nation’s ff.ghttng forcec,<br />
we are interested fn determtnfng and fmprovfng the validity of our itema,<br />
and thus of our testa.<br />
The California Te8t Bureau has cotegorited the two bAsliC spproachrs<br />
to the detcrmlnAtion of validity according to the chr.;t on page 201.<br />
Thorndike end Hsgen (1961) list tha specific considerationa entering<br />
into evaluationa of test8 as (1) validity, (2) rellabFLlty, and (3) practfciality.<br />
Validity refers to the extent to which t test meaeuree what it<br />
18 intended to meascru. Reliability has to do with the accuracy and precision<br />
of a measurement procedure --conoirtency and reproducibility. Practicelity<br />
in concerned with Marty factora, ruch AP economy, convenience,<br />
and interpretabilfty--whfch determine vhether a teixt is practical for<br />
widespread use.<br />
A reading achievement tert requires people to select certain answer@<br />
to quertlonr. Penci.1 marka on enrwer rheeta determlqs each peraon’r<br />
more. Thio Gcore ir called their reading comprehenrlve #core, but the<br />
Icore fs NOT the compreheneion. It io A rfecord of 8ample~behavior.<br />
Any judgment regarding compreheorion ir an inference from thfc number<br />
of allegedly correct anowers. Its vs:fdity in not self-evident but rnuet<br />
be astsblfshed on the basis of BdequAta evidence.<br />
Thorndike and Hagen (1961) #Late, “A teet may be thought of ao cxraapondlng<br />
to some aspect of human bahavior In three sensea.” To identify<br />
these categories they use the terms (1) “reprerent,” (2) “predict,” and<br />
(3) “sfgnify.” Of these three type6 of evidences of validity, our topic<br />
ir prfnci+lly concerned with the fLrst type, KO wit: How well do our<br />
tests reprerment the standard8 (the practical derfred level of performance)<br />
.<br />
162<br />
_ _ .--