15.11.2012 Views

Technical Report - International Military Testing Association

Technical Report - International Military Testing Association

Technical Report - International Military Testing Association

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

. .<br />

‘-<br />

-.-<br />

.<br />

.I :<br />

.<br />

-..<br />

(. .-<br />

,.’<br />

.‘;. . . .<br />

:<br />

.._<br />

. .<br />

../<br />

./’<br />

:<br />

.<br />

-.<br />

:’<br />

. .<br />

. ‘,\<br />

.’ ‘\<br />

.<br />

,<br />

.<br />

.<br />

.<br />

._<br />

������ ������ �������� ���������� ⌧ ���� ������ � �� �� � ��<br />

��<br />

- ~- . ---. .*._ -..-___ _,- .__^.. ____.<br />

.<br />

.<br />

���<br />

� �<br />

�� � ���� � �������� � � �������<br />

.<br />

.I..-.-..--.---. .._... _ __; _.., ._ ._<br />

c<br />

������� ��<br />

The third general area fall8 into the Human Error category. ~x~plea of<br />

thir type of error will vary from the use of the wrong cnsver key (eaey<br />

to spot vicually) to the inclusion in the ramplt of answer cards from<br />

another examination, or lack of coordination portafning to different<br />

elements of the program.<br />

In short, diacrcpancier do occur in machine-produced item analysis.<br />

and it is generally agreed that human inspection of item analysis rt6UltiB<br />

ir ncca88ary.<br />

Hethodr<br />

At this point it might be useful to conaider scme methods upon which<br />

vieual fnrpection of machine itear � naly8i8 re8ulte may be baaed.<br />

The firrrt method considered might be called the u8e of an outside<br />

criteria. One such outside criterion vould concern the Item difficulty<br />

index or item p-value. The p-value criterion 8tAte8 that a p-value may<br />

not be less than 0.00, nor greater than +l.OO. That In, not leaa than<br />

zero percent of the rample may an8wer an item correctly, nor more than<br />

100 percent. Another outside criterion would 8tate that the item-teat<br />

correlation or r-value (vhich 18 generalized a8 the discrimination index)<br />

may not be lese than -1.00 nor more than +l.OO without further invertigat<br />

ion. Another method of check:ng itan analyrir might be called the<br />

intra-item conristency method. If the item r and p values are bared on<br />

the total sample, and if the correct and incorrect item alternative8<br />

r and y ertimatar are baaed on a high and low group (upper and lower<br />

27% for fnrtrncc) then the correct alternative is being mearur.-d indcpendently<br />

on two overlapping groups to yield similar although not identical<br />

rerults. Empirical studier based on a large number of item8 will yeild<br />

probability table8 for variourr degree8 of diacrepancice betvecn there<br />

tvo ret8 of r and p valuer.<br />

A more involved method of checking item analysis might be termed<br />

the Item history method. This method is rertricted to pre-tested or<br />

control items, and COn8it3t8 of observing change8 in the p and r value8<br />

of the right and wrong alternative8 of an item from one examination<br />

period to another. Although the p-value of an item is canputed independently<br />

of other items, it8 value may change due to change8 in the examiaee<br />

population or varying degrees of item coatpromise or obsolescence. The<br />

r-value vould also be expected to have a history showing a degree of<br />

fluctuation which would be further affected by its dependence on the<br />

action of other itanu in the examination.<br />

Another method of checking item analysis data ir based on trends in<br />

the item results. When the p and r value8 of the examination items arc<br />

viewed am a whole, trendo may become apparent. For instance, if one or<br />

more clusters of negatfve r-values are obrerved wfthout cmpcnsatfng<br />

group8 of high r-values and the rav score standard deviation io adequate<br />

on the ballsi of experience or a rtatistical model, these trend8 may<br />

indicate error8 in the item analysis resultr.<br />

. .<br />

.<br />

1 0 3<br />

I<br />

� �<br />

� �<br />

f

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!