awej 5 no.4 full issue 2014

More documents

Recommendations

Info

AWEJ Volume.5 Number.4, 2014Fair, Reliable, Valid: Developing a Grammar Test UtilizingEfeotorgrammaticalMike Orrterminology. Hence, an examinee may get an item wrong, not because they areunaware of the grammatical structure, but because they are unaware of the terminology. Thiswould have a damaging effect on the reliability of the test. Moreover, it is likely that in an openendeditem, the target grammatical structure would be used in the stem itself, and easilyduplicated, which would also impact upon the reliability of the test.The fixed-response format was advantageous for this test. However, much rests on the writing ofgood items. ―One of the greatest limitations of the selected-response formats derives from thecreation of flawed selected-response items‖ (Downing, 2006b, p.290). As a result, certain goodcodes of practice were adhered to when writing the items. Firstly, each item had four possibleoptions. Rodriguez (2005) holds that three options are sufficient and that ―using more optionsdoes little to improve item and test score statistics and typically results in implausible distracters‖(p.11). However, having four options does lower the probability of randomly guessing thecorrect answer. Moreover, Lord (1977) demonstrated that when reliability is estimated using theSpearman-Brown prophecy formula, it increases when the number of options increases. Havingfive options would obviously lower the probability of guessing further, however, as more optionsare added it is more likely that the options would not be statistically functional (Downing,2006b), as a result four options were opted for. Secondly, the options were homogenous incontent. Hence, if the target item was an object pronoun, the three distracters were also objectpronouns. The options were also similar in length in order to avoid giving a hint to testwiseexaminees. Thirdly, extra care was taken to ensure that the items were independent of oneanother. Thus, the answer to an item could not be found in the stem of another item. Finally, theitems did not contain unnecessary or high-level language which could have introduced constructirrelevantvariance into the measurement. This would have affected the fairness of the test. ―Ifsome students have an advantage because of factors unrelated to what is being assessed, then theassessment is not fair‖ (McMillan, 2001, p.55). As Abedi (2006, p.383) states, ―complexlanguage in the content-based assessment for non-native speakers of English may reduce thevalidity of inferences drawn about students‘ content-based knowledge.‖ While every effort wasmade to write good test items it is possible that some of the items were flawed. A discussion ofthe items will follow in the results section. The full test is shown in Appendix 1.The Outcome SpaceThe next building block, the outcome space, is concerned with scoring the responses of theexaminees. Scoring the test correctly has great implications for the validity of the test. If the testscores are to be valid a scoring key must be accurately applied to mirror the examinee itemresponses (Wilson, 2005). The term outcome space was introduced by Marlon (1981) to describea set of categories which are devised to grade examinee responses. As Wilson (2005) purports,―inherent in the idea of categorization is an understanding that the categories that define theoutcome space are qualitatively distinct‖ (p.63). This is even the case for fixed-response items,such as multiple tests and Likert-style survey questions. With open-ended items it is pertinent todevelop an outcome space which provides example item responses belonging to the differentcategories. This requires researching the variety of responses that examinees could give.However, in fixed-response items such as multiple choice questions and true-false surveyquestions there are traditionally two categories; one category for choosing the correct answer andanother category for choosing the wrong answer. However, this is not the only possible way tograde multiple choice items.Arab World English JournalISSN: 2229-9327www.awej.org209
AWEJ Volume.5 Number.4, 2014Fair, Reliable, Valid: Developing a Grammar Test UtilizingEfeotorMultipleMike Orrchoice item responses may also be categorised, where an examinee scores a point forchoosing a distracter which is considered a better response than the other distracters. This ishighlighted in the question in figure 4.Figure 4.A multiple choice item that could use polytomous scoring.Q. Who was the 41 st President of the United States of America?a) George Bush b) Abraham Lincoln c) Ronald Reagan d) Tony BlairThe multiple choice item in Figure 4 could be scored using two categories. If the examineechooses the correct answer, (a), they score one point, if they choose any of the distracters theyscore zero. Alternatively, the distracters could be graded in order, to show partial success. AsWilson (2005) states, this is done when the difference between the distracters is large enough,and when ―there is a way to interpret those differences with respect to the construct definition‖(p.70). Subsequently, the outcome space may award points for each distracter accordingly; (a=3,c=2, b=1, d=0).Another possible scoring scheme is to award negative marks for an incorrect answer. Adisadvantage of this is that it causes cautious students to refrain from answering items, even ifthey possibly know the correct answer. The analysis focuses on the items themselves, rather thanthe overall score. Therefore negative marking could have had the effect of leaving certain itemswith very little statistical data available. Furthermore, it was believed that the measurementmodel would highlight possible instances of examinee guessing.The traditional method of scoring multiple choice items was chosen. Negative marking wasrejected for the reasons outlined above. Polytomous scoring was rejected because there are nosignificant differences between the distracters which could be interpreted with respect to theconstruct. In many of the items the options are homogenous. Figure 5 is taken from the testpaper.Figure 5.An item from the test.Q. Ali is tall and ___________ hair is black.a) our b) your c) her d) hisThe answer to the item in Figure 5 is a possessive pronoun, as are the other three distracters.Therefore, due to this homogeny there is no ‗better‘ answer amongst the distracters. As a result,the most valid scoring scheme for the test was adopted, which was to give one point for a correctanswer and zero for an incorrect answer.The Measurement ModelThe final building block, the measurement model aims to ―relate the scored outcomes from theitems design and the outcome space back to the construct that was the original inspiration of theitems‖ (Wilson, 2005, p.85). Two main approaches to measurement have emerged, the first onehas been coined Classical Test Theory (CTT). The theory purports an examinee‘s observed scoreArab World English JournalISSN: 2229-9327www.awej.org210
Page 3 and 4:
AWEJ Volume.5 Number.3, 2014Dr. El-
Page 5 and 6:
AWEJ Volume.3 Number.4, 2012Content
Page 7 and 8:
AWEJ Volume.5 Number.4, 2014Ideolog
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
AWEJ Volume.5 Number.4, 2014Pp. 14-
Page 19 and 20:
AWEJ Volume.5 Number.4, 2014Motivat
Page 21 and 22:
Page 23 and 24:
Page 26 and 27:
Page 28 and 29:
Page 30 and 31:
Page 32 and 33:
AWEJ Volume.5 Number.4, 2014The Nul
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
Page 56 and 57:
Page 58 and 59:
AWEJ Volume.5 Number.4, 2014Pp.55-6
Page 60 and 61:
AWEJ Volume.5 Number.4, 2014Using P
Page 62 and 63:
Page 64 and 65:
Page 66 and 67:
Page 68 and 69:
AWEJ Volume.5 Number.4, 2014Explici
Page 70 and 71:
Page 72 and 73:
Page 74 and 75:
Page 76:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
AWEJ Volume.5 Number.4, 2014Pp. 84-
Page 89 and 90:
AWEJ Volume.5 Number.3, 2014Errors
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
AWEJ Volume.5 Number.3, 2014Percept
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
AWEJ Volume.5 Number.3, 2014Explica
Page 117 and 118:
Page 119 and 120:
Page 121 and 122:
Page 123 and 124:
Page 125 and 126:
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
AWEJ Volume.5 Number.3, 2014The Eff
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
AWEJ Volume.5 Number.4, 2014Pp. 146
Page 151 and 152:
AWEJ Volume.5 Number.3, 2014Dialogi
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162: AWEJ Volume.5 Number.3, 2014Dialogi
Page 175 and 176: AWEJ Volume.5 Number.4, 2014Pp. 172
Page 177 and 178: AWEJ Volume.5 Number.3, 2014Global
Page 191 and 192: AWEJ Volume.5 Number.3, 2014Metacog
Page 199 and 200: CSUMSUAWEJ Volume.5 Number.3, 2014M
Page 207 and 208: AWEJ Volume.5 Number.4, 2014Fair, R
Page 211: AWEJ Volume.5 Number.4, 2014Fair, R
Page 229 and 230: AWEJ Volume.5 Number.4, 2014Pp.226-
Page 231 and 232: AWEJ Volume.5 Number.4, 2014Correla
Page 245 and 246: AWEJ Volume.5 Number.3, 2014The Ara
Page 259 and 260: AWEJ Volume.5 Number.3, 2014An Inve
Page 261 and 262: AWEJ Volume.5 Number.3, 2014An Inve
Page 263 and 264:
AWEJ Volume.5 Number.3, 2014An Inve
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
AlwaysUsuallySometimesrarelyNeverAW
Page 279 and 280:
Page 281 and 282:
AWEJ Volume.5 Number.3, 2014Problem
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
Page 293 and 294:
AWEJ Volume.5 Number.3, 2014The Tra
Page 295 and 296:
Page 297 and 298:
Page 299 and 300:
Page 301 and 302:
AWEJ Volume.5 Number.3, 2014Student
Page 303 and 304:
Page 305 and 306:
Page 307 and 308:
Page 309 and 310:
Page 311 and 312:
Page 313 and 314:
Page 315 and 316:
Page 317 and 318:
Page 319 and 320:
AWEJ Volume.5 Number.3, 2014Effecti
Page 321 and 322:
Page 323 and 324:
Page 325 and 326:
Page 327 and 328:
Page 329 and 330:
AWEJ Volume.5 Number.4, 2014Pp.326-
Page 331 and 332:
Page 333 and 334:
Page 335 and 336:
Page 337 and 338:
Page 339 and 340:
Page 341 and 342:
AWEJ Volume.5 Number.4, 2014Explori
Page 343 and 344:
Page 345 and 346:
Page 347 and 348:
Page 349 and 350:
Page 351 and 352:
Page 353 and 354:
Page 355 and 356:
Page 357 and 358:
AWEJ Volume.5 Number.4, 2014Rites o
Page 359 and 360:
Page 361 and 362:
Page 363 and 364:
Page 365 and 366:
Page 367 and 368:
Page 369 and 370:
AWEJ Volume.5 Number.4, 2014Towards
Page 371 and 372:
Page 373 and 374:
Page 375 and 376:
Page 377 and 378:
Page 379 and 380:
Page 381 and 382:
Page 383 and 384:
Page 385 and 386:
AWEJ Volume.5 Number.4, 2014Saadall
Page 387 and 388:
Page 389 and 390:
Page 391 and 392:
Page 393 and 394:
Page 395 and 396:
Page 397 and 398:
Page 399 and 400:
Page 401 and 402:
Page 403 and 404:
AWEJ Volume.5 Number.4, 2014Teacher
Page 405 and 406:
Page 407 and 408:
Page 409 and 410:
Page 411 and 412:
Page 413 and 414:
Page 415 and 416:
AWEJ Volume.5 Number.4, 2014A Socio
Page 417 and 418:
Page 419 and 420:
Page 421 and 422:
Page 423 and 424:
Page 425 and 426:
Page 427 and 428:
Page 429 and 430:
Page 431 and 432:
AWEJ Volume.5 Number.4, 2014The Int
Page 433 and 434:
Page 435 and 436:
Page 437 and 438:
Page 439 and 440:
Page 441 and 442:
Page 443 and 444:
Page 445 and 446:
Page 447 and 448:
AWEJ Volume.5 Number.4, 2014Reading
Page 449 and 450:
Page 451 and 452:
Page 453 and 454:
Page 455 and 456:
Page 457 and 458:
Page 459 and 460:
Page 461 and 462:
Page 463 and 464:
Page 465 and 466:
Page 467 and 468:
Page 469 and 470:
Page 471 and 472:
Page 473 and 474:
AWEJ Volume.5 Number.4, 2014Group W
Page 475 and 476:
Page 477 and 478:
Page 479 and 480:
Page 481 and 482:
Page 483 and 484:
Page 485 and 486:
Page 487 and 488:
Page 489 and 490:
AWEJ Volume.5 Number.4, 2014Reflect
Page 491 and 492:
Page 493 and 494:
Page 495 and 496:
Page 497 and 498:
Page 499 and 500:
Page 501 and 502:
Page 503 and 504:
Page 505 and 506:
Page 507 and 508:
AWEJ Volume.5 Number.4, 2014Guessin
Page 509 and 510:
Page 511 and 512:
Page 513 and 514:
Page 515 and 516:
Page 517 and 518:
Page 519 and 520:
Page 521 and 522:
Page 523:
show all

awej 5 no.4 full issue 2014

Create successful ePaper yourself

Delete template?

Save as template?