October 2011 Volume 14 Number 4 - Educational Technology ...

More documents

Recommendations

Info

Table 6. NR and NRET scoring taxonomy for items with correct answers NRET NR Score PK FK Correct 1 2 3 4 1 Final test 0.54 0.49 1.38 23.32 25.73 Note: PK = partial knowledge, and FK = full knowledge Table 7. NR and NRET scoring taxonomy for items with wrong answers NRET NR Score FM PM NK PK Wrong −3 −2 −1 0 1 2 1 Final test 0.27 1.21 10.34 1.02 0.63 0.80 14.27 Note: FM = full misconception, PM = partial misconception, AK = absence of knowledge, PK = partial knowledge The results in Table 6 show that the knowledge states of the students with correct answers could be further classified into full knowledge and partial knowledge. Similarly, the results in Table 7 show that their knowledge states for wrong answers could be further categorized into full misconception, partial misconception, no knowledge, and partial knowledge under NRET. Thus, NRET could detect partial knowledge and misconceptions of the students. For the third research question, the average reliabilities of NR scores and NRET scores were compared as in Table 8. Subtest length Table 8: Average reliability by different subtests Average reliability alpha coefficient NR NRET 37 0.907 0.914 35 0.903 0.911 33 0.899 0.906 31 0.893 0.900 29 0.887 0.895 27 0.877 0.887 25 0.868 0.879 23 0.852 0.864 21 0.847 0.857 20 0.842 0.854 The reliabilities of NRET scores were consistently slightly higher than the corresponding NR scores for all the subtests. Thus, the findings indicated that there was no loss of reliability in NRET scores as compared to NR scores, but NRET could detect partial knowledge and misconceptions. Discussion As more emphasis is placed on the accountability of educational institutions, the need for assessments that can provide diagnostic information for teachers to identify effective classroom instruction and help teachers construct informative feedback for students to improve learning becomes crucial. Without such focus, assessments are designed only to audit students’ learning and are just “terminal events” in the teaching and learning process. MC tests are the most preferred assessments for objective measurement of students’ knowledge, ability, or performance. When the dichotomous NR scoring is employed, MC tests are criticized for encouraging guessing and for their inability to discriminate between different levels of knowledge. Over the last half a century, many scoring methods have been proposed to differentiate students’ knowledge for MC tests. These methods use different test instructions and scoring mechanisms to facilitate students’ responding based on their true knowledge. The reliabilities of these methods are better than those of conventional NR scoring. However, these scoring methods are not accepted as an alternative to NR due to their complex or unfamiliar test instructions and scoring rules. Among this “bad lot,” ET proposed by Coombs et al. (1956), with a score ranging 107
from −3 to 3 for any four-option MC item, stands out for its ability to detect partial knowledge and misconceptions. They wrote as follows: “All positive scores represent some degree of partial information and all negative scores represent some degree of misinformation” (p. 35). The study reported in this paper investigated the effect of CAAS using NRET as a scoring method for MC tests. The results were based on the analyses performed on the data collected for this study. Clearly, there are a number of limitations. First, this study used one group of students to sit for the final test under the NRET test instructions, and the NR scores were determined from these responses. This approach of calculating different scores from one test using one common test instruction had been employed by past researchers such as Kansup (1973) and Holmes (2002). It has been noted that the observed score for any item of an examinee is influenced by many variables such as guessing, testing situation, content, scoring, administration, and examinee’s behavior. Through this approach, many of these variables can be held constant and the scoring errors can be minimized. However, further studies could be done with two groups of students with similar ability sitting the final test, one under NRET and the other under NR. Then, not only the reliability of NRET could be compared to NR, but the significance of a difference in reliability could also be identified. Second, we should also compare NRET to ET. Third, mathematics was the content matter and the study involved Form Two students only in Malaysia. Further studies conducted across different subjects and age groups could help to clarify the generalizability of the findings of this study. Fourth, the comparison of scoring methods should also be done using different sample distributions such as non-normal sample distributions with different values of skewness and kurtosis. Through such studies, clearer differences between the scoring methods may emerge. Last, but not least, this study was conducted using CAAS. Technology in assessment often results in unforeseen social, negative and unintended consequences. Thus, further study is needed to look into the impact on examinee’s behavior regarding these issues. Conclusion The results showed the feasibility of adopting NRET to replace the conventional NR. First, the analyses performed by using the IRT models and on the performance of the low-ability students for the three most difficult items showed that guessing was minimal using the NRET method. These findings are consistent with findings by Swineford and Miller (1953) and Traub and Hambleton (1972), in which guessing was minimal under the penalty method. Secondly, the analyses of NR and NRET scoring taxonomies done for the whole test, on the items with correct answers and the items with wrong answers, showed that NRET could detect full knowledge, partial knowledge, absence of knowledge, partial misconception, and full misconception. These results are similar with Bradbard and Green (1986), Bradbard et al. (2004), and Chang et al. (2007), in that there is evidence that guessing is reduced and partial knowledge can be detected. Third, the analyses performed on different subtests of the final test showed that the NRET scores were consistently more reliable than the NR scores. This finding is consistent with suggestion by Ma (2004), who said that when test items were scored dichotomously, potentially useful information about individual’s level of proficiency that was contained in the other response options was lost. Thus, the precision of measurement was reduced. The results of this study are similar to the results of the studies done by comparing ET with NR (Collet, 1971; Hakstian & Kansup, 1975; Traub & Fisher, 1977). Although the results of NRET and ET were comparable, NRET has an added advantage over ET. According to Jaradat and Tollefson (1988), ET instructions are confusing despite prior practice. It is conflicting where students are taught to solve for the correct answer but being assessed on their ability to identify incorrect answer. On the other hand, the test instructions of NRET resemble one of the most commonly used test strategy, in which students first eliminate the obviously incorrect options before choosing the answer. NRET delivered through CAAS allows administrators or teachers to control the students’ responses. In this case, there may have been several possible reasons for students to not employ the NRET test instructions. Firstly, they may not have understood the NRET test instructions. Secondly, they may have forgotten and lapsed into the traditional response mode. Thirdly, they may have chosen not to comply with the NRET testing mode. Regardless of the reason for noncompliance, CAAS can ensure that students comply with the required instruction. In addition, CAAS also allows speedy calculation of item scores and total scores. It also creates an opportunity to provide feedback concerning performance after each item. This can help students understand the reward or penalty associated with each response strategy. Thus, CAAS using NRET has the potential to resolve the problems of guessing and failing to detect partial knowledge and misconception that are common with the NR method. 108
Page 1 and 2:
October 2011 Volume 14 Number 4
Page 3 and 4:
Supporting Organizations Centre for
Page 5 and 6:
Time-Quality Tradeoff of Waiting St
Page 7 and 8:
Ibáñez, M. B., García, J. J., Ga
Page 9 and 10:
Situated learning. It states that k
Page 11 and 12:
executed in the client, the server
Page 13 and 14:
The participants were twelve non-na
Page 15 and 16:
Huang, H. M., Rauch, U., & Liaw, S.
Page 17 and 18:
Theoretical background Feedback, is
Page 19 and 20:
There are already a number of tools
Page 21 and 22:
output used were blogs, written by
Page 23 and 24:
Question Table 2: Perceived Relevan
Page 25 and 26:
Black, P., & Wiliam, D. (2009). Dev
Page 27 and 28:
detection, and analysis of affectiv
Page 29 and 30:
experience is a suitable method of
Page 31 and 32:
Collins, 1998). The appraisal theor
Page 33 and 34:
encompasses the finer shades of an
Page 35 and 36:
and can also track faces with glass
Page 37 and 38:
Acknowledgements This research was
Page 39 and 40:
Merill, D. C., Reiser, B. J., Traft
Page 41 and 42:
to the above-discussed issues of th
Page 43 and 44:
The annotation part of the SDM onto
Page 45 and 46:
that enables the office users to ac
Page 47 and 48:
Figure 4. Semantic document browser
Page 49 and 50:
to the group’s shared semantic do
Page 51 and 52:
participants performed less mouse c
Page 53 and 54:
Bastili, V.R., Gianluigi, C., & Rom
Page 55 and 56:
esource contribution, enabling conv
Page 57 and 58:
Semantic Search In contrast to the
Page 59 and 60:
Participation: P i i i NiVadd Vde
Page 61 and 62: and comment on each page as well. T
Page 63 and 64: Table 1. Questionnaire results (N=2
Page 65 and 66: propose for him, as well as calenda
Page 67 and 68: Discussion and Conclusion Collabora
Page 69 and 70: Sampson, D. G., & Zervas, P. (2011)
Page 71 and 72: Obtaining: the first step of the li
Page 73 and 74: process (Palmer & Richardson, 2004)
Page 75 and 76: and ratings (the use of star rating
Page 77 and 78: (K1) is the number of non-reusable
Page 79 and 80: Case 3: The process of reusing a se
Page 81 and 82: Washizaki, H., Yamamoto, Y. & Fukaz
Page 83 and 84: to examine determinants for pre-ser
Page 85 and 86: Overview of the model The first res
Page 87 and 88: significance of relationships. All
Page 89 and 90: Discussion This study represents a
Page 91 and 92: Davis, F., Bagozzi, R., & Warshaw,
Page 93 and 94: Ko, C.-C., Chiang, C.-H., Lin, Y.-L
Page 95 and 96: Interestingly, the Universal Design
Page 97 and 98: Figure 1. Framework of the TriAcces
Page 99 and 100: Experiment This experiment aimed to
Page 101 and 102: Table 2. Descriptive statistics for
Page 103 and 104: Foulds, R., & Camacho, C. (2003). T
Page 105 and 106: answer. However, students can choos
Page 107 and 108: Hartley & Collins-Brown, 1999; Morl
Page 109 and 110: Subjects Figure 4. Partial misconce
Page 111: correct for a four-option MC item i
Page 115 and 116: Kansup, W. (1973). A comparison of
Page 117 and 118: The theory of mental models explain
Page 119 and 120: computer-mediated counter-arguments
Page 121 and 122: Learning in this study refers to, w
Page 123 and 124: the experiment process was anonymou
Page 125 and 126: supported. However, significant dif
Page 127 and 128: Cannon-Bowers, J. A., Salas, E., &
Page 129 and 130: Lee, Y.-H., Hsieh, Y.-C., & Hsu, C.
Page 131 and 132: Many researchers have conducted emp
Page 133 and 134: Trialability Some studies have empi
Page 135 and 136: Results Instrument validation Two c
Page 137 and 138: In this study, Amos 6.0 was employe
Page 139 and 140: Our results strongly supported the
Page 141 and 142: Fornell, C., & Larcker, D. F. (1981
Page 143 and 144: Kimmerle, J., Moskaliuk, J., & Cres
Page 145 and 146: What becomes clear from this short
Page 147 and 148: Figure 1. Distribution of informati
Page 149 and 150: Results In order to test the hypoth
Page 151 and 152: esults of the present study with th
Page 153 and 154: Kolbitsch, J., & Maurer, H. (2006).
Page 155 and 156: A conceptual framework of this stud
Page 157 and 158: eveal positive effects on students
Page 159 and 160: learning resources; and able to acc
Page 161 and 162: Discussion This study suggests that
Page 163 and 164:
146). Also, role of constructivist
Page 165 and 166:
eform strategy, namely, the MOE-Int
Page 167 and 168:
McNaughton, S. (2002). Meeting of m
Page 169 and 170:
Wu, T.-T., Sung, T.-W., Huang, Y.-M
Page 171 and 172:
Ubiquitous English-Reading Learning
Page 173 and 174:
include the number of times and ran
Page 175 and 176:
initial value for the predetermined
Page 177 and 178:
The following Eq. (18) expresses th
Page 179 and 180:
Limitations This experiment adopted
Page 181 and 182:
Table 3 shows the analysis of the p
Page 183 and 184:
enhance learning outcomes, it impor
Page 185 and 186:
Ogata, H., & Yano, Y. (2004a). Cont
Page 187 and 188:
Communication seems to be more effe
Page 189 and 190:
the audience only listened, one whe
Page 191 and 192:
Step 5) Writing the story On the ba
Page 193 and 194:
As indicated in Table 2, the differ
Page 195 and 196:
References Armstrong, S. (2003). Th
Page 197 and 198:
Goh, T.-T. (2011). Exploring Gender
Page 199 and 200:
OPAC has been transformed into AirP
Page 201 and 202:
Perceived Self-Efficacy Figure 3. H
Page 203 and 204:
were asked to rate their confidence
Page 205 and 206:
(2001). The correlation table begin
Page 207 and 208:
Discussion and Implications Note. S
Page 209 and 210:
will become important to identify i
Page 211 and 212:
Parker, A. (2007). SMS - Its use in
Page 213 and 214:
Find the perimeter of the rectangle
Page 215 and 216:
Wait Time Management Issues related
Page 217 and 218:
Precision Reduced_Precision ( T T0
Page 219 and 220:
learning concept. If the Specialty
Page 221 and 222:
Two criteria have been adopted to e
Page 223 and 224:
Reduced Recall 0.07 0.06 0.05 0.04
Page 225 and 226:
In this work, both recall-oriented
Page 227 and 228:
Tsai, C.-C., Chuang, S.-C., Liang,
Page 229 and 230:
confidence and self-belief in their
Page 231 and 232:
Peng et al. (2006) To explore ISE &
Page 233 and 234:
Torkzadeh, Chang, and Demirhan (200
Page 235 and 236:
educational level, locus of control
Page 237 and 238:
Internet-based learning self-effica
Page 239 and 240:
Though Lee and Lee (2008) did not d
Page 241 and 242:
Regarding the category of ISE resea
Page 243 and 244:
Bandura, A. (1997). Self-efficacy:
Page 245 and 246:
Pintrich, P. R., & de Groot, E. V.
Page 247 and 248:
In formal learning, Safran, Helic,
Page 249 and 250:
that culture is essential when user
Page 251 and 252:
Table 3. Analysis of usage frequenc
Page 253 and 254:
7 1.16 137.73 0.25 0.29 0.25 Social
Page 255 and 256:
discussions freely. So Korean stude
Page 257 and 258:
Johnson, L., Levine, A., & Smith, R
Page 259 and 260:
Figure 1. The UE Faculty Web Portal
Page 261 and 262:
2006), and consistency (Sindhuja &
Page 263 and 264:
Independent Variables Dependent Var
Page 265 and 266:
Meanwhile, as shown in Table 4, Ava
Page 267 and 268:
design methods in the development o
Page 269 and 270:
Pearson, J. M., & Pearson, A. (2008
Page 271 and 272:
Clewley, N., Chen, S. Y., & Liu, X.
Page 273 and 274:
As shown in this Table, Holists wil
Page 275 and 276:
satisfaction or dissatisfaction wit
Page 277 and 278:
8 I would prefer to learn from huma
Page 279 and 280:
the examples need to be more practi
Page 281 and 282:
Brotherton, J. A,. & Abowd, G. D. (
Page 283 and 284:
Cobos, D. L. (2011). Book review: T
show all

October 2011 Volume 14 Number 4 - Educational Technology ...

Create successful ePaper yourself

Delete template?

Save as template?