April 2012 Volume 15 Number 2 - Educational Technology & Society

More documents

Recommendations

Info

Experiment 2 and results In this experiment, the performance of nine adaptive testing algorithms, Diagnosys, Diagnosys+MDI, Diagnosys+PI, OT, OT+MDI, OT+PI, IRS, IRS+MDI, and IRS+PI, were evaluated by using the same data set as Experiment 1. The use of test items and prediction accuracy were obtained by five-fold cross-validation. Results were presented in Table 6. For example, in Table 6, when the threshold was 0.01, the prediction accuracy of Diagnosys, Diagnosys+MDI, and Diagnosys+PI were 0.956, 0.996, and 0.992, respectively, and their corresponding use of test items were 32.64, 34.70, and 34.53. A Wilcoxon-Signed-Ranks test was used to compare the performances among nine models (Table 7). In Table 7, “Diag+MDI to Diag” indicates that the performance between original Diagnosys and Diagnosys with MDI was compared. Due to different thresholds, the performance of Diag+MDI, OT+MDI, and IRS+MDI were not explored. The results are as follows. 1. In Table 7, the results of the Wilcoxon-signed-ranks test revealed that Diagnosys, OT, and IRS, the prediction accuracies, adaptive testing algorithms with the most-difficult-item method (MDI) and prerequisite method (PI) both perform better than the original adaptive testing algorithms (z = −3.422 ~ −3.409, p = 0.001). Otherwise, in Diagnosys, Diagnosys+MDI outperform Diagnosys+PI (z = −3.415, p = 0.001); in OT, OT+MDI outperform OT+PI (z = −3.066, p = 0.002); in IRS, IRS+MDI outperformed IRS+PI (z = −3.482, p = 0.000).Overall, the performance of adaptive testing algorithms with the most difficult item (MDI) method was better than that of adaptive testing algorithms with the prerequisite method (PI method). 2. In Table 6, under the same (or almost the same) prediction accuracies, the use of test items in the proposed KSAT+MDI and KSAT+PI are fewer than those in the original KSAT algorithms. For example, in Table 6, when the prediction accuracies of Diagnosys, Diagnosys+MDI, and Diagnosys+PI are 0.945, 0.943, and 0.945, respectively and their corresponding use of test items are 31.85, 24.23, and 30.81. When the prediction accuracy of OT, OT+MDI, and OT+PI are 0.991, 0.991, and 0.991, respectively, their corresponding use of test items are 27.27, 26.25, and 26.18. When the prediction accuracy of IRS, IRS+MDI, and IRS+PI are 0.991, 0.991, and 0.991, respectively, their corresponding use of test items are 27.75, 26.38, and 27.47 (see grayed cells). 3. In Table 6, OT+MDI and OT+PI outperform Diagnosys+MDI, Diagnosys+PI, IRS+MDI, and IRS+PI at the same prediction accuracies. The only exception is at the prediction accuracy of 0.997, where the use of test items of OT+PI and IRS_MDI are 31.46 and 30.82, respectively (see bold cells). Table 6. The prediction accuracy and use of test items (in brackets) of Diagnosys, OT, and IRS with MDI or PI Diag threshold 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 0.06 Diag 0.956 (32.64) 0.945 (31.85) 0.935 (31.02) 0.927 (30.22) 0.918 (29.31) 0.920 (29.50) 0.912 (28.14) 0.908 (27.20) 0.896 (23.55) 0.893 (22.40) 0.883 (19.27) Diag +MDI 0.996 (34.70) 0.993 (34.50) 0.990 (34.25) 0.986 (33.92) 0.983 (33.50) 0.982 (33.43) 0.975 (32.43) 0.972 (31.83) 0.958 (28.52) 0.954 (27.36) 0.943 (24.23) Diag +PI 0.992 (34.53) 0.992 (34.45) 0.985 (34.05) 0.978 (33.58) 0.969 (32.93) 0.966 (32.62) 0.945 (30.81) 0.928 (29.31) 0.903 (24.97) 0.899 (23.76) 0.886 (20.39) OT threshold OT 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 0.06 OT +MDI OT +PI 0.995 0.998 0.997 (29.98) (31.51) (31.46) 0.991 0.996 0.995 (27.27) (29.18) (29.16) 0.985 0.991 0.991 (24.41) (26.25) (26.18) 0.979 0.987 0.986 (22.14) (24.07) (23.94) 0.972 0.981 0.980 (19.42) (21.59) (21.42) 0.966 0.976 0.975 (17.06) (19.13) (19.01) 0.960 0.972 0.969 (15.59) (17.68) (17.43) 0.955 0.967 0.966 (14.50) (16.54) (16.32) 0.949 0.962 0.962 (13.30) (15.43) (15.27) 0.943 0.958 0.956 (11.95) (14.23) (14.06) 0.938 0.954 0.954 (11.07) (13.31) (13.29) IRS threshold 0.58 0.56 0.54 0.52 0.5 0.48 0.46 0.44 0.42 0.4 0.38 IRS IRS +MDI IRS +PI 0.991 0.997 0.996 (27.75) (30.82) (30.68) 0.987 0.996 0.995 (26.44) (30.16) (30.03) 0.985 0.995 0.993 (25.20) (28.89) (28.73) 0.981 0.992 0.991 (23.50) (27.59) (27.47) 0.977 0.991 0.989 (21.82) (26.38) (26.21) 0.974 0.988 0.987 (20.79) (25.17) (25.04) 0.969 0.985 0.983 (19.47) (23.85) (23.73) 0.967 0.983 0.982 (18.52) (22.86) (22.72) 0.960 0.978 0.976 (16.47) (20.62) (20.41) 0.956 0.974 0.973 (15.17) (19.29) (19.21) 0.949 0.968 0.967 (13.54) (17.38) (17.20) 85
0.065 0.877 (17.56) 0.938 (22.55) 0.07 0.873 (16.37) 0.934 (21.40) 0.075 0.863 (13.57) 0.924 (18.63) 0.08 0.861 (12.70) 0.922 (17.76) Note: Diag refers to Diagnosys. Diag +MDI v.s.Diag 0.880 (18.71) 0.876 (17.52) 0.866 (14.73) 0.864 (13.85) 0.065 0.07 0.075 0.08 0.933 0.951 0.950 (9.99) (12.45) (12.36) 0.929 0.947 0.947 (9.36) (11.79) (11.70) 0.925 0.944 0.943 (8.73) (11.10) (11.03) 0.920 0.941 0.939 (8.18) (10.67) (10.46) 0.36 0.34 0.32 0.30 Table 7. The results of Wilcoxon-Signed-Ranks tests among nigh models Diag +PI v.s. Diag OT +MDI to OT OT+PI to OT IRS +MDI to IRS IRS +PI to IRS Diag +MDI to Diag+PI 0.945 0.964 0.963 (12.66) (16.42) (16.23) 0.936 0.959 0.957 (10.99) (14.52) (14.34) 0.930 0.953 0.949 (9.97) (13.34) (13.09) 0.925 0.949 0.945 (9.04) (12.21) (11.92) OT+MDI to OT+PI IRS+MDI to IRS+PI Z value −3.422* −3.422* −3.409* −3.410* −3.411* −3.409* −3.415* −3.066* −-3.482* Note. * indicate the statistically significant at � � 0. 01 Evaluation of the KSAT system To evaluate the performance of the KSAT system, an experiment has been conducted. This experiment aimed to evaluate the efficiencies of use of test items and prediction accuracy in administering the computerized adaptive test. One hundred and twenty-three students from fifth-grade classes of Taiwanese elementary schools participated in this experiment. The procedure was conducted as follows. First, all students received a knowledge-structure-based adaptive testing (KSAT) based on OT algorithm (threshold = 0.05).The content of the test was on the triangle unit, as mentioned above. Then, when the students completed the adaptive portion of the test, the system administered the rest of the 35 items in order to compute the prediction accuracy. Finally, the use of test items and prediction accuracy were computed. After completion of the test, the results of the use of test items and prediction accuracy were 11.42% and 93%, respectively. The results show that the KSAT system can decrease the use of test items and are able to precisely diagnose the cognitive status of examinees. These results are consistent with the results of OT case in the previous simulation experiment (the use of test items: 13; prediction accuracy: 95%). The two different interfaces, paper-based and computer-based, do not affect the examinees’ performance in adaptive tests. For exploring the performances of OT+MDI and OT+PI in this system, since the responses of all 35 items were available, this data set was applied to simulate OT+MDI and OT+PI processes. This simulated result shows that the use of test items and prediction accuracy of OT+MDI and OT+PI processes were (11.3, 94%) and (11.2, 94%), respectively. This implies OT+MDI and OT+PI have better performance than original OT, which is similar to the results of experiment 2. Discussions and conclusions In this paper, some traditional item ordering theories (OT and IRS) that were used to develop instruction sequences or learning progress indices were applied to develop the computerized adaptive testing processes. The performances of the adaptive testing algorithms based on the item structures constructed by OT, IRS, Diagnosys, and domain experts were evaluated. Three findings were found from the experimental results. First, OT and IRS based KSAT algorithms provide better prediction accuracy with less the use of test items. Second, OT-based KSAT algorithm is less sensitive to the training sample size. Third, the estimation error of OT method is less than others and this means that the diagnostic results estimated by OT-based KSAT is more stable. From the theoretic view of OT, 86
Page 1 and 2:
April 2012 Volume 15 Number 2
Page 3 and 4:
Supporting Organizations Centre for
Page 5 and 6:
Contextualizing a MALL: Practice De
Page 7 and 8:
Kop, R. (2012). The Unexpected Conn
Page 9 and 10:
Human mediation and information flo
Page 11 and 12:
elevant information based on some k
Page 13 and 14:
of learners. A search facilitated t
Page 15 and 16:
information as human mediation mean
Page 17 and 18:
Blau, I., & Barak, A. (2012). How D
Page 19 and 20:
examined whether the readiness to p
Page 21 and 22:
Table 5 presents the ANOVA for the
Page 23 and 24:
However, in discussing a non-sensit
Page 25 and 26:
deviations showed smaller variance,
Page 27 and 28:
actual participation discussing sen
Page 29 and 30:
Mesch, G., & Elgali, Z. (2009). Soc
Page 31 and 32:
Although prior research on overall
Page 33 and 34:
with a score of 39 and below as “
Page 35 and 36:
Table 2 demonstrates mean scores an
Page 37 and 38:
Discussion Although the recent Inte
Page 39 and 40: Furthermore, the most preferred pla
Page 41 and 42: MEB. (2009). Internete erisim proje
Page 43 and 44: Learning Management Systems (LMS) b
Page 45 and 46: practice and repetition with feedba
Page 47 and 48: Online peer assessment Peer assessm
Page 49 and 50: At an international conference on m
Page 51 and 52: � Ability to produce rich assessm
Page 53 and 54: Meishar-Tal, H., & Tal-Elhasid, E.
Page 55 and 56: Tsai, S. C. (2012). Integration of
Page 57 and 58: order to decrease text complexity a
Page 59 and 60: matter how the learners’ aptitude
Page 61 and 62: If compared by programs, WP student
Page 63 and 64: mean F2F 4.64 4.52 4.35 4.44 *: p
Page 65 and 66: een analyzed and were compared with
Page 67 and 68: Chen, G.-D., Lee, J.-H., Wang, C.-Y
Page 69 and 70: and diverse cognitional and emotion
Page 71 and 72: Encourage and persuade students wit
Page 73 and 74: Materials and procedure Each studen
Page 75 and 76: The lower part of Figure 4 shows st
Page 77 and 78: Mayer, R. E., Sobko, K., & Mautone,
Page 79 and 80: that an item is answered correctly
Page 81 and 82: Table 1. Relative concepts frequenc
Page 83 and 84: The Account Management Module provi
Page 85 and 86: Experiment 1 and results Figure 7.
Page 87 and 88: Utilization of Test Item 36 33 30 2
Page 89: Utilization of Test Item 36 33 30 2
Page 93 and 94: Hwang, G.J., Hsiao, C.L., & Tseng,
Page 95 and 96: the diverse needs of modern industr
Page 97 and 98: Figure 1. The framework of interdis
Page 99 and 100: Research Design Figure 3. Recommend
Page 101 and 102: friendly. 8. I feel the layout and
Page 103 and 104: satisfaction, and system acceptance
Page 105 and 106: Ozok, A. A., Fan, Q., & Norcio, A.
Page 107 and 108: than students in a conventional cla
Page 109 and 110: shown in Figure 2 to include the st
Page 111 and 112: (1996) Pleasure-Arousal-Dominance (
Page 113 and 114: Based on this connection, we consid
Page 115 and 116: Final dominance measurements were p
Page 117 and 118: student mood and thereby allow comp
Page 119 and 120: Sessink, O., Beeftink, H., Tramper,
Page 121 and 122: The rationale behind the video form
Page 123 and 124: usefulness of the movies and their
Page 125 and 126: case of the first two questions and
Page 127 and 128: this paper in order to evaluate the
Page 129 and 130: The survey questions Appendix A In
Page 131 and 132: E-training in organizations E-train
Page 133 and 134: efficacy regarding online training,
Page 135 and 136: Method Sample and procedure Figure
Page 137 and 138: 0.874 which exceeded the recommende
Page 139 and 140: Even though the result does not con
Page 141 and 142:
Brown, M., & Cudeck, R. (1993). Alt
Page 143 and 144:
Wen, Y., Looi, C.-K., & Chen, W. (2
Page 145 and 146:
when they are designing and impleme
Page 147 and 148:
paper focuses on the first three st
Page 149 and 150:
their self-esteem to be partners in
Page 151 and 152:
Democratizing knowledge and symmetr
Page 153 and 154:
1) Reading the article: At the star
Page 155 and 156:
Effects of RCKI principle-based ped
Page 157 and 158:
References Alexander, C. (1979). Th
Page 159 and 160:
Valsamidis, S., Kontogiannis, S., K
Page 161 and 162:
The Analog system (Yan et al., 1996
Page 163 and 164:
First, the number of the sessions a
Page 165 and 166:
BioLayout uses a modified version o
Page 167 and 168:
As shown in figure 3, all metrics c
Page 169 and 170:
Figure 7. The detailed results for
Page 171 and 172:
Enright, A. J., van Dongen, S., Ouz
Page 173 and 174:
Chu, S. K. W., Kwan, A. C. M., & Wa
Page 175 and 176:
teachers, as well as collaboration
Page 177 and 178:
from the students’ ratings appear
Page 179 and 180:
Table 5 summarizes the students’
Page 181 and 182:
eading blogs (i.e., reminders, easy
Page 183 and 184:
Luehmann, A. L. (2008). Using blogg
Page 185 and 186:
Uzunboylu et al. (2009) examined th
Page 187 and 188:
System implementation Overview of M
Page 189 and 190:
Methods Before the experiment, a mo
Page 191 and 192:
too simple, incomplete, or they eve
Page 193 and 194:
Comparisons of UMLS Questionnaire i
Page 195 and 196:
already described may play an impor
Page 197 and 198:
Parker, D., Manstead, A. S. R., Str
Page 199 and 200:
Avci, U., & Askar, P. (2012). The C
Page 201 and 202:
Literature review The related varia
Page 203 and 204:
collected through Personal Informat
Page 205 and 206:
Self-efficacy 92 5 21 13.37 3.726 W
Page 207 and 208:
In this problem, intention was take
Page 209 and 210:
References Ajjan, H., & Hartshorne,
Page 211 and 212:
Tan, T.-H., Lin, M.-S., Chu, Y.-L.,
Page 213 and 214:
Students perused the course materia
Page 215 and 216:
problem. All articles were then sen
Page 217 and 218:
tests, electronic tests are more li
Page 219 and 220:
Students 10 and 11: My classmate Ed
Page 221 and 222:
collection was also approved by all
Page 223 and 224:
Ubiquitous Revision Seamless Collab
Page 225 and 226:
Tai, Y. (2012). Contextualizing a M
Page 227 and 228:
tasks were developed from task type
Page 229 and 230:
experiences in this phase. Moreover
Page 231 and 232:
test is administered again right af
Page 233 and 234:
The collected data were analyzed wi
Page 235 and 236:
Gorp, K. V., & Bogaert, N. (2006).
Page 237 and 238:
4PL IRT model According to the numb
Page 239 and 240:
examinee’s ability more accuratel
Page 241 and 242:
would be more accurate if items j+1
Page 243 and 244:
C�I or Change I�I) would be 0.2
Page 245 and 246:
improved by rearrangement procedure
Page 247 and 248:
Table 6. Repeated Measures ANOVA of
Page 249 and 250:
Leppisaari, I., & Lee, O. (2012). M
Page 251 and 252:
Selection of the virtual tool The t
Page 253 and 254:
Environmental themes emerged strong
Page 255 and 256:
Visualization of text (visual langu
Page 257 and 258:
environments where visual technolog
Page 259 and 260:
- Logging in to learning environmen
Page 261 and 262:
McNaught, C., Lam, P. & Lam, S.L. (
Page 263 and 264:
on-site to finally achieve a meanin
Page 265 and 266:
Table 1. Digital game designers and
Page 267 and 268:
worldviews are socio-cultural-histo
Page 269 and 270:
Phase Methods Visitor data collecti
Page 271 and 272:
operate the game without the help o
Page 273 and 274:
Visitors (end users) must be heard.
Page 275 and 276:
Islas Sedano, C., Pawlowski, J., Su
Page 277 and 278:
part of something larger than the s
Page 279 and 280:
Previous studies have developed rel
Page 281 and 282:
Technology Acceptance. The 10 items
Page 283 and 284:
Table 4. Model Fit Indices Model χ
Page 285 and 286:
In the final path model of set 1, t
Page 287 and 288:
Discussion Differing from prior stu
Page 289 and 290:
Haythornthwaite, C., Kazmer, M. M.,
Page 291 and 292:
Ge, Z.-G. (2012). Cyber Asynchronou
Page 293 and 294:
college of a university situated in
Page 295 and 296:
Table 3 shows us the following info
Page 297 and 298:
Appendix B shows Class 1’s respon
Page 299 and 300:
increase one’s ability to process
Page 301 and 302:
Appendix A Responses concerning syn
Page 303 and 304:
Shih, S.-C., Kuo, B.-C., & Liu, Y.-
Page 305 and 306:
Adaptive U-learning Math Path Syste
Page 307 and 308:
Figure 4. Online tutorial courses o
Page 309 and 310:
Participants and Experimental Proce
Page 311 and 312:
the influence of Test1 scores, an e
Page 313 and 314:
Hall, T., & Bannon, L. (2006). Desi
Page 315 and 316:
(Hofer & Pintrich, 1997; Liu, Lin &
Page 317 and 318:
� Through online peer assessment,
Page 319 and 320:
elativism.” High Internet self-ef
Page 321 and 322:
Peng, H., Tsai, C.-C., & Wu, Y.-T.
Page 323 and 324:
Since game-based learning has been
Page 325 and 326:
The second reason is about the sust
Page 327 and 328:
Methods Figure 5. Quest NPCs provid
Page 329 and 330:
These differences indicated that pa
Page 331 and 332:
quests further involves students’
Page 333 and 334:
Chang, I.-H. (2012). The Effect of
Page 335 and 336:
(1) Vision, planning and management
Page 337 and 338:
Based on an examination of the lite
Page 339 and 340:
Table 2. Analysis of reliability an
Page 341 and 342:
λ10 - 0.69 0.83 ε5 14.50 * - 0.32
Page 343 and 344:
Bailey, G. D. (1997). What technolo
Page 345 and 346:
Stegall, P. (1998). The principal:
Page 347 and 348:
instruction has studied the student
Page 349 and 350:
espond and the estimation of possib
Page 351 and 352:
Research question For the purpose o
Page 353 and 354:
READI reliability To verify the con
Page 355 and 356:
Physical -1.43333(*) .44699 .029 -2
Page 357 and 358:
The results of this study suggest t
Page 359 and 360:
Hsu, Y.-C., Ho, H. N. J., Tsai, C.-
Page 361 and 362:
domains. Five research questions we
Page 363 and 364:
Research sample groups (4) Faculty
Page 365 and 366:
13. Policies, Social Culture Impact
Page 367 and 368:
Non-Specified Others Engineering &
Page 369 and 370:
(AR = -2.1) (n = 7) 2. Non-specifie
Page 371 and 372:
(n): Total number of articles Resea
Page 373 and 374:
Compared to the publications in 200
Page 375:
Pedagogical Content Knowledge (TPAC
show all

April 2012 Volume 15 Number 2 - Educational Technology & Society

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?