13.07.2015 Views

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Kasper <strong>and</strong> ÜnlüAssumptions of factor analytic approachesTable 1 | Theoretical values of skewness <strong>and</strong> kurtosis for z i (fourfactors).Table 2 | Theoretical values of skewness <strong>and</strong> kurtosis for z i (eightfactors).z iLatent variablez iLatent variableNormal aSlightlyskewed bStronglyskewed cNormal aSlightlyskewed bStronglyskewed c√β1i β 2i√β1i β 2i√β1i β 2i√β1i β 2i√β1i β 2i√β1i β 2iz 1 0 3 −0.060 3 −0.599 4.202z 2 0 3 −0.049 3 −0.488 3.914z 3 0 3 −0.038 3 −0.377 3.649z 4 0 3 −0.027 3 −0.272 3.420z 5 0 3 −0.018 3 −0.179 3.240z 6 0 3 −0.010 3 −0.102 3.114z 7 0 3 −0.049 3 −0.488 3.914z 8 0 3 −0.038 3 −0.377 3.649z 9 0 3 −0.027 3 −0.272 3.420z 10 0 3 −0.018 3 −0.179 3.240z 11 0 3 −0.010 3 −0.102 3.114z 12 0 3 −0.005 3 −0.047 3.041z 13 0 3 −0.027 3 −0.272 3.420z 14 0 3 −0.027 3 −0.272 3.420z 15 0 3 −0.018 3 −0.179 3.240z 16 0 3 −0.010 3 −0.102 3.114z 17 0 3 −0.010 3 −0.102 3.114z 18 0 3 −0.005 3 −0.047 3.041z 19 0 3 −0.027 3 −0.272 3.420z 20 0 3 −0.018 3 −0.179 3.240z 21 0 3 −0.018 3 −0.179 3.240z 22 0 3 −0.010 3 −0.102 3.114z 23 0 3 −0.005 3 −0.047 3.041z 24 0 3 −0.005 3 −0.047 3.041aµ 1m = 0, µ 2m = 1, µ 3m = 0, <strong>and</strong> µ 4m = 3.bµ 1m = 0, µ 2m = 1, µ 3m = −0.20, <strong>and</strong> µ 4m = 3.cµ 1m = 0, µ 2m = 1, µ 3m = −2, <strong>and</strong> µ 4m = 9.z 1 0 3 −0.060 3 −0.599 4.202z 2 0 3 −0.049 3 −0.488 3.914z 3 0 3 −0.038 3 −0.377 3.649z 4 0 3 −0.049 3 −0.488 3.914z 5 0 3 −0.049 3 −0.488 3.914z 6 0 3 −0.038 3 −0.377 3.649z 7 0 3 −0.049 3 −0.488 3.914z 8 0 3 −0.038 3 −0.377 3.649z 9 0 3 −0.027 3 −0.272 3.420z 10 0 3 −0.038 3 −0.377 3.649z 11 0 3 −0.038 3 −0.377 3.649z 12 0 3 −0.038 3 −0.377 3.649z 13 0 3 −0.038 3 −0.377 3.649z 14 0 3 −0.027 3 −0.272 3.420z 15 0 3 −0.027 3 −0.272 3.420z 16 0 3 −0.027 3 −0.272 3.420z 17 0 3 −0.027 3 −0.272 3.420z 18 0 3 −0.018 3 −0.179 3.240z 19 0 3 −0.018 3 −0.179 3.240z 20 0 3 −0.010 3 −0.102 3.114z 21 0 3 −0.010 3 −0.102 3.114z 22 0 3 −0.010 3 −0.102 3.114z 23 0 3 −0.010 3 −0.102 3.114z 24 0 3 −0.005 3 −0.047 3.041aµ 1m = 0, µ 2m = 1, µ 3m = 0, <strong>and</strong> µ 4m = 3.bµ 1m = 0, µ 2m = 1, µ 3m = −0.20, <strong>and</strong> µ 4m = 3.cµ 1m = 0, µ 2m = 1, µ 3m = −2, <strong>and</strong> µ 4m = 9.are 12 conditions <strong>and</strong> for every condition 100 <strong>data</strong> sets weresimulated.Each of <strong>the</strong> generated 1,200 <strong>data</strong> sets were analyzed using allof <strong>the</strong> models of principal component analysis, exploratory factoranalysis (ML estimation), <strong>and</strong> principal axis analysis altoge<strong>the</strong>rwith a varimax rotation (Kaiser, 1958). 8 For any <strong>data</strong> set undereach model, <strong>the</strong> factors, <strong>and</strong> hence, <strong>the</strong> numbers of retained factorswere determined by applying <strong>the</strong> following three extractioncriteria or approaches: <strong>the</strong> Kaiser-Guttman criterion, <strong>the</strong> scree test,<strong>and</strong> <strong>the</strong> parallel analysis procedure. 9Table 3 | Summary of <strong>the</strong> simulation design <strong>and</strong> number of generated<strong>data</strong> sets.SamplesizeNumber offactorsLatent variable distributionNormal Slightly skewed Strongly skewed200 4 100 100 1008 100 100 100600 4 100 100 1008 100 100 1008 Because of rotational indeterminacy in <strong>the</strong> factor analysis approach (e.g., Maraun,1996), <strong>the</strong> results are as much an evaluation of varimax rotation as <strong>the</strong>y are anevaluation of <strong>the</strong> manipulated variables in <strong>the</strong> study. For more information, seeSection 7.9 The Kaiser-Guttman criterion is a poor way to determine <strong>the</strong> number of factors.However, due to <strong>the</strong> fact that none of <strong>the</strong> existing studies has investigated <strong>the</strong> estimationaccuracy of this criterion when <strong>the</strong> latent ability distribution is skewed, we havedecided to include <strong>the</strong> Kaiser-Guttman criterion in our study. This criterion may4.3. EVALUATION CRITERIAThe criteria for evaluating <strong>the</strong> performance of <strong>the</strong> classical factormodels are <strong>the</strong> number of extracted factors (as compared totrue dimensionality), <strong>the</strong> skewness of <strong>the</strong> estimated latent abilityalso be viewed as a “worst performing” baseline criterion, which o<strong>the</strong>r extractionmethods need to outperform, as best as possible.<strong>Frontiers</strong> in Psychology | Quantitative Psychology <strong>and</strong> Measurement March 2013 | Volume 4 | Article 109 | 129

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!