13.07.2015 Views

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Kasper <strong>and</strong> ÜnlüAssumptions of factor analytic approachesaccuracies of <strong>the</strong> factor models can be compared. This is known as“rotational indeterminacy”in <strong>the</strong> factor analysis approach (e.g., seeMaraun, 1996). For more information, <strong>the</strong> reader is also referredto Footnote 8 <strong>and</strong> Section 7.Second, <strong>the</strong> criterion used to determine <strong>the</strong> number of factorsextracted from <strong>the</strong> <strong>data</strong> must be distinguished as well. In practice,not all k or p but instead ˆk < k or p factors with <strong>the</strong> ˆk largest eigenvaluesare extracted. Various procedures are available to determineˆk. Commonly used criteria in educational research are <strong>the</strong> Kaiser-Guttman criterion (Guttman, 1954; Kaiser <strong>and</strong> Dickman, 1959),<strong>the</strong> scree test (Cattell, 1966), <strong>and</strong> <strong>the</strong> method of parallel analysis(Horn, 1965).3. ASSUMPTIONS ASSOCIATED WITH THE CLASSICALFACTOR MODELSThe three models described in <strong>the</strong> previous section in particularassume interval-scaled <strong>data</strong> <strong>and</strong> full rank covariance or correlationmatrices for <strong>the</strong> manifest variables. Typically in <strong>the</strong> exploratoryfactor analysis model, <strong>the</strong> manifest variables y or <strong>the</strong> st<strong>and</strong>ardizedvariables z are assumed to be normally distributed. For <strong>the</strong> PCA<strong>and</strong> PAA models, we additionally want to presuppose – for computationalreasons – that <strong>the</strong> variances of <strong>the</strong> manifest variables aresubstantially large. The EFA <strong>and</strong> PAA models assume uncorrelatedfactor terms <strong>and</strong> uncorrelated error terms (which can be relaxedin <strong>the</strong> framework of structural equation models; e.g., Jöreskog,1966), uncorrelatedness between <strong>the</strong> error <strong>and</strong> latent ability variables,<strong>and</strong> expected values of zero for <strong>the</strong> errors as well as latentability variables.The question now arises whe<strong>the</strong>r <strong>the</strong> assumptions are criticalwhen it comes to educational tests or survey <strong>data</strong>? 43.1. CRITERION-REFERENCED TESTS AND PISA QUESTIONNAIREDATAFrom <strong>the</strong> perspective of applying <strong>the</strong>se models to <strong>data</strong> of criterionreferencedtests, <strong>the</strong> last three of <strong>the</strong> above mentioned assumptionsare less problematic. For a criterion-referenced test, it is importantthat all items of <strong>the</strong> test are valid for <strong>the</strong> investigated content. Assuch, <strong>the</strong> usual way of excluding items from <strong>the</strong> analysis when<strong>the</strong> covariance or correlation matrices are not of full rank does notwork for criterion-referenced tests, because this can reduce contentvalidity of a test. A similar argument applies to <strong>the</strong> assumption of4 Note that, at <strong>the</strong> latent level, <strong>the</strong>re is no formal assumption that <strong>the</strong> latent factors(what we synonymously also want to call “person abilities”) are normally distributed.At <strong>the</strong> manifest level, maximum likelihood estimation (EFA) assumes that <strong>the</strong>observed variables are normal; ULS <strong>and</strong> GLS (EFA), PAA (EFA), <strong>and</strong> PCA do not.The latter two methods only require a non-singular correlation matrix (e.g., seeMacCallum, 2009). However, in applications, for example in empirical educationalresearch, one often assumes that <strong>the</strong> latent ability values follow a normal distributionin <strong>the</strong> population of reference. Moreover, Mattson (1997)’s method describedin Section 4.1 states that <strong>the</strong>re is a connection between <strong>the</strong> manifest <strong>and</strong> latent distributionsin factor analysis. Hence <strong>the</strong> question is what implications one can expect ifthis “implicit assumption” may not be justified. Related to <strong>the</strong> study <strong>and</strong> evaluationof <strong>the</strong> underlying assumptions associated with <strong>the</strong>se classical factor models, thispaper, amongst o<strong>the</strong>rs, shows that <strong>the</strong> <strong>data</strong> re-expression method PCA performssurprisingly well if compared to <strong>the</strong> results obtained based on <strong>the</strong> latent variableapproaches EFA <strong>and</strong> PAA. Moreover, ML <strong>and</strong> PAA estimation procedures for EFAare compared with one ano<strong>the</strong>r, for different degrees of violating <strong>the</strong> normalityassumption at <strong>the</strong> manifest or latent levels.substantially large variances of <strong>the</strong> manifest variables. As Klauer(1987) suggested <strong>and</strong> Sturzbecher et al. (2008) have shown for<strong>the</strong> driving license test in Germany, <strong>the</strong> variances of <strong>the</strong> manifestvariables of criterion-referenced tests are seldom high, <strong>and</strong> in general<strong>the</strong> <strong>data</strong> obtained from those tests may lead to extracting toofew dimensions. However, for <strong>the</strong> analysis of criterion-referencedtests, <strong>the</strong> assumption of interval-scaled <strong>data</strong> <strong>and</strong> <strong>the</strong> assumptionof normality of <strong>the</strong> manifest test <strong>and</strong> latent ability scores are evenmore problematic. Data from criterion-referenced tests are rarelyinterval-scaled – instead <strong>the</strong> items of criterion-referenced tests areoften dichotomous (Klauer,1987). For criterion-referenced tests,itis plausible to have skewed (non-symmetric) test <strong>and</strong> ability scoredistributions, because criterion-referenced tests are constructedto assess whe<strong>the</strong>r a desired <strong>and</strong> excessive teaching goal has beenachieved or not. In o<strong>the</strong>r words, <strong>the</strong> tested population is explicitly<strong>and</strong> intensively trained regarding <strong>the</strong> evaluated ability, <strong>and</strong> so it isra<strong>the</strong>r likely that most people will have high values on <strong>the</strong> manifesttest score as well as latent ability (e.g., see <strong>the</strong> German drivinglicense test; Sturzbecher et al., 2008).The assumption of interval-scaled <strong>data</strong> <strong>and</strong> <strong>the</strong> normalityassumption for <strong>the</strong> manifest test <strong>and</strong> latent ability scores mayalso be crucial for <strong>the</strong> scaling of cognitive <strong>data</strong> in PISA (OECD,2012; Chap. 9 <strong>the</strong>rein). In PISA, <strong>the</strong> generated students’ scores areplausible values. These are r<strong>and</strong>omly drawn realizations basicallyfrom a multivariate normal distribution (as <strong>the</strong> prior) of latentability values (person ability is modeled as a r<strong>and</strong>om effect, alatent variable), in correspondence to a fitted item response <strong>the</strong>orymodel (Adams et al., 1997) giving <strong>the</strong> estimated parameters of <strong>the</strong>normal distribution. The mean of <strong>the</strong> multivariate normal distributionis expressed as linear regression of various direct manifestregressors (e.g., administered test booklet, gender) <strong>and</strong> indirect“latent” or complex regressors obtained by aggregating over manifest<strong>and</strong> latent context or background variables (e.g., indicatorsfor economic, social, <strong>and</strong> cultural status) in a principal componentanalysis. The component scores used in <strong>the</strong> scaling model as <strong>the</strong>indirect “latent” regressors are extracted, in <strong>the</strong> purely computationalsense, to account for approximately 95% of <strong>the</strong> total variancein all <strong>the</strong> original variables. The background variables may be categoricalor dummy-coded <strong>and</strong> may not be measured at an intervalscale (nor be normally distributed). So as we said before, if onecan assume that <strong>the</strong> distribution of latent background informationrevealed through questionnaires is skewed, we can expect that<strong>the</strong> empirical distribution of background information computedby principal component analysis is likely to be biased compared to<strong>the</strong> true distribution. This is suggested by <strong>the</strong> results of our simulationstudy. The bias of <strong>the</strong> empirical distribution in turn mayresult in biasing <strong>the</strong> regression expression for <strong>the</strong> mean. Therefore,special caution has to be taken regarding possible violationsof those assumptions, <strong>and</strong> a minimum of related sensitivity analysesare required <strong>and</strong> necessary in order to control for <strong>the</strong>ir potentialeffects.3.2. HISTORICAL REMARKSThe primary aim is to review results of previous studies focusingon <strong>the</strong> impact of violations of model assumptions. As to ourknowledge, such studies did not systematically vary <strong>the</strong> distributionsof <strong>the</strong> factors (in <strong>the</strong> case of continuous <strong>data</strong> as well) <strong>and</strong><strong>Frontiers</strong> in Psychology | Quantitative Psychology <strong>and</strong> Measurement March 2013 | Volume 4 | Article 109 | 125

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!