Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

More documents

Recommendations

Info

Nimon et al.The assumption of reliable datathat consider variable relationships and identified techniques thatapplied researchers can use to fully understand the impact of measurementerror on their data. We synthesized RG literature andproposed a reporting methodology that can improve the qualityof future RG studies as well as substantive studies that they mayinform.In a perfect world, data would be perfectly reliable andresearchers would not have worry to what degree their analyseswere subject to nuisance correlations that exist in sample data.However, in the real-world, measurement error exists, can besystematic, and is unavoidably coupled with sampling error. Assuch, researchers must be aware of the full impact that measurementerror has on their results and do all that they can a priorito select instruments that are likely to yield appropriate levels ofreliability for their given sample. If considering these factors, wecan better inform our collective research and move the fields ofeducation and psychology forward by meticulously reporting theeffects of reliability in our data.REFERENCESAllen, M. J., and Yen, W. M. (1979).Introduction to Measurement Theory.Monterey, CA: Brooks/Cole.American Educational Research Association.(2006). Standards for reportingon empirical social scienceresearch in AERA publications.Educ. Res. 35, 33–40.American Psychological Association.(2009a). Publication Manual ofthe American Psychological Association:Supplemental Material: WritingClearly and Concisely, Chap.3. Available at: http://supp.apa.org/style/pubman-ch03.00.pdfAmerican Psychological Association.(2009b). Publication Manual of theAmerican Psychological Association,6th Edn. Washington, DC: AmericanPsychological Association.Anastasi, A., and Urbina, S. (1997).Psychological Testing, 7th Edn.Upper Saddle River, NJ: PrenticeHall.Bandura, A. (2006). “Guide for constructingself-efficacy scales,” in Self-Efficacy Beliefs of Adolescents, Vol. 5,eds F. Pajares and T. Urdan (Greenwich,CT: Information Age Publishing),307–337.Barrett, P. (2001). Assessing theReliability of Rating Data. Availableat: http://www.pbarrett.net/presentations/rater.pdfBeck, A. T. (1990). Beck Anxiety Inventory(BAI). San Antonio, TX: ThePsychological Corporation.Beck, A. T. (1996). Beck DepressionInventory-II (BDI-II). San Antonio,TX: The Psychological Corporation.Benefield, D. (2011). SAT Scores byState, 2011. Harrisburg, PA: CommonWealth Foundation. Availableat: http://www.commonwealthfoundation.org/policyblog/detail/sat-scores-by-state-2011Charles, E. P. (2005). The correctionfor attenuation due to measurementerror: clarifying concepts, and creatingconfidence sets. Psychol. Methods10, 206–226.Cohen, J. (1960). A coefficient of agreementfor nominal scales. Educ. Psychol.Meas. 20, 37–46.College Board. (2011). Archived SATData and Reports. Available at:http://professionals.collegeboard.com/data-reports-research/sat/archivedCrocker, L., and Algina, J. (1986).Introduction to Classical and ModernTest Theory. Belmont, CA:Wadsworth.Cronbach, L. J. (1951). Coefficient alphaand the internal structure of tests.Psychometrika 16, 297–334.Cronbach, L. J. (2004). My currentthoughts on coefficient alpha andsuccessor procedures. Educ. Psychol.Meas. 64, 391–418. (editorial assistanceby Shavelson, R. J.).Dillon, W. R., and Bearden, W. (2001).Writing the survey question to capturethe concept. J. Consum. Psychol.10, 67–69.Dimitrov, D. M. (2002). Reliability:arguments for multiple perspectivesand potential problems with generalizabilityacross studies. Educ. Psychol.Meas. 62, 783–801.Dozois, D. J. A., Dobson, K. S., andAhnberg, J. L. (1998). A psychometricevaluation of the Beck DepressionInventory –II. Psychol. Assess.10, 83–89.Faul, F., Erdfelder, E., Lang, A., andBuchner, A. (2007). G*Power 3:a flexible statistical power analysisprogram for the social, behavioral,and biomedical sciences. Behav. Res.Methods 39, 175–191.Fleiss, J. L. (1981). Statistical Methods forRates and Proportions, 2nd Edn. NewYork: John Wiley.Harman, H. H. (1967). Modern FactorAnalysis. Chicago: University ofChicago Press.Henson, R. K. (2001). Understandinginternal consistency reliability estimates:a conceptual primer on coefficientalpha. Meas. Eval. Couns. Dev.34, 177–189.Hogan, T. P., Benjamin, A., and Brezinski,K.L. (2000). Reliability methods:a note on the frequency of use of varioustypes. Educ. Psychol. Meas. 60,523–531.Hulin,C.,Netemeyer,R.,and Cudeck,R.(2001). Can a reliability coefficientbe too high? J. Consum. Psychol. 10,55–58.Hunter, J. E., and Schmidt, F. L. (1990).Methods of Meta-Analysis: CorrectingError and Bias in Research Findings.Newbury Park, CA: Sage.Kieffer, K. M., Reese, R. J., and Thompson,B. (2001). Statistical techniquesemployed in AERJ and JCP articlesfrom 1988 to 1997: a methodologicalreview. J. Exp. Educ. 69,280–309.Leach, L. F., Henson, R. K., Odom,L. R., and Cagle, L. S. (2006). Areliability generalization study ofthe Self-Description Questionnaire.Educ. Psychol. Meas. 66, 285–304.Lorenzo-Seva, U., Ferrando, P. J., andChico, E. (2010). Two SPSS programsfor interpreting multipleregression results. Behav. Res. Methods42, 29–35.Magnusson, D. (1967). Test Theory.Reading, MA: Addison-Wesley.Marat, D. (2005). Assessing mathematicsself-efficacy of diverse studentsform secondary schools inAuckland: implications for academicachievement. Issues Educ. Res. 15,37–68.Marsh, H. W. (1989). Age and sexeffects in multiple dimensions ofself-concept: preadolescence to earlyadulthood. J. Educ. Psychol. 81,417–430.Miller, C. S., Shields, A. L., Campfield,D., Wallace, K. A., andWeiss, R. D. (2007). Substance usescales of the Minnesota MultiphasicPersonality Inventory: an explorationof score reliability via metaanalysis.Educ. Psychol. Meas. 67,1052–1065.Muchinsky,P. M. (1996). The correctionfor attenuation. Educ. Psychol. Meas.56, 63–75.Nimon, K., and Henson, R. K. (2010).Validity of residualized variablesafter pretest covariance corrections:still the same variable? Paper Presentedat the Annual Meeting of theAmerican Educational Research Association,Denver, CO.Nunnally, J. C. (1967). PsychometricTheory. New York: McGraw-Hill.Nunnally, J. C. (1978). PsychometricTheory, 2nd Edn. New York:McGraw-Hill.Nunnally, J. C., and Bernstein, I. H.(1994). Psychometric Theory, 3rdEdn. New York: McGraw-Hill.Onwuegbuzie, A. J., Roberts, J. K., andDaniel, L. G. (2004). A proposednew “what if” reliability analysis forassessing the statistical significanceof bivariate relationships. Meas.Eval. Couns. Dev. 37, 228–239.Osborne, J., and Waters, E. (2002). Fourassumptions of multiple regressionthat researchers should always test.Pract. Assess. Res. Eval. 8. Availableat: http://PAREonline.net/getvn.asp?v=8&n=2Pajares, F., and Graham, L. (1999).Self-efficacy, motivation constructs,and mathematics performanceof entering middle school students.Contemp. Educ. Psychol. 24,124–139.Pedhazur, E. J. (1997). Multiple Regressionin Behavioral Research: Explanationand Prediction, 3rd Edn. FortWorth, TX: Harcourt Brace.Pintrich, P. R., Smith, D. A. F., Garcia,T., and McKeachie, W. J. (1991).A Manual for the Use of the MotivatedStrategies for Learning Questionnaire(MSLQ). Ann Arbor: Universityof Michigan, National Centerfor Research to Improve PostsecondaryTeaching and Learning.Popham, W. J. (2000). Modern EducationalMeasurement: Practical Guidelinesfor Educational Leaders, 3rdEdn. Needham, MA: Allyn &Bacon.Public Agenda. (2011). State-by-StateSAT and ACT Scores. Availableat: http://www.publicagenda.org/charts/state-state-sat-and-act-scoresRodriguez, M. C., and Maeda, Y.(2006). Meta-analysis of coefficientalpha. Psychol. Methods 11,306–322.Rosenthal, R. (1979). The file drawerproblem and tolerance for nullresults. Psychol. Bull. 86, 638–641.Rosenthal, R. (1995). Writing metaanalyticreviews. Psychol. Bull. 118,183–192.www.frontiersin.org April 2012 | Volume 3 | Article 102 | 51
Nimon et al.The assumption of reliable dataSchmidt, F. L., and Hunter, J. E. (1977).Development of a general solutionto the problem of validitygeneralization. J. Appl. Psychol. 62,529–540.Shavelson, R., and Webb, N. (1991).Generalizability Theory: A Primer.Newbury Park, CA: Sage.Shields, A. L., and Caruso, J. C. (2004).A reliability induction and reliabilitygeneralization study of the CageQuestionnaire. Educ. Psychol. Meas.64, 254–270.Spearman, C. (1904). The proofand measurement of associationbetween two things. Am.J.Psychol.15, 72–101.Stemler, S. (2004). A comparison ofconsensus, consistency, and measurementapproaches to estimatinginterrater reliability. Pract. Assess.Res. Eval. 9. Available at: http://PAREonline.net/getvn.asp?v=9&n=4Thompson, B. (2000). “Ten commandmentsof structural equationmodeling,” in Reading andUnderstanding More MultivariateStatistics, eds L. Grimm and P.Yarnold (Washington, DC: AmericanPsychological Association),261–284.Thompson, B. (2003a). “Guidelinesfor authors reporting score reliabilityestimates,” in Score Reliability:Contemporary Thinking onReliability Issues, ed. B. Thompson(Newbury Park, CA: Sage),91–101.Thompson, B. (2003b). “A brief introductionto generalizability theory,”in Score Reliability: ContemporaryThinking on Reliability Issues, ed.B. Thompson (Newbury Park, CA:Sage), 43–58.Thompson, B., and Vacha-Haase,T. (2000). Psychometrics isdatametrics: the test is not reliable.Educ. Psychol. Meas. 60,174–195.Trafimow,D.,and Rice,S. (2009). Potentialperformance theory (PPT):describing a methodology for analyzingtask performance. Behav. Res.Methods 41, 359–371.Vacha-Haase, T. (1998). Reliability generalization:exploring variance inmeasurement error affecting scorereliability across studies. Educ. Psychol.Meas. 58, 6–20.Vacha-Haase, T., Henson, R. K., andCaruso, J. C. (2002). Reliabilitygeneralization: moving towardimproved understanding and use ofscore reliability. Educ. Psychol. Meas.62, 562–569.Vacha-Haase, T., Kogan, L, R., andThompson, B. (2000). Sample compositionsand variabilities in publishedstudies versus those in testmanuals: validity of score reliabilityinductions. Educ. Psychol. Meas. 60,509–522.Vacha-Haase, T., Ness, C., Nillson,J., and Reetz, D. (1999).Practices regarding reporting ofreliability coefficients: a review ofthree journals. J. Exp. Educ. 67,335–341.Vacha-Haase, T., and Thompson, B.(2011). Score reliability: a retrospectivelook back at 12 yearsof reliability generalization studies.Meas. Eval. Couns. Dev. 44,159–168.Wetcher-Hendricks, D. (2006). Adjustmentsto the correction forattenuation. Psychol. Methods 11,207–215.Wilkinson, L., and APA Task Force onStatistical Inference. (1999). Statisticalmethods in psychology journals:guidelines and explanation. Am. Psychol.54, 594–604.Yetkiner, Z. E., and Thompson, B.(2010). Demonstration of how scorereliability is integrated into SEM andhow reliability affects all statisticalanalyses. Mult. Linear Regress.Viewp.26, 1–12.Zientek, L. R., Capraro, M. M., andCapraro, R. M. (2008). Reportingpractices in quantitativeteacher education research: onelook at the evidence cited in theAERA panel report. Educ. Res. 37,208–216.Zientek, L. R., and Thompson, B.(2009). Matrix summaries improveresearch reports: secondary analysesusing published literature. Educ. Res.38, 343–352.Zimmerman, D. W. (2007). Correctionfor attenuation with biased reliabilityestimates and correlated errorsin populations and samples. Educ.Psychol. Meas. 67, 920–939.Zimmerman, D. W., and Williams,R. H. (1977). The theory of testvalidity and correlated errors ofmeasurement. J. Math. Psychol. 16,135–152.Conflict of Interest Statement: Theauthors declare that the research wasconducted in the absence of any commercialor financial relationships thatcould be construed as a potential conflictof interest.Received: 30 December 2011; paper pendingpublished: 26 January 2012; accepted:19 March 2012; published online: 12 April2012.Citation: Nimon K, Zientek LR andHenson RK (2012) The assumption ofa reliable instrument and other pitfallsto avoid when considering the reliabilityof data. Front. Psychology 3:102. doi:10.3389/fpsyg.2012.00102This article was submitted to Frontiersin Quantitative Psychology and Measurement,a specialty of Frontiers in Psychology.Copyright © 2012 Nimon, Zientek andHenson. This is an open-access articledistributed under the terms of the CreativeCommons Attribution Non CommercialLicense, which permits noncommercialuse, distribution, and reproductionin other forums, provided theoriginal authors and source are credited.Frontiers in Psychology | Quantitative Psychology and Measurement April 2012 | Volume 3 | Article 102 | 52
Page 2 and 3: FRONTIERS COPYRIGHTSTATEMENT© Copy
Page 4 and 5: Table of Contents05 Is Data Cleanin
Page 7 and 8: OsborneAssumptions and data cleanin
Page 9 and 10: ORIGINAL RESEARCH ARTICLEpublished:
Page 12 and 13: Hoekstra et al.Why assumptions are
Page 20 and 21: García-PérezStatistical conclusio
Page 30 and 31: Sheng and ShengEffect of non-normal
Page 42 and 43: REVIEW ARTICLEpublished: 12 April 2
Page 44 and 45: Nimon et al.The assumption of relia
Page 56 and 57: TressoldiPower replication unreliab
Page 58 and 59: TressoldiPower replication unreliab
Page 62 and 63: FinchModern methods for the detecti
Page 72 and 73: MINI REVIEW ARTICLEpublished: 28 Au
Page 74 and 75: NimonStatistical assumptionsand Del
Page 76 and 77: NimonStatistical assumptionsFor exa
Page 78 and 79: Kraha et al.Interpreting multiple r
Page 94 and 95: SmithsonComparing moderation of slo
Page 102 and 103:
REVIEW ARTICLEpublished: 01 March 2
Page 104 and 105:
Flora et al.Factor analysis assumpt
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Kasper and ÜnlüAssumptions of fac
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Lages and JaworskaHow predictable a
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Cummiskey et al.Testing assumptions
Page 154 and 155:
Page 156 and 157:
Page 158:
show all

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?