13.07.2015 Views

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

Sweating the Small Stuff: Does data cleaning and testing ... - Frontiers

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

TressoldiPower replication unreliabilitypower = 0.9 with α = 0.05 given <strong>the</strong> observed r<strong>and</strong>om ESs, wasestimated.Statistical power was calculated using <strong>the</strong> software G ∗ Power(Faul et al., 2007).COMMENTThe results are quite clear: apart from <strong>the</strong> unconscious semanticpriming for semantic categorization, where <strong>the</strong> number of participantsin a typical experiment is sufficient to obtain a statisticalpower above 0.90, for all remaining phenomena, to achieve thislevel of power, it is necessary to increase <strong>the</strong> number of participantsin a typical study, from a minimum of seven participants for<strong>the</strong> unconscious semantic priming for lexical decision <strong>and</strong> namingto around 3400 to investigate NLP using <strong>the</strong> forced-choice withnormal state of consciousness protocol.GENERAL DISCUSSIONThe response to <strong>the</strong> question posed in <strong>the</strong> introduction, as towhe<strong>the</strong>r <strong>the</strong>re are elusive phenomena or an elusive power to detect<strong>the</strong>m, is quite clear. If <strong>the</strong>re are clear estimates of ESs from <strong>the</strong>evidence of <strong>the</strong> phenomenon derived from a sufficient numberof studies analyzed meta-analytically <strong>and</strong> <strong>the</strong>ir values are moderateor low, it is m<strong>and</strong>atory to increase <strong>the</strong> number of participantsto achieve a statistical power of 0.90, with <strong>the</strong> inevitable consequenceof investing more time <strong>and</strong> money into each study beforeinterpreting <strong>the</strong> results as support for reality or unreality of aphenomenon.Are <strong>the</strong>re alternatives to this obligation? Yes, <strong>and</strong> we briefly illustratesome of <strong>the</strong>se, also providing references for those interestedin using <strong>the</strong>m.CONFIDENCE INTERVALSIn line with <strong>the</strong> statistical reform movement (i.e., Cumming, 2012),in <strong>the</strong> APA manual (American Psychological Association, APA,2010), <strong>the</strong>re are <strong>the</strong> following statistical recommendations “Alternatively,(to <strong>the</strong> use of NHST) use calculations based on a chosentarget precision (confidence interval width) to determine samplesizes. Use <strong>the</strong> resulting confidence intervals to justify conclusionsconcerning ESs (e.g., that some effect is negligibly small) p. 30.”EQUIVALENCE TESTINGEquivalence tests are inferential statistics designed to provideevidence for a null hypo<strong>the</strong>sis. Like effect tests, <strong>the</strong> nil–null iseschewed in equivalence <strong>testing</strong>. However unlike st<strong>and</strong>ard NHST,equivalence tests provide evidence that <strong>the</strong>re is little differenceor effect. A significant result in an equivalence test means that<strong>the</strong> hypo<strong>the</strong>sis that <strong>the</strong> effects or differences are substantial can berejected. Hence, equivalence tests are appropriate when researcherswant to show little difference or effect (Levine et al., 2008).EVALUATING INFORMATIVE HYPOTHESESEvaluating specific expectations directly produces more usefulresults than sequentially <strong>testing</strong> traditional null hypo<strong>the</strong>ses againstcatch-all rivals. Researchers are often interested in <strong>the</strong> evaluationof informative hypo<strong>the</strong>ses <strong>and</strong> already know that <strong>the</strong> traditionalnull hypo<strong>the</strong>sis is an unrealistic hypo<strong>the</strong>sis. This presupposes thatprior knowledge is often available; if this is not <strong>the</strong> case, <strong>testing</strong> <strong>the</strong>traditional null hypo<strong>the</strong>sis is appropriate. In most applied studies,however, prior knowledge is indeed available in <strong>the</strong> form ofspecific expectations about <strong>the</strong> ordering of statistical parameters(Kuiper <strong>and</strong> Hoijtink, 2010; Van de Schoot et al., 2011).BAYESIAN APPROACHAno<strong>the</strong>r alternative is to ab<strong>and</strong>on <strong>the</strong> frequentist approach <strong>and</strong>use a Bayesian one (Wagenmakers et al., 2011). With a Bayesianapproach <strong>the</strong> problem of statistical power is substituted with parameterestimation <strong>and</strong>/or model comparison (Kruschke, 2011). In<strong>the</strong> first approach, assessing null values, <strong>the</strong> analyst simply setsup a range of c<strong>and</strong>idate values, including <strong>the</strong> null value, <strong>and</strong> usesBayesian inference to compute <strong>the</strong> relative credibility of all <strong>the</strong>c<strong>and</strong>idate values. In <strong>the</strong> model comparison approach, <strong>the</strong> analystsets up two competing models of what values are possible.One model posits that only <strong>the</strong> null value is possible whereas <strong>the</strong>alternative model posits that a broad range of o<strong>the</strong>r values is alsopossible. Bayesian inference is used to compute which model ismore credible, given <strong>the</strong> <strong>data</strong>.FINAL COMMENTIs <strong>the</strong>re a chance to ab<strong>and</strong>on “The Null Ritual” in <strong>the</strong> near future<strong>and</strong> to think of science as cumulative knowledge? The answer is“yes”if we approach scientific discovery thinking meta-analytically(Cumming, 2012), that is, simply reporting observed (st<strong>and</strong>ardized)ES <strong>and</strong> <strong>the</strong> corresponding confidence intervals, both whenNHST is refuted <strong>and</strong> when it is not refuted (Nickerson, 2000;American Psychological Association, APA, 2010) without drawingdichotomous decisions. The statistical approaches listed above aregood tools to achieve this goal.How many editors <strong>and</strong> reviewers are committed to pursuing it?ACKNOWLEDGMENTSSuggestions <strong>and</strong> comments by <strong>the</strong> reviewers were greatly appreciatedfor improving <strong>the</strong> clarity <strong>and</strong> quality of <strong>the</strong> paper. ProofReading Service revised <strong>the</strong> English.REFERENCESAlcock, J. E. (2003). Give <strong>the</strong> nullhypo<strong>the</strong>sis a chance: reasons toremain doubtful about <strong>the</strong> existenceof PSI. J. Conscious. Stud. 10, 29–50.American Psychological Association.(2010). Publication manual of <strong>the</strong>American Psychological Association,6th Edn, Washington, DC: AmericanPsychological Association.Bezeau, S., <strong>and</strong> Graves, R. (2001). Statisticalpower <strong>and</strong> effect sizes of clinicalneuropsychology research. J. Clin.Exp. Neuropsychol. 23, 399–406.Bouwmeester, D., Pan, J. W., Mattle,K., Eibl, M., Weinfurter, H., <strong>and</strong>Zeilinger, A. (1997). Experimentalquantum teleportation. Nature 390,575–579.Busemeyer, J. R., Pothos, E. M., Franco,R., <strong>and</strong> Trueblood, J. S. (2011). Aquantum <strong>the</strong>oretical explanation forprobability judgment errors. Psychol.Rev. 118, 2, 193–218.Cohen, J. (1992). A power primer. Psychol.Bull. 112, 1,155–1,159.Cohen, J. (1994). The earth isround (p < .085). Am. Psychol.49, 997–1003.Cumming, G. (2012). Underst<strong>and</strong>ing <strong>the</strong>New Statistics: Effect Sizes, ConfidenceIntervals, <strong>and</strong> Meta-Analysis.New York: Routledge.Dijksterhuis, A., Bos, M. W., Nordgren,L. F., <strong>and</strong> Van Baaren, R. B.(2006). On making <strong>the</strong> right choice:<strong>the</strong> deliberation-without attentioneffect. Science 311, 1005–1007.Faul, F., Erdfelder, E., Lang, A.-G., <strong>and</strong>Buchner, A. (2007). G∗Power 3:a flexible statistical power analysisprogram for <strong>the</strong> social, behavioral,<strong>and</strong> biomedical sciences. Behav. Res.Methods 39, 175–191.Genovese, M. (2005). Research on hiddenvariable <strong>the</strong>ories, a review ofrecent progresses. Phys. Rep. 413,319–396.<strong>Frontiers</strong> in Psychology | Quantitative Psychology <strong>and</strong> Measurement July 2012 | Volume 3 | Article 218 | 57

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!