A COMPENDIUM OF SCALES for use in the SCHOLARSHIP OF TEACHING AND LEARNING
compscalesstl
compscalesstl
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Summative Assessments<br />
Summative Rubrics<br />
Although APA lists several standardized assessments designed to measure various aspects of<br />
communication (e.g., Collegiate Assessment of Academic Proficiency Writ<strong>in</strong>g Essay Test and<br />
Writ<strong>in</strong>g Skills Test, WorkKeys Foundational Skills Assessment, and Collegiate Level Assessment),<br />
and <strong>the</strong> Association of American Colleges and Universities (AACU) provides a Written<br />
Communication rubric (https://www.aacu.org/value/rubrics/written-communication), none<br />
focus on communication <strong>in</strong> psychology, or more globally, on scientific writ<strong>in</strong>g. Although<br />
writ<strong>in</strong>g, specifically scientific writ<strong>in</strong>g, is typical <strong>in</strong> natural and social science curricula, few<br />
standardized rubrics to evaluate such work have been developed, tested, and made accessible.<br />
In this section, we identify empirically-tested rubrics <strong>for</strong> summative <strong>use</strong> <strong>in</strong> scientific writ<strong>in</strong>g that<br />
can be easily accessed and implemented or adapted <strong>for</strong> classroom <strong>use</strong> <strong>in</strong> psychology.<br />
Stellmack, Konheim-Kalkste<strong>in</strong>, Manor, Massay, and Schmitz (2009) created and evaluated <strong>the</strong><br />
reliability and validity of a rubric <strong>use</strong>d to score APA-style research papers. The researchers<br />
foc<strong>use</strong>d on only <strong>the</strong> Introduction section of <strong>the</strong> rubric; however, updated and expanded rubrics<br />
<strong>for</strong> all sections (e.g., Introduction, Method, and Results/Discussion) are available at<br />
http://www.psych.umn.edu/psylabs/acoustic/rubrics.htm. Students were <strong>in</strong>structed to write<br />
an APA-style Introduction <strong>in</strong>corporat<strong>in</strong>g five sources. The rubric encompasses eight dimensions<br />
that are fundamental aspects of this type of writ<strong>in</strong>g assignment: APA <strong>for</strong>matt<strong>in</strong>g, literature<br />
review, purpose of study, study description and hypo<strong>the</strong>sis, overall organization/logical flow,<br />
sources, scientific writ<strong>in</strong>g style, and composition/grammar/word choice. Students earn up to 3<br />
po<strong>in</strong>ts <strong>for</strong> each dimension, and examples of <strong>the</strong> grade levels with<strong>in</strong> each dimension are<br />
provided. For example, if ei<strong>the</strong>r <strong>the</strong> study description or <strong>the</strong> hypo<strong>the</strong>sis is miss<strong>in</strong>g, <strong>the</strong>n 1 po<strong>in</strong>t<br />
would be earned <strong>for</strong> <strong>the</strong> study description and hypo<strong>the</strong>sis dimension. Alternatively, if both <strong>the</strong><br />
description and hypo<strong>the</strong>sis are provided, but ei<strong>the</strong>r is unclear, <strong>the</strong>n 2 po<strong>in</strong>ts would be earned.<br />
Scores <strong>for</strong> <strong>the</strong> Introduction section range from 0 to 24.<br />
Stellmack et al. (2009) evaluated <strong>the</strong> <strong>in</strong>terrater and <strong>in</strong>trarater reliability of this rubric. For three<br />
graders, <strong>in</strong>terrater reliability was def<strong>in</strong>ed both liberally (i.e., scores <strong>for</strong> a dimension were with<strong>in</strong><br />
1 po<strong>in</strong>t across <strong>the</strong> three graders) and conservatively (i.e., scores <strong>for</strong> a dimension were equal<br />
across <strong>the</strong> three graders). Agreement ranged from .90 (liberal) to .37 (conservative). For<br />
<strong>in</strong>trarater reliability, graders re-evaluated a subset of <strong>the</strong>ir orig<strong>in</strong>al papers 2 weeks after <strong>the</strong><br />
<strong>in</strong>itial assessment. Agreement ranged from .98 (liberal) to .78 (conservative). In such<br />
consistency estimates, values at or above .70 are viewed as acceptable (Stemler, 2004).<br />
Beyond psychology-based writ<strong>in</strong>g, <strong>the</strong> Biology Thesis Assessment Protocol (BioTAP) provides a<br />
systematic method to assess scientific writ<strong>in</strong>g <strong>in</strong> <strong>the</strong> biological sciences, and <strong>the</strong> components of<br />
<strong>the</strong> rubric are applicable across natural and social science discipl<strong>in</strong>es and could be adapted <strong>for</strong><br />
<strong>use</strong> with<strong>in</strong> psychology. Reynolds, Smith, Moskovitz, and Sayle (2009) <strong>use</strong>d BioTAP to evaluate<br />
undergraduate <strong>the</strong>ses <strong>in</strong> biology. The rubric has three categories: higher-order writ<strong>in</strong>g issues<br />
(e.g., “Does <strong>the</strong> <strong>the</strong>sis make a compell<strong>in</strong>g argument <strong>for</strong> <strong>the</strong> significance of <strong>the</strong> student’s<br />
research with<strong>in</strong> <strong>the</strong> context of <strong>the</strong> current literature?”), mid- and lower-order writ<strong>in</strong>g issues (“Is<br />
59