Evaluation and testing in nursing education - Springer Publishing
Evaluation and testing in nursing education - Springer Publishing
Evaluation and testing in nursing education - Springer Publishing
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Oermann title pps 8/31/05 4:11 PM Page 1<br />
<strong>Evaluation</strong> <strong>and</strong> Test<strong>in</strong>g<br />
<strong>in</strong> Nurs<strong>in</strong>g Education<br />
Second Edition
Marilyn H. Oermann, PhD, RN, FAAN, is a<br />
Professor <strong>in</strong> the College of Nurs<strong>in</strong>g, Wayne<br />
State University, Detroit, Michigan. She is author/co-author<br />
of 11 nurs<strong>in</strong>g <strong>education</strong> books<br />
<strong>and</strong> many articles on cl<strong>in</strong>ical evaluation, teach<strong>in</strong>g<br />
<strong>in</strong> nurs<strong>in</strong>g, <strong>and</strong> writ<strong>in</strong>g for publication as a<br />
nurse educator. She is the Editor of the Annual<br />
Review of Nurs<strong>in</strong>g Education <strong>and</strong> Journal of Nurs<strong>in</strong>g<br />
Care Quality. Dr. Oermann lectures widely<br />
on teach<strong>in</strong>g <strong>and</strong> evaluation <strong>in</strong> nurs<strong>in</strong>g.<br />
Kathleen B. Gaberson, PhD, RN, CNOR, is<br />
Professor, Chair of the Department of Nurs<strong>in</strong>g<br />
Education, <strong>and</strong> Director of Nurs<strong>in</strong>g at Shepherd<br />
University, Shepherdstown, WV. She has over<br />
30 years of teach<strong>in</strong>g experience <strong>in</strong> graduate <strong>and</strong><br />
undergraduate nurs<strong>in</strong>g programs <strong>and</strong> has presented,<br />
written, <strong>and</strong> consulted extensively on<br />
evaluation <strong>and</strong> teach<strong>in</strong>g <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong>.<br />
She is Research Section Editor of the AORN<br />
Journal.
Oermann title pps 8/31/05 4:11 PM Page 2<br />
<strong>Evaluation</strong> <strong>and</strong> Test<strong>in</strong>g<br />
<strong>in</strong> Nurs<strong>in</strong>g Education<br />
Second Edition<br />
Marilyn H. Oermann, PhD, RN, FAAN<br />
Kathleen B. Gaberson, PhD, RN, CNOR
Copyright © 2006 by Spr<strong>in</strong>ger Publish<strong>in</strong>g Company, Inc.<br />
All rights reserved<br />
No part of this publication may be reproduced, stored <strong>in</strong> a retrieval system, or<br />
transmitted <strong>in</strong> any form or by any means, electronic, mechanical, photocopy<strong>in</strong>g,<br />
record<strong>in</strong>g, or otherwise, without the prior permission of Spr<strong>in</strong>ger Publish<strong>in</strong>g<br />
Company, Inc.<br />
Spr<strong>in</strong>ger Publish<strong>in</strong>g Company, Inc.<br />
11 West 42nd Street<br />
New York, NY 10036<br />
Acquisitions Editor: Ruth Chasek<br />
Production Editor: Pam Lankas<br />
Cover design by Joanne Honigman<br />
Typeset by International Graphic Services, Inc., Newtown, PA<br />
0607080910/54321<br />
Library of Congress Catalog<strong>in</strong>g-<strong>in</strong>-Publication Data<br />
Oermann, Marilyn H.<br />
<strong>Evaluation</strong> <strong>and</strong> <strong>test<strong>in</strong>g</strong> <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong> / Marilyn H. Oermann,<br />
Kathleen B. Gaberson. — 2nd ed.<br />
p. ; cm. — (Teach<strong>in</strong>g of nurs<strong>in</strong>g series)<br />
Includes bibliographical references <strong>and</strong> <strong>in</strong>dex.<br />
ISBN 0-8261-9951-8<br />
1. Nurs<strong>in</strong>g—Exam<strong>in</strong>ations. 2. Nurs<strong>in</strong>g—Ability <strong>test<strong>in</strong>g</strong>. 3. Nurs<strong>in</strong>g —Study<br />
<strong>and</strong> teach<strong>in</strong>g—<strong>Evaluation</strong>. I. Gaberson, Kathleen B. II. Title. III. Series:<br />
Spr<strong>in</strong>ger series on the teach<strong>in</strong>g of nurs<strong>in</strong>g (unnumbered)<br />
[DNLM: 1. Education, Nurs<strong>in</strong>g—st<strong>and</strong>ards—United States. 2. Educational Measurement—methods—United<br />
States. 3. <strong>Evaluation</strong> Studies—United States.<br />
WY 18 O29e 2006]<br />
RT73.7.O47 2006<br />
610.73'071'1—dc22 2005023071<br />
Pr<strong>in</strong>ted <strong>in</strong> the United States of America by Sheridan Books, Inc.
To our parents:<br />
Laurence <strong>and</strong> Dorothy Haag<br />
Homer <strong>and</strong> Dorothy Bollen<br />
<strong>and</strong> families:<br />
David, Eric, <strong>and</strong> Ross Oermann<br />
Paul Gaberson <strong>and</strong> Matthew Ammon
Contents<br />
Preface<br />
ix<br />
1. <strong>Evaluation</strong>, Measurement, <strong>and</strong> the Educational Process 1<br />
2. Qualities of Effective Measurement Instruments 23<br />
3. Plann<strong>in</strong>g for Classroom Test<strong>in</strong>g 39<br />
4. Selected-Response Test Items: True-False <strong>and</strong> Match<strong>in</strong>g 59<br />
5. Selected-Response Test Items: Multiple-Choice <strong>and</strong><br />
Multiple-Response 69<br />
6. Constructed-Response Test Items: Completion<br />
(Fill-<strong>in</strong>-the-Blank) <strong>and</strong> Essay 91<br />
7. <strong>Evaluation</strong> of Higher-Level Learn<strong>in</strong>g, Problem Solv<strong>in</strong>g, <strong>and</strong><br />
Critical Th<strong>in</strong>k<strong>in</strong>g: Context-Dependent Item Sets <strong>and</strong> Other<br />
<strong>Evaluation</strong> Strategies 111<br />
8. Test Construction <strong>and</strong> Preparation of Students for Licensure<br />
<strong>and</strong> Certification Exam<strong>in</strong>ations 137<br />
9. Assembl<strong>in</strong>g <strong>and</strong> Adm<strong>in</strong>ister<strong>in</strong>g Tests 155<br />
10. Scor<strong>in</strong>g <strong>and</strong> Analyz<strong>in</strong>g Tests 171<br />
11. <strong>Evaluation</strong> of Written Assignments 185<br />
12. Cl<strong>in</strong>ical <strong>Evaluation</strong> 199<br />
13. Cl<strong>in</strong>ical <strong>Evaluation</strong> Methods 213<br />
14. Social, Ethical, <strong>and</strong> Legal Issues 251<br />
15. Interpret<strong>in</strong>g Test Scores 263<br />
16. Grad<strong>in</strong>g 277<br />
17. Program <strong>Evaluation</strong> 303<br />
Appendix A Examples of Rat<strong>in</strong>g Forms for Cl<strong>in</strong>ical <strong>Evaluation</strong> 321<br />
Appendix B Codes of Ethics <strong>in</strong> Test<strong>in</strong>g <strong>and</strong> 365<br />
Educational Measurement<br />
Appendix C 9 Pr<strong>in</strong>ciples of Good Practice for Assess<strong>in</strong>g 383<br />
Student Learn<strong>in</strong>g<br />
Index 387<br />
vii
Preface<br />
All teachers at some time or another need to measure <strong>and</strong> evaluate<br />
learn<strong>in</strong>g. The teacher may write test items, prepare tests <strong>and</strong> analyze their results,<br />
develop rat<strong>in</strong>g scales <strong>and</strong> cl<strong>in</strong>ical evaluation methods, <strong>and</strong> plan other strategies<br />
for assess<strong>in</strong>g learn<strong>in</strong>g <strong>in</strong> the classroom, cl<strong>in</strong>ical practice, onl<strong>in</strong>e <strong>and</strong> distance<br />
<strong>education</strong> courses, <strong>and</strong> other sett<strong>in</strong>gs. Often teachers are not prepared to carry<br />
out these tasks as part of their <strong>in</strong>structional role. This second edition of <strong>Evaluation</strong><br />
<strong>and</strong> Test<strong>in</strong>g <strong>in</strong> Nurs<strong>in</strong>g Education is a resource for teachers <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong><br />
programs <strong>and</strong> health care agencies <strong>and</strong> a textbook for graduate students prepar<strong>in</strong>g<br />
for teach<strong>in</strong>g roles, for nurses <strong>in</strong> cl<strong>in</strong>ical practice who teach others <strong>and</strong> are<br />
responsible for evaluat<strong>in</strong>g their learn<strong>in</strong>g <strong>and</strong> performance, <strong>and</strong> for other health<br />
professionals <strong>in</strong>volved <strong>in</strong> evaluation, measurement, <strong>and</strong> <strong>test<strong>in</strong>g</strong>. While the examples<br />
of test items <strong>and</strong> other types of evaluation methods provided <strong>in</strong> this book<br />
are nurs<strong>in</strong>g-oriented, they are easily adapted to assessment <strong>and</strong> evaluation <strong>in</strong><br />
other health fields.<br />
The purposes of the book are to describe concepts of evaluation, measurement,<br />
<strong>and</strong> <strong>test<strong>in</strong>g</strong> <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong>; qualities of effective measurement <strong>in</strong>struments;<br />
how to plan for classroom <strong>test<strong>in</strong>g</strong>, assemble <strong>and</strong> adm<strong>in</strong>ister tests, <strong>and</strong><br />
analyze test results; how to write all types of test items <strong>and</strong> develop evaluation<br />
strategies; methods for evaluat<strong>in</strong>g higher level cognitive skills <strong>and</strong> critical th<strong>in</strong>k<strong>in</strong>g;<br />
how to evaluate written assignments <strong>in</strong> nurs<strong>in</strong>g; cl<strong>in</strong>ical evaluation <strong>and</strong><br />
how to evaluate cl<strong>in</strong>ical performance; social, ethical, <strong>and</strong> legal issues associated<br />
with evaluation <strong>and</strong> <strong>test<strong>in</strong>g</strong>; grad<strong>in</strong>g; <strong>and</strong> program evaluation. The content is<br />
useful for teachers <strong>in</strong> any sett<strong>in</strong>g who are <strong>in</strong>volved <strong>in</strong> evaluat<strong>in</strong>g others, whether<br />
they are students, nurses, or other types of health personnel.<br />
Chapter 1 addresses the purposes of evaluation <strong>and</strong> measurement <strong>in</strong> nurs<strong>in</strong>g<br />
<strong>education</strong> <strong>and</strong> provides a framework for evaluat<strong>in</strong>g students <strong>and</strong> other learners.<br />
Differences between formative <strong>and</strong> summative evaluation <strong>and</strong> between norm<strong>and</strong><br />
criterion-referenced measurement are explored. S<strong>in</strong>ce effective evaluation<br />
requires a clear description of what <strong>and</strong> how to evaluate, the chapter describes<br />
the use of objectives as a basis for develop<strong>in</strong>g test items <strong>and</strong> evaluation strategies,<br />
provides examples of objectives at different taxonomic levels, <strong>and</strong> describes how<br />
test items would be developed at each of these levels. Some teachers, however,<br />
ix
x<br />
PREFACE<br />
do not use objectives as the basis for <strong>test<strong>in</strong>g</strong> <strong>and</strong> evaluation but <strong>in</strong>stead develop<br />
test items <strong>and</strong> other evaluation methods from the content of the course. For<br />
this reason chapter 1 also <strong>in</strong>cludes an explanation of how to develop test items<br />
us<strong>in</strong>g this process.<br />
In chapter 2, qualities of effective measurement <strong>in</strong>struments are discussed.<br />
The concepts of validity <strong>and</strong> reliability <strong>and</strong> their effects on the <strong>in</strong>terpretive<br />
quality of measurement results are described <strong>in</strong> the chapter. Teachers must<br />
gather evidence to support their <strong>in</strong>ferences about scores obta<strong>in</strong>ed on a measure.<br />
Although this evidence traditionally has been classified as content, criterionrelated,<br />
<strong>and</strong> construct validity, a newer underst<strong>and</strong><strong>in</strong>g of validity centers on<br />
construct validity as a unitary concept. New ways of th<strong>in</strong>k<strong>in</strong>g about reliability<br />
<strong>and</strong> its relationship to validity are expla<strong>in</strong>ed. Also discussed <strong>in</strong> chapter 2 are<br />
important practical considerations that might affect the choice or development<br />
of tests <strong>and</strong> other <strong>in</strong>struments.<br />
Chapter 3 describes the steps <strong>in</strong>volved <strong>in</strong> plann<strong>in</strong>g for test construction,<br />
enabl<strong>in</strong>g the teacher to make good decisions about what <strong>and</strong> when to test; test<br />
length; difficulty of test items; item formats; <strong>and</strong> scor<strong>in</strong>g procedures. An important<br />
focus of the chapter is how to develop a test bluepr<strong>in</strong>t <strong>and</strong> then use it<br />
for writ<strong>in</strong>g test items; examples are provided to clarify this process for the reader.<br />
Broad pr<strong>in</strong>ciples important <strong>in</strong> develop<strong>in</strong>g test items regardless of the specific<br />
type are described <strong>in</strong> the chapter.<br />
There are different ways of classify<strong>in</strong>g test items. One way is to group them<br />
accord<strong>in</strong>g to how they are scored—objectively or subjectively. Another way is<br />
to group them by the type of response required of the test-taker, which is how<br />
we organized the chapters. Selected-response items require the test-taker to select<br />
the correct or best answer from options provided by the teacher. These items<br />
<strong>in</strong>clude true-false, match<strong>in</strong>g exercises, multiple-choice, <strong>and</strong> multiple-response.<br />
Constructed-response items ask the test-taker to supply an answer rather than<br />
choose from options already provided. Constructed-response items <strong>in</strong>clude completion<br />
<strong>and</strong> essay (short <strong>and</strong> extended). Chapters 4 through 6 discuss these<br />
test items.<br />
A true-false item consists of a statement that the student judges as true or<br />
false. In some forms, students also correct the response or supply a rationale as<br />
to why the statement is true or false. True-false items are most effective for recall<br />
of facts <strong>and</strong> specific <strong>in</strong>formation but may also be used to test the student’s<br />
comprehension of the content. Chapter 4 describes how to construct truefalse<br />
items <strong>and</strong> different variations, for example, correct<strong>in</strong>g false statements or<br />
provid<strong>in</strong>g a rationale for the response, which allows the teacher to evaluate if<br />
the learner underst<strong>and</strong>s the content. Chapter 4 also expla<strong>in</strong>s how to develop<br />
match<strong>in</strong>g exercises. These consist of two parallel columns <strong>in</strong> which students<br />
match terms, phrases, sentences, or numbers from one column to the other.
Preface<br />
xi<br />
Pr<strong>in</strong>ciples for writ<strong>in</strong>g each type of item are presented, accompanied by sample<br />
items.<br />
In chapter 5 the focus is on writ<strong>in</strong>g multiple-choice <strong>and</strong> multiple-response<br />
items. Multiple-choice items, with one correct answer, are used widely <strong>in</strong> nurs<strong>in</strong>g<br />
<strong>and</strong> other fields. This format of test item <strong>in</strong>cludes an <strong>in</strong>complete statement or<br />
question, followed by a list of options that complete the statement or answer<br />
the question. Multiple-response items are designed similarly, although more than<br />
one answer may be correct. Both of these formats of test items may be used for<br />
evaluat<strong>in</strong>g learn<strong>in</strong>g at the recall, comprehension, application, <strong>and</strong> analysis levels,<br />
mak<strong>in</strong>g them adaptable for a wide range of content <strong>and</strong> learn<strong>in</strong>g outcomes.<br />
There are three parts <strong>in</strong> a multiple-choice item, each with its own set of pr<strong>in</strong>ciples<br />
for development: (a) stem, (b) answer, <strong>and</strong> (c) distractors. In chapter 5 we<br />
discuss how to write each of these parts <strong>and</strong> provide many examples. Multipleresponse<br />
items are now <strong>in</strong>cluded on the NCLEX® Exam<strong>in</strong>ation as one of the<br />
types of alternate item formats; we have a section <strong>in</strong> chapter 5 on how to write<br />
these items.<br />
With selected-response items the test-taker chooses the correct or best<br />
answer from the options provided by the teacher. In contrast, with constructedresponse<br />
items, the test-taker supplies an answer rather than select<strong>in</strong>g from the<br />
options already provided. Constructed-response items <strong>in</strong>clude completion, also<br />
referred to as fill-<strong>in</strong>-the-blank, <strong>and</strong> essay. Completion items can be answered by<br />
a word, phrase, or number. These are some of the alternate item formats used<br />
on the NCLEX® Exam<strong>in</strong>ation; c<strong>and</strong>idates may be asked to perform a calculation<br />
<strong>and</strong> type <strong>in</strong> the number or to put a list of responses <strong>in</strong> proper order. In this<br />
chapter we describe how to write fill-<strong>in</strong>-the-blank <strong>and</strong> short-answer items. We<br />
also expla<strong>in</strong> how to develop <strong>and</strong> score essay items. With essay items, students<br />
construct responses based on their underst<strong>and</strong><strong>in</strong>g of the content. Essay items<br />
provide an opportunity for students to select content to discuss, present ideas<br />
<strong>in</strong> their own words, <strong>and</strong> develop an orig<strong>in</strong>al <strong>and</strong> a creative response to a question.<br />
We provide an extensive discussion on scor<strong>in</strong>g essay responses.<br />
There is much discussion <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong> about develop<strong>in</strong>g higherlevel<br />
cognitive skills <strong>and</strong> critical th<strong>in</strong>k<strong>in</strong>g ability of students. With higher-level<br />
th<strong>in</strong>k<strong>in</strong>g, students apply concepts, theories, <strong>and</strong> other forms of knowledge to<br />
new situations; use that knowledge to solve patient <strong>and</strong> other types of problems;<br />
<strong>and</strong> arrive at rational <strong>and</strong> well thought-out decisions about actions to take. The<br />
ma<strong>in</strong> pr<strong>in</strong>ciple <strong>in</strong> evaluat<strong>in</strong>g higher-level learn<strong>in</strong>g is to develop test items <strong>and</strong><br />
other evaluation strategies that require students to apply knowledge <strong>and</strong> skills<br />
<strong>in</strong> a new situation; the teacher can then assess if the students are able to use<br />
what they have learned <strong>in</strong> a different context. Chapter 7 presents strategies for<br />
evaluat<strong>in</strong>g higher levels of learn<strong>in</strong>g <strong>in</strong> nurs<strong>in</strong>g. Context-dependent item sets or<br />
<strong>in</strong>terpretative exercises are discussed as one format of <strong>test<strong>in</strong>g</strong> appropriate for
xii<br />
PREFACE<br />
assess<strong>in</strong>g higher-level cognitive skills. Suggestions for develop<strong>in</strong>g them are presented<br />
<strong>in</strong> the chapter, <strong>in</strong>clud<strong>in</strong>g examples of different items. Other methods for<br />
evaluat<strong>in</strong>g cognitive skills <strong>in</strong> nurs<strong>in</strong>g also are presented <strong>in</strong> the chapter: case<br />
method <strong>and</strong> study, unfold<strong>in</strong>g cases, discussions us<strong>in</strong>g higher-level <strong>and</strong> Socratic<br />
question<strong>in</strong>g, debate, multimedia, <strong>and</strong> short written assignments.<br />
Chapter 8 focuses on develop<strong>in</strong>g test items that prepare students for licensure<br />
<strong>and</strong> certification exam<strong>in</strong>ations. The chapter beg<strong>in</strong>s with an explanation of the<br />
NCLEX® Exam<strong>in</strong>ation test plans <strong>and</strong> implications for nurse educators. Examples<br />
are provided of items written at different cognitive levels, thereby avoid<strong>in</strong>g tests<br />
that focus only on recall <strong>and</strong> memorization of facts. The chapter also describes<br />
how to write questions about the nurs<strong>in</strong>g process <strong>and</strong> provides sample stems for<br />
use with those items. The types of items presented <strong>in</strong> the chapter are similar to<br />
those found on the NCLEX® Exam<strong>in</strong>ation <strong>and</strong> many certification tests. By<br />
<strong>in</strong>corporat<strong>in</strong>g these items on tests <strong>in</strong> nurs<strong>in</strong>g courses, students acquire experience<br />
with this type of <strong>test<strong>in</strong>g</strong> as they progress through the program, prepar<strong>in</strong>g them<br />
for tak<strong>in</strong>g licensure <strong>and</strong> certification exam<strong>in</strong>ations as graduates.<br />
Chapter 9 expla<strong>in</strong>s how to assemble <strong>and</strong> adm<strong>in</strong>ister a test. In addition to<br />
prepar<strong>in</strong>g a test bluepr<strong>in</strong>t <strong>and</strong> skillful construction of test items, the f<strong>in</strong>al appearance<br />
of the test <strong>and</strong> the way <strong>in</strong> which it is adm<strong>in</strong>istered can affect the validity<br />
of its results. In chapter 9, test design rules are described; suggestions for reproduc<strong>in</strong>g<br />
the test, ma<strong>in</strong>ta<strong>in</strong><strong>in</strong>g test security, adm<strong>in</strong>ister<strong>in</strong>g it, <strong>and</strong> prevent<strong>in</strong>g cheat<strong>in</strong>g<br />
are presented <strong>in</strong> the chapter as well. We also <strong>in</strong>cluded a section on adm<strong>in</strong>ister<strong>in</strong>g<br />
tests <strong>in</strong> an onl<strong>in</strong>e environment. As more courses <strong>and</strong> programs are offered through<br />
distance <strong>education</strong>, teachers are faced with how to prevent cheat<strong>in</strong>g on an<br />
assessment when they cannot directly observe their students; we discuss different<br />
approaches that can be used for this purpose.<br />
After adm<strong>in</strong>ister<strong>in</strong>g the test, the teacher needs to score it, <strong>in</strong>terpret the<br />
results, <strong>and</strong> then use the results to make varied decisions. Chapter 10 discusses<br />
the processes of obta<strong>in</strong><strong>in</strong>g scores <strong>and</strong> perform<strong>in</strong>g test <strong>and</strong> item analysis. It also<br />
suggests ways <strong>in</strong> which teachers can use posttest discussions to contribute to<br />
student learn<strong>in</strong>g <strong>and</strong> seek student feedback that can lead to test item improvement.<br />
The chapter beg<strong>in</strong>s with a discussion of scor<strong>in</strong>g tests, <strong>in</strong>clud<strong>in</strong>g weight<strong>in</strong>g<br />
items <strong>and</strong> correct<strong>in</strong>g for guess<strong>in</strong>g, then proceeds to item analysis. How to calculate<br />
the difficulty <strong>in</strong>dex <strong>and</strong> discrim<strong>in</strong>ation <strong>in</strong>dex <strong>and</strong> analyze each distractor are<br />
described; perform<strong>in</strong>g an item analysis by h<strong>and</strong> is expla<strong>in</strong>ed with an illustration<br />
for teachers who do not have computer software for this purpose. Teachers often<br />
debate the merits of adjust<strong>in</strong>g test scores by elim<strong>in</strong>at<strong>in</strong>g items or add<strong>in</strong>g po<strong>in</strong>ts<br />
to compensate for real or perceived deficiencies <strong>in</strong> test construction or performance.<br />
We discuss this <strong>in</strong> the chapter <strong>and</strong> provide guidel<strong>in</strong>es for faculty <strong>in</strong><br />
mak<strong>in</strong>g these decisions. A section of the chapter also presents suggestions <strong>and</strong><br />
examples of develop<strong>in</strong>g a test item bank. Many publishers also offer test item
Preface<br />
xiii<br />
banks that relate to the content conta<strong>in</strong>ed <strong>in</strong> their textbooks; we discuss why<br />
faculty need to be cautious about us<strong>in</strong>g these items for their own exam<strong>in</strong>ations.<br />
Through papers <strong>and</strong> other written assignments, students develop an underst<strong>and</strong><strong>in</strong>g<br />
of the content they are writ<strong>in</strong>g about <strong>and</strong> improve ability to communicate<br />
their ideas <strong>in</strong> writ<strong>in</strong>g. Written assignments with feedback from the teacher<br />
help students improve their writ<strong>in</strong>g ability, an important outcome <strong>in</strong> any nurs<strong>in</strong>g<br />
program from the beg<strong>in</strong>n<strong>in</strong>g level through graduate study. Chapter 11 provides<br />
guidel<strong>in</strong>es for develop<strong>in</strong>g <strong>and</strong> evaluat<strong>in</strong>g written assignments <strong>in</strong> nurs<strong>in</strong>g courses.<br />
The chapter <strong>in</strong>cludes criteria for evaluat<strong>in</strong>g papers, an example of a scor<strong>in</strong>g<br />
rubric, <strong>and</strong> suggestions for assess<strong>in</strong>g <strong>and</strong> grad<strong>in</strong>g written assignments.<br />
Through cl<strong>in</strong>ical evaluation, the teacher arrives at judgments about learners’<br />
competencies—their performance <strong>in</strong> practice. Chapter 12 describes the process<br />
of cl<strong>in</strong>ical evaluation <strong>in</strong> nurs<strong>in</strong>g. It beg<strong>in</strong>s with a discussion on the outcomes<br />
of cl<strong>in</strong>ical practice <strong>in</strong> nurs<strong>in</strong>g programs <strong>and</strong> then presents essential concepts<br />
underly<strong>in</strong>g cl<strong>in</strong>ical evaluation. In this chapter we discuss fairness <strong>in</strong> evaluation,<br />
the stress experienced by learners <strong>in</strong> cl<strong>in</strong>ical practice <strong>and</strong> the relationship of<br />
this stress to evaluation, how to build feedback <strong>in</strong>to the evaluation process, <strong>and</strong><br />
how to determ<strong>in</strong>e what to evaluate <strong>in</strong> cl<strong>in</strong>ical courses.<br />
Chapter 13 builds on concepts of cl<strong>in</strong>ical evaluation exam<strong>in</strong>ed <strong>in</strong> the<br />
preced<strong>in</strong>g chapter. Many evaluation methods are available for assess<strong>in</strong>g competencies<br />
<strong>in</strong> cl<strong>in</strong>ical practice. We discuss observation <strong>and</strong> record<strong>in</strong>g observations<br />
<strong>in</strong> anecdotal notes, checklists, <strong>and</strong> rat<strong>in</strong>g scales; simulations <strong>in</strong>clud<strong>in</strong>g patient<br />
simulators, st<strong>and</strong>ardized patients, <strong>and</strong> structured cl<strong>in</strong>ical exam<strong>in</strong>ations; written<br />
assignments useful for cl<strong>in</strong>ical evaluation such as journals, nurs<strong>in</strong>g care plans,<br />
concept maps, case analyses, <strong>and</strong> short papers; portfolio assessment <strong>and</strong> how to<br />
set up a portfolio system for cl<strong>in</strong>ical evaluation, <strong>in</strong>clud<strong>in</strong>g an electronic portfolio;<br />
<strong>and</strong> other methods such as conference, group projects, <strong>and</strong> self-evaluation. The<br />
chapter <strong>in</strong>cludes a sample form for evaluat<strong>in</strong>g student participation <strong>in</strong> cl<strong>in</strong>ical<br />
conferences <strong>and</strong> a rubric for peer evaluation of participation <strong>in</strong> group projects.<br />
Because most nurs<strong>in</strong>g <strong>education</strong> programs use rat<strong>in</strong>g scales for cl<strong>in</strong>ical evaluation,<br />
we have collected a variety of cl<strong>in</strong>ical evaluation forms from associate degree,<br />
baccalaureate, <strong>and</strong> graduate nurs<strong>in</strong>g programs for review by readers; these are<br />
found <strong>in</strong> Appendix A.<br />
Chapter 14 explores social, ethical, <strong>and</strong> legal issues associated with <strong>test<strong>in</strong>g</strong><br />
<strong>and</strong> evaluation. Social issues such as test bias, grade <strong>in</strong>flation, effects of <strong>test<strong>in</strong>g</strong><br />
on self-esteem, <strong>and</strong> test anxiety are discussed. Ethical issues <strong>in</strong>clude privacy <strong>and</strong><br />
access to test results. By underst<strong>and</strong><strong>in</strong>g <strong>and</strong> apply<strong>in</strong>g codes for the responsible<br />
<strong>and</strong> ethical use of tests, teachers can assure the proper use of assessment procedures<br />
<strong>and</strong> the valid <strong>in</strong>terpretation of test results. We <strong>in</strong>clude several of these codes <strong>in</strong><br />
the Appendices. We also discuss selected legal issues associated with <strong>test<strong>in</strong>g</strong>.<br />
In chapter 15, the discussion focuses on how to <strong>in</strong>terpret the mean<strong>in</strong>g of<br />
test scores. Basic statistical concepts are presented <strong>and</strong> used for criterion- <strong>and</strong><br />
norm-referenced <strong>in</strong>terpretations of teacher-made <strong>and</strong> st<strong>and</strong>ardized test results.
xiv<br />
PREFACE<br />
Grad<strong>in</strong>g is the use of symbols, such as the letters A through F or pass-fail,<br />
to report student achievement. Grad<strong>in</strong>g is for summative purposes, <strong>in</strong>dicat<strong>in</strong>g<br />
how well the student met the outcomes of the course <strong>and</strong> cl<strong>in</strong>ical practicum.<br />
To represent valid judgments about student achievement, grades should be based<br />
on sound evaluation practices, reliable test results, <strong>and</strong> multiple evaluation<br />
measures. Chapter 16 exam<strong>in</strong>es the uses of grades <strong>in</strong> nurs<strong>in</strong>g programs, criticisms<br />
of grades, types of grad<strong>in</strong>g systems, assign<strong>in</strong>g letter grades, select<strong>in</strong>g a grad<strong>in</strong>g<br />
framework, how to calculate grades with each of these frameworks, <strong>and</strong> how to<br />
use a spreadsheet application <strong>and</strong> course management system for calculat<strong>in</strong>g<br />
grades. We also discuss grad<strong>in</strong>g cl<strong>in</strong>ical practice, us<strong>in</strong>g pass-fail <strong>and</strong> other systems<br />
for grad<strong>in</strong>g, <strong>and</strong> provide guidel<strong>in</strong>es for the teacher when students have the<br />
potential for fail<strong>in</strong>g a cl<strong>in</strong>ical practicum.<br />
Program evaluation is the process of judg<strong>in</strong>g the worth or value of an<br />
<strong>education</strong>al program. With the dem<strong>and</strong> for high quality programs, the development<br />
of newer models for the delivery of higher <strong>education</strong>, such as Web-based<br />
<strong>in</strong>struction, <strong>and</strong> public calls for accountability, there has been a greater emphasis<br />
on systematic <strong>and</strong> ongo<strong>in</strong>g program evaluation. Thus, chapter 17 presents an<br />
overview of program evaluation models <strong>and</strong> discusses evaluation of selected<br />
program components, <strong>in</strong>clud<strong>in</strong>g curriculum, outcomes, <strong>and</strong> teach<strong>in</strong>g.<br />
The authors wish to acknowledge Ruth Chasek for her encouragement,<br />
patience, <strong>and</strong> editorial assistance; Pamela Lankas, for her assistance dur<strong>in</strong>g production;<br />
<strong>and</strong> Nancy A. Wilmes, Librarian at the Science <strong>and</strong> Eng<strong>in</strong>eer<strong>in</strong>g Library,<br />
Wayne State University, who assisted us <strong>in</strong> locat<strong>in</strong>g <strong>in</strong>formation <strong>and</strong> checked<br />
many references for us. We also thank Spr<strong>in</strong>ger Publish<strong>in</strong>g Company for its<br />
cont<strong>in</strong>ued support of nurs<strong>in</strong>g <strong>education</strong>.<br />
Marilyn H. Oermann<br />
Kathleen B. Gaberson
Chapter 1<br />
<strong>Evaluation</strong>, Measurement, <strong>and</strong> the<br />
Educational Process<br />
In all areas of nurs<strong>in</strong>g <strong>education</strong> <strong>and</strong> practice, evaluation is an important<br />
process to measure learn<strong>in</strong>g <strong>and</strong> other outcomes, judge performance, determ<strong>in</strong>e<br />
competence to practice, <strong>and</strong> arrive at other decisions about students <strong>and</strong> nurses.<br />
<strong>Evaluation</strong> is <strong>in</strong>tegral to monitor<strong>in</strong>g the quality of <strong>education</strong>al <strong>and</strong> health care<br />
programs. By evaluat<strong>in</strong>g outcomes achieved by students, graduates, <strong>and</strong> clients,<br />
the effectiveness of programs can be measured <strong>and</strong> decisions can be made about<br />
needed improvements.<br />
<strong>Evaluation</strong> provides a means of ensur<strong>in</strong>g accountability for the quality of<br />
<strong>education</strong> <strong>and</strong> services provided. Nurses, similar to other health professionals,<br />
are accountable to their clients <strong>and</strong> society <strong>in</strong> general for meet<strong>in</strong>g their health<br />
needs. Along the same l<strong>in</strong>e, nurse educators are accountable for the quality of<br />
teach<strong>in</strong>g provided to learners, outcomes achieved, <strong>and</strong> overall effectiveness of<br />
programs that prepare graduates to meet the health needs of society. Educational<br />
<strong>in</strong>stitutions are also accountable to their govern<strong>in</strong>g bodies <strong>and</strong> society <strong>in</strong> terms<br />
of educat<strong>in</strong>g graduates for present <strong>and</strong> future roles. Through evaluation, nurs<strong>in</strong>g<br />
faculty <strong>and</strong> other health professionals can measure the quality of their teach<strong>in</strong>g<br />
<strong>and</strong> programs as well as document outcomes for others to review. All educators,<br />
regardless of the sett<strong>in</strong>g, need to be knowledgeable about the evaluation process,<br />
measurement, <strong>test<strong>in</strong>g</strong>, assessment, <strong>and</strong> other related concepts.<br />
EVALUATION<br />
<strong>Evaluation</strong> is the process of collect<strong>in</strong>g <strong>and</strong> <strong>in</strong>terpret<strong>in</strong>g <strong>in</strong>formation for mak<strong>in</strong>g<br />
judgments about student learn<strong>in</strong>g <strong>and</strong> achievement, cl<strong>in</strong>ical performance, employee<br />
competence, <strong>and</strong> <strong>education</strong>al programs, among others. In nurs<strong>in</strong>g educa-<br />
1
2 EVALUATION AND TESTING IN NURSING EDUCATION<br />
tion, evaluation typically takes the form of judg<strong>in</strong>g student atta<strong>in</strong>ment of the<br />
<strong>education</strong>al objectives <strong>and</strong> goals <strong>in</strong> the classroom <strong>and</strong> the quality of student<br />
performance <strong>in</strong> the cl<strong>in</strong>ical sett<strong>in</strong>g. Educational experiences produce changes <strong>in</strong><br />
the learner; evaluation provides the means to assess those changes. With this<br />
evaluation, learn<strong>in</strong>g outcomes are measured, further <strong>education</strong>al needs are identified,<br />
<strong>and</strong> additional <strong>in</strong>struction can be provided to assist students <strong>in</strong> their learn<strong>in</strong>g<br />
<strong>and</strong> <strong>in</strong> develop<strong>in</strong>g competencies for practice. Similarly, evaluation of employees<br />
provides <strong>in</strong>formation on their performance at varied po<strong>in</strong>ts <strong>in</strong> time as a basis<br />
for judg<strong>in</strong>g their competence.<br />
<strong>Evaluation</strong> extends beyond a test score or cl<strong>in</strong>ical rat<strong>in</strong>g. In evaluat<strong>in</strong>g<br />
learners, teachers judge the merits of the learn<strong>in</strong>g <strong>and</strong> performance based on<br />
the data. <strong>Evaluation</strong> <strong>in</strong>volves mak<strong>in</strong>g value judgments about learners; <strong>in</strong> fact,<br />
value is part of the word evaluation. Questions such as “How well did the student<br />
perform?” <strong>and</strong> “Is the student competent <strong>in</strong> cl<strong>in</strong>ical practice?” are answered by<br />
the evaluation process. The teacher collects <strong>and</strong> analyzes data about the student’s<br />
performance, then makes a value judgment about the quality of that performance.<br />
In terms of <strong>education</strong>al programs, evaluation <strong>in</strong>cludes collect<strong>in</strong>g <strong>in</strong>formation<br />
prior to develop<strong>in</strong>g the program, dur<strong>in</strong>g the process of program development to<br />
provide a basis for ongo<strong>in</strong>g revision, <strong>and</strong> after implement<strong>in</strong>g the program to<br />
determ<strong>in</strong>e its effectiveness. With program evaluation, faculty collect data about<br />
their students, alumni, curriculum, <strong>and</strong> other dimensions of the program for the<br />
purpose of document<strong>in</strong>g the program outcomes <strong>and</strong> for mak<strong>in</strong>g sound decisions<br />
about curriculum revision. As faculty measure outcomes for accreditation <strong>and</strong><br />
evaluate their courses <strong>and</strong> curriculum, they are engag<strong>in</strong>g <strong>in</strong> program evaluation.<br />
While many of the concepts described <strong>in</strong> this book are applicable to program<br />
evaluation, the focus <strong>in</strong>stead is on evaluat<strong>in</strong>g learners, <strong>in</strong>clud<strong>in</strong>g students <strong>in</strong> all<br />
types <strong>and</strong> levels of nurs<strong>in</strong>g programs <strong>and</strong> nurses <strong>in</strong> health care sett<strong>in</strong>gs. The term<br />
“students” is used broadly to reflect both of these groups of learners.<br />
<strong>Evaluation</strong> <strong>and</strong> Instruction<br />
Through evaluation the teacher determ<strong>in</strong>es the progress of students toward<br />
meet<strong>in</strong>g the objectives <strong>and</strong> develop<strong>in</strong>g cl<strong>in</strong>ical practice competencies <strong>and</strong> their<br />
achievement of them. The data that the teacher collects, through classroom<br />
<strong>test<strong>in</strong>g</strong>, observations <strong>in</strong> cl<strong>in</strong>ical practice, <strong>and</strong> other assessment strategies, provide<br />
the basis for further <strong>in</strong>struction. Figure 1.1 demonstrates the relationship between<br />
evaluation <strong>and</strong> <strong>in</strong>struction. The objectives specify the <strong>in</strong>tended learn<strong>in</strong>g outcomes;<br />
these may be designated for atta<strong>in</strong>ment <strong>in</strong> the classroom, cl<strong>in</strong>ical sett<strong>in</strong>g,<br />
learn<strong>in</strong>g laboratory, onl<strong>in</strong>e environment, or other sett<strong>in</strong>g. Follow<strong>in</strong>g assessment<br />
of learner needs to determ<strong>in</strong>e gaps <strong>in</strong> learn<strong>in</strong>g <strong>and</strong> cl<strong>in</strong>ical competency, the
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 3<br />
FIGURE 1.1<br />
Relationship of evaluation <strong>and</strong> <strong>in</strong>struction.<br />
teacher selects teach<strong>in</strong>g methods <strong>and</strong> plans cl<strong>in</strong>ical activities to meet those<br />
needs. This phase of the <strong>in</strong>structional process <strong>in</strong>cludes develop<strong>in</strong>g a plan for<br />
learn<strong>in</strong>g, select<strong>in</strong>g learn<strong>in</strong>g activities, <strong>and</strong> teach<strong>in</strong>g learners <strong>in</strong> varied sett<strong>in</strong>gs.<br />
The rema<strong>in</strong><strong>in</strong>g components of the <strong>in</strong>structional process relate to evaluation.<br />
With formative evaluation, dur<strong>in</strong>g this process, the teacher assesses student<br />
progress toward meet<strong>in</strong>g the objectives <strong>and</strong> demonstrat<strong>in</strong>g competency <strong>in</strong> cl<strong>in</strong>ical<br />
practice. This type of evaluation is displayed with a feedback loop to <strong>in</strong>struction.<br />
Formative evaluation provides <strong>in</strong>formation about further learn<strong>in</strong>g needs of students<br />
<strong>and</strong> where additional <strong>in</strong>struction is needed. This feedback to students<br />
re<strong>in</strong>forces successful learn<strong>in</strong>g <strong>and</strong> identifies learn<strong>in</strong>g errors that need correction<br />
(L<strong>in</strong>n & Gronlund, 2000). Summative evaluation, at the end of the <strong>in</strong>struction,<br />
determ<strong>in</strong>es if the objectives have been achieved <strong>and</strong> competencies developed.<br />
This model of teach<strong>in</strong>g has been called the cont<strong>in</strong>uous feedback model<br />
because it <strong>in</strong>corporates formative evaluation for provid<strong>in</strong>g feedback, focused
4 EVALUATION AND TESTING IN NURSING EDUCATION<br />
<strong>in</strong>struction to fill gaps <strong>in</strong> learn<strong>in</strong>g so students achieve mastery, <strong>and</strong> summative<br />
measures that are usually limited <strong>in</strong> number (Mertler, 2003). Another model<br />
connect<strong>in</strong>g evaluation <strong>and</strong> <strong>in</strong>struction is the time-restricted model <strong>in</strong> which<br />
evaluation occurs only after a segment of <strong>in</strong>struction is completed. In this model<br />
teachers present new content <strong>and</strong> guide learners <strong>in</strong> acquir<strong>in</strong>g new cl<strong>in</strong>ical knowledge<br />
<strong>and</strong> skills, students complete related learn<strong>in</strong>g activities, <strong>and</strong> students then<br />
are evaluated periodically, for example, at midterm <strong>and</strong> the end of the term. A<br />
third model, <strong>in</strong>tegrated assessment, focuses on problem solv<strong>in</strong>g <strong>and</strong> critical th<strong>in</strong>k<strong>in</strong>g<br />
as the outcomes, with the teacher facilitat<strong>in</strong>g <strong>and</strong> guid<strong>in</strong>g learn<strong>in</strong>g rather<br />
than present<strong>in</strong>g <strong>in</strong>formation to a group of learners (Mertler). <strong>Evaluation</strong> is<br />
<strong>in</strong>tegrated throughout the <strong>in</strong>struction as a means of assess<strong>in</strong>g students’ ability<br />
to problem solve. In this model students learn ma<strong>in</strong>ly <strong>in</strong> small groups.<br />
Formative <strong>Evaluation</strong><br />
<strong>Evaluation</strong> fulfills two major roles <strong>in</strong> the classroom <strong>and</strong> cl<strong>in</strong>ical sett<strong>in</strong>g: formative<br />
<strong>and</strong> summative. Formative evaluation is feedback to learners about their progress<br />
<strong>in</strong> meet<strong>in</strong>g the objectives <strong>and</strong> develop<strong>in</strong>g competencies for practice. It occurs<br />
throughout the <strong>in</strong>structional process <strong>and</strong> provides a basis for determ<strong>in</strong><strong>in</strong>g where<br />
further learn<strong>in</strong>g is needed. As such, formative evaluation is diagnostic, provid<strong>in</strong>g<br />
<strong>in</strong>formation to the teacher <strong>and</strong> learner about the progress be<strong>in</strong>g made <strong>in</strong> meet<strong>in</strong>g<br />
the learn<strong>in</strong>g objectives.<br />
In the classroom, formative <strong>in</strong>formation may be collected by teacher observation<br />
<strong>and</strong> question<strong>in</strong>g of students, diagnostic quizzes, small group activities,<br />
written assignments, <strong>and</strong> other activities that students complete <strong>in</strong> <strong>and</strong> out of<br />
class. In cl<strong>in</strong>ical practice, formative evaluation is an <strong>in</strong>tegral part of the <strong>in</strong>structional<br />
process. The teacher cont<strong>in</strong>ually makes observations of students as they<br />
learn to provide patient care, questions them about their underst<strong>and</strong><strong>in</strong>g <strong>and</strong><br />
cl<strong>in</strong>ical decisions, discusses these observations <strong>and</strong> judgments with them, <strong>and</strong><br />
guides them <strong>in</strong> how to improve performance. With formative evaluation the<br />
teacher gives feedback to learners about their progress <strong>in</strong> achiev<strong>in</strong>g the goals of<br />
cl<strong>in</strong>ical practice <strong>and</strong> how they can further develop their knowledge <strong>and</strong> skills.<br />
Anecdotal notes are commonly used to record the teacher’s observations <strong>and</strong><br />
judgments <strong>and</strong> to communicate them to the students.<br />
Consider<strong>in</strong>g that the purpose of formative evaluation is to provide feedback,<br />
typically formative evaluation is not graded. S<strong>in</strong>ce formative evaluation is “directed<br />
toward improv<strong>in</strong>g learn<strong>in</strong>g <strong>and</strong> <strong>in</strong>struction, the results typically are not<br />
used for assign<strong>in</strong>g course grades” (L<strong>in</strong>n & Gronlund, 2000, p. 41). Teachers<br />
should remember that the purpose of formative evaluation is to assess where<br />
further learn<strong>in</strong>g is needed <strong>and</strong> to guide cont<strong>in</strong>ued teach<strong>in</strong>g <strong>and</strong> learn<strong>in</strong>g.
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 5<br />
Summative <strong>Evaluation</strong><br />
Summative evaluation, on the other h<strong>and</strong>, is end-of-<strong>in</strong>struction evaluation designed<br />
to determ<strong>in</strong>e what the student has learned <strong>in</strong> the classroom or cl<strong>in</strong>ical<br />
sett<strong>in</strong>g. Summative evaluation provides <strong>in</strong>formation on the extent to which<br />
objectives were achieved, not on the progress of the learner <strong>in</strong> meet<strong>in</strong>g them.<br />
As such, summative evaluation occurs at the end of the learn<strong>in</strong>g process, for<br />
<strong>in</strong>stance, the end of a course, to determ<strong>in</strong>e the student’s grade <strong>and</strong> certify<br />
competence. Although formative evaluation occurs throughout the learn<strong>in</strong>g<br />
experience, for example, daily, summative evaluation is conducted on a periodic<br />
basis, for <strong>in</strong>stance, every few weeks or at the midterm <strong>and</strong> f<strong>in</strong>al evaluation<br />
periods. This type of evaluation is “f<strong>in</strong>al” <strong>in</strong> nature <strong>and</strong> serves as a basis for<br />
grad<strong>in</strong>g <strong>and</strong> other high-stakes decisions.<br />
Summative evaluation typically assesses broader content areas than formative<br />
evaluation, which tends to be more specific <strong>in</strong> terms of the content evaluated.<br />
Methods commonly used for summative evaluation <strong>in</strong> the classroom <strong>in</strong>clude<br />
tests, term papers, <strong>and</strong> other types of projects. In cl<strong>in</strong>ical practice, rat<strong>in</strong>g scales,<br />
written assignments, portfolios, projects completed about cl<strong>in</strong>ical experiences,<br />
<strong>and</strong> other performance measures may be used.<br />
Both formative <strong>and</strong> summative evaluation are essential components of most<br />
nurs<strong>in</strong>g courses. However, because formative evaluation represents feedback to<br />
learners with the goal of improv<strong>in</strong>g learn<strong>in</strong>g, it should be a major part of any<br />
nurs<strong>in</strong>g course. By provid<strong>in</strong>g feedback on a cont<strong>in</strong>ual basis <strong>and</strong> l<strong>in</strong>k<strong>in</strong>g that<br />
feedback with further <strong>in</strong>struction, the teacher can assist students <strong>in</strong> develop<strong>in</strong>g<br />
the knowledge <strong>and</strong> skills they lack.<br />
MEASUREMENT<br />
Measurement is the process of assign<strong>in</strong>g numbers to represent student achievement<br />
or performance accord<strong>in</strong>g to certa<strong>in</strong> rules, for <strong>in</strong>stance, answer<strong>in</strong>g 85 out<br />
of 100 items correctly on a test. Measurement answers the question “how much?”<br />
(L<strong>in</strong>n & Gronlund, 2000). In contrast to evaluation, measurement does not<br />
imply value judgments about the quality of the results. Measurement is important<br />
for describ<strong>in</strong>g the achievement of learners on nurs<strong>in</strong>g <strong>and</strong> other tests, but not<br />
all outcomes important <strong>in</strong> nurs<strong>in</strong>g practice can be measured by <strong>test<strong>in</strong>g</strong>. Many<br />
outcomes are evaluated qualitatively through other means, such as observations<br />
of performance.<br />
While measurement <strong>in</strong>volves assign<strong>in</strong>g numbers to reflect learn<strong>in</strong>g, these<br />
numbers <strong>in</strong> <strong>and</strong> of themselves have no mean<strong>in</strong>g. Scor<strong>in</strong>g 15 on a test means<br />
noth<strong>in</strong>g unless it is referenced or compared with other students’ scores or to a
6 EVALUATION AND TESTING IN NURSING EDUCATION<br />
predeterm<strong>in</strong>ed st<strong>and</strong>ard. Perhaps 15 was the highest or lowest score on the test<br />
compared with other students. Or, the student might have set a personal goal<br />
of achiev<strong>in</strong>g 15 on the test; thus meet<strong>in</strong>g this goal is more important than how<br />
others scored on the test. Another <strong>in</strong>terpretation is that a score of 15 might<br />
reflect the st<strong>and</strong>ard expected of this particular group of learners. Hav<strong>in</strong>g a<br />
reference po<strong>in</strong>t with which to compare a particular test score is essential to give<br />
it mean<strong>in</strong>g <strong>and</strong> to <strong>in</strong>terpret the score.<br />
In cl<strong>in</strong>ical practice, how does a learner’s performance compare with that<br />
of others <strong>in</strong> the group? Does the learner meet the cl<strong>in</strong>ical objectives <strong>and</strong> possess<br />
certa<strong>in</strong> competencies regardless of how other students <strong>in</strong> the group perform<br />
<strong>in</strong> cl<strong>in</strong>ical practice? Answers to these questions depend on the basis used for<br />
<strong>in</strong>terpret<strong>in</strong>g cl<strong>in</strong>ical performance, similar to <strong>in</strong>terpret<strong>in</strong>g test scores.<br />
Norm-Referenced Interpretation<br />
There are two ma<strong>in</strong> ways of <strong>in</strong>terpret<strong>in</strong>g test scores <strong>and</strong> other types of evaluation<br />
conducted <strong>in</strong> the cl<strong>in</strong>ical sett<strong>in</strong>g, norm- <strong>and</strong> criterion-referenced. In normreferenced<br />
<strong>in</strong>terpretation, test scores <strong>and</strong> other evaluation data are referenced<br />
to a norm group. Norm-referenced <strong>in</strong>terpretation compares a student’s test scores<br />
with those of others <strong>in</strong> the class or with some other relevant group. The student’s<br />
score may be described as below or above average or at a certa<strong>in</strong> rank <strong>in</strong> the<br />
class. Norm-referenced <strong>in</strong>terpretations are limited because they do not <strong>in</strong>dicate<br />
what the student can <strong>and</strong> cannot do (Oosterhof, 2001).<br />
In cl<strong>in</strong>ical sett<strong>in</strong>gs, norm-referenced <strong>in</strong>terpretations compare the student’s<br />
cl<strong>in</strong>ical performance with those of a group of learners, <strong>in</strong>dicat<strong>in</strong>g that the student<br />
has more or less cl<strong>in</strong>ical competence than others <strong>in</strong> the group. A cl<strong>in</strong>ical evaluation<br />
<strong>in</strong>strument <strong>in</strong> which student performance is rated on a scale of below- to<br />
above-average reflects a norm-referenced system.<br />
Criterion-Referenced Interpretation<br />
Criterion-referenced <strong>in</strong>terpretation, on the other h<strong>and</strong>, <strong>in</strong>volves <strong>in</strong>terpret<strong>in</strong>g<br />
scores based on preset criteria, not <strong>in</strong> relation to the group of learners. With<br />
this type of measurement, an <strong>in</strong>dividual score is compared to a preset st<strong>and</strong>ard<br />
or criterion. The concern is how well the student performed <strong>and</strong> what the student<br />
can do regardless of the performance of other learners. Criterion-referenced<br />
<strong>in</strong>terpretations may (a) describe the specific learn<strong>in</strong>g tasks a student can perform,<br />
e.g., def<strong>in</strong>e medical terms; (b) <strong>in</strong>dicate the percentage of tasks performed or<br />
items answered correctly, e.g., def<strong>in</strong>e correctly 80% of the terms, <strong>and</strong> (c) compare<br />
performance aga<strong>in</strong>st a set st<strong>and</strong>ard <strong>and</strong> decide whether the student met that
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 7<br />
st<strong>and</strong>ard, e.g., met the medical term<strong>in</strong>ology competency (L<strong>in</strong>n & Gronlund,<br />
2000, p. 43). Criterion-referenced <strong>in</strong>terpretation determ<strong>in</strong>es how well the student<br />
performed at the end of the <strong>in</strong>struction <strong>in</strong> comparison with the objectives <strong>and</strong><br />
competencies to be achieved.<br />
With criterion-referenced cl<strong>in</strong>ical evaluation, student performance is compared<br />
aga<strong>in</strong>st preset criteria. In some nurs<strong>in</strong>g courses these criteria are the cl<strong>in</strong>ical<br />
objectives to be met <strong>in</strong> the course. Other courses <strong>in</strong>dicate competencies to be<br />
demonstrated <strong>in</strong> cl<strong>in</strong>ical practice, which are then used as the st<strong>and</strong>ards for<br />
evaluation. Rather than compar<strong>in</strong>g the performance of the student to others <strong>in</strong><br />
the group, <strong>and</strong> <strong>in</strong>dicat<strong>in</strong>g that the student was above or below the average of<br />
the group, <strong>in</strong> criterion-referenced cl<strong>in</strong>ical evaluation, performance is measured<br />
aga<strong>in</strong>st the objectives or competencies to be demonstrated. The concern with<br />
criterion-referenced cl<strong>in</strong>ical evaluation is whether students achieved the cl<strong>in</strong>ical<br />
objectives or demonstrated the competencies, not how well they performed <strong>in</strong><br />
comparison to the other students.<br />
TESTING<br />
Tests are one form of measurement. A test is a set of questions to which the<br />
student responds <strong>in</strong> written or oral form. With tests the questions are adm<strong>in</strong>istered<br />
dur<strong>in</strong>g a fixed period of time under comparable conditions for all students (L<strong>in</strong>n &<br />
Gronlund, 2000). While students often dread tests, <strong>in</strong>formation from tests enables<br />
faculty to make important decisions about students, the <strong>in</strong>struction, <strong>and</strong><br />
programs.<br />
First, test results provide a basis for grad<strong>in</strong>g. For some nurs<strong>in</strong>g courses, tests<br />
may be the only source of <strong>in</strong>formation for determ<strong>in</strong><strong>in</strong>g grades, but <strong>in</strong> most<br />
courses, other means for grad<strong>in</strong>g student performance also are used, such as<br />
papers, projects, <strong>and</strong> other assignments. Tests are typically the primary means<br />
for arriv<strong>in</strong>g at grades <strong>in</strong> nurs<strong>in</strong>g courses, particularly <strong>in</strong> undergraduate programs.<br />
Second, tests may be used to identify the student’s knowledge <strong>and</strong> skills<br />
prior to the <strong>in</strong>struction, which enables the teacher to gear the <strong>in</strong>struction to<br />
the learner’s needs. Mertler (2003) referred to this pre<strong>test<strong>in</strong>g</strong> as diagnostic<br />
assessment. The test results <strong>in</strong>dicate gaps <strong>in</strong> learn<strong>in</strong>g that should be addressed<br />
first <strong>and</strong> knowledge <strong>and</strong> skills already acquired. With this <strong>in</strong>formation teachers<br />
can plan their <strong>in</strong>struction better. When teachers are work<strong>in</strong>g with large groups<br />
of students, it is difficult to gear the <strong>in</strong>struction to meet each student’s needs.<br />
However, the teacher can use diagnostic tests to reveal content areas for which<br />
<strong>in</strong>dividual learners may be lack<strong>in</strong>g <strong>and</strong> then suggest remedial learn<strong>in</strong>g activities.<br />
Third, tests are used for selection <strong>and</strong> fourth, placement of students <strong>in</strong><br />
nurs<strong>in</strong>g programs. In some nurs<strong>in</strong>g programs, students take tests for admission
8 EVALUATION AND TESTING IN NURSING EDUCATION<br />
to the program, provid<strong>in</strong>g data for faculty to accept or reject applicants. For<br />
example, the National League for Nurs<strong>in</strong>g (NLN) offers pre-admission exam<strong>in</strong>ations<br />
for registered nurse (RN) <strong>and</strong> licensed practical nurse (LPN) programs.<br />
Test results provide norms that allow comparison of the applicant’s performance<br />
with those of other applicants (National League for Nurs<strong>in</strong>g, 2003). Tests also<br />
may be used to place students <strong>in</strong>to appropriate courses. Placement tests, taken<br />
after the <strong>in</strong>dividual has been admitted, provide data for determ<strong>in</strong><strong>in</strong>g which<br />
courses students should complete <strong>in</strong> their programs of study. For example, a<br />
diagnostic test of math skills may determ<strong>in</strong>e whether a nurs<strong>in</strong>g student is required<br />
to take a medication dosage calculation course.<br />
Fifth, test results provide clues as to content areas that students learned or did<br />
not learn <strong>in</strong> a course. With this <strong>in</strong>formation, faculty can modify the <strong>in</strong>struction to<br />
better meet student learn<strong>in</strong>g needs. Student responses to higher-level test questions,<br />
such as ones measur<strong>in</strong>g critical th<strong>in</strong>k<strong>in</strong>g, may suggest changes <strong>in</strong> teach<strong>in</strong>g<br />
strategies to encourage development of cognitive skills rather than memorization<br />
of facts <strong>and</strong> pr<strong>in</strong>ciples.<br />
Last, <strong>test<strong>in</strong>g</strong> may be an <strong>in</strong>tegral part of the curriculum <strong>and</strong> program evaluation<br />
<strong>in</strong> a school of nurs<strong>in</strong>g. Students may complete tests to measure program<br />
outcomes rather than to document what was learned <strong>in</strong> a course. Test results<br />
for this purpose often suggest areas of the curriculum for revision.<br />
ASSESSMENT<br />
Even though a test is a one-time measure, <strong>education</strong>al assessment <strong>in</strong>volves the<br />
collection of data about learners <strong>and</strong> programs over a period of time. Through<br />
assessment the teacher exam<strong>in</strong>es students’ performance over time, for example,<br />
cover<strong>in</strong>g the length of a semester or a cl<strong>in</strong>ical rotation. Assessment <strong>in</strong>cludes a<br />
variety of techniques to obta<strong>in</strong> <strong>in</strong>formation about students’ performance (L<strong>in</strong>n &<br />
Gronlund, 2000). These techniques <strong>in</strong>clude not only tests but also other strategies<br />
for evaluat<strong>in</strong>g learn<strong>in</strong>g such as papers, other types of written assignments, projects,<br />
portfolios, <strong>and</strong> conferences, among others.<br />
Assessment is broader <strong>and</strong> more <strong>in</strong>clusive than measurement <strong>and</strong> <strong>test<strong>in</strong>g</strong>.<br />
Assessment <strong>in</strong>volves all of the strategies planned for determ<strong>in</strong><strong>in</strong>g student performance<br />
<strong>in</strong> a course. Through assessment teachers systemically collect data about<br />
performance <strong>in</strong> the classroom, learn<strong>in</strong>g laboratory, <strong>and</strong> cl<strong>in</strong>ical sett<strong>in</strong>g, provid<strong>in</strong>g<br />
<strong>in</strong>formation for mak<strong>in</strong>g decisions about course grades <strong>and</strong> cl<strong>in</strong>ical competence.<br />
OBJECTIVES FOR EVALUATION AND TESTING<br />
Objectives play a role <strong>in</strong> teach<strong>in</strong>g students <strong>in</strong> varied sett<strong>in</strong>gs <strong>in</strong> nurs<strong>in</strong>g. They<br />
provide guidel<strong>in</strong>es for student learn<strong>in</strong>g <strong>and</strong> <strong>in</strong>struction <strong>and</strong> a basis for evaluat<strong>in</strong>g
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 9<br />
learn<strong>in</strong>g. The objectives represent the outcomes of learn<strong>in</strong>g; these outcomes may<br />
<strong>in</strong>clude the acquisition of knowledge, development of values, <strong>and</strong> performance of<br />
psychomotor <strong>and</strong> technological skills. <strong>Evaluation</strong> serves to determ<strong>in</strong>e the extent<br />
<strong>and</strong> quality of the student’s learn<strong>in</strong>g <strong>in</strong> relation to these outcomes. This does<br />
not mean that the teacher is unconcerned about learn<strong>in</strong>g that occurs <strong>and</strong> is not<br />
expressed as outcomes. Many students will acquire knowledge, values, <strong>and</strong> skills<br />
beyond those expressed <strong>in</strong> the objectives, but the evaluation methods planned<br />
by the teacher focus on the preset outcomes to be met by students.<br />
To develop test items, cl<strong>in</strong>ical evaluation strategies, <strong>and</strong> other evaluation<br />
methods, teachers needs a clear description of what to evaluate. The knowledge,<br />
values, <strong>and</strong> skills to be evaluated are specified by the outcomes of the course<br />
<strong>and</strong> cl<strong>in</strong>ical practicum. These provide the basis for evaluat<strong>in</strong>g learn<strong>in</strong>g <strong>in</strong> the<br />
classroom, practice laboratories, <strong>and</strong> cl<strong>in</strong>ical sett<strong>in</strong>g.<br />
Writ<strong>in</strong>g Objectives<br />
In develop<strong>in</strong>g <strong>in</strong>structional objectives, there are two important dimensions. The<br />
first is the actual technique for writ<strong>in</strong>g objectives <strong>and</strong> the second is decid<strong>in</strong>g on<br />
their complexity. The predom<strong>in</strong>ant format for writ<strong>in</strong>g objectives <strong>in</strong> past years<br />
was to develop a highly specific objective that <strong>in</strong>cluded (a) a description of the<br />
learner, (b) behaviors the learner would exhibit at the end of the <strong>in</strong>struction,<br />
(c) conditions under which the behavior would be demonstrated, <strong>and</strong> (d) the<br />
st<strong>and</strong>ard of performance. An example of this format for an objective is: Given a<br />
written nurs<strong>in</strong>g assessment, the student identifies <strong>in</strong> writ<strong>in</strong>g two nurs<strong>in</strong>g diagnoses<br />
with support<strong>in</strong>g rationale. This objective <strong>in</strong>cludes the follow<strong>in</strong>g components:<br />
Learner:<br />
Behavior:<br />
Conditions:<br />
St<strong>and</strong>ard:<br />
Student<br />
Identifies <strong>in</strong> writ<strong>in</strong>g nurs<strong>in</strong>g diagnoses<br />
Given a written nurs<strong>in</strong>g assessment<br />
Two nurs<strong>in</strong>g diagnoses must be identified with support<strong>in</strong>g rationale.<br />
It is clear from this example that specific <strong>in</strong>structional objectives are too prescriptive<br />
for use <strong>in</strong> nurs<strong>in</strong>g. The complexity of learn<strong>in</strong>g expected <strong>in</strong> a nurs<strong>in</strong>g program<br />
makes it difficult to use such a system for specify<strong>in</strong>g the objectives. L<strong>in</strong>n <strong>and</strong><br />
Gronlund (2000) also suggested that highly specific objectives are concerned<br />
ma<strong>in</strong>ly with simple knowledge <strong>and</strong> skills, mak<strong>in</strong>g them <strong>in</strong>appropriate for nurs<strong>in</strong>g<br />
courses that require problem solv<strong>in</strong>g <strong>and</strong> critical th<strong>in</strong>k<strong>in</strong>g. In addition, they<br />
limit flexibility <strong>in</strong> plann<strong>in</strong>g <strong>in</strong>structional methods <strong>and</strong> <strong>in</strong> develop<strong>in</strong>g evaluation<br />
techniques. For these reasons, a general format for writ<strong>in</strong>g objectives is sufficient
10 EVALUATION AND TESTING IN NURSING EDUCATION<br />
to express the learn<strong>in</strong>g outcomes <strong>and</strong> to provide a basis for evaluat<strong>in</strong>g learn<strong>in</strong>g<br />
<strong>in</strong> nurs<strong>in</strong>g courses.<br />
General objectives <strong>in</strong>clude the behavior the learner will exhibit as a result<br />
of the <strong>in</strong>struction <strong>and</strong> content to which that behavior relates. A general objective<br />
similar to the earlier outcome is: The student identifies nurs<strong>in</strong>g diagnoses based<br />
on the assessment. With this example, the components would be:<br />
Learner:<br />
Behavior:<br />
Content:<br />
Student<br />
Identifies<br />
Nurs<strong>in</strong>g diagnoses from the assessment.<br />
This general objective, which is open-ended, provides flexibility for the teacher<br />
<strong>in</strong> develop<strong>in</strong>g <strong>in</strong>struction to meet it <strong>and</strong> for evaluat<strong>in</strong>g student learn<strong>in</strong>g. The<br />
outcome could be met <strong>and</strong> evaluated <strong>in</strong> the classroom through varied activities<br />
<strong>in</strong> which students analyze assessment data, presented as part of a lecture, <strong>in</strong> a<br />
written case study, or <strong>in</strong> a videotape, <strong>and</strong> identify nurs<strong>in</strong>g diagnoses. Students<br />
might work <strong>in</strong> groups <strong>in</strong> or out of class, review<strong>in</strong>g various assessments <strong>and</strong><br />
discuss<strong>in</strong>g possible diagnoses. Or, they might complete a simulated experience<br />
<strong>in</strong> which they collect data <strong>and</strong> identify nurs<strong>in</strong>g diagnoses. In the cl<strong>in</strong>ical sett<strong>in</strong>g,<br />
patient assignments, conferences, discussions with students, <strong>and</strong> reviews of patient<br />
records provide other strategies for develop<strong>in</strong>g an ability to derive nurs<strong>in</strong>g<br />
diagnoses <strong>and</strong> for evaluat<strong>in</strong>g student competency. Generally stated objectives,<br />
therefore, provide sufficient guidel<strong>in</strong>es for <strong>in</strong>struction <strong>and</strong> evaluation of student<br />
learn<strong>in</strong>g.<br />
Gronlund (2000) identified two criteria to be met when writ<strong>in</strong>g objectives.<br />
First, the objective should specify the behavior that the student should demonstrate<br />
at the end of the <strong>in</strong>struction, not what the teacher does. Second, the<br />
behavior should be objective <strong>and</strong> measurable so students can demonstrate what<br />
they have learned <strong>and</strong> teachers can measure their performance.<br />
The objectives are important <strong>in</strong> develop<strong>in</strong>g evaluation methods that measure<br />
the behavior <strong>and</strong> content area <strong>in</strong>tended by the objective. In evaluat<strong>in</strong>g the<br />
sample objective cited earlier, the method selected—for <strong>in</strong>stance, a test—needs<br />
to exam<strong>in</strong>e student ability to identify nurs<strong>in</strong>g diagnoses from assessment data.<br />
The objective does not specify the number of nurs<strong>in</strong>g diagnoses, type of client<br />
problem, complexity of the assessment data, or other variables associated with<br />
the cl<strong>in</strong>ical situation; there is opportunity for the teacher to develop various<br />
types of test questions <strong>and</strong> evaluation measures as long as they ask the learner<br />
to identify nurs<strong>in</strong>g diagnoses based on the given assessment.<br />
Clearly written objectives guide the teacher <strong>in</strong> select<strong>in</strong>g evaluation methods.<br />
The behavior <strong>in</strong>dicated by the objective suggests evaluation methods, such as<br />
tests, observations <strong>in</strong> the cl<strong>in</strong>ical sett<strong>in</strong>g, written assignments, <strong>and</strong> others. When
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 11<br />
tests are chosen as the method, the objective <strong>in</strong> turn suggests the type of test<br />
question, for <strong>in</strong>stance, true-false, multiple-choice, or essay. In addition to guid<strong>in</strong>g<br />
decisions about evaluation methods, the objective gives clues to faculty about<br />
teach<strong>in</strong>g methods <strong>and</strong> learn<strong>in</strong>g activities to assist students <strong>in</strong> meet<strong>in</strong>g it. For<br />
the sample objective, teach<strong>in</strong>g methods might <strong>in</strong>clude: read<strong>in</strong>gs, lecture, discussion,<br />
case study, simulation, role play, videotape, <strong>in</strong>teractive video, cl<strong>in</strong>ical<br />
practice, post-cl<strong>in</strong>ical conference, <strong>and</strong> other methods that present assessment<br />
data <strong>and</strong> ask students to identify varied nurs<strong>in</strong>g diagnoses.<br />
Objectives that are useful for test construction <strong>and</strong> for design<strong>in</strong>g other<br />
evaluation methods meet three general pr<strong>in</strong>ciples. First, the behavior, or action<br />
verb, should be measurable. The behavior represents the outcome expected of<br />
the learner at the end of the <strong>in</strong>struction. Terms such as identify, describe, <strong>and</strong><br />
analyze are specific <strong>and</strong> may be measured; words such as underst<strong>and</strong> <strong>and</strong> know, <strong>in</strong><br />
contrast, represent a wide variety of behaviors, some simple <strong>and</strong> others complex,<br />
mak<strong>in</strong>g these terms difficult to evaluate. The student’s knowledge might range<br />
from identify<strong>in</strong>g <strong>and</strong> nam<strong>in</strong>g through synthesiz<strong>in</strong>g <strong>and</strong> evaluat<strong>in</strong>g. Sample behaviors<br />
are presented <strong>in</strong> Table 1.1.<br />
Second, the objectives should be as general as possible to allow for their<br />
achievement with varied course content. For <strong>in</strong>stance, <strong>in</strong>stead of stat<strong>in</strong>g that<br />
the student identifies physiological nurs<strong>in</strong>g diagnoses from the assessment of<br />
acutely ill patients, <strong>in</strong>dicat<strong>in</strong>g that the learner identifies nurs<strong>in</strong>g diagnoses from<br />
assessment data provides more flexibility for the teacher <strong>in</strong> design<strong>in</strong>g evaluation<br />
strategies that reflect different types of diagnoses from varied data sets presented<br />
<strong>in</strong> the course.<br />
Third, the teach<strong>in</strong>g method should be omitted from the objective to provide<br />
greater flexibility <strong>in</strong> how the <strong>in</strong>struction is planned. For example, <strong>in</strong> the objective<br />
“Uses effective communication techniques <strong>in</strong> a simulated client-nurse <strong>in</strong>teraction,”<br />
the teacher is limited to evaluat<strong>in</strong>g communication techniques through<br />
simulations rather than through <strong>in</strong>teractions the student might have <strong>in</strong> the<br />
cl<strong>in</strong>ical sett<strong>in</strong>g. The objective would be better if stated as “Uses effective communication<br />
techniques with clients.”<br />
In test construction <strong>and</strong> evaluation, both the behavior to be achieved, for<br />
example, lists <strong>and</strong> applies, <strong>and</strong> the content area to which that behavior relates,<br />
such as pa<strong>in</strong> management, are important. With clear <strong>and</strong> measurable objectives,<br />
teachers have a sound framework for develop<strong>in</strong>g the evaluation strategies for<br />
the course. Airasian (2000) suggested an easy to use model for writ<strong>in</strong>g objectives:<br />
Model:<br />
Example:<br />
The student will [measurable behavior] [content].<br />
The student will [describe] [classifications for asthma severity.]<br />
A model such as this facilitates the development of test items <strong>and</strong> other evaluation<br />
strategies.
TABLE 1.1 Sample Verbs for Taxonomic Levels<br />
Cognitive Doma<strong>in</strong> Affective Doma<strong>in</strong> Psychomotor Doma<strong>in</strong><br />
Knowledge Receiv<strong>in</strong>g Imitation<br />
Def<strong>in</strong>e Name<br />
Acknowledge Follow example of<br />
Identify Recall<br />
Ask Imitate<br />
Label State<br />
Reply<br />
Manipulation<br />
List Show awareness of<br />
Assemble<br />
Comprehension Respond<strong>in</strong>g Carry out<br />
Defend Expla<strong>in</strong><br />
Act will<strong>in</strong>gly Follow procedure<br />
Describe Give examples<br />
Assist<br />
Precision<br />
Differentiate Interpret<br />
Is will<strong>in</strong>g to<br />
Demonstrate Draw conclusions Select<br />
Support Is accurate <strong>in</strong><br />
skill<br />
Summarize<br />
Application Respond<br />
Articulation<br />
Apply Produce<br />
Seek opportunities<br />
Carry out accurately<br />
Demonstrate use of Relate<br />
Valu<strong>in</strong>g <strong>and</strong> <strong>in</strong> reasonable<br />
Modify Solve<br />
Accept time frame<br />
Operate Use<br />
Assume responsibility Is skillful<br />
Predict<br />
Participate <strong>in</strong><br />
Naturalization<br />
Analysis Respect<br />
Is competent<br />
Analyze Differentiate<br />
Support<br />
Carry out<br />
Breaks down Identify<br />
Value<br />
competently<br />
Compare Relate<br />
Organization of Values Integrate skill with<strong>in</strong><br />
Contrast Select<br />
Argue care<br />
Detect<br />
Debate<br />
12
TABLE 1.1 (cont<strong>in</strong>ued)<br />
Cognitive Doma<strong>in</strong> Affective Doma<strong>in</strong> Psychomotor Doma<strong>in</strong><br />
Synthesis<br />
Compile Plan Declare<br />
Construct Produce Defend<br />
Design Revise Take a st<strong>and</strong><br />
Develop Synthesize<br />
Devise Write<br />
Generate<br />
<strong>Evaluation</strong><br />
Appraise Evaluate<br />
Assess Judge<br />
Critique Justify<br />
Discrim<strong>in</strong>ate Support<br />
Characterization by Value<br />
Act consistently<br />
St<strong>and</strong> for<br />
13
14 EVALUATION AND TESTING IN NURSING EDUCATION<br />
TAXONOMIES OF OBJECTIVES<br />
The need for clearly stated objectives becomes evident when the teacher translates<br />
them <strong>in</strong>to test items <strong>and</strong> other evaluation methods. Test items need to<br />
adequately measure the behavior <strong>in</strong> the objective, for <strong>in</strong>stance, identify, describe,<br />
apply, <strong>and</strong> analyze, as it relates to the content area. Objectives may be written<br />
to reflect three doma<strong>in</strong>s of learn<strong>in</strong>g, each with its own classification or taxonomic<br />
system: cognitive, affective, <strong>and</strong> psychomotor. A taxonomy is a classification<br />
system that places an objective with<strong>in</strong> a broader system or scheme. While learn<strong>in</strong>g<br />
<strong>in</strong> nurs<strong>in</strong>g ultimately represents an <strong>in</strong>tegration of these doma<strong>in</strong>s, it is valuable<br />
<strong>in</strong> plann<strong>in</strong>g <strong>in</strong>struction <strong>and</strong> particularly <strong>in</strong> develop<strong>in</strong>g evaluation measures, for<br />
the doma<strong>in</strong>s to be considered separately.<br />
Cognitive Taxonomy<br />
The cognitive doma<strong>in</strong> deals with knowledge <strong>and</strong> <strong>in</strong>tellectual skills. Learn<strong>in</strong>g<br />
with<strong>in</strong> this doma<strong>in</strong> <strong>in</strong>cludes the acquisition of facts <strong>and</strong> specific <strong>in</strong>formation<br />
underly<strong>in</strong>g the practice of nurs<strong>in</strong>g; concepts, theories, <strong>and</strong> pr<strong>in</strong>ciples about<br />
nurs<strong>in</strong>g; <strong>and</strong> cognitive skills such as decision mak<strong>in</strong>g, problem solv<strong>in</strong>g, <strong>and</strong><br />
critical th<strong>in</strong>k<strong>in</strong>g. The most widely used cognitive taxonomy was developed years<br />
ago, <strong>in</strong> 1956, by Bloom <strong>and</strong> associates. It provides for six levels of cognitive<br />
learn<strong>in</strong>g, <strong>in</strong>creas<strong>in</strong>g <strong>in</strong> complexity: knowledge, comprehension, application, analysis,<br />
synthesis, <strong>and</strong> evaluation. This hierarchy suggests that knowledge, such as<br />
recall of specific facts, is less complex <strong>and</strong> dem<strong>and</strong><strong>in</strong>g <strong>in</strong>tellectually than the<br />
higher levels of learn<strong>in</strong>g. <strong>Evaluation</strong>, the most complex level, requires judgments<br />
based on varied criteria. For each of the levels, except for application, Bloom,<br />
Englehart, Furst, Hill, <strong>and</strong> Krathwohl (1956) identified sublevels.<br />
One advantage <strong>in</strong> consider<strong>in</strong>g this taxonomy when writ<strong>in</strong>g objectives <strong>and</strong><br />
test items is that it encourages the teacher to th<strong>in</strong>k about higher levels of learn<strong>in</strong>g<br />
expected as a result of the <strong>in</strong>struction. If the course goals reflect application of<br />
concepts <strong>in</strong> cl<strong>in</strong>ical practice, use of theories <strong>in</strong> patient care, <strong>and</strong> critical th<strong>in</strong>k<strong>in</strong>g<br />
outcomes, these higher levels of learn<strong>in</strong>g should be reflected <strong>in</strong> the objectives <strong>and</strong><br />
evaluation rather than focus<strong>in</strong>g only on the recall of facts <strong>and</strong> other <strong>in</strong>formation.<br />
In us<strong>in</strong>g the taxonomy, the teacher decides first on the level of cognitive<br />
learn<strong>in</strong>g <strong>in</strong>tended <strong>and</strong> then develops objectives <strong>and</strong> evaluation methods for that<br />
particular level. Decisions about the taxonomic level at which to gear <strong>in</strong>struction<br />
<strong>and</strong> evaluation depend on the teacher’s judgment <strong>in</strong> consider<strong>in</strong>g the background<br />
of the learner; placement of the course <strong>and</strong> learn<strong>in</strong>g experiences with<strong>in</strong> the<br />
curriculum to provide for the progressive development of knowledge, skills, <strong>and</strong><br />
values; <strong>and</strong> complexity of the behavior <strong>and</strong> content <strong>in</strong> relation to the time
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 15<br />
allowed for teach<strong>in</strong>g. If the time for teach<strong>in</strong>g <strong>and</strong> evaluation is limited, the<br />
objectives may need to be written at a lower level. The taxonomy provides a<br />
cont<strong>in</strong>uum for educators to use <strong>in</strong> plann<strong>in</strong>g <strong>in</strong>struction <strong>and</strong> carry<strong>in</strong>g out evaluation,<br />
beg<strong>in</strong>n<strong>in</strong>g with recall of facts <strong>and</strong> <strong>in</strong>formation <strong>and</strong> progress<strong>in</strong>g toward<br />
underst<strong>and</strong><strong>in</strong>g, us<strong>in</strong>g concepts <strong>and</strong> theories <strong>in</strong> practice, analyz<strong>in</strong>g situations,<br />
synthesiz<strong>in</strong>g from different sources to develop new products, <strong>and</strong> evaluat<strong>in</strong>g<br />
materials <strong>and</strong> situations based on <strong>in</strong>ternal <strong>and</strong> external criteria.<br />
A description <strong>and</strong> sample objective for each of the six levels of learn<strong>in</strong>g<br />
<strong>in</strong> the cognitive taxonomy follow. While sublevels have been established for<br />
these levels, except for application, only the six major levels are essential to<br />
guide the teacher for <strong>in</strong>structional <strong>and</strong> evaluation purposes.<br />
1. Knowledge: Recall of facts <strong>and</strong> specific <strong>in</strong>formation. Memorization of<br />
specifics.<br />
The student def<strong>in</strong>es the term systole.<br />
2. Comprehension: Underst<strong>and</strong><strong>in</strong>g. Ability to describe <strong>and</strong> expla<strong>in</strong> the<br />
material.<br />
The learner describes the circulation through the heart.<br />
3. Application: Use of <strong>in</strong>formation <strong>in</strong> a new situation. Ability to use<br />
knowledge <strong>in</strong> a new situation.<br />
The student applies concepts of ag<strong>in</strong>g <strong>in</strong> develop<strong>in</strong>g <strong>in</strong>terventions for<br />
the elderly.<br />
4. Analysis: Ability to break down material <strong>in</strong>to component parts <strong>and</strong><br />
identify the relationships among them.<br />
The student analyzes the organizational structure of the community<br />
health agency <strong>and</strong> its impact on client services.<br />
5. Synthesis: Ability to develop a new product. Comb<strong>in</strong><strong>in</strong>g elements to<br />
form a new product.<br />
The student develops a plan for deliver<strong>in</strong>g services to elderly persons<br />
who are home-bound.<br />
6. <strong>Evaluation</strong>: Judgments about value based on <strong>in</strong>ternal <strong>and</strong> external<br />
criteria. Extent to which materials <strong>and</strong> objects meet criteria.<br />
The learner evaluates nurs<strong>in</strong>g research based on predeterm<strong>in</strong>ed criteria.<br />
This taxonomy is useful <strong>in</strong> develop<strong>in</strong>g test items because it helps the teacher<br />
gear the item to a particular cognitive level. For example, if the objective focuses<br />
on application, the test question should measure whether the student can use<br />
the concept <strong>in</strong> a new situation, which is the <strong>in</strong>tent of learn<strong>in</strong>g at that level.<br />
However, the taxonomy alone does not always determ<strong>in</strong>e the level of complexity
16 EVALUATION AND TESTING IN NURSING EDUCATION<br />
of the item because one other consideration is how the <strong>in</strong>formation was presented<br />
<strong>in</strong> the <strong>in</strong>struction. For example, a test item at the application level requires use<br />
of previously learned concepts <strong>and</strong> theories <strong>in</strong> a new situation. Whether or not<br />
the situation is new for each student, however, is not known. Some students<br />
may have had cl<strong>in</strong>ical experience with that situation or been exposed to it<br />
through another learn<strong>in</strong>g activity. As another example, a question written at<br />
the comprehension level may actually be at the knowledge level if the teacher<br />
used that specific explanation <strong>in</strong> class <strong>and</strong> students only need to recall it to<br />
answer the item.<br />
Mertler (2003) suggested a modification of the six levels of the taxonomy<br />
<strong>in</strong>to a two-category system: lower- <strong>and</strong> higher-level cognitive behaviors. Lowerlevel<br />
behaviors focus on memorization, recall of facts, <strong>and</strong> underst<strong>and</strong><strong>in</strong>g of<br />
content; this level captures the knowledge <strong>and</strong> comprehension level of Bloom’s<br />
taxonomy. Higher-level behaviors extend beyond comprehension to <strong>in</strong>clude<br />
development of th<strong>in</strong>k<strong>in</strong>g skills, from application through evaluation. A twolevel<br />
system works well <strong>in</strong> nurs<strong>in</strong>g because it helps avoid teach<strong>in</strong>g <strong>and</strong> evaluat<strong>in</strong>g<br />
lower-level behaviors only.<br />
Affective Doma<strong>in</strong><br />
The affective doma<strong>in</strong> relates to the development of values, attitudes, <strong>and</strong> beliefs<br />
consistent with st<strong>and</strong>ards of professional nurs<strong>in</strong>g practice. Developed by Krathwohl,<br />
Bloom, <strong>and</strong> Masia (1964), the taxonomy of the affective doma<strong>in</strong> <strong>in</strong>cludes<br />
five levels organized hierarchically based on the pr<strong>in</strong>ciple of <strong>in</strong>creas<strong>in</strong>g <strong>in</strong>volvement<br />
of the learner <strong>and</strong> <strong>in</strong>ternationalization of a value. Krathwohl et al. (1964)<br />
believed that objectives for learn<strong>in</strong>g could be specified <strong>in</strong> the affective doma<strong>in</strong><br />
<strong>and</strong> <strong>in</strong>struction provided to assist the learner <strong>in</strong> develop<strong>in</strong>g a value system that<br />
guides decisions <strong>and</strong> behaviors. The pr<strong>in</strong>ciple on which the affective taxonomy<br />
is based relates to the movement of learners from mere awareness of a value,<br />
for <strong>in</strong>stance, confidentiality, to <strong>in</strong>ternalization of that value as a basis for their<br />
own behavior.<br />
There are two important dimensions <strong>in</strong> evaluat<strong>in</strong>g affective outcomes. The<br />
first relates to the student’s knowledge of the values, attitudes, <strong>and</strong> beliefs that<br />
are important <strong>in</strong> guid<strong>in</strong>g decisions <strong>in</strong> nurs<strong>in</strong>g. Prior to <strong>in</strong>ternaliz<strong>in</strong>g a value <strong>and</strong><br />
us<strong>in</strong>g it as a basis for decision mak<strong>in</strong>g <strong>and</strong> behavior, the student needs to know<br />
what are important values <strong>in</strong> nurs<strong>in</strong>g. There is a cognitive base, therefore, to<br />
the development of a value system. <strong>Evaluation</strong> of this dimension focuses on<br />
acquisition of knowledge about the values, attitudes, <strong>and</strong> beliefs consistent with<br />
professional nurs<strong>in</strong>g practice. A variety of test items <strong>and</strong> evaluation methods<br />
are appropriate to assess this knowledge base.
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 17<br />
The second dimension of affective evaluation focuses on whether or not<br />
students have accepted these values, attitudes, <strong>and</strong> beliefs <strong>and</strong> are <strong>in</strong>ternaliz<strong>in</strong>g<br />
them to form a system for their own decision mak<strong>in</strong>g <strong>and</strong> behavior. <strong>Evaluation</strong><br />
at these higher levels of the affective doma<strong>in</strong> is more difficult because it requires<br />
observation of student behavior over time to determ<strong>in</strong>e if there is commitment<br />
to act accord<strong>in</strong>g to professional values. Test items are not appropriate for these<br />
levels as the teacher is concerned with the use of values <strong>in</strong> practice <strong>and</strong> motivation<br />
to carry them out consistently <strong>in</strong> patient care.<br />
A description <strong>and</strong> sample objective for each of the five levels of learn<strong>in</strong>g<br />
<strong>in</strong> the affective taxonomy follow. While sublevels have been established for<br />
each of these levels, only these five major categories are essential to guide the<br />
teacher for <strong>in</strong>structional <strong>and</strong> evaluation purposes:<br />
1. Receiv<strong>in</strong>g: Awareness of values, attitudes, <strong>and</strong> beliefs important <strong>in</strong><br />
nurs<strong>in</strong>g practice. Sensitivity to a client, cl<strong>in</strong>ical situation, problem.<br />
The student expresses an awareness of the need for ma<strong>in</strong>ta<strong>in</strong><strong>in</strong>g confidentiality<br />
of patient <strong>in</strong>formation.<br />
2. Respond<strong>in</strong>g: React<strong>in</strong>g to a situation. Respond<strong>in</strong>g voluntarily to a given<br />
phenomenon reflect<strong>in</strong>g a choice made by the learner.<br />
The student shares will<strong>in</strong>gly feel<strong>in</strong>gs about car<strong>in</strong>g for a dy<strong>in</strong>g patient.<br />
3. Valu<strong>in</strong>g: Internalization of a value. Acceptance of a value <strong>and</strong> commitment<br />
to us<strong>in</strong>g it as a basis for behavior.<br />
The learner supports the rights of clients to their own life styles <strong>and</strong><br />
decisions about care.<br />
4. Organization: Development of a complex system of values. Organization<br />
of a value system.<br />
The learner forms a position about issues surround<strong>in</strong>g cost <strong>and</strong> quality<br />
of care.<br />
5. Characterization by a value: Internalization of a value system provid<strong>in</strong>g<br />
a philosophy for practice.<br />
The learner acts consistently to <strong>in</strong>volve clients <strong>and</strong> families <strong>in</strong> decision<br />
mak<strong>in</strong>g about care.<br />
Psychomotor Doma<strong>in</strong><br />
Psychomotor learn<strong>in</strong>g <strong>in</strong>volves the development of skills <strong>and</strong> competency <strong>in</strong> the<br />
use of technology. This doma<strong>in</strong> <strong>in</strong>cludes activities that are movement oriented,<br />
requir<strong>in</strong>g some degree of physical coord<strong>in</strong>ation. Motor skills have a cognitive<br />
base, which are the pr<strong>in</strong>ciples underly<strong>in</strong>g the skill. They also have an affective
18 EVALUATION AND TESTING IN NURSING EDUCATION<br />
component reflect<strong>in</strong>g the values of the nurse while carry<strong>in</strong>g out the skill, for<br />
<strong>in</strong>stance, respect<strong>in</strong>g the client while perform<strong>in</strong>g the procedure (Oermann, 2004).<br />
Different taxonomies have been developed for the evaluation of psychomotor<br />
skills. One taxonomy useful <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong> specifies five levels <strong>in</strong> the<br />
development of psychomotor skills. The lowest level is imitation learn<strong>in</strong>g; here<br />
the learner observes a demonstration of the skill <strong>and</strong> imitates that performance.<br />
In the second level, the learner performs the skill follow<strong>in</strong>g written guidel<strong>in</strong>es.<br />
By practic<strong>in</strong>g skills the learner ref<strong>in</strong>es the ability to perform them without errors<br />
(precision) <strong>and</strong> <strong>in</strong> a reasonable time frame (articulation) until they become a<br />
natural part of care (naturalization) (Dave, 1970; Gaberson & Oermann, 1999).<br />
A description of each of these levels <strong>and</strong> sample objectives follows.<br />
1. Imitation: Performance of skill follow<strong>in</strong>g demonstration by teacher or<br />
through multimedia. Imitative learn<strong>in</strong>g.<br />
The student follows the example for chang<strong>in</strong>g a dress<strong>in</strong>g.<br />
2. Manipulation: Ability to follow <strong>in</strong>structions rather than need<strong>in</strong>g to<br />
observe the procedure or skill.<br />
The student suctions a patient accord<strong>in</strong>g to the accepted procedure.<br />
3. Precision: Ability to perform skill accurately, <strong>in</strong>dependently, <strong>and</strong> without<br />
us<strong>in</strong>g a model or set of directions.<br />
The student takes vital signs accurately.<br />
4. Articulation: Coord<strong>in</strong>ated performance of a skill with<strong>in</strong> a reasonable<br />
time frame.<br />
The learner demonstrates skill <strong>in</strong> suction<strong>in</strong>g patients with vary<strong>in</strong>g<br />
health problems.<br />
5. Naturalization: High degree of proficiency. Integration of skill with<strong>in</strong><br />
care.<br />
The learner competently carries out skills needed for care of technologydependent<br />
children <strong>in</strong> their homes.<br />
<strong>Evaluation</strong> methods for psychomotor skills provide data on knowledge of<br />
the pr<strong>in</strong>ciples underly<strong>in</strong>g the skill <strong>and</strong> ability to carry out the procedure <strong>in</strong><br />
simulated sett<strong>in</strong>gs <strong>and</strong> with clients. Most of the evaluation of performance is<br />
done <strong>in</strong> the cl<strong>in</strong>ical sett<strong>in</strong>g <strong>and</strong> learn<strong>in</strong>g laboratory; however, test items may be<br />
used for evaluat<strong>in</strong>g pr<strong>in</strong>ciples associated with perform<strong>in</strong>g the skill.<br />
Integrated Framework<br />
One other framework that could be used to classify objectives was developed by<br />
L<strong>in</strong>n <strong>and</strong> Gronlund (2000). This framework <strong>in</strong>tegrates the cognitive, affective,
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 19<br />
<strong>and</strong> psychomotor doma<strong>in</strong>s <strong>in</strong>to one list <strong>and</strong> can be easily adapted for nurs<strong>in</strong>g<br />
<strong>education</strong>:<br />
1. Knowledge (knowledge of terms, facts, concepts, <strong>and</strong> methods)<br />
2. Underst<strong>and</strong><strong>in</strong>g (underst<strong>and</strong><strong>in</strong>g concepts; methods; written materials,<br />
graphs, <strong>and</strong> data; <strong>and</strong> problems)<br />
3. Application (of knowledge <strong>and</strong> problem-solv<strong>in</strong>g skills)<br />
4. Th<strong>in</strong>k<strong>in</strong>g skills (critical th<strong>in</strong>k<strong>in</strong>g)<br />
5. General skills (psychomotor, communication, <strong>and</strong> other skills), <strong>and</strong><br />
6. Attitudes (<strong>and</strong> values reflect<strong>in</strong>g st<strong>and</strong>ards of nurs<strong>in</strong>g practice).<br />
L<strong>in</strong>n <strong>and</strong> Gronlund <strong>in</strong>clude three other categories <strong>in</strong> their framework: <strong>in</strong>terests,<br />
appreciations, <strong>and</strong> adjustments, but these are not as applicable to evaluation <strong>in</strong><br />
nurs<strong>in</strong>g as are the other six areas.<br />
USE OF OBJECTIVES FOR EVALUATION AND TESTING<br />
As described earlier, the taxonomies provide a framework for the teacher to<br />
plan <strong>in</strong>struction <strong>and</strong> design evaluation at different levels of learn<strong>in</strong>g, from simple<br />
to complex <strong>in</strong> the cognitive doma<strong>in</strong>, from awareness of a value to develop<strong>in</strong>g<br />
a philosophy of practice based on <strong>in</strong>ternalization of a value system <strong>in</strong> the affective<br />
doma<strong>in</strong>, <strong>and</strong> <strong>in</strong>creas<strong>in</strong>g psychomotor competency, from imitation of the skill to<br />
performance as a natural part of care. These taxonomies are of value <strong>in</strong> design<strong>in</strong>g<br />
the evaluation protocol to gear tests <strong>and</strong> other evaluation methods to the level<br />
of learn<strong>in</strong>g anticipated from the <strong>in</strong>struction. If the outcome of learn<strong>in</strong>g is application,<br />
then test items also need to be at the application level. If the outcome of<br />
learn<strong>in</strong>g is valu<strong>in</strong>g, then the evaluation methods need to exam<strong>in</strong>e students’<br />
behaviors over time to determ<strong>in</strong>e if they are committed to practice reflect<strong>in</strong>g<br />
these values. If the outcome of skill learn<strong>in</strong>g is precision, then evaluation strategies<br />
need to focus on accuracy <strong>in</strong> performance, not the speed with which the<br />
skill is performed. The taxonomies, therefore, provide a useful framework to<br />
assure that test items <strong>and</strong> evaluation methods are at the appropriate level for<br />
the <strong>in</strong>tended learn<strong>in</strong>g outcomes.<br />
In develop<strong>in</strong>g test items <strong>and</strong> other types of evaluation methods, the teacher<br />
identifies first the objective to be evaluated, then designs test items or other<br />
evaluation strategies to measure that objective. The objective specifies the behavior,<br />
at a particular taxonomic level, to be evaluated <strong>and</strong> the content area to which<br />
it relates. For the objective, Identifies characteristics of premature ventricular<br />
contractions, the test item would exam<strong>in</strong>e student ability to recall these charac-
20 EVALUATION AND TESTING IN NURSING EDUCATION<br />
teristics. The behavior, identifies, at the knowledge level, <strong>in</strong>dicates that recall of<br />
facts is needed to answer the question, not comprehension or use of this knowledge<br />
<strong>in</strong> new client situations. The content area, characteristics of premature<br />
ventricular contractions, <strong>in</strong>dicates the content to be tested.<br />
Some teachers choose not to use objectives as the basis for <strong>test<strong>in</strong>g</strong> <strong>and</strong><br />
evaluation <strong>and</strong> <strong>in</strong>stead develop test items <strong>and</strong> other evaluation methods from<br />
the content of the course. With this process the teacher identifies explicit<br />
content areas to be evaluated; test items then sample knowledge of this content.<br />
If us<strong>in</strong>g this method, the teacher should refer to the course outcomes for decisions<br />
about the level of complexity of the test items <strong>and</strong> methods of evaluation.<br />
Throughout this book, multiple types of test items <strong>and</strong> other evaluation<br />
strategies are presented. It is assumed that these items were developed from<br />
specific outcomes or objectives, or from explicit content areas. Regardless of<br />
whether the teacher uses objectives or content doma<strong>in</strong>s as the framework for<br />
<strong>test<strong>in</strong>g</strong> <strong>and</strong> evaluation, test items <strong>and</strong> other strategies should evaluate the learn<strong>in</strong>g<br />
outcome <strong>in</strong>tended from the <strong>in</strong>struction. This outcome specifies a behavior to<br />
be evaluated, at a particular level of complexity <strong>in</strong>dicated by the taxonomic<br />
level, <strong>and</strong> a content area to which it relates. The behavior <strong>and</strong> content area<br />
provide the framework for develop<strong>in</strong>g test items <strong>and</strong> other evaluation methods.<br />
SUMMARY<br />
<strong>Evaluation</strong> is an <strong>in</strong>tegral part of the <strong>in</strong>structional process <strong>in</strong> nurs<strong>in</strong>g. Through<br />
evaluation, the teacher makes important decisions about the extent <strong>and</strong> quality<br />
of learn<strong>in</strong>g. <strong>Evaluation</strong> fulfills two major roles <strong>in</strong> the classroom <strong>and</strong> cl<strong>in</strong>ical<br />
sett<strong>in</strong>g: formative <strong>and</strong> summative.<br />
Measurement is the process of assign<strong>in</strong>g numbers to represent student<br />
achievement accord<strong>in</strong>g to certa<strong>in</strong> rules. It answers the question “How much?”<br />
There are two ma<strong>in</strong> ways of <strong>in</strong>terpret<strong>in</strong>g test scores <strong>and</strong> other types of evaluation<br />
conducted <strong>in</strong> the cl<strong>in</strong>ical sett<strong>in</strong>g: norm- <strong>and</strong> criterion-referenced. In normreferenced<br />
<strong>in</strong>terpretation, test scores <strong>and</strong> other evaluation data are referenced<br />
to a norm group. The scores are <strong>in</strong>terpreted by compar<strong>in</strong>g them to those of other<br />
<strong>in</strong>dividuals. Norm-referenced cl<strong>in</strong>ical evaluation compares students’ cl<strong>in</strong>ical performance<br />
with those of a group of learners, <strong>in</strong>dicat<strong>in</strong>g that the learner has more<br />
or less cl<strong>in</strong>ical competence than other students. Criterion-referenced <strong>in</strong>terpretation,<br />
on the other h<strong>and</strong>, <strong>in</strong>volves <strong>in</strong>terpret<strong>in</strong>g scores based on preset criteria, not<br />
<strong>in</strong> relation to a group of learners. With criterion-referenced cl<strong>in</strong>ical evaluation,<br />
student performance is compared with a set of criteria to be met. A test, which<br />
is one form of measurement, is a set of items each with a correct answer.
<strong>Evaluation</strong>, Measurement, <strong>and</strong> Education 21<br />
Objectives play a role <strong>in</strong> teach<strong>in</strong>g <strong>and</strong> evaluat<strong>in</strong>g students <strong>in</strong> varied sett<strong>in</strong>gs<br />
<strong>in</strong> nurs<strong>in</strong>g. They provide guidel<strong>in</strong>es for student learn<strong>in</strong>g <strong>and</strong> <strong>in</strong>struction <strong>and</strong><br />
serve as a basis for evaluat<strong>in</strong>g learn<strong>in</strong>g. The objectives represent the outcomes<br />
of learn<strong>in</strong>g; these outcomes may <strong>in</strong>clude the acquisition of knowledge, development<br />
of values, <strong>and</strong> performance of psychomotor <strong>and</strong> technological skills. <strong>Evaluation</strong><br />
serves to determ<strong>in</strong>e the extent of the student’s learn<strong>in</strong>g <strong>in</strong> relation to these<br />
outcomes. Each of the doma<strong>in</strong>s of learn<strong>in</strong>g has its own classification or taxonomic<br />
system, which is of value <strong>in</strong> gear<strong>in</strong>g <strong>test<strong>in</strong>g</strong> <strong>and</strong> evaluation to the level of learn<strong>in</strong>g<br />
anticipated from the <strong>in</strong>struction. While different methods have been proposed<br />
for develop<strong>in</strong>g test items <strong>and</strong> other evaluation strategies, the important pr<strong>in</strong>ciple<br />
is that they relate to the learn<strong>in</strong>g outcomes.<br />
REFERENCES<br />
Airasian, P. W. (2000). Assessment <strong>in</strong> the classroom: A concise approach (2nd ed.). Boston:<br />
McGraw-Hill.<br />
Bloom, B. S., Englehart, M. D., Furst, E. J., Hill, W. H., & Krathwohl, D. R. (1956).<br />
Taxonomy of <strong>education</strong>al objectives: The classification of <strong>education</strong>al goals. H<strong>and</strong>book I:<br />
Cognitive doma<strong>in</strong>. White Pla<strong>in</strong>s, NY: Longman.<br />
Dave, R. H. (1970). Psychomotor levels. In R. J. Armstrong (Ed.), Develop<strong>in</strong>g <strong>and</strong> writ<strong>in</strong>g<br />
behavioral objectives. Tucson, AZ: Educational Innovators.<br />
Gaberson, K. B., & Oermann, M. H. (1999). Cl<strong>in</strong>ical teach<strong>in</strong>g strategies <strong>in</strong> nurs<strong>in</strong>g <strong>education</strong>.<br />
New York: Spr<strong>in</strong>ger.<br />
Gronlund, N. E. (2000). How to write <strong>and</strong> use <strong>in</strong>structional objectives (6th ed.). Upper<br />
Saddle River, NJ: Prentice Hall.<br />
Krathwohl, D., Bloom, B., & Masia, B. (1964). Taxonomy of <strong>education</strong>al objectives, H<strong>and</strong>book<br />
II: Affective doma<strong>in</strong>. New York: Longman.<br />
L<strong>in</strong>n, R. L., & Gronlund, N. E. (2000). Measurement <strong>and</strong> assessment <strong>in</strong> teach<strong>in</strong>g (8th ed.).<br />
Upper Saddle River, NJ: Prentice Hall.<br />
Mertler, C. A. (2003). Classroom assessment. Los Angeles: Pyrczak.<br />
Oermann, M. H. (2004). Basic skills for teach<strong>in</strong>g <strong>and</strong> the advanced practice nurse. In<br />
L. Joel (Ed.), Advanced practice nurs<strong>in</strong>g: Essentials for role development (pp. 398–429).<br />
Philadelphia: F. A. Davis.<br />
Oosterhof, A. (2001). Classroom applications of <strong>education</strong>al measurement (3rd ed.). Upper<br />
Saddle River, NJ: Prentice Hall.