DATA MANAGEMENT: KEEPING A RESEARCH NOTEBOOK, FILES AND RELATED ISSUES
DATA MANAGEMENT: KEEPING A RESEARCH NOTEBOOK, FILES AND RELATED ISSUES
DATA MANAGEMENT: KEEPING A RESEARCH NOTEBOOK, FILES AND RELATED ISSUES
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
PERCSS<br />
Program for the Ethical and Responsible Conduct of Science and<br />
Scholarship<br />
<strong>DATA</strong><br />
<strong>MANAGEMENT</strong>: <strong>KEEPING</strong> A<br />
<strong>RESEARCH</strong> <strong>NOTEBOOK</strong>, <strong>FILES</strong><br />
<strong>AND</strong> <strong>RELATED</strong> <strong>ISSUES</strong><br />
Steven Brody, MD<br />
Department p of Internal Medicine<br />
Washington University School of Medicine<br />
1
Good record keeping promotes<br />
accountability t bilit and d integrity i t it iin researchh<br />
Necessary for data analysis, publication,<br />
collaboration, peer review among<br />
others.<br />
Necessary ecessa y to suppo support t intellectual te ectua<br />
property claims.<br />
Can defend you against false allegations<br />
of research misconduct.<br />
2
Good record keeping,<br />
contin continued… ed<br />
Important for the care of human<br />
subjects.<br />
necessary for members of your group<br />
or o any a y ot other e researcher esea c e to be ab able e to<br />
duplicate your work, when needed.<br />
Required by NIH and other granting<br />
institutions to meet policies and<br />
standards for the conduct of good<br />
science<br />
3
The lab notebook notebook: Overview O e ie<br />
It is a record of both physical and<br />
mental activity.<br />
Good records are complete, accurate<br />
and a d understandable u de sta dab e to ot others. e s<br />
Each member of the research team is<br />
responsible for his/her records records.<br />
The PI has the final responsibility for<br />
the laboratory records. records<br />
4
CCurie’s ie’s Notebook<br />
Ca. 1900<br />
5
PProf. of X’s Notebook<br />
Ca. 2000<br />
6
Wh What t iis a LLab b NNotebook? t b k?<br />
Complete record of procedures,<br />
reagents, data, and thoughts to pass<br />
on t to other th researchers h<br />
Why experiments were initiated, how performed,<br />
and results, results comments<br />
Place to compile data/charts/photos/ideas<br />
Place of clues, , to troubleshoot problems p<br />
Place to observe whole picture and think<br />
Legal document, to prove patents<br />
Defense against accusations of fraud or lawsuits<br />
7
TpeandFomat<br />
Type and Format…<br />
Ultimately dictated by your PI<br />
Type yp Advantages g Drawbacks<br />
Bound book<br />
(stitched)<br />
No lost sheets<br />
Proof against Fraud<br />
Entered as done<br />
No logical order<br />
Duplicate “Bound” Double record. Easy Diff. To Read.<br />
Book (yellow sheets) copies<br />
Not true duplicate<br />
Loose Leaf<br />
Order by expts Lost sheets<br />
(folders/notebooks) Easy recording Authenticity prob.<br />
Computer p<br />
Easy y to read Lost data<br />
(ELN)<br />
Easy calculations Authenticity prob<br />
8
Physical Characteristics of a<br />
Good Notebook Notebook…<br />
Large >= 8.5x11 at least<br />
Bound (stitched) pages to ensure integrity<br />
Numbered pages<br />
Number the book<br />
White gridded<br />
Acid free paper p p (30 ( years) y )<br />
Name<br />
Dates<br />
PI<br />
Book No.<br />
Duplicate pages (differing opinions)<br />
Written in Pen (gel pens are good). good) Felt tip is bad. bad<br />
9
What pen is best? Roller tip in this test:<br />
http://www.swarthmore.edu/NatSci/cpurrin1/images/penexp2.jpg10
PPreparing epa ing a Ne New Lab Notebook<br />
Create a table of contents in the first 2<br />
pages<br />
List experiments by:<br />
Title<br />
Date<br />
Lab Book Number/Page Number<br />
11
For Every Experiment, Record:<br />
Start Date: on all pages<br />
Title<br />
Why: Brief statement of purpose, rationale<br />
How: Description/protocol with ref to origin<br />
Calculations, concentrations, dilutions, etc.,<br />
Include sketches, diagrams<br />
What Happened:<br />
All (protocol changes; on floor, kicked)<br />
Taped/stabled in Information<br />
Data<br />
Associated information: Samples, storage<br />
llocation, ti DData t Files, Fil Disks, Di k Images I<br />
What It means: Interpretation (and comments)<br />
What’s Next<br />
12
Attaching mate materials ials<br />
Computer generated data<br />
Photographic data<br />
Printed graphs (make as you go)<br />
Datasheet templates<br />
Product labels<br />
Who provided plasmids, etc.<br />
Notes (or pasted copies) of discussions, conversations,<br />
emails, readings related to exp’t design or goals<br />
Archive locations of plasmids, plasmids probes, probes etc. etc<br />
X-rays and other large items may be kept in a<br />
separate folder if they don’t fit in the lab notebook.<br />
13
Referencing attached data and<br />
samples<br />
Name<br />
Expt/Dates<br />
Book#/page#<br />
Name<br />
Expt/Dates<br />
Book#/page#<br />
Name<br />
Expt/Datees<br />
Book#/pagge#<br />
Label Notebooks, Boxes with Lab Book Book#/Page#<br />
14
Also record… eco d<br />
Discussions with others (time, names,<br />
issues). Good for future credit.<br />
Correspondence with colleagues (phone<br />
calls calls, ee-mails) mails).<br />
Names of individuals who have played<br />
any role in your research. research Technicians<br />
in core laboratories are also important<br />
15
Date<br />
Title<br />
Times<br />
Data<br />
Calculations<br />
Interpretation<br />
Future studies<br />
Correction<br />
16<br />
Taken from the online Linus Pauling Research Notebooks, http://osulibrary.orst.edu/specialcollections/rnb/index.html
FFrequently eq entl Fo Forgotten gotten Info Info…<br />
Serum lot number<br />
Antibody titer<br />
Oth Other people l involved i l d<br />
Centrifuge model,<br />
speed, temp<br />
Incubation time<br />
# washes<br />
Tube size and type<br />
Unexpected delays<br />
Growth medium used<br />
Buffer pH<br />
CCalculations l l ti<br />
Initial # cells<br />
Age/passage culture<br />
% Gel<br />
Growth stage g Bact.<br />
Condition of cells used<br />
17
The impo importance tance of timing timing…<br />
Always record, update, review…<br />
Record as you go<br />
Input paper towel and post-it info ASAP!<br />
At the LATEST, insert data the next day!<br />
Do a weekly checkup<br />
1 hour to review<br />
Make sure everything is attached securely, all<br />
summaries written written, future directions written written,<br />
record in table of contents<br />
Retractions have been associated with untimely<br />
record keeping<br />
18
If you o make a mistake mistake…<br />
They should be marked through with a<br />
single line and a full explanation of the<br />
error provided.<br />
Never e e ob obliterate te ate mistakes sta es with t ink oor<br />
cover them with corrective fluid.<br />
To a casual observer these actions may<br />
suggest improper reasons.<br />
19
Characteristics Helpful for Filing a<br />
PPatent t t<br />
Patents on intellectual property p p y are awarded not by y the first to<br />
file, but “first to invent” – first to record!<br />
Sign and date each entry<br />
HHave an iindependent d d witness i sign i and d ddate each h entry<br />
Witness cannot be co-inventor and must understand data<br />
Do not change entries (make a new entry and cross-reference<br />
tto prior i entry) t )<br />
Use past tense<br />
Never remove original pages or attachments<br />
RRecord d all ll di discussions/meetings/ideas i / ti /id relevant l t tto th the project j t<br />
Record as much detail as possible<br />
http://www.invention-protection.com/ip/publications/docs/A_Primer_On_Lab_Notebooks.html<br />
http://www http://www.uspto.gov/web/offices/pac/doc/general/<br />
uspto gov/web/offices/pac/doc/general/<br />
20
Paper vs. Electronic Lab<br />
Notebook:<br />
Paper notebooks have limited capacity<br />
Raw data is massive<br />
Data capture is archaic<br />
RRaw printouts i t t can no llonger be b stored t d<br />
Data formats are restricted<br />
2-D gels, photomicrographs<br />
RRecord-keeping d k i is i tedious t di<br />
Repetitive protocols lost, require manual entries<br />
Searching is difficult<br />
Context of work is often lost<br />
No connection to other people, projects, labs<br />
Paper records no longer legally required<br />
p g g y q<br />
Electronic records are legal<br />
21
Data Flow in the Laboratory<br />
Lab Automation<br />
& Robotics<br />
Equipment<br />
Interfacing<br />
Chromatography<br />
Data Systems<br />
DData t<br />
Analysis<br />
Data<br />
Warehousing<br />
Laboratory<br />
Laboratory Information<br />
Management<br />
Data<br />
IInstruments t t<br />
SSystems t (LIMS) Mi Mining i<br />
Electronic<br />
Laboratory<br />
Notebooks<br />
Data Acquisition q Information Processingg Knowledge g Management g<br />
22
Collabo Collaborative ati e ELN<br />
scientist<br />
writer<br />
clinician<br />
statistician<br />
research<br />
specialist<br />
lab<br />
manager<br />
nurse<br />
data<br />
encoder<br />
lab<br />
technician<br />
23
Demands for ELN:<br />
21 CFR part pa t 11*<br />
FDA rule initiated August 1997<br />
Sets standards for electronic submission<br />
Electronic records<br />
Thoroughly validated<br />
Automatic audit trails<br />
Results can be recreated<br />
Electronic signatures<br />
Unique identity<br />
Linked to e-record<br />
No grandfather clause<br />
*Code of Federal regulations Title 21, Part 11, Electronic Signatures<br />
http://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfcfr/cfrsearch.cfm?cfrpart=11<br />
24
Software for ELN<br />
25
How long to keep notebooks<br />
and records? eco ds?<br />
NIH policy mandates 3 years after the<br />
end of the project p j (grant (g funding g period) p )<br />
FDA policy mandates 10 years after use<br />
P Patent t t policy li mandates d t 23 years after ft<br />
issue of the patent.<br />
The organization with the longest policy<br />
has priority p y<br />
26
Who owns notebooks and<br />
data?<br />
When NIH awards a research grant to a<br />
university, any and all data collected as<br />
part of that funded project are owned<br />
by the grantee institution.<br />
Data books of all investigators (PI,<br />
postdocs, grad g ad students, stude ts, technicians)<br />
tec c a s)<br />
are the property of the institution.<br />
27
Who owns notebooks and<br />
data?<br />
The principal investigator is the<br />
steward.<br />
If the PI resigns or moves to another<br />
institution: st tut o<br />
Equipment and all data belong to the initial<br />
institution.<br />
Permission can be obtained to transfer the<br />
award, equipment q p and data.<br />
28
Resea Research ch data do not incl include: de<br />
Preliminary analyses<br />
Drafts of papers<br />
Plans for future research<br />
Peer reviews<br />
Physical objects (samples, tapes)<br />
Trade secrets or commercial<br />
information<br />
29
Applicabilit Applicability of rules les<br />
Data produced with Federal support that are<br />
cited publicly and officially.<br />
Data collected by institutions of higher education,<br />
hospitals, and non-profit institutions<br />
Does not apply l to state and d local l l govt<br />
Does not apply to commercial organizations<br />
Applies to new and competing awards made after<br />
the effective date of the amendment (11/8/1999)<br />
30
PPublic blic Access to Data<br />
The Office of Management and Budget<br />
(OMB) revised the regulations regarding<br />
public access to experimental data in<br />
1999<br />
The public can obtain access to<br />
experimental data through the Freedom<br />
of Information Act (FOIA)<br />
31
What happens if the PI<br />
refuses ef ses to compl comply? ?<br />
This is view as a material failure to<br />
comply with the terms of the award<br />
NIH would initiate appropriate<br />
enforcement e o ce e t acto action<br />
This could include withholding future<br />
support to the institution<br />
32
Consequences of Research<br />
Mi Misconduct d t<br />
Debarment from eligibility to receive Federal<br />
ffunds d<br />
Prohibition from service on advisory<br />
committees, peer review committees, or as<br />
consultants<br />
Certification of information sources that is<br />
forwarded by the institution<br />
Certification of data by the institution<br />
Supervision by the institution<br />
Submission of a correction or retraction of a<br />
published article<br />
http://ori.dhhs.gov/html/misconduct/administrative_actions.asp<br />
33
Best practice principles for<br />
leade leaders s of research esea ch ggroups: o ps<br />
Set standards and provide examples of<br />
good record keeping practices for the<br />
group<br />
Provide o de too tools s for o good record eco d keeping eep g<br />
Provide training for the group and<br />
review records<br />
Clarify data and research ownership and<br />
access rights<br />
34
Institutional Best Practice<br />
PPrinciples: inciples WUSTL<br />
“It is the policy of WUSTL to publish<br />
research results and to foster the use of<br />
University inventions, discoveries and and<br />
other works for the common good good. ”<br />
Intellectual Property Policy<br />
http://www.wustl.edu/policies/intelprop.html<br />
35
With care and attention to our<br />
work o k we e sho should ld avoid… a oid<br />
RETRACTION<br />
“We wish to retract our report (Science 2004;<br />
303 303, 371) iin which hi h we reported d that h ββ-N N<br />
acetylglucosamine-serine can be<br />
bi biosynthetically th ti ll incorporated i t d at t a defined d fi d site it in i<br />
myoglobin in E. coli. …<br />
36
RETRACTION<br />
RETRACTION…<br />
Regrettably Regrettably, through no fault of the authors, authors<br />
the lab notebooks are no longer available to<br />
replicate the original experimental conditions, conditions<br />
and we are unable to introduce this amino acid<br />
into myoglobin with the information and<br />
reagents currently in hand…”<br />
ZZ. Zhang et al al., UT at Austin Austin.<br />
Science 2009; 326, 326 1187. 1187<br />
37
Notebook Ethics<br />
"Many people say that it is the intellect<br />
which makes a great scientist. They are<br />
wrong: it is character." -- Albert<br />
Einstein<br />
38
Resea Research ch Integ Integrity it<br />
The h NIH Office Off of f Research h Integrity defines d f<br />
misconduct in science as:<br />
“Fabrication, ab cat o , falsification, a s cat o , pplagiarism, ag a s , oor ot other e<br />
practices that seriously deviate from those that<br />
are commonly accepted within the scientific<br />
community. This does not include honest error or<br />
hhonest t diff differences iin interpretations i t t ti or jjudgments d t<br />
of data.” (Federal Register 54:32446-32451, Aug<br />
8, 1989)<br />
“Th “The iintegrity t it of f research h ddepends d on<br />
accurate, detailed, organized, complete, and<br />
accessible data” Office of Research Integrity<br />
39
Tips to PPreserve ese e Data Integ Integrity it<br />
Never, e e , eever, e , remove e o e a page<br />
Fill consecutive pages<br />
Cross out unused parts p of pages p g<br />
Record all info as accurately as possible.<br />
Do NOT omit any result, no matter how odd.<br />
Cross out mistakes lightly (might need to<br />
recover)<br />
Write llegibly bl<br />
Put a full date (international date problems…)<br />
with month spelled out out.<br />
40
Mo More e Helpf Helpful l Tips<br />
The institution owns “your” your notebook<br />
Do NOT remove your notebook from the lab<br />
( (unless l this thi is i an acceptable t bl lab l b practice) ti )<br />
You may get permission to take copies at<br />
graduation graduation, but do not take original pages<br />
Do NOT read another person’s notebook<br />
without permission (even the PI won won’tt look at<br />
advanced researcher’s notebooks secretly).<br />
Should be kept for at least 5 years<br />
41
Data manipulation<br />
manip lation<br />
“Data may be excluded from the<br />
experimental i l results l only l if if you have h<br />
a sound reason to do so!”<br />
Mother Nature<br />
42
misleading presentation of<br />
ddata t ffrom ddata t management t<br />
errors<br />
Number of animals vs. number of<br />
determinations<br />
Non-standard normalization<br />
Not showing entire gel<br />
Varying Varying exposures of images<br />
INSIST ON VIEWING THE RAW <strong>DATA</strong>!<br />
43
As photographed<br />
Brightness Brightness-<br />
Contrast<br />
Adjustment<br />
44
As photographed<br />
Erasing<br />
45
The The Journal Journal of of Cell Cell Biology:<br />
Biology:<br />
• revokes the acceptance of about 1% of its<br />
papers due to inappropriate image<br />
image<br />
manipulation<br />
• 25% of accepted accepted p papers p p have at least 1 figure g<br />
with undocumented manipulation<br />
“If you misrepresent your data you are deceiving<br />
people.”<br />
Rossner, M. (2006). The Scientist 20:24 20:24-25.<br />
25.<br />
46
What can I do?<br />
Nature, instructions to authors (2006)<br />
• List all image acquisition and processing tools and<br />
software<br />
•Document key y image-gathering g g g setting g and<br />
manipulations in the Supplemental materials<br />
• Clearly demarcate borders between images collected at<br />
different times<br />
• Avoid use of touch up tools or deliberately obscuring<br />
parts of an image<br />
• Processing is acceptable bl only l if f applied l d across the h<br />
entire image<br />
•Be prepared p p to deliver the original, g , unprocessed p images g<br />
to the editor<br />
48
Slide sources: so ces<br />
•Gail Gail P. P Taylor PhD and Angelina<br />
Hoefle, UT San Antonio<br />
•Carmen S. S Dence MS, MS Washington<br />
University<br />
•Edward Ed Edward d Krug K PhD PhD Medical M di l College C ll<br />
of South Carolina<br />
•Douglas Perry PhD,<br />
IU School of Informatics<br />
49
Acknowledgements and<br />
Refe References ences<br />
Kathy Barker, At the Bench: A laboratory Navigator. Cold Spring<br />
Harbor Laboratory yppress.<br />
1998.<br />
Guidelines for Keeping a Laboratory Record. David Caprette, Rice<br />
University. http://www.ruf.rice.edu/~bioslabs/tools/notebook/notebook.html<br />
Guidelines for Keeping a Laboratory Notebook. Colin Purrington,<br />
Swarthmore Univ Univ.<br />
http://www.swarthmore.edu/NatSci/cpurrin1/notebookadvice.htm<br />
Laboratory Record Keeping. Todd E. Garabedian, Nature<br />
Biotechnology v. 15 (August 1997) pp.799-800<br />
http://biotech http://biotech.about.com/gi/dynamic/offsite.htm<br />
about com/gi/dynamic/offsite htm<br />
Office of Research Integrity, U.S. Dept. of Health and Human<br />
Services http://www.unh.edu/rcr/<br />
Responsible Conduct of Research Online Study Guide. Julie<br />
Si Simpson, UUniversity i it of f New N Hampshire H hi http://www.unh.edu/rcr/<br />
Francis L. Macrina, Scientific Integrity: An Introductory Text with<br />
Cases. ASM Press. 2000.<br />
Academic Research Record Record-keeping: keeping: Best Practices for Individuals Individuals,<br />
Group Leaders and Institutions. Academic Med. 81, 42-47; 2006,<br />
50
QQuestions estions<br />
51