21.07.2013 Views

Departments of Chemistry and Mathematics MSc Chemoinformatics

Departments of Chemistry and Mathematics MSc Chemoinformatics

Departments of Chemistry and Mathematics MSc Chemoinformatics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

C h e m o i n f o r m a t i c s - the application <strong>of</strong> mathematical <strong>and</strong> computational techniques to the analysis <strong>of</strong><br />

chemical data<br />

<strong>Departments</strong> <strong>of</strong> <strong>Chemistry</strong><br />

<strong>and</strong> <strong>Mathematics</strong><br />

<strong>MSc</strong><br />

in<br />

<strong>Chemoinformatics</strong><br />

Student H<strong>and</strong>book<br />

2010-11


CONTENTS<br />

1. Welcome<br />

2. Purpose <strong>of</strong> this h<strong>and</strong>book<br />

3. <strong>Chemoinformatics</strong><br />

4. Being an <strong>MSc</strong> student<br />

5. Nature <strong>of</strong> the <strong>MSc</strong> course <strong>and</strong> the assessments<br />

6. Assessment for the <strong>MSc</strong> degree<br />

7. The course structure<br />

8. Module descriptions<br />

9. H<strong>and</strong>ing in written assignments, practical assignments <strong>and</strong> the dissertation<br />

10. Lateness penalties<br />

11. Size restrictions<br />

12. Disclosure <strong>of</strong> marks/feedback meetings<br />

13. Marking criteria<br />

14. Mark record sheets <strong>and</strong> course transcripts<br />

15. Appeals procedure<br />

16. Academic misconduct: University guidelines<br />

17. The York Award<br />

17. Careers<br />

18. Facilities<br />

Library<br />

Computing<br />

Photocopying<br />

Telephones<br />

19. The Department <strong>of</strong> <strong>Mathematics</strong><br />

20. Staff involved with the <strong>MSc</strong> <strong>and</strong> their research interests<br />

21. Supervisors<br />

22. Welfare Support Services<br />

23. Contact Details<br />

24. Presentations<br />

25. Dissertation Guidelines


1. Welcome<br />

First <strong>of</strong> all, welcome (or welcome back!) to the <strong>Mathematics</strong> department at York.<br />

2. Purpose <strong>of</strong> this h<strong>and</strong>book<br />

This h<strong>and</strong>book is designed to give you information about the<br />

<strong>MSc</strong> in <strong>Chemoinformatics</strong>.<br />

Our intention is to make the course structure <strong>and</strong> the assessment process as clear as possible.<br />

If you have any queries about the course as a whole please contact:<br />

For General Queries Nick Page: njp503@york.ac.uk<br />

For Examination <strong>and</strong> Assessment Queries Assessment secretary: maths-exams@york.ac.uk<br />

For Queries about Computing Facilities Henning Bostelmann: hb540@york.ac.uk<br />

For Queries about External Placement Julie Wilson: julie@ysbl.york.ac.uk<br />

This h<strong>and</strong>book also contains a variety <strong>of</strong> other information concerning the University, Careers<br />

Issues, The Department <strong>of</strong> <strong>Mathematics</strong>, Library <strong>and</strong> other facilities; <strong>and</strong> also briefly lists<br />

opportunities for developing scientific <strong>and</strong> personal contacts.<br />

You should make sure that you have the following documents, from which further information<br />

can be obtained:<br />

Graduate Prospectus<br />

Ordinances <strong>and</strong> Regulations <strong>of</strong> the University<br />

For general enquiries the staff <strong>of</strong> the university Graduate Schools Office, which is situated in<br />

the Student Services Building, between Computing Service <strong>and</strong> Vanbrugh College, will<br />

always be glad to help you if they can, or to tell you where to go next. The telephone number<br />

for enquiries is 4684. The Assistant Registrar responsible for graduate students is Philip<br />

Simison (telephone number 2141). Other information is available on the World Wide Web at<br />

http://www.york.ac.uk/admin/gso<br />

Your supervisor will be able to help with queries on the course <strong>and</strong> the general interests <strong>of</strong><br />

graduate students in the university are looked after by the Students' Union <strong>and</strong> the Graduate<br />

Students' Association.<br />

3. <strong>Chemoinformatics</strong><br />

<strong>Chemoinformatics</strong> is a multi-disciplinary science at the interface <strong>of</strong> chemistry, mathematics<br />

<strong>and</strong> computer science, that involves the organisation, retrieval, management <strong>and</strong> processing <strong>of</strong><br />

chemical information. New technology, advances in methodology <strong>and</strong> increased


computational power have led to an enormous increase in the amount <strong>of</strong> data generated in<br />

analytical <strong>and</strong> computational chemistry. The manipulation, interpretation <strong>and</strong> presentation <strong>of</strong><br />

chemical information underpin contemporary scientific research <strong>and</strong> the chemical,<br />

biochemical <strong>and</strong> pharmaceutical industries are increasingly reliant upon intelligent data<br />

management <strong>and</strong> analysis. Industry today requires people adept at statistical treatment <strong>of</strong> data,<br />

data analysis, database design, information management <strong>and</strong> computer programming who also<br />

know their chemistry.<br />

The need for mathematically trained scientists to deal with this wealth <strong>of</strong> data has been<br />

recognised <strong>and</strong> chemoinformatics has been identified as an EPSRC industrial sector priority.<br />

However this is not yet reflected in the postgraduate courses available <strong>and</strong> chemoinformatics<br />

training is usually limited to a single module within a theoretical chemistry course. Courses at<br />

UMIST <strong>and</strong> Sheffield have been set up recently in response to an EPSRC initiative; the<br />

department <strong>of</strong> Information studies at Sheffield <strong>of</strong>fers an <strong>MSc</strong> in <strong>Chemoinformatics</strong> <strong>and</strong> the<br />

department <strong>of</strong> <strong>Chemistry</strong> at UMIST <strong>of</strong>fers an <strong>MSc</strong> entitled Cheminformatics. Both <strong>of</strong> these<br />

courses are targeted towards chemistry graduates.<br />

The course is suitable for mathematics <strong>and</strong> computer science graduates with an interest in the<br />

application <strong>of</strong> mathematical <strong>and</strong> computational techniques to chemical problems as well as<br />

mathematically–inclined students with a chemistry, physics or biology background.<br />

The proposed programme exploits the skills <strong>and</strong> interests in the departments <strong>of</strong> <strong>Mathematics</strong><br />

<strong>and</strong> <strong>Chemistry</strong> <strong>and</strong> addresses the need for interdisciplinary skills. It connects the advances in<br />

mathematics <strong>and</strong> statistics with the needs <strong>of</strong> industry <strong>and</strong> the life sciences while providing the<br />

students with the chance to develop generic skills <strong>and</strong> to undertake a substantial research<br />

project to be carried out either at the University or in industry. The course is appropriate for<br />

those students wishing to pursue research in this area <strong>and</strong> for those seeking a career in industry<br />

with employment opportunities likely in the pharmaceutical, chemical <strong>and</strong> information<br />

industries. The job market in chemoinformatics is buoyant as judged by the number <strong>of</strong><br />

advertisements for vacancies. It is anticipated that the majority <strong>of</strong> students will attend full time<br />

for a year, but there is flexibility to allow part-time study over two years with a variable<br />

programme structure.<br />

If you have ideas for a placement then please do follow up those ideas <strong>and</strong> also keep us<br />

informed. In any case you should read the Careers advice in section 19 <strong>of</strong> this document.<br />

4. Being an <strong>MSc</strong> student<br />

You will probably find that your life as a graduate student is rather different from what you<br />

were used to as an undergraduate. However, the differences are unpredictable <strong>and</strong> will vary<br />

with the individual. The following are some <strong>of</strong> the possibilities that you should be prepared<br />

for.


You may already be expecting a discontinuity in the nature <strong>and</strong> the amount <strong>of</strong> mathematics<br />

that you have to learn. This is similar to the discontinuity between school <strong>and</strong> university<br />

mathematics. You may be used to acquiring mathematical knowledge only from lectures; if<br />

so, you will have to acquire the habit <strong>of</strong> reading mathematical books <strong>and</strong> papers. You should<br />

try to develop the skill <strong>of</strong> being selective in the way you cover material; it is <strong>of</strong>ten not<br />

necessary to know every detail <strong>of</strong> every pro<strong>of</strong>, <strong>and</strong> even when you do need to underst<strong>and</strong> the<br />

details it can be helpful to start with a preliminary reading in which you concentrate on<br />

definitions <strong>and</strong> the statements <strong>of</strong> theorems, skipping the pro<strong>of</strong>s.<br />

You are one <strong>of</strong> a group <strong>of</strong> students working in the same area <strong>and</strong> if you tended to work on<br />

your own as an undergraduate, you might find that the <strong>MSc</strong> is a more social activity than you<br />

had expected. It can be very helpful <strong>and</strong> enjoyable to discuss problems <strong>and</strong> to share ideas with<br />

others. This is still true even if you are the sort <strong>of</strong> person who needs to be on their own to<br />

think effectively. You can still pick up the bones <strong>of</strong> an idea or a problem from conversation,<br />

without feeling that you really underst<strong>and</strong> it, <strong>and</strong> then go away <strong>and</strong> think about it by yourself<br />

until you do underst<strong>and</strong> it. It might be only after doing this that you have some contribution to<br />

make to a discussion. On the other h<strong>and</strong>, it can be very helpful in the development <strong>of</strong> your<br />

own ideas to talk about them to other people. Even if they don't respond, the act <strong>of</strong> expressing<br />

your ideas can help to clarify them to yourself. Unnoticed mistakes <strong>of</strong>ten come to light this<br />

way.<br />

If you are to get the maximum benefit from the course, there are two important don'ts to bear<br />

in mind: never be afraid to admit your ignorance, <strong>and</strong> never be afraid <strong>of</strong> asking a silly<br />

question. These are easy to state, <strong>and</strong> hard to follow. But you should try.<br />

Notice Boards<br />

The notice boards in the corridors show a lot <strong>of</strong> information that may be <strong>of</strong> interest to you.<br />

There is also a graduate notice-board: you should get into the habit <strong>of</strong> looking at this regularly.<br />

Seminars<br />

Seminars <strong>and</strong> other contributions from visiting speakers will be arranged specially for the<br />

<strong>MSc</strong>, <strong>and</strong> you will be notified <strong>of</strong> those.<br />

Also the department regularly invites mathematicians from other places to give seminars on<br />

their work; these usually take place on Wednesday afternoons at 4:30 pm in a room to be<br />

advertised. Tea is served beforeh<strong>and</strong>, at 4 pm (probably in either Goodricke Senior Common<br />

Room or in the Departmental C<strong>of</strong>fee Room G109). A list <strong>of</strong> each term's seminars is produced<br />

at the start <strong>of</strong> the term. Your supervisor will alert you to speakers who may be relevant to<br />

your interests. You should also see the departmental research list at<br />

http://www.york.ac.uk/depts/maths/research/welcome.htm


5. Nature <strong>of</strong> the <strong>MSc</strong> course <strong>and</strong> the assessments<br />

The course comprises:<br />

Lectures, seminars, computer workshops, a dissertation, a short presentation summarising<br />

your dissertation <strong>and</strong> (if possible) a placement.<br />

Placements<br />

These are not guaranteed for everyone but they are intended to be an important part <strong>of</strong> the<br />

course. If you do go on a placement outside the University <strong>of</strong> York, the dissertation should<br />

ideally be written around an industrially-inspired project while on that placement.<br />

Placement opportunities are currently being sought <strong>and</strong> an up-to-date list <strong>of</strong> opportunities<br />

currently available will be maintained by Dr Julie Wilson <strong>and</strong> circulated from time to time.<br />

Lectures<br />

Outline syllabuses for those modules in the Autumn Term are available from the lecturer<br />

responsible for teaching the module <strong>and</strong> some cases on a website (to be announced).<br />

Seminars, laboratory workshops<br />

Seminars <strong>and</strong> laboratory workshops will be an integral part <strong>of</strong> the lecture courses.<br />

Assessments<br />

The assessments comprise:<br />

marked written assignments,<br />

marked practical assignments or projects,<br />

an assessment <strong>of</strong> the dissertation, <strong>and</strong><br />

formal examinations.<br />

Assessments arising from the marked written <strong>and</strong> practical assignments will accumulate<br />

throughout the first two terms <strong>of</strong> the course. For some courses there will be a formal closed<br />

examination. A brief summary <strong>of</strong> the assessments is given below.<br />

Written assignments <strong>and</strong> practical assignments<br />

These will be marked <strong>and</strong> returned by the lecturer giving the course.<br />

Marking guidelines<br />

General guidelines for marking Masters work are given below. You should also read the<br />

assessment details given in the module synopses <strong>of</strong> your course outline.


Project reports<br />

Abstract (5%) Introduction (20%) Methods (20%) Results <strong>and</strong> Analysis<br />

(25%)<br />

80– 100 Excellent.<br />

Correct length.<br />

Informative,<br />

Grammatical<br />

<strong>and</strong> concise.<br />

70 – 79 Very good.<br />

Correct length.<br />

Informative <strong>and</strong><br />

Mainly<br />

grammatical<br />

<strong>and</strong> concise.<br />

60 – 69 Good.<br />

Some what less<br />

Grammatical<br />

<strong>and</strong> concise.<br />

Very clearly defines the<br />

question. Fully<br />

evaluates the current<br />

knowledge<br />

through appropriate<br />

references. Critical,<br />

thoughtful <strong>and</strong> incisive.<br />

Defines <strong>and</strong> evaluates<br />

the question well<br />

through a good selection<br />

<strong>of</strong> references. A good<br />

summary <strong>of</strong> the subject.<br />

Defines <strong>and</strong> evaluates<br />

the question well with a<br />

reasonable selection <strong>of</strong><br />

references but with a<br />

less well constructed<br />

storyline than for above.<br />

Very clear <strong>and</strong> precise<br />

description <strong>of</strong> very well<br />

chosen <strong>and</strong> cleverly<br />

designed methods.<br />

An elegantly designed<br />

programme with good quality<br />

documentation.<br />

Clear <strong>and</strong> precise description<br />

<strong>of</strong> well-chosen <strong>and</strong> well<br />

designed methods.<br />

An elegantly designed<br />

programme with good quality<br />

documentation.<br />

Reasonable description <strong>of</strong><br />

methods but less well<br />

chosen or designed than for<br />

above. A well designed <strong>and</strong><br />

documented programme.<br />

Excellent <strong>and</strong> entirely<br />

appropriate analysis<br />

Excellent description that<br />

goes beyond the obvious.<br />

Programme works <strong>and</strong><br />

goes beyond specification<br />

requirements.<br />

Very good <strong>and</strong> appropriate<br />

analysis with sensible<br />

deductions. Programme<br />

works <strong>and</strong><br />

specification requirements<br />

clearly met.<br />

Good analysis but may be<br />

inappropriate in places.<br />

Misses some evidence.<br />

Programme mainly<br />

worked.<br />

Discussion (25%) Reference list (5%)<br />

Clever, complete <strong>and</strong><br />

wise discussion <strong>of</strong><br />

results in relation to those<br />

<strong>of</strong> others.<br />

Excellent link to aims <strong>of</strong> the<br />

project <strong>and</strong><br />

impressive further ideas.<br />

Good discussion <strong>of</strong> results<br />

in relation to those <strong>of</strong><br />

others. Good<br />

link to aims <strong>of</strong> the project<br />

<strong>and</strong> sound further ideas.<br />

Reasonable discussion<br />

<strong>of</strong> results in relation to<br />

those <strong>of</strong> others.<br />

Reasonable link to aims<br />

<strong>of</strong> the project but some<br />

impractical further ideas.<br />

Appropriate number <strong>of</strong><br />

references that are well<br />

formatted, <strong>and</strong><br />

consistently cited.<br />

Appropriate balance <strong>of</strong><br />

general <strong>and</strong> specific<br />

references <strong>and</strong> evidence<br />

<strong>of</strong><br />

extensive literature<br />

searching.<br />

Good number <strong>of</strong><br />

references<br />

which are well formatted,<br />

<strong>and</strong><br />

consistently cited.<br />

Appropriate balance <strong>of</strong><br />

general <strong>and</strong> specific<br />

references <strong>and</strong> evidence<br />

<strong>of</strong><br />

some literature searching<br />

Reasonable number <strong>of</strong><br />

references which are<br />

mainly well formatted, <strong>and</strong><br />

consistently cited.<br />

Inappropriate balance <strong>of</strong><br />

general <strong>and</strong> specific<br />

references. Limited<br />

evidence<br />

<strong>of</strong> literature searching.


50 – 59 Reasonable.<br />

Some<br />

sentences<br />

wordy or<br />

ungrammatical.<br />

Adequate<br />

description <strong>of</strong><br />

methods but<br />

unclear in<br />

places.<br />

40 – 49 Weak.<br />

More than 20<br />

words too<br />

long/short.<br />

Wordy <strong>and</strong><br />

ungrammatical.<br />

0 – 39 Distinctly<br />

inadequate or<br />

absent<br />

Adequately defines the<br />

question through<br />

adequate references<br />

with a poorly structured<br />

storyline.<br />

Poorly defines the<br />

question by not placing<br />

it in context by the use<br />

<strong>of</strong> a poor selection <strong>of</strong><br />

references. Weak<br />

storyline <strong>and</strong> limited<br />

underst<strong>and</strong>ing.<br />

The purpose <strong>of</strong> the<br />

research is very unclear.<br />

Very poor choice <strong>of</strong><br />

references. Lack <strong>of</strong><br />

underst<strong>and</strong>ing.<br />

Adequate selection or<br />

design <strong>of</strong> methods but some<br />

poor choices.<br />

A reasonably well-designed<br />

programme.<br />

Unclear description <strong>of</strong><br />

methods that were poorly<br />

chosen or poorly designed.<br />

Reasonable attempt at<br />

programme design.<br />

Very unclear description <strong>of</strong><br />

methods. Very limited<br />

selection <strong>of</strong> poorly chosen<br />

methods. Poorly designed<br />

programme.<br />

Adequate but somewhat<br />

inappropriate analysis.<br />

Programme mainly<br />

worked.<br />

Inadequate or mainly<br />

inappropriate analysis.<br />

Programme doesn't meet<br />

several <strong>of</strong> the<br />

specifications or only<br />

partially works.<br />

Very weak or no real<br />

analysis<br />

which was not appropriate.<br />

Programme doesn't meet<br />

many <strong>of</strong> the specifications.<br />

Adequate discussion <strong>of</strong><br />

results in relation to<br />

those <strong>of</strong> others. Limited link<br />

to aims <strong>of</strong> the project <strong>and</strong><br />

impractical<br />

further ideas.<br />

Inadequate discussion <strong>of</strong><br />

results in relation to those<br />

<strong>of</strong> others.<br />

Inadequate or no link to<br />

aims <strong>of</strong> the project with<br />

impractical or no further<br />

ideas.<br />

Very weak discussion<br />

with little reference to<br />

the results <strong>of</strong> others <strong>and</strong> no<br />

link to the aims <strong>of</strong> the<br />

project.<br />

Reasonable number <strong>of</strong><br />

references with format<br />

<strong>and</strong> citation weaker than<br />

above.<br />

Does not have the<br />

appropriate balance <strong>of</strong><br />

general <strong>and</strong> specific<br />

references. Little<br />

evidence <strong>of</strong> literature<br />

searching.<br />

Inappropriate number <strong>of</strong><br />

references with poor<br />

format<br />

<strong>and</strong> citation Does not<br />

have the appropriate<br />

balance <strong>of</strong> general <strong>and</strong><br />

specific references.<br />

Distinctly inadequate or<br />

absent.


Report presentation<br />

80 – 100<br />

70 – 79<br />

60 – 69<br />

50 – 59<br />

40 – 49<br />

0 – 39<br />

Beautifully written. Very clear. Very few spelling or grammatical errors which are minor. Very<br />

attractive layout which is clear <strong>and</strong> entirely appropriate. Excellent use <strong>of</strong> figures <strong>and</strong> tables<br />

which are referenced in the text. Figures easy to underst<strong>and</strong> <strong>and</strong> with complete legends.<br />

Consistence in style. Appropriate appendices.<br />

Well written <strong>and</strong> fairly clear. A few spelling or grammatical errors. Reasonably attractive.<br />

Possibly some inconsistency in style but generally nice layout. Good figures <strong>and</strong> tables which<br />

are referenced in the text. Most <strong>of</strong> the figures easy to underst<strong>and</strong>. Appropriate appendices.<br />

Adequately written but with some laboured or poorly written sections. Adequate use <strong>of</strong> figures but which may be<br />

hard to underst<strong>and</strong> <strong>and</strong>/or untidy in places. Some spelling or grammatical errors. Inconsistency in style. Possibly<br />

some unnecessary appendices which those that should be in the main body <strong>of</strong> the work.<br />

Poorly written <strong>and</strong> rather scruffy. Many spelling or grammatical errors. Inconsistency in style.<br />

Little thought to layout. Inadequate use <strong>of</strong> figures which are hard to underst<strong>and</strong>, untidy <strong>and</strong> may not be referred to in<br />

the text. Inappropriate use <strong>of</strong> appendices.<br />

Poorly written <strong>and</strong> seriously flawed use <strong>of</strong> English. Many spelling <strong>and</strong> grammatical errors which make the work hard<br />

to underst<strong>and</strong>. Poor layout. Figures inappropriate or absent, difficult to underst<strong>and</strong>, very untidy <strong>and</strong> not referred to in<br />

text. Inappropriate use <strong>of</strong> appendices.<br />

Very poorly written <strong>and</strong> seriously flawed use <strong>of</strong> English which makes the work nearly<br />

impossible to underst<strong>and</strong>. Many spelling <strong>and</strong> grammatical errors. Very poor layout. Figures<br />

inappropriate or absent, difficult to underst<strong>and</strong>, very untidy <strong>and</strong> not referred to in text.<br />

Inappropriate use <strong>of</strong> appendices.


Supervisors Assessment <strong>of</strong> Attitude <strong>and</strong> Approach<br />

80 – 100<br />

70 – 79<br />

60 – 69<br />

50 – 59<br />

40 – 49<br />

0 – 39<br />

A great deal <strong>of</strong> initiative <strong>and</strong> maturity shown with minimal supervision required.<br />

Highly capable with a real interest in developing technical <strong>and</strong>/or research skills as<br />

appropriate. Worked very hard <strong>and</strong> was very willing to be involved in all aspects <strong>of</strong><br />

the work. Where relevant, very good team member with excellent interactions with<br />

members <strong>of</strong> the lab. Very well organised <strong>and</strong> efficient.<br />

Good initiative <strong>and</strong> maturity shown with limited supervision required. Capable <strong>and</strong><br />

interested in developing technical <strong>and</strong>/or research skills as appropriate. Worked hard<br />

<strong>and</strong> was willing to be involved in all aspects <strong>of</strong> the work. Where relevant, good team<br />

member with good interactions with members <strong>of</strong> the lab. Well organised <strong>and</strong><br />

efficient.<br />

Reasonably good initiative <strong>and</strong> maturity shown but requiring more supervision <strong>and</strong><br />

guidance than above. Reasonably capable <strong>and</strong> interested in developing technical<br />

<strong>and</strong>/or research skills as appropriate. Worked fairly hard <strong>and</strong> was willing to be<br />

involved in most <strong>of</strong> the aspects <strong>of</strong> the work.<br />

Where relevant, reasonable team member with acceptable interactions with<br />

members <strong>of</strong> the lab.<br />

Reasonably well organised <strong>and</strong> efficient.<br />

Required some direction <strong>and</strong> help with the work <strong>and</strong> solving problems. Adequately<br />

capable <strong>and</strong> interested. Student worked sufficiently hard but could have done more.<br />

Where relevant, acceptable team member <strong>and</strong> interactions with members <strong>of</strong> the lab<br />

adequate. Organisation <strong>and</strong> efficiency adequate but could have been better.<br />

Required a lot <strong>of</strong> direction <strong>and</strong> help with the work <strong>and</strong> solving problems. Little effort<br />

or commitment, poor time keeping. Only superficial interest shown. Where relevant,<br />

poor team member. Poor organisation <strong>and</strong> inefficient working.<br />

Required pushing to do the minimum. Inadequate effort <strong>and</strong> commitment. No<br />

interest shown.<br />

Where relevant, no integration with the team. Unorganised <strong>and</strong> inefficient. Failed to<br />

appear with no prior explanation.


Dissertation<br />

Your dissertation will be the main fruit <strong>of</strong> your year <strong>of</strong> work as a graduate student, <strong>and</strong><br />

naturally you will want it to be something you can be proud <strong>of</strong>. (However, you should not let<br />

your own high st<strong>and</strong>ards tempt you to delay the writing <strong>of</strong> your dissertation, or to try to make<br />

it better than it should be for the degree for which you are submitting.) The dissertation will be<br />

written either on placement or at York. There will be an individual at any placement who will<br />

also help supervise the dissertation.<br />

The University's requirements for the format <strong>and</strong> binding <strong>of</strong> dissertations <strong>and</strong> theses are set out<br />

in the pamphlet Regulations relating to Theses, which can be obtained from the Graduate<br />

Office. These regulations are enforced when the dissertation is lodged with the Library. Since<br />

we do not do this the expectations <strong>of</strong> the <strong>Mathematics</strong> department for the <strong>MSc</strong> dissertations<br />

are somewhat more flexible<br />

Three printed copies <strong>of</strong> each dissertation should be h<strong>and</strong>ed in.<br />

These will normally be assessed by the dissertation supervisor <strong>and</strong> a second marker, <strong>and</strong> by<br />

the external examiner. As far as length <strong>of</strong> the dissertation goes:<br />

Your dissertation should normally consist <strong>of</strong> between 50 <strong>and</strong> 80 A4 pages.<br />

A guideline for the marking scheme <strong>of</strong> the dissertations is available at the end <strong>of</strong> this<br />

document as part <strong>of</strong> item 25 on Dissertation Guidelines.<br />

University requirements for the presentation <strong>of</strong> dissertation theses.<br />

Dissertations for the degrees <strong>of</strong> <strong>MSc</strong> shall be presented in accordance with the requirements<br />

given in the University guidelines:<br />

http://www.york.ac.uk/admin/gso/exams/thesis/requirements.htm<br />

The requirements refer primarily to printed copies. Electronic copies shall be identical in<br />

presentation to the printed copies submitted or deposited.<br />

Quality <strong>of</strong> reproduction<br />

All copies shall be legible <strong>and</strong> <strong>of</strong> good print quality.<br />

Text should be set with even or proportionate spacing between words. Word division at the<br />

ends <strong>of</strong> lines should be avoided if possible. In typescript or printout, one-<strong>and</strong>-a-half line<br />

spacing or equivalent shall be used, although double spacing may be used if necessary. Lines<br />

that contain mathematical formulae, diacritical marks or strings <strong>of</strong> capital letters may need<br />

additional space. It should be clear when a new paragraph is starting <strong>and</strong> where matter in the<br />

text is being quoted.<br />

Tables



If there are relatively few tables, each shall appear as near as possible to the first reference to it<br />

in the text. If there are frequent references to tables, or if there are many tables, they may be<br />

collected together at the end <strong>of</strong> the text, possibly as an appendix.<br />

Each table shall, if possible, appear complete on one page. A table shall be neither spaced out<br />

to fill the available space nor reduced to fit a small space. Tables shall normally be in the same<br />

orientation as the main text.<br />

Each table shall have a number <strong>and</strong> title. The number shall precede the title. The title shall<br />

describe the content <strong>of</strong> the table. If a table occupies more than one page, its number shall be<br />

given on each page.<br />

The character size used in tables shall be large enough to allow the table to be reproduced<br />

without risk to legibility. The presentation <strong>of</strong> a series <strong>of</strong> tables shall be consistent in character<br />

size, use <strong>of</strong> space <strong>and</strong> other typographic treatment.<br />

Headings 
<br />

Headings shall be used to indicate the hierarchical structure <strong>of</strong> the text. There shall normally<br />

be not more than four levels, including the chapter headings as the first level. Each level shall<br />

be distinguished from the others by position or typography, or both. The space that precedes<br />

<strong>and</strong> follows a heading shall be not less than the space between paragraphs. Headings shall not<br />

normally be centred (except, possibly, for chapter <strong>and</strong> part headings).<br />

Illustrations 
<br />

An illustration should normally appear near the first reference made to it in the text. The<br />

desirability <strong>of</strong> grouping illustrations at the back <strong>of</strong> a volume or in a separate volume shall be<br />

considered if they need to be compared with one another, are referred to frequently in the text,<br />

or need to be separate because <strong>of</strong> their nature, e.g. their size or method <strong>of</strong> production.<br />

Illustrations shall be <strong>of</strong> a technical quality that reproduces well.<br />

Every illustration, including appendices <strong>and</strong> material that cannot be bound, shall be included<br />

in the list <strong>of</strong> illustrations with page numbers or other identification.<br />

Each label within an illustration shall be either so positioned that the part it applies to cannot<br />

be confused with any other, or linked to the part by a thin line. The lettering shall be large<br />

enough <strong>and</strong> clear enough to remain legible when the illustration is photographed <strong>and</strong><br />

subsequently copied. A short legend shall appear beneath each illustration.<br />

Numbering 
<br />

Arabic numerals shall normally be used for numbering all sequences within a dissertation.<br />

Page numbers shall be visibly clear <strong>of</strong> the text. The pages <strong>of</strong> the thesis or dissertation shall be<br />

numbered in a single sequence beginning with the title page, which shall be counted but not<br />

numbered, <strong>and</strong> including pages that carry tables, illustrations, appendices, etc.


Chapters shall be numbered from the start to the finish <strong>of</strong> the thesis or dissertation.<br />

Appendices shall be numbered in a separate sequence from that used for chapters.<br />

Illustrations shall be numbered consecutively in a single sequence, generally without<br />

distinguishing between different kinds <strong>of</strong> illustration. Tables within the text shall be numbered<br />

consecutively in a single sequence, separate from illustrations.<br />

References
<br />

A bibliographical reference shall be given for every work, published or unpublished, cited in<br />

the text. References may be identified by one <strong>of</strong> two methods, either:<br />

by numbers typed as superscripts, or, if on the line, in brackets, immediately following the<br />

relevant word or phrase in the text; or<br />

by citing the author's name <strong>and</strong> the date <strong>of</strong> publication in brackets immediately following<br />

the relevant word or phrase in the text.<br />

Sequence <strong>of</strong> material
<br />

Material shall be arranged in the following sequence:<br />

Title <strong>and</strong> subtitle. The title page <strong>of</strong> every volume shall give the following information in the<br />

order listed:<br />

the full title <strong>of</strong> the thesis or dissertation <strong>and</strong> any subtitle;<br />

the total number <strong>of</strong> volumes, if more than one, <strong>and</strong> the number <strong>of</strong> the particular volume;<br />

the full name <strong>of</strong> the author, followed, if desired, by any qualifications <strong>and</strong> distinctions;<br />

the qualification for which the thesis or dissertation is submitted (e.g., PhD or MA);<br />

the name <strong>of</strong> the University;<br />

the name <strong>of</strong> the Department or Centre in which the research was conducted;<br />

the month <strong>and</strong> year <strong>of</strong> submission.<br />

Abstract. The abstract shall follow the title page. It shall provide a synopsis <strong>of</strong> the thesis or<br />

dissertation, stating the nature <strong>and</strong> scope <strong>of</strong> work undertaken <strong>and</strong> the contribution made to<br />

knowledge in the subject treated. It shall appear on its own on a single page <strong>and</strong> shall not<br />

exceed 300 words in length. The abstract <strong>of</strong> the thesis or dissertation may, after the award <strong>of</strong><br />

the degree, be published by the University in any manner approved by the Senate, <strong>and</strong> for this<br />

purpose, the copyright <strong>of</strong> the abstract shall be deemed to be vested in the University.<br />

List <strong>of</strong> contents. It shall list in sequence, with page numbers, all relevant sub-divisions <strong>of</strong> the<br />

thesis or dissertation, including the titles <strong>of</strong> chapters, sections <strong>and</strong> subsections; the list <strong>of</strong><br />

references; the bibliography (if any); the list <strong>of</strong> abbreviations <strong>and</strong> other functional parts <strong>of</strong> the<br />

whole thesis or dissertation; appendices (if any); <strong>and</strong> the index (if any). If a thesis or<br />

dissertation consists <strong>of</strong> more than one volume, the contents <strong>of</strong> the whole thesis or dissertation<br />

shall be shown in the first volume <strong>and</strong> the contents <strong>of</strong> subsequent volumes in a separate


contents list in the appropriate volume.<br />

List <strong>of</strong> tables, illustrations, etc (if any).<br />

List <strong>of</strong> accompanying material (if any).<br />

Preface (if any).<br />

Acknowledgements (if any).<br />

Author's declaration. When submitting the thesis or dissertation, the author shall draw<br />

attention to any material contained in it that has been presented before. If the thesis or<br />

dissertation is based on joint research, the nature <strong>and</strong> extent <strong>of</strong> the author's individual<br />

contribution shall be stated. The declaration shall follow the acknowledgement, under a<br />

separate heading.<br />

Text , divided into sections (Introduction; Materials <strong>and</strong> Methods; Results <strong>and</strong> Discussion,<br />

Conclusions).<br />

Appendices (if any). Appendices may consist <strong>of</strong> material <strong>of</strong> considerable length or <strong>of</strong> lists,<br />

documents, commentaries, tables or other evidence that, if included in the main text, would<br />

interrupt its flow. The presentation <strong>of</strong> appendices, including character size, shall be consistent<br />

with the style <strong>of</strong> the main text.<br />

Definitions (if any). Definitions <strong>of</strong> any terms specific to the thesis or dissertation, including<br />

abbreviations <strong>and</strong> codes used in illustrations, shall be given.<br />

Glossary (if any). Terms that require explanation shall be defined in a glossary, which shall<br />

include a key to any abbreviations used. For an abbreviation not in common use, the term shall<br />

be given in full at the first instance followed by the abbreviation in brackets.<br />

List <strong>of</strong> references.<br />

Index (if any).<br />

6. Assessment weightings for the <strong>MSc</strong> degree<br />

Marking <strong>of</strong> examinations is generally out <strong>of</strong> 100 with a pass mark <strong>of</strong> 50%.<br />

Marks from returned assessments are considered to be “un-moderated” <strong>and</strong> are subject to<br />

alteration by examiners until the final examiners‟ meeting at the end <strong>of</strong> the course. An external<br />

examiner assists with the formal assessment process providing an independent view when<br />

borderline cases are considered <strong>and</strong> also giving advice on the whole assessment procedure.<br />

Your overall results will be classified by one <strong>of</strong> the three possible outcomes:


Distinction : 70% or above overall weighted average <strong>and</strong> 70% or above mark on<br />

dissertation.<br />

Pass : 50% - 69% overall weighted average.<br />

Fail: less than 50% overall weighted average.<br />

These criteria state the necessary conditions for a Distinction <strong>and</strong> a Pass. When these<br />

necessary conditions are met, it remains at the discretion <strong>of</strong> the BoE whether or not to award<br />

the respective degree classification.<br />

The criteria for being recommended for a Pass with Distinction are as follows. C<strong>and</strong>idates will<br />

need to:<br />

(i) attain a final mark <strong>of</strong> at least 70%,<br />

(ii) achieve a mark <strong>of</strong> at least 70% in the dissertation, <strong>and</strong><br />

(iii) satisfy the external examiner from their coursework, examination results <strong>and</strong><br />

dissertation that they are <strong>of</strong> a st<strong>and</strong>ard worthy <strong>of</strong> distinction.<br />

The award <strong>of</strong> the <strong>MSc</strong> degree depends upon completion <strong>of</strong> all aspects <strong>of</strong> the taught course<br />

(comprising 100 credits) <strong>and</strong> the completion <strong>of</strong> a dissertation. The dissertation for the <strong>MSc</strong><br />

will comprise 80 credits. The total number <strong>of</strong> credits taken must be 180.<br />

Important: Your performance in the taught part <strong>of</strong> the <strong>MSc</strong> will be evaluated around the end<br />

<strong>of</strong> the Easter Vacation. If you have scored less than 50 in the taught part <strong>of</strong> the course, you<br />

may not be allowed to proceed to the dissertation stage. The <strong>MSc</strong> Steering Committee will<br />

decide if you are able to continue in this case.<br />

Size restrictions<br />

Word counts should be given on all reports, essays or other assessed work that<br />

specifies a word limit. If you do not include a word count on a project report, five marks (<strong>of</strong><br />

one hundred) will be deducted from the presentation component <strong>of</strong> the assessment.<br />

Misrepresentation <strong>of</strong> the word count will be treated as Academic Misconduct. The word count<br />

should include the main text but not the abstract, tables, figures, reference list or appendices.<br />

You should also bear in mind that essays that exceed the word limit may also contain<br />

irrelevant information, repetition <strong>and</strong>/or other features that could affect your mark. Work that<br />

exceeds the specified word length will be penalised. The work will be marked as normal (out<br />

<strong>of</strong> 100), then one mark will be deducted for every 200 words above word limits between 4000<br />

<strong>and</strong> 8000 or every 100 words above word limits <strong>of</strong> 4000 or fewer. The penalty applies only to<br />

that component <strong>of</strong> assessment, for example, a project report that exceeds the word limit, not<br />

the project mark as a whole.<br />

Project reports require an abstract <strong>of</strong> approximately 250 words.<br />

7. The course structure<br />

The course is split into three terms: Autumn, Spring <strong>and</strong> Summer.<br />

In the Autumn term students will take the following taught modules with associated practical<br />

classes:


Research <strong>and</strong> Presentation Skills (10 credits)<br />

Statistics <strong>and</strong> Probability (10 credits)<br />

Introduction to Signal Processing (10 credits)<br />

Introduction to Programming (PYTHON) (10 credits)<br />

Structural Bioinformatics (10 credits)<br />

In the Spring term the courses exp<strong>and</strong> upon the topics <strong>of</strong> the first term including more<br />

advanced theory <strong>and</strong> techniques. The m<strong>and</strong>atory taught modules <strong>and</strong> practical classes are:<br />

Chemometrics (10 credits)<br />

Introduction to Machine Learning (10 credits)<br />

Biocomputing <strong>and</strong> Web Applications (10 credits)<br />

Chemical informatics (10 credits)<br />

In addition students will choose one module from the following:<br />

Current Research Topics (10 credits)<br />

Stochastic Processes (10 credits)<br />

The Summer term is devoted entirely to the work towards the <strong>MSc</strong> dissertation. Placements<br />

may be within the University or be a project supervised by someone from one <strong>of</strong> our industrial<br />

partners.<br />

Abbreviations: JW (Julie Wilson, Maths/<strong>Chemistry</strong>), JL (Jason Levesley, Maths), ZC (Zaq<br />

Coelho, Maths), RL (Richard Law, Biology), GD (Gustav Delius, Maths), GM (Garib<br />

Murshudov, <strong>Chemistry</strong>), IM (Ian McIntosh, Maths), JC (James Cussens, Computer Science),<br />

JP (Jon Pitchford, Biology), RA (Rashid Asher, Computer Science), BW (Bo Wang, Maths),<br />

YZ (Yuriy Zakharov, Electronics), RH (Rod Hubbard, <strong>Chemistry</strong>), KC (Kevin Cowtan,<br />

<strong>Chemistry</strong>).


Module<br />

Title<br />

Transferable <strong>and</strong><br />

Generic Skills<br />

Statistics<br />

<strong>and</strong> Probability<br />

Introduction to Signal<br />

Processing<br />

Introduction to<br />

Programming<br />

(PYTHON)<br />

Structural<br />

Bioinformatics<br />

Chemometrics<br />

Introduction to Machine<br />

Learning<br />

Current Research<br />

Stochastic Processes<br />

Biocomputing <strong>and</strong> Web<br />

Applications<br />

Chemical Informatics<br />

TOTAL 100 *<br />

(*) You should choose 10 credits <strong>of</strong> suitable modules to complement the compulsory ones in<br />

the Spring term.<br />

Do not hesitate to approach the organiser <strong>of</strong> a module if you have a query.<br />

8. Module descriptions<br />

Credits Term Written<br />

Assign-<br />

ments<br />

Projects Exam Taught<br />

by<br />

10 Autumn Yes Yes No JL, ZC,<br />

RL, GD,<br />

10 Autumn Yes Workshop<br />

assessments<br />

10 Autumn Yes No Yes<br />

JP, IM<br />

Yes GM<br />

10 Autumn Yes Yes No JC<br />

YZ<br />

10 Autumn Yes Yes No RH<br />

10 Spring Yes Yes No JW<br />

10 Spring Yes No No JC<br />

10 Spring Yes Yes No JW<br />

10 Spring Yes No Yes JP, BW<br />

10 Spring Yes Yes No KC<br />

10 Spring No No Yes GM<br />

Dissertation 80 Approximately April to August; 50-80 pages


00580026 Transferable & Generic Skills Autumn term, 10 Credits<br />

Module Organiser: Jon Pitchford, Room: Biology S Block, Tel: 8559, e-mail:<br />

jwp5@york.ac.uk<br />

Aims: This module enables the development <strong>of</strong> personal transferable competencies required<br />

for the management role. Particular emphasis is given to the skills needed to function<br />

effectively as a scientist working to apply mathematics to real-world problems. A series <strong>of</strong><br />

interactive sessions in the first week <strong>of</strong> teaching (before formal lectures begin) provides initial<br />

training in the main subjects, <strong>and</strong> these skills are further developed in the other Masters<br />

modules.<br />

Learning objectives: Acquire basic research skills, skills in writing mathematics, team work<br />

skills <strong>and</strong> presentation skills.<br />

Syllabus:<br />

Teambuilding fundamentals;<br />

Group projects <strong>and</strong> presentations;<br />

Literature resources;<br />

Library tour, safety at work;<br />

Typesetting s<strong>of</strong>tware: Micros<strong>of</strong>t Word <strong>and</strong> LaTeX;<br />

Giving talks, writing styles.<br />

Recommended texts: Given during the course <strong>of</strong> the module.<br />

Teaching: Teaching takes place during the orientation week. The remaining credits are<br />

accounted for by the requirement that the students become effective users <strong>of</strong> several technical<br />

s<strong>of</strong>tware packages: G.I.Sci. s<strong>of</strong>tware, Maple, Matlab, R, SPSS, <strong>and</strong> UNIX applications. Most<br />

<strong>of</strong> this learning is undertaken in the students' own time during the Autumn <strong>and</strong> Spring terms,<br />

with guidance from course lecturers, problems classes, web based materials <strong>and</strong> textbooks.<br />

Assessment: There are no assessments for this module.


0580050 Statistics <strong>and</strong> Probability Theory Autumn term, 10 Credits<br />

Module Organiser: Garib Murshudov (<strong>Chemistry</strong>), Room: B/K/065, Tel: 8253, e-mail:<br />

garib@ysbl.york.ac.uk<br />

Aims: Information flow <strong>and</strong> exchange is an integrated part <strong>of</strong> the modern life. Size <strong>and</strong><br />

amount <strong>of</strong> data sets are ever increasing. Data analysis is an important part <strong>of</strong> underst<strong>and</strong>ing a<br />

variety <strong>of</strong> data types. This course aims to introduce the most popular data analytical<br />

techniques <strong>and</strong> their application to students using one <strong>of</strong> the available packages (examples are<br />

given using the package R).<br />

Learning objectives: At the end <strong>of</strong> the course students should be aware <strong>of</strong> the role <strong>of</strong><br />

statistics <strong>and</strong> be familiar with a number <strong>of</strong> practical techniques from data analysis.<br />

Syllabus: In the first part <strong>of</strong> the course, basic statistical terms <strong>and</strong> concepts are introduced. In<br />

the second part, the most widely used data analysis techniques are described <strong>and</strong> illustrated.<br />

Basic Introduction to R: this lecture introduces basic comm<strong>and</strong>s from the statistical<br />

package R.<br />

Role <strong>and</strong> place <strong>of</strong> data analysis: this gives a basic introduction to data analysis.<br />

Maximum likelihood: maximum likelihood <strong>and</strong> related concepts such as observed <strong>and</strong><br />

Fisher information are described.<br />

Elementary hypothesis testing: the idea behind hypothesis testing is explored.<br />

Re-sampling techniques: some <strong>of</strong> the techniques for bias reduction <strong>and</strong> prediction error<br />

estimation are described. These include jackknife, bootstrap, cross-validation.<br />

Linear <strong>and</strong> generalised linear model: classic linear model for normal distribution case<br />

<strong>and</strong> its generalisation for the natural exponential family is described.<br />

Fixed, r<strong>and</strong>om <strong>and</strong> mixed effects models<br />

Elements <strong>of</strong> design <strong>of</strong> experiment<br />

Basics <strong>of</strong> ANOVA: purpose <strong>and</strong> ways <strong>of</strong> analysing ANOVA tables are described.<br />

Log-linear <strong>and</strong> logistic models: cases when data are from Poisson <strong>and</strong> binomial<br />

distribution are described.<br />

Teaching:<br />

Lectures: 20 x 2 hr lectures.<br />

Practicals: 10 x 1 hr practicals.<br />

Private study: 43 hrs.<br />

Assessment: 7 hrs.<br />

Assessment: An all day open (computing) exam in week 10 <strong>of</strong> the Autumn term (90%).<br />

Coursework (10%).<br />

Recommended texts:<br />

Intelligent data Analysis, Berthold M. <strong>and</strong> H<strong>and</strong> D.J.<br />

Introductory Statistics with R, Dalgaard P.<br />

Experimental Design, Cochran W.G. <strong>and</strong> Cox G.M.<br />

Prerequisites: None.<br />

0780170 Introduction to Signal Processing Autumn term, 10 Credits


Module Organiser: Yuriy Zakharov (Electronics), Room: P/K/001, Tel: 2399, e-mail:<br />

yz1@ohm.york.ac.uk<br />

Aims: This module introduces the students to the fundamental concepts <strong>of</strong> signal<br />

processing:analog <strong>and</strong> digital signals <strong>and</strong> systems, Fourier series, sampling, statistical signal<br />

processing <strong>and</strong> parameter estimation.<br />

Learning objectives: At the end <strong>of</strong> this module students are expected to:<br />

Underst<strong>and</strong> signal sampling <strong>and</strong> reconstruction<br />

Analyse continuous <strong>and</strong> discrete-time signals <strong>and</strong> systems in the time <strong>and</strong> frequency<br />

domain<br />

Underst<strong>and</strong> cocepts <strong>of</strong> autocorrelation, convolution <strong>and</strong> linearity<br />

Underst<strong>and</strong> statistical properties <strong>of</strong> signals<br />

Underst<strong>and</strong> principles <strong>of</strong> parameter estimation in noise<br />

Syllabus:<br />

Analogue <strong>and</strong> digital signals<br />

Signal sampling <strong>and</strong> reconstruction<br />

The sampling theorem <strong>and</strong> Nyquist interval<br />

R<strong>and</strong>om processes, probability density function, correlation <strong>and</strong> spectral density<br />

Systems, linearity <strong>and</strong> time-invariance<br />

Impulse <strong>and</strong> frequency responses<br />

Convolution, Fourier series, Discrete Forier Transform (DFT) <strong>and</strong> Fast Fourier Transform<br />

(FFT)<br />

Fundamentals <strong>of</strong> linear parameter estimation <strong>and</strong> spectrum estimation<br />

Least squares <strong>and</strong> maximum liklihood estimates<br />

Teaching:<br />

Teaching:<br />

Lectures: 10 x 2 hr lectures.<br />

Practicals: 2 x 1 hr practicals.<br />

Private study: 75.5 hrs.<br />

Assessment: 2.5 hrs.<br />

Students will receive h<strong>and</strong>outs, tutorial questions <strong>and</strong> revision questions.<br />

Recommended texts:<br />

Lathi, B.P. "Signal Processing <strong>and</strong> Linear Systems", 2003, Oxford University Press,<br />

ISBN 0195219171.<br />

Kay, S.M. "Fundamentals <strong>of</strong> Statistical Signal Processing: Estimation Theory",<br />

Prentice Hall, 1993.<br />

Assessment: This module is assessed by a 2 1/2 hour closed book examination in January.<br />

Prerequisites: Knowledge <strong>of</strong> simple probability theory <strong>and</strong> matrix algebra.<br />

0680102 Introduction to Programming (PYTHON) Autumn term, 10 Credits


Module Organiser: James Cussens(Computer Science), Room: BS/111, Tel: 8396, e-mail:<br />

james.cussens@cs.york.ac.uk<br />

Aims: The main aim <strong>of</strong> the module is to demonstrate the fundamental principles <strong>of</strong> computer<br />

programming, including object-oriented methods. A subsidiary aim is to teach the Python<br />

programming language, particularly for applications in bioinformatics <strong>and</strong> databases.<br />

This module introduces students to computer programming. Students will learn the principles<br />

<strong>of</strong> programming, in particular object-oriented methods. The Python programming language<br />

will be used throughout. Bioinformatics tasks will be used as examples <strong>of</strong> problems soluble by<br />

writing computer programs. Database programming will also form an important component <strong>of</strong><br />

the module.<br />

Learning objectives: At the end <strong>of</strong> this module successful students will: underst<strong>and</strong> the basic<br />

principles <strong>of</strong> computer programming; be able to write computer programs <strong>of</strong> moderate<br />

complexity; write their own programs to solve a range <strong>of</strong> bioinformatics tasks; underst<strong>and</strong> the<br />

principles <strong>of</strong> object-oriented methods; underst<strong>and</strong> how to incorporate third-party code<br />

(particularly Biopython) into their own programs; be able to write programs which interface<br />

with database systems.<br />

Syllabus:<br />

Core Computer Science<br />

1. Introduction to computer programming<br />

2. Variables, expressions <strong>and</strong> statements<br />

3. Functions, conditionals <strong>and</strong> recursion<br />

4. Fruitful functions <strong>and</strong> iteration<br />

5. Strings <strong>and</strong> lists<br />

6. Tuples <strong>and</strong> dictionaries<br />

7. Files <strong>and</strong> exceptions<br />

8. Classes <strong>and</strong> objects<br />

9. Class methods <strong>and</strong> composition<br />

10. Inheritance<br />

Algorithmics<br />

1. Writing loops<br />

2. Dynamic programming<br />

3. Recursion <strong>and</strong> sorting<br />

4. Algorithms on graphs<br />

5. Dijkstra's algorithm<br />

Applications programming<br />

1. Using Biopython<br />

2. Regular expressions<br />

3. Database programming<br />

4. CGI programming


5. Simple GUIs with Tkinter<br />

6. Introduction to XML processing<br />

Practicals<br />

1. Getting started with Python<br />

2. Basic programming<br />

3. Using built-in data types<br />

4. Files, exceptions <strong>and</strong> classes<br />

5. Object oriented programmming<br />

6. Using Biopython<br />

7. Database programming<br />

8. CGI & Tkinter programming<br />

9. XML<br />

Teaching:<br />

Lectures: 21 x 1 hr lectures.<br />

Practicals: 9 x 2 hr practicals.<br />

Private study (including assessment): 61 hrs.<br />

Recommended texts:<br />

How to Think Like a Computer Scientist: Learning with Python, by Allen Downey, Jeff<br />

Elkner <strong>and</strong> Chris Meyers. Freely available from: http://www.greenteapress.com/thinkpython/<br />

Introduction to Programming using Python, by Katja Schuerer, Catherine Letondal <strong>and</strong> Eric<br />

Deveaud. Freely available from: http://www.pasteur.fr/formation/infobio/python<br />

Learning Python, by Mark Lutz & David Ascher, 2nd edition, O'Reilly, 2003, £24.95<br />

Assessment: Open - The assessment involves writing computer programs to solve specified<br />

problems, <strong>and</strong> is done over a period <strong>of</strong> two weeks. The assessment is not divided into sections<br />

assessing distinct aspects <strong>of</strong> the module.<br />

Prerequisites: None.


0281105 Structural Bioinformatics Autumn term, 10 Credits<br />

Module Organiser: Rod Hubbard, Room: B/k/176, Tel: 8267, e-mail: rod@ysbl.york.ac.uk<br />

Aims: To provide an introduction to the fundamental principles <strong>and</strong> patterns in protein<br />

structure, structure determination, <strong>and</strong> sequence-structure-function relationships, <strong>and</strong> also the<br />

basic methods <strong>of</strong> structural bioinformatics - structure prediction, threading <strong>and</strong> homology<br />

modelling. To enable students to use molecular graphics analyses to relate protein structure to<br />

mechanism <strong>and</strong> biological function.<br />

This module provides an introduction to the fundamental principles <strong>of</strong> protein structure,<br />

structure determination methods, sequence-structure-function relationships <strong>and</strong> the various<br />

bioinformatics approaches to analysis <strong>and</strong> interrogation <strong>of</strong> protein structure. The module<br />

contains lectures <strong>and</strong> demonstrations, but the majority <strong>of</strong> the time is spent in computer-based<br />

workshops.<br />

Learning objectives:<br />

A revision <strong>of</strong> basic principles <strong>of</strong> protein structure primary, secondary, super-secondary,<br />

tertiary <strong>and</strong> quarternary structure<br />

An ability to use a molecular graphics system to analyse protein structure <strong>and</strong> to use<br />

classification schemes to characterise protein folds<br />

To have an outline underst<strong>and</strong>ing <strong>of</strong> the key stages in the determination <strong>of</strong> protein structure<br />

by X-ray crystallography <strong>and</strong> NMR methods<br />

An underst<strong>and</strong>ing <strong>of</strong> how protein structure relates to function<br />

The ability to compare protein sequences <strong>and</strong> structures <strong>and</strong> identify how change in<br />

sequence leads to change in structure <strong>and</strong> function<br />

How to construct homology models <strong>of</strong> proteins<br />

To be familiar with threading, fold <strong>and</strong> structure prediction methods, their application <strong>and</strong><br />

limitations<br />

Syllabus:<br />

This module consists <strong>of</strong> a series <strong>of</strong> lectures, demonstrations <strong>and</strong> workshop sessions to<br />

introduce you to the study <strong>and</strong> analysis <strong>of</strong> protein structure.<br />

Each session will be between 3-4 hrs in length, depending on the constraints <strong>of</strong> the timetable.<br />

The following synopsis describes the overall flow <strong>of</strong> topics; detailed division between sessions<br />

will depend on how the course develops<br />

Session 1<br />

You will learn how to use the graphics systems. We will then review the elements <strong>of</strong><br />

protein structure covering very rapidly notions <strong>of</strong> sequence, peptide bond <strong>and</strong> the chemistry <strong>of</strong><br />

the individual amino acids. We will use physical models to emphasise the importance <strong>of</strong> noncovalent<br />

interactions in biological systems (van der Waals, electrostatics, hydrogen bonds) <strong>and</strong><br />

how this gives rise to conformational flexibility in the peptide chain. This principle underlies<br />

the Ramach<strong>and</strong>ran diagram <strong>and</strong> is the origin <strong>of</strong> the different types <strong>of</strong> secondary structure<br />

(helix, sheet <strong>and</strong> turns). You will build a physical model <strong>of</strong> an alpha helix <strong>and</strong> then use the<br />

graphics systems to look at the helix in a real protein.<br />

Session 2


We will begin with a discussion about how computers hold models <strong>of</strong> protein structure <strong>and</strong><br />

then use the graphics systems to perform a simple set <strong>of</strong> calculations to produce the<br />

Ramach<strong>and</strong>ran diagram.<br />

There is then a series <strong>of</strong> worksheets introducing how to look at proteins <strong>and</strong> interpret<br />

function. You will use the graphics systems to produce scientifically meaningful<br />

representations <strong>of</strong> some protein structures. Three families <strong>of</strong> protein will be looked at –<br />

sialidase, anti-lysozyme Fab <strong>and</strong> HIV-protease.<br />

Session 3<br />

We will (in very outline terms) discuss how protein structures are determined by the<br />

techniques <strong>of</strong> nmr <strong>and</strong> x-ray crystallography. This will include a tour <strong>of</strong> the X-ray labs.<br />

The workshop will demonstrate how some <strong>of</strong> the features <strong>of</strong> structures can be influenced by<br />

the experimental methods used to discover them. This will include discussion <strong>of</strong> symmetry,<br />

solvent, disorder, B values, crystal packing <strong>and</strong> how to interpret NMR structures. This will<br />

also introduce you to some <strong>of</strong> the tools on the web for searching public databases <strong>of</strong> protein<br />

information. There will be a short discussion on data formats – heritage, data content <strong>and</strong><br />

issues.<br />

Session 4<br />

Comparison <strong>of</strong> structures – given two proteins, producing a structure <strong>and</strong>/or sequence<br />

based alignment <strong>of</strong> their structures. Short assessed project on a particular protein, relating the<br />

information in a scientific paper to the structure on the screen. This will consolidate the<br />

visualisation <strong>and</strong> comparison work <strong>of</strong> the first set <strong>of</strong> workshops.<br />

Session 5<br />

You will begin with some multiple sequence alignments – comparing the sequences <strong>and</strong><br />

structures <strong>of</strong> series <strong>of</strong> molecules <strong>and</strong> demonstrating the underlying relationship –similarity<br />

<strong>of</strong> sequence means similarity <strong>of</strong> structure.<br />

Session 6<br />

Discussion on progress in structure determination <strong>and</strong> how this has influenced the<br />

underst<strong>and</strong>ing <strong>of</strong> patterns in protein structures. Discussion on secondary structure prediction.<br />

An introduction to homology modelling. Using the relationship between sequence <strong>and</strong><br />

structure to construct models <strong>of</strong> proteins on the basis <strong>of</strong> sequence similarity. Homology<br />

modelling exercise.<br />

Session 7<br />

Presentations <strong>and</strong> tutorials from the Molecular Structures Database (MSD) project at EBI<br />

Session 8<br />

An introduction to the ideas <strong>of</strong> domains <strong>and</strong> folds looking at some protein structures.<br />

Exploring the CATH <strong>and</strong> SCOP databases. Introduction to the ideas <strong>of</strong> threading scoring<br />

potentials <strong>and</strong> the methods used. Threading workshop. Discussion <strong>of</strong> structural genomics.<br />

Session 9 <strong>and</strong> 10<br />

Assessed mini-project. Using available web resources to explore possible sequence


structure-function relationships in a particular protein.<br />

Recommended texts:<br />

Br<strong>and</strong>en <strong>and</strong> Tooze. An introduction to protein structure, ISBN 0815323042, Garl<strong>and</strong><br />

Science<br />

Most biochemistry textbooks, Stryer is recommended<br />

Teaching:<br />

Workshops: 5 x 3 hr workshops.<br />

5 x 4 hr workshops.<br />

Private study (including assessment): 65 hrs.<br />

Note that attendance at workshops is compulsory <strong>and</strong> will be recorded.<br />

Assessment: Two assessments. The first is a mini-project to assess how well students can use<br />

the basic principles <strong>of</strong> structure representation taught in the course to interpret protein<br />

structures. The student produces a short (2 sides <strong>of</strong> A4) report that uses 2-3 molecular graphics<br />

images to describe how aspects <strong>of</strong> the structure <strong>of</strong> the protein(s) provides insight into key<br />

features <strong>of</strong> the mechanism <strong>of</strong> action <strong>and</strong> function <strong>of</strong> the protein. The second is to use a<br />

combination <strong>of</strong> molecular graphics, sequence alignment, structural databases <strong>and</strong>, where<br />

appropriate, homology modelling, to produce a short (4 sides <strong>of</strong> A4) report analysing sequence<br />

structure-function relationships in a set or family <strong>of</strong> proteins.<br />

Prerequisites: None.


03811047 Chemometrics Spring term, 10 Credits<br />

Module Organiser: Julie Wilson (<strong>Mathematics</strong>/<strong>Chemistry</strong>), Room: B/S/108, Tel: 8282, email:<br />

julie@ysbl.york.ac.uk<br />

Aims: This module introduces the concepts <strong>and</strong> methods <strong>of</strong> multivariate data analysis <strong>and</strong><br />

visuualisation commonly applied to biological <strong>and</strong> chemical data.<br />

Learning objectives: At the end <strong>of</strong> this module students are expected to:<br />

be familiar with basic concepts such as similarity, metrics, state spaces <strong>and</strong> ordination;<br />

underst<strong>and</strong> the goals <strong>of</strong> dimensionality reduction <strong>and</strong> clustering<br />

underst<strong>and</strong> the importance <strong>of</strong> pre-processing<br />

be able to perform multivariate analyses (e.g. Principal Component (<strong>and</strong> Coordinate)<br />

Analysis <strong>and</strong> (hierarchical) Cluster Analysis) using the program R<br />

underst<strong>and</strong> the importance <strong>of</strong> visualisation for the interpretation <strong>of</strong> large datasets<br />

Syllabus:<br />

Data pre-processing<br />

Principal component analysis<br />

Factor analysis<br />

Multidimensional scaling<br />

Correspondence analysis<br />

Cluster analysis<br />

Canonical Correlations<br />

Discriminant analysis<br />

Wavelet analysis<br />

evolutionary programming<br />

Teaching:<br />

Lectures: 10 x 2 hr lectures.<br />

Practicals: 4 x 2hr practicals.<br />

Private study (including assessment): 72 hrs.<br />

Recommended texts:<br />

Krzanowski WJ <strong>and</strong> Marriout FHC. (1994) Multivariate analysis. Vol 1 <strong>and</strong> 2. Kendallâ€s<br />

library <strong>of</strong> statistics.<br />

Mardia,KV, Kent, JT <strong>and</strong> Bibby, JM (2003) Multivariate analysis<br />

Assessment: Open - Assessment is via a written assignment done over a period <strong>of</strong> two-weeks.<br />

Prerequisites: Knowledge <strong>of</strong> simple probability theory <strong>and</strong> matrix algebra.


0690502 Introduction to Machine Learning Spring term, 10 Credits<br />

Module Organiser: James Cussens (Computer Science), Room: BS/111, Tel: 8396, e-mail:<br />

james.cussens@cs.york.ac.uk<br />

Aims: The main aim <strong>of</strong> the module is for students to develop the skills <strong>and</strong> underst<strong>and</strong>ing<br />

necessary for the application <strong>of</strong> selected machine learning (ML) <strong>and</strong> statistical techniques in<br />

bioinformatics. The selected techniques are: pr<strong>of</strong>ile HMMs, nearest neighbour, decision trees<br />

<strong>and</strong> artificial neural networks. Another aim is for students to gain a better underst<strong>and</strong>ing <strong>of</strong><br />

probabilistic <strong>and</strong> statistical reasoning so that they can apply maximum likelihood <strong>and</strong><br />

Bayesian approaches to the estimation <strong>of</strong> probabilities. Students should become familiar with<br />

the HMMER pr<strong>of</strong>ile hidden Markov model s<strong>of</strong>tware; <strong>and</strong> to become familiar with the C4.5<br />

system.<br />

This module gives an introduction to the field <strong>of</strong> machine learning for solving biological<br />

problems. Specific topics include: pr<strong>of</strong>ile hidden Markov models, decision tree learning,<br />

nearest neighbour classification <strong>and</strong> artificial neural networks. Pr<strong>of</strong>ile hidden Markov models<br />

are 'compiled' versions <strong>of</strong> multiple sequence alignments <strong>and</strong> can be used for searching<br />

databases <strong>of</strong> sequences. The other three topics are central approaches to machine learning not<br />

just in bioinformatics but also in other areas.<br />

The course will as <strong>of</strong>ten as possible use s<strong>of</strong>tware <strong>and</strong> libraries that are freely available for<br />

download on the Internet. Thus enabling a student to work on or <strong>of</strong>f campus.<br />

Learning objectives: By the end <strong>of</strong> the module you should be able to:<br />

Select appropriate machine learning <strong>and</strong> statistical approaches for specific applications.<br />

Underst<strong>and</strong> core HMM algorithms: computation <strong>of</strong> forward <strong>and</strong> backward probabilities, the<br />

Viterbi algorithm<br />

Be able to use nearest neighbour classification<br />

Be able to use decision trees for classification<br />

Be able to use artificial neural networks for pattern recognition<br />

To be able to apply the HMMER systems<br />

To be able to apply the C4.5 system<br />

To use maximum likelihood estimation <strong>of</strong> probabilities<br />

To use Bayesian approaches to probability estimation<br />

To use constructive induction to extract features from sequences<br />

Syllabus:<br />

Lectures:<br />

Multiple alignments, families <strong>of</strong> sequences <strong>and</strong> pr<strong>of</strong>ile hidden Markov models<br />

Pr<strong>of</strong>ile HMMs for ungapped regions <strong>of</strong> a multiple alignment<br />

Pr<strong>of</strong>ile HMMs for multiple alignments with gaps<br />

Estimation <strong>of</strong> probabilities<br />

Forward <strong>and</strong> backward probabilities in HMMs<br />

The Viterbi <strong>and</strong> Baum-Welch algorithms for HMMs<br />

Significance <strong>of</strong> hits. Local <strong>and</strong> global alignment with HMMER<br />

Introduction to Inductive Learning


Decision Tree Learning<br />

Decision Tree Learning (cont.)<br />

Constructive Induction<br />

Nearest Neighbour Methods<br />

Computing Similarity<br />

Similarity (cont.)<br />

Introduction to pattern recognition<br />

Neural network architectures for pattern recognition<br />

Application: Identification <strong>of</strong> promoter sequences<br />

Application: Protein classifcation rule extraction<br />

Application: Classification <strong>of</strong> microarray gene data<br />

Practicals:<br />

HMMer tutorial<br />

Estimation probabilities<br />

Forward <strong>and</strong> backward probabilities<br />

The Viterbi algorithm<br />

Using C4.5<br />

Constructive induction<br />

Recommended texts:<br />

Biological Sequence Analysis: Probabilistic Models <strong>of</strong> Proteins <strong>and</strong> Nucleic Acids Richard<br />

Durbin, et al, Cambridge University Press (Paperback - 1998 £23.99)<br />

Teaching:<br />

Lectures: 20 x 1 hr lectures.<br />

Practicals: 9 x 2 hr practicals.<br />

Private study (including assessment): 62 hrs.<br />

Assessment: Open - Assessment is via a written assignment done over a period <strong>of</strong> two weeks.<br />

There is a section on pr<strong>of</strong>ile hidden Markov models (40%), a section on machine learning<br />

(40%) <strong>and</strong> a section on artificial neural networks (20%).<br />

Prerequisites: None.


0381045 Current Research Spring term, 10 Credits<br />

Module Organiser: Julie Wilson (<strong>Mathematics</strong>/<strong>Chemistry</strong>), Room: B/S/108, Tel: 8282, email:<br />

julie@ysbl.york.ac.uk<br />

Aims:<br />

This module aims to provide a comprehensive knowledge base to underst<strong>and</strong> <strong>and</strong> appraise<br />

contemporary chemoinformatics research, <strong>and</strong> to underst<strong>and</strong> its applications.<br />

Learning objectives: At the end <strong>of</strong> this module students are expected to:<br />

appreciate major theoretical <strong>and</strong> conceptual issues in current chemoinformatics research<br />

underst<strong>and</strong> how research methods relate to underlying theoretical <strong>and</strong> conceptual<br />

assumptions <strong>and</strong> how these can influence research outcomes<br />

make informed choices <strong>of</strong> appropriate research methods for specific research questions.<br />

Teaching:<br />

Seminars/workshops: 9 x 3 hr.<br />

Private study (including assessment): 73 hrs.<br />

Recommended texts:<br />

Reading lists will be provided during the module. Some lecturers require you to have read<br />

some relevant material before the session.<br />

Assessment: Assessment is by way <strong>of</strong> a 1000-word assignment submitted by the end <strong>of</strong> Week<br />

1 <strong>of</strong> the summer term. Write a critical review <strong>of</strong> an issue <strong>of</strong> your choice associated with any <strong>of</strong><br />

the presentations given in the current academic year. The review should include relevant<br />

references.<br />

Prerequisites: None.


0381046 Chemical Informatics Spring term, 10 Credits<br />

Module Organiser: Garib Murshudov (<strong>Chemistry</strong>), Room: B/K/065, Tel: 8253, e-mail:<br />

garib@ysbl.york.ac.uk<br />

Aims: Small molecular compounds are essential in underst<strong>and</strong>ing function <strong>of</strong><br />

macromolecules. The search for compounds similar structural <strong>and</strong> chemical properties in large<br />

databases is a first step in drug design. The course aims to introduce the design, organisation,<br />

retrieval, visualisation <strong>and</strong> some use <strong>of</strong> chemical information.<br />

Learning objectives: At the end <strong>of</strong> the course students should be aware <strong>of</strong> the organisation <strong>of</strong><br />

<strong>and</strong> search in small molecular databases. They will also learn some <strong>of</strong> the techniques to<br />

generate three-dimensional structures from one or two-dimensional representations.<br />

Syllabus:<br />

1. Chemical structure representation.<br />

2. Resources.<br />

3. QSAR - quantitative structure activity relationship.<br />

4. Graph <strong>and</strong> sub-graph isomorphism algorithms.<br />

5. Smiles - Simplified Molecular Input Line Entry System.<br />

6. Molecular Dimension Limited (MDL) file format for chemical connectivity.<br />

7. Chemical Structure similarity.<br />

8. Fingerprints <strong>and</strong> search for substructure similarity.<br />

9. Generation <strong>of</strong> 3D structures from 2D representations.<br />

10.3D structure similarity.<br />

11. Elements <strong>of</strong> molecular descriptors.<br />

Teaching:<br />

Lectures: 12 x 2 hr lectures.<br />

Practicals: 6 x 1 hr practicals.<br />

Private study (including assessment): 70 hrs.<br />

Assessment: The module will be assessed by a short project carried out over a two-week<br />

period.<br />

Recommended texts:<br />

An introduction to chemoinformatics. Andrew R. Leach <strong>and</strong> Valerie J. Gillet.<br />

<strong>Chemoinformatics</strong>, Edited by Casteiger J. <strong>and</strong> Engel T.<br />

Prerequisites: None.


0580500 Stochastic Processes Spring term, 10 Credits<br />

Module Organisers:<br />

Jon Pitchford, Room: Biology S Block, Tel: 8559, e-mail: jwp5@york.ac.uk<br />

Bo Wang, Room: G/043, Tel: 3096, e-mail: bw527@york.ac.uk<br />

Aims: The module is designed to raise awareness <strong>of</strong> the differences between deterministic <strong>and</strong><br />

stochastic approaches to mathematical modelling, <strong>and</strong> to provide the students with the<br />

necessary techniques to investigate processes in which r<strong>and</strong>om effects are important. Students<br />

will see how stochastic modelling methods are being applied to practical problems,<br />

particularly in biological <strong>and</strong> ecological systems. Where possible the core syllabus is<br />

augmented by research seminars given by biologists <strong>and</strong> mathematicians. Students will also<br />

work in groups to present seminars on recent research papers.<br />

Learning objectives: At the end <strong>of</strong> the module you should:<br />

have a sound knowledge <strong>of</strong> the theory <strong>of</strong> Markov chains <strong>and</strong> their dynamics<br />

underst<strong>and</strong> the behaviour <strong>and</strong> the analysis <strong>of</strong> some simple stochastic paradigms,<br />

including r<strong>and</strong>om walks, branching processes, Poisson processes, <strong>and</strong> birth-death<br />

processes.<br />

have a broad overview <strong>of</strong> the ways in which mathematics have been used to enhance<br />

underst<strong>and</strong>ing <strong>of</strong> ecological dynamics <strong>and</strong> other processes susceptible to stochastic<br />

effect.<br />

be able to tackle problems in ecological modelling <strong>and</strong> more broadly in mathematical<br />

modelling in general.<br />

Syllabus:<br />

First part <strong>of</strong> course (BW): These lectures present a generalised <strong>and</strong> comprehensive<br />

description <strong>of</strong> Markov chains, demonstrate widely applicable methods for their solution, <strong>and</strong><br />

discuss their applications in some detail. Topics covered include definitions <strong>and</strong> classifications<br />

for Markov chains (with examples), stationary distributions <strong>and</strong> recurrence, continuous-time<br />

Markov chains, <strong>and</strong> Markov chain Monte Carlo methods.<br />

Second part <strong>of</strong> course (JWP): Revision <strong>of</strong> sets, events, <strong>and</strong> probability (part 1). SRW<br />

example (gambler's ruin); Revision <strong>of</strong> sets, events, <strong>and</strong> probability (part 2); Network<br />

reliability example; SRWs <strong>and</strong> difference equation methods for their solution; Generating<br />

functions<br />

Applications <strong>of</strong> generating functions to SRWs; Simple branching processes; Age<br />

dependent branching processes, <strong>and</strong> Poisson processes; Simple birth processes; Simple<br />

death processes, <strong>and</strong> general birth-death processes.<br />

In addition, there will be at least three student-led seminars based on recent mathematical<br />

modelling research papers (typically involving r<strong>and</strong>om walks, branching processes, <strong>and</strong><br />

stochastic differential equations).


Recommended texts:<br />

G Grimmett <strong>and</strong> D Stirzaker, Probability <strong>and</strong> R<strong>and</strong>om Processes, Oxford.<br />

E Renshaw, Modelling Biological Populations in Space <strong>and</strong> Time, Cambridge.<br />

Teaching:<br />

Lectures: 9 x 2 hr.<br />

Seminars: 9 x 1 hr.<br />

Private study (including assessed coursework) : 70 hrs.<br />

Assessment: 3hr examination


0281203 Biocomputing <strong>and</strong> Web Applications Autumn term, 10 Credits<br />

Module Organiser: Kevin Cowtan (<strong>Chemistry</strong>) Room: BK/064, Tel: 8254, e-mail:<br />

cowtan@york.ac.uk<br />

Aims: To introduce the basic elements <strong>of</strong> the programming language Python, including:<br />

values, variables <strong>and</strong> expressions; data types; strings; arrays <strong>and</strong> vectors; conditions <strong>and</strong> loops;<br />

methods; recursion; classes. To provide tools <strong>and</strong> experience in the use <strong>of</strong> existing<br />

computational biology resources <strong>and</strong> the construction <strong>of</strong> new resources using local <strong>and</strong><br />

distributed computing techniques.<br />

This module will build on the introduction to the programming language Python <strong>and</strong> introduce<br />

local <strong>and</strong> distributed data organisation <strong>and</strong> manipulation as applied to computational biology<br />

problems. As useful as word processors, databases <strong>and</strong> other tools might be, the one thing that<br />

makes computers so important is our ability to program them to perform a huge variety <strong>of</strong><br />

tasks, from solving equations to processing <strong>and</strong> analysing data. In addition, the large databases<br />

produced today by genomics efforts present significant problems <strong>of</strong> data description <strong>and</strong><br />

organisation, <strong>and</strong> are <strong>of</strong>ten distributed across local <strong>and</strong> external computing resources. This<br />

module is an introduction to programming as a means <strong>of</strong> solving practical problems, using the<br />

python programming language, <strong>and</strong> will explore techniques for data organisation <strong>and</strong> methods<br />

for accessing local <strong>and</strong> remote databases through JDBC <strong>and</strong> web services portals.<br />

Learning objectives: At the end <strong>of</strong> this module students will: be able to create a Python<br />

program to solve a problem <strong>of</strong> moderate complexity; have the foundation for studying<br />

advanced concepts <strong>of</strong> Python programming; underst<strong>and</strong> the purpose <strong>and</strong> benefits <strong>of</strong> data<br />

modelling; be able to design a database <strong>and</strong> describe it using a schema; access from Java using<br />

JDBC; underst<strong>and</strong> the common markup language XML <strong>and</strong> its schema; be able to model data<br />

<strong>and</strong> processes using UML; have an overview <strong>of</strong> GRID computing protocols; the SOAP<br />

protocol <strong>and</strong> WSDL; experience in using <strong>and</strong> web services; awareness <strong>of</strong> available web<br />

services for computational biology.<br />

Teaching:<br />

Lectures: 13 x 1 hr; 4 x 2 hr.<br />

Practicals: 4 x 3 hr; 9 x 2 hr.<br />

Private study (including assessed project) : 49 hrs.<br />

Lectures:<br />

1. Scientific algorithms in python.<br />

- basic algorithms<br />

- searching data<br />

- indexing data<br />

- numerical analysis<br />

- network data<br />

- data clustering<br />

- routefinding<br />

- advanced DB connectivity


2. DB design<br />

- analysing the problem<br />

- designing a database<br />

- optimising the design<br />

- interacting with object oriented programs<br />

3. Basic web applications with mod_python<br />

- basic pages<br />

- introducing dynamic elements<br />

- forms <strong>and</strong> interaction<br />

- persistence<br />

- DB applications<br />

4. Advanced web applications with google apps<br />

- basics<br />

- building complex applications<br />

5. Grid computing <strong>and</strong> web services, with examples<br />

- creating web services<br />

- calling web services<br />

- EBI examples<br />

Recommended texts:<br />

Q. Charatan <strong>and</strong> A. Kans. Java: The First Semester. McGraw-Hill, 2001, ISBN: 0077097572<br />

Core Servlets <strong>and</strong> Java Server Pages (Marty Hall): available as free download from:<br />

http://pdf.coreservlets.com/, or from Prentice Hall, 2000, ISBN: 0130893404<br />

Assessment: Open - The assessment involves writing computer programs to solve specified<br />

problems, <strong>and</strong> is done over a period <strong>of</strong> two weeks. The assessment is not divided into sections<br />

assessing distinct aspects <strong>of</strong> the module. One short report on protein identification workshop<br />

(50% weighting) <strong>and</strong> data analysis problem sheet on t<strong>and</strong>em MS fragmentation <strong>of</strong> peptides<br />

(50% weighting). The student should submit their protein identification work on the form<br />

provided, <strong>and</strong> two hard copies <strong>and</strong> one s<strong>of</strong>t copy (on floppy disk or CD) <strong>of</strong> their data analysis<br />

answers, to the Biology Graduate Office. The student should use only their c<strong>and</strong>idate<br />

identification number on all work submitted for assessment.<br />

Prerequisites: Introduction to programming (Python).


9. H<strong>and</strong>ing in written assignments, practical assignments <strong>and</strong> the dissertation<br />

Written assignments <strong>and</strong> practical assignments<br />

All assignments should be h<strong>and</strong>ed in to the module organiser at the time <strong>and</strong> place specified.<br />

The dissertation<br />

Three printed copies must reach the <strong>Mathematics</strong> Departmental Office (G127) by 16.30 on<br />

Friday 19th August 2011. You should also h<strong>and</strong> in a CD-R containing your dissertation in<br />

both Adobe PDF format <strong>and</strong> Postscript format to the <strong>Mathematics</strong> Departmental Office. Please<br />

use a CD-R pen to write clearly your name on your disk.<br />

The dissertation must be h<strong>and</strong>ed in, in person, or by a person nominated by you; <strong>and</strong> must be<br />

signed in to confirm the date <strong>and</strong> time. Postal or electronic submissions will not be accepted.<br />

Late submissions will incur a penalty unless there are extenuating circumstances.<br />

Dissertations will remain in the Department <strong>of</strong> <strong>Mathematics</strong> until the final assessments have<br />

been approved. See also the Dissertation Guidelines as item 25 <strong>of</strong> this document.<br />

10. Lateness penalties<br />

Each day or part day that an assessment is late will incur a 10% penalty. This includes<br />

weekends <strong>and</strong> bank holidays when the <strong>of</strong>fice is not open. For example, if work is awarded a<br />

mark <strong>of</strong> 30 out <strong>of</strong> 50, <strong>and</strong> the work is up to one day late, the final mark is 25. After five days,<br />

the work is marked at zero. Note however, the penalty cannot take the mark into a negative<br />

result. Please do not ask for deadlines to be extended: the answer will be no, with the<br />

exception <strong>of</strong> illness supported by a medical note or other urgent good cause supported by a<br />

written submission from your supervisor. It is your responsibility to plan your dissertation<br />

around these deadlines.<br />

11. Size restrictions<br />

Your dissertation will be judged on content. The guide in the mentioned before <strong>of</strong><br />

50 - 80 pages for an <strong>MSc</strong> dissertation<br />

is only a rough guide concerning length. The over-riding requirement is for excellent content.<br />

The assessment will be based on underst<strong>and</strong>ing, clarity, insight, innovation <strong>and</strong> presentation.<br />

12. Disclosure <strong>of</strong> marks/feedback meetings<br />

Marks for individual assignments will be given out as soon as possible. You are encouraged<br />

to discuss your performance with the module organiser <strong>and</strong>/or your supervisor. Verbal<br />

feedback sessions will also be organised as required.<br />

13. Marking criteria


The criteria used for marking the dissertation will be made available in a separate document in<br />

due course. These will help you underst<strong>and</strong> what is required for each type <strong>of</strong> assessment. The<br />

dissertation mark to be used in the computation <strong>of</strong> your final average mark in the course is a<br />

weighted average <strong>of</strong> an agreed mark between first <strong>and</strong> second markers <strong>of</strong> the content <strong>of</strong> your<br />

dissertation, a mark given by your dissertation supervisor which accounts for effort <strong>and</strong><br />

dedication, <strong>and</strong> finally a mark for your presentation.<br />

14. Mark record sheets <strong>and</strong> course transcripts<br />

The Department <strong>of</strong> <strong>Mathematics</strong> keeps a running record <strong>of</strong> your marks as the course proceeds.<br />

This will be available for inspection in the Departmental Office (G/127) <strong>and</strong> will enable you to<br />

keep a check on your marks. At the end <strong>of</strong> the course a transcript is prepared for each student<br />

THAT gives a breakdown <strong>of</strong> the assessment.<br />

15. Appeals procedure<br />

This is only an outline <strong>of</strong> the procedure <strong>and</strong> is condensed from University Ordinances <strong>and</strong><br />

Regulations.<br />

A. Students wishing to appeal against their result<br />

The university regulations are quite clear that there is no appeal available against the<br />

judgement <strong>of</strong> the examiners (Regulation 2.8c). There is only appeal against the decision to<br />

allow resits or if there is prima facie evidence that one or more <strong>of</strong> the following has occurred:<br />

seriously inadequate supervision; examinations conducted unfairly or improperly or examiners<br />

showed prejudice against the student.<br />

If an appeal is desired then it must be made in writing to the Registrar within four weeks <strong>of</strong><br />

being notified <strong>of</strong> the decision.<br />

B. Students wishing to appeal against the mark <strong>of</strong> a specific assessment<br />

Again the university regulations state that there is no appeal against the judgement <strong>of</strong> the<br />

examiners (Regulation 2.8c). However, if a student wishes to draw the attention <strong>of</strong> the external<br />

examiner to a particular mark, which they believe to be in error, then they should write to the<br />

examinations <strong>of</strong>ficer stating their case.<br />

All special cases, including late submissions, will be marked with no allowance made but will<br />

be brought to the attention <strong>of</strong> the external examiner. Any particular circumstance that might<br />

affect performance in an examination should be made clear, in writing, to the examinations<br />

<strong>of</strong>ficer. This should be done as near to the time <strong>of</strong> the assessment as possible (Regulation<br />

5.4h). In the case <strong>of</strong> illness a doctor's note is required (Regulation 5.4g). These circumstances<br />

will be brought to the attention <strong>of</strong> the external examiner <strong>and</strong>, if necessary, raised at the Board<br />

<strong>of</strong> Studies before it considers examination results.<br />

There should be no correspondence with the external examiner. All material will be passed on<br />

by the examinations <strong>of</strong>ficer before the viva voce examinations <strong>and</strong> final examiner's meeting.<br />

All appeals will be treated in strictest confidence.<br />

16. Academic misconduct: University guidelines


You are responsible for ensuring that your work does not contravene the University‟s rules on<br />

academic misconduct, which are set out in regulation 5. The University takes a very serious<br />

view <strong>of</strong> such misconduct <strong>and</strong> penalties will be applied to students who are found to have<br />

attempted to mislead examiners. Forms <strong>of</strong> academic misconduct include:<br />

cheating deliberate failure to comply with the rules governing examinations, e.g. by<br />

making arrangements to have unauthorised access to information;<br />

collusion assisting another individual to gain an advantage by unfair means, or receiving<br />

such assistance yourself;<br />

fabrication misleading the examiners by presenting work for assessment in a way which<br />

intentionally or recklessly suggests that you have collected factual information<br />

which has not in fact been collected, or falsifies factual information;<br />

personation producing work to be submitted as that not <strong>of</strong> yourself but <strong>of</strong> another, or<br />

assuming the identity <strong>of</strong> another individual in order to deceive the examiners,<br />

or soliciting another individual to act or appear as yourself, or to produce work<br />

on your behalf;<br />

plagiarism incorporating within your work without appropriate acknowledgement material<br />

derived from the work (published or unpublished) <strong>of</strong> another.<br />

The penalties for academic misconduct will depend on the seriousness <strong>of</strong> the <strong>of</strong>fence.<br />

Students found guilty <strong>of</strong> academic misconduct may, for example, have their degree class<br />

reduced, fail their degree or be asked to leave the University.<br />

If you have any queries about what constitutes academic misconduct, <strong>and</strong> in particular about<br />

the proper attribution <strong>of</strong> material derived from another‟s work, you should seek advice from<br />

your supervisors or tutors.<br />

The important University regulations on plagiarism <strong>and</strong> collusion are reproduced below.<br />

(i) C<strong>and</strong>idates must not by implication or otherwise represent the work <strong>of</strong> others as their<br />

own. All sources, whether published books <strong>and</strong> articles or unpublished material <strong>of</strong> any<br />

kind must be explicitly acknowledged, <strong>and</strong> quotations <strong>and</strong> close paraphrases clearly<br />

attributed.<br />

(ii) C<strong>and</strong>idates must not by implication or otherwise represent work done in collaboration<br />

with others as their own unaided work, nor may any member <strong>of</strong> the University,<br />

whether or not he or she is a c<strong>and</strong>idate in the examination, knowingly allow his or her<br />

work to be used without acknowledgement by examination c<strong>and</strong>idates.<br />

(iii) The examiners will take full account <strong>of</strong> any breach <strong>of</strong> the requirements in (i) <strong>and</strong> (ii)<br />

above in determining a mark for the work affected. In serious cases this may result in<br />

a mark <strong>of</strong> zero for the paper or papers concerned with consequent effects on the<br />

assessment <strong>of</strong> the c<strong>and</strong>idates overall performance, even failure in the examination as a


whole.<br />

If, in the opinion <strong>of</strong> the examiners, the case is <strong>of</strong> particular gravity, they may also<br />

recommend disciplinary action.<br />

The penalties available in such cases are:<br />

(a) Suspension or exclusion from the University<br />

(b) A lowering <strong>of</strong> the class <strong>of</strong> degree to be awarded<br />

(c) Withholding the award <strong>of</strong> a degree<br />

(d) In the event <strong>of</strong> failure or action under (iii) (b) above, withdrawal <strong>of</strong><br />

any entitlement to repeat the examination concerned.<br />

17. The York Award<br />

As part <strong>of</strong> your career planning (see 19 below) you might wish to consider enrolling for the<br />

York Award. The York Award is a certificate awarded to the University to people who<br />

complete a significant programme <strong>of</strong> personal development during their time as a student at<br />

York.<br />

The programme <strong>of</strong>fers courses aimed at developing skills such as communication (including<br />

presentations <strong>and</strong> negotiating), numeracy, improving own learning, working with others,<br />

information literacy, time management, etc.<br />

More information is available at<br />

http://www.york.ac.uk/services/careers/skills.cfm<br />

or in a brochure available from the Student Skills Development Unit (in the Careers Service<br />

building).<br />

18. Careers<br />

Placements are not guaranteed but (provided they do happen for you) might give a good<br />

access to a job after you have completed the course. Nonetheless you should maximise your<br />

chances <strong>of</strong> pursuing a career <strong>of</strong> interest to you by following the path below.<br />

Find your way to the Careers Service <strong>and</strong> discover how it can help you choose <strong>and</strong> get<br />

the right job or further course.<br />

Complete the “Personal Development Record” for your supervisor <strong>and</strong> begin to add up<br />

what you have to <strong>of</strong>fer the world beyond education - use the self assessment exercises<br />

available at the Careers Service <strong>and</strong> pick up a copy <strong>of</strong> “Planning Your Future”<br />

Explore the extensive range <strong>of</strong> information on jobs <strong>and</strong> courses, at home <strong>and</strong> overseas,<br />

held at the Careers Service.<br />

Use every opportunity to confirm you know all you need to know about your chosen<br />

option; lots <strong>of</strong> Careers Service events, especially in the Autumn term, will give you<br />

direct access to potential employers. Collect a copy <strong>of</strong> the events programme from the


Careers Service or from the Graduate Students Association.<br />

Always collect, or read on the Web, the Careers Service Vacancy Bulletin containing<br />

details <strong>of</strong> vacancies advertised for graduates, including those employers who conduct<br />

their initial interviews on campus; don‟t miss options with early application times such<br />

as teacher training, Civil Service recruitment competitions, financial careers <strong>and</strong><br />

overseas research scholarships - the recruitment season begins in September for the<br />

following Autumn.<br />

Attend training sessions on application form, CV <strong>and</strong> interview technique.<br />

Consult your referees about your plans <strong>and</strong> give them a copy <strong>of</strong> your CV or Personal<br />

Development Record completed for your supervisor.<br />

If you have questions or concerns about what to do next arrange to see a Careers<br />

Adviser, either “drop in” to see the duty adviser or make an appointment for a personal<br />

interview.<br />

The Careers Service is located between the Language Centre <strong>and</strong> the large car park on the<br />

Central Hall access road <strong>of</strong>f the University Road. You are welcome to call, on 2685 (internal),<br />

or visit when we are open. Most <strong>of</strong> our facilities are available on a “self-service” basis with<br />

help from our Receptionist, Information Officer or duty Careers Adviser.<br />

For information on Opening Hours <strong>and</strong> Duty Careers Adviser availability, please visit the<br />

webpage http://www.york.ac.uk/services/careers<br />

19. Facilities<br />

Computer room<br />

G/120 has been set up with computers for you to use on the <strong>MSc</strong> course. You will be sharing<br />

this facility with students on the MRes course. TESA or Onity security cards will be given to<br />

you to access this room <strong>and</strong> the department.<br />

Computing <strong>and</strong> printing<br />

When you register you will be automatically registered with the Computing Service <strong>and</strong> have<br />

an account on the central system. Access to this is available via the dedicated terminals in the<br />

<strong>MSc</strong> Computer Room, via terminals in each college building, or via the networked PC in the<br />

departmental <strong>of</strong>fice. There will be a dedicated "<strong>MSc</strong> printer" in the <strong>MSc</strong> Computer Room.<br />

Printing will be free up to a point <strong>and</strong> then you will be charged per sheet. The detailed<br />

regulations will be specified later.<br />

Library<br />

When you register you will be automatically registered with the library <strong>and</strong> be able to borrow<br />

books. As a graduate student you may find that journals will become as important to you as<br />

books; these can only be borrowed at weekends. The articles you need can be photocopied (at<br />

your own expense) in the library. You may well find that you need to read papers in journals<br />

that our library does not take. These (as well as books that are not in the library) can be


obtained by requesting an Inter-Library Loan; in the case <strong>of</strong> a journal article, this will result in<br />

a photocopy being sent to you.<br />

The library has a clearly signed Information Desk, where you can ask where to find the<br />

Mathematical Books collection as well as the Mathematical Journals.<br />

Telephones<br />

Internal telephones can be used freely. Note that four-figure telephone numbers are internal<br />

university numbers. They can be converted into York telephone numbers by prefixing 43.<br />

The York code is 01904.<br />

Other courses<br />

Extra lecture courses are sometimes arranged by individual research groups <strong>and</strong> you will be<br />

informed <strong>of</strong> any that are relevant.<br />

20. The Department <strong>of</strong> <strong>Mathematics</strong><br />

The decision-making body in the department is the Board <strong>of</strong> Studies, which consists <strong>of</strong> all<br />

teaching staff in the department, three elected undergraduate students <strong>and</strong> an elected graduate<br />

student. Students are not present for the discussion <strong>of</strong> “starred items" <strong>of</strong> business, which are<br />

items concerning individual students or members <strong>of</strong> staff. Graduate studies are considered by<br />

the <strong>Mathematics</strong> Graduate School Committee, which reports to the Board <strong>of</strong> Studies. The<br />

committee is chaired by the Graduate Tutor, who is responsible for the welfare <strong>of</strong> graduate<br />

students.<br />

Head <strong>of</strong> Department: Pr<strong>of</strong>essor Stephen Donkin<br />

Graduate Tutor: Pr<strong>of</strong>essor Paul Busch<br />

Chair <strong>of</strong> the Board <strong>of</strong> Studies: Dr Jason Levesley<br />

Members <strong>of</strong> the Graduate School Committee:<br />

Pr<strong>of</strong>essor Stephen Donkin, Pr<strong>of</strong>essor Paul Busch (Chair), Pr<strong>of</strong>essor Zdzislaw Brzezniak, Dr<br />

Chris Fewster, Dr Chris Wood, Dr Ian McIntosh.<br />

Departmental Administrator: Mrs Cathy Moore<br />

Secretaries: Mrs Sue Adams<br />

Mrs Chris Higgins (Monday am, Tuesday, Wednesday,<br />

Thursday am, Friday am)<br />

Mrs Christine Cockett (Monday to Friday am)<br />

Ms Stella Fisher<br />

Mr Nicholas Page<br />

21. Academic staff involved with the <strong>MSc</strong> <strong>and</strong> their research interests<br />

e-mail Room Tel.<br />

Julie Wilson julie@ysbl.york.ac.uk B/S/108 8282


(Chemometrics, biostatistics <strong>and</strong> image processing)<br />

James Cussens james.cussens@cs.york.ac.uk B/S/111 8396<br />

(Machine learning, inductive logic programming)<br />

Garib Murshudov garib@ysbl.york.ac.uk K/066 8252<br />

(Statistical study <strong>of</strong> macromolecular crystal structures)<br />

Asher Rashid asher@cs.york.ac.uk CS/203F 2767<br />

(Machine Learning)<br />

Jon Pitchford jwp5@york.ac.uk B/S/109 8282<br />

(Mathematical biology <strong>and</strong> ecology, stochastic processes)<br />

Bo Wang bw527@york.ac.uk G/043 3096<br />

(Computational statistics, stochastic analysis)<br />

Rod Hubbard rod@ysbl.york.ac.uk B/K/176 8267<br />

(Structure based drug discovery)<br />

Yuriy Zakharov yz1@ohm.york.ac.uk P/K/001 2399<br />

(Signal Processing for Communications)<br />

Kevin Cowtan cowtan@ysbl.york.ac.uk B/K/264 8253<br />

(Automated protein structure solution)<br />

22. Supervisors<br />

You will have been informed <strong>of</strong> your supervisor by October 2009.<br />

23. Welfare Support Services<br />

The University‟s Welfare Network is designed to provide students with quick <strong>and</strong> easy access<br />

to a variety <strong>of</strong> sources <strong>of</strong> help <strong>and</strong> advice on all aspects <strong>of</strong> life as a student. Personal<br />

supervisors in academic departments are responsible for overseeing both academic progress<br />

<strong>and</strong> general welfare. In addition each college has a welfare team which includes the Provost<br />

<strong>and</strong> a College Dean who has special responsibility for student welfare. Every full time student<br />

is a member <strong>of</strong> a college, <strong>and</strong> students may approach their college welfare team for help <strong>and</strong><br />

advice whether or not they are resident in the college at the time. Central support services<br />

available to all students include the Student Counselling Service, the Adviser on Disabilities,<br />

the Welfare Information Officer, the Equal Opportunities Adviser, the International Office, the<br />

Student Support Funds Co-ordinator <strong>and</strong> the First Contact Network (which <strong>of</strong>fers support in<br />

cases <strong>of</strong> harassment). In addition administrative <strong>of</strong>fices such as the Undergraduate Office, the<br />

Graduate Schools Office <strong>and</strong> the Accommodation Office all provide information <strong>and</strong> advice.<br />

Welfare support is also available through the student-run organisations, particularly the<br />

Students‟ Union <strong>and</strong> the Graduate Students Association. Information about the welfare<br />

network <strong>and</strong> its co-ordination is widely disseminated, so that students seeking assistance in<br />

any quarter can, if necessary, be referred quickly to those with the specialist knowledge <strong>and</strong><br />

skills to help them. The Student Support Services H<strong>and</strong>book, issued to all incoming students,<br />

describes the main contributors to the Welfare Network, <strong>and</strong> includes information about the<br />

Campus Nursery, the Health Centre <strong>and</strong> the Chaplaincy, which <strong>of</strong>fers a contact for all faiths.<br />

Comprehensive information on welfare within the University can also be found at<br />

http://www.york.ac.uk/admin/gso/gsp/living/welfare.htm


24. Contact Details<br />

When you are about to finish the <strong>MSc</strong> course, you are required to complete a form giving<br />

details <strong>of</strong> how to be contacted after completion <strong>of</strong> your course, for the purposes <strong>of</strong> graduation<br />

invitation <strong>and</strong> to receive the formal transcripts <strong>of</strong> the course.<br />

We will require you to write a report on your experience as an <strong>MSc</strong> student, which will be<br />

added in our website.<br />

For security reasons, please do not forget to return to the <strong>Mathematics</strong> Departmental Office<br />

your TESA or Onity security access card before you leave.<br />

25. Presentations<br />

After you h<strong>and</strong> in your dissertation, you will be informed <strong>of</strong> the day in which you should give<br />

a presentation highlighting the main points <strong>of</strong> the content <strong>of</strong> your dissertation <strong>and</strong> your<br />

conclusions. This will take place in a lecture room, where there will be a laptop attached to a<br />

data projector for you to use. You will have 10 minutes to give your presentation <strong>and</strong> there<br />

will be 5 minutes available at the end for possible questions from the audience. Typically the<br />

audience will consist <strong>of</strong> the external examiner, the <strong>MSc</strong> teaching staff, invited members <strong>of</strong><br />

staff <strong>and</strong> some <strong>of</strong> your <strong>MSc</strong> colleagues.<br />

In case you wish to use your own computer, that is fine, but come in advance to test whether it<br />

works with the data projector on the day. If you are not using your own computer, then it is<br />

expected that you prepare your presentation in PowerPoint or Adobe PDF <strong>and</strong> bring a CD-R<br />

containing the file <strong>of</strong> your presentation.<br />

Attention: Please do not bring a floppy disk (this is an unreliable media <strong>and</strong> considered<br />

obsolete), also the laptop provided on the day is not going to be a networked computer, so you<br />

will not be able to download your presentation from your university network account, so make<br />

sure to burn a CD-R with your presentation in advance.<br />

26. Dissertation Guidelines<br />

The intention <strong>of</strong> this section is to give you some advice on how you should write your <strong>MSc</strong><br />

dissertation. However, it is not a strict rule that you should take literally. The style in which a<br />

dissertation is written may reflect the subject it is about, <strong>and</strong> since the <strong>MSc</strong> course has many<br />

different directions one may pursue, the dissertation may differ in length <strong>and</strong> style. Also, you<br />

may have to write your dissertation when you are in placement, where placement means to do<br />

a project outside the <strong>Mathematics</strong> Department <strong>of</strong> the University <strong>of</strong> York, which can be a<br />

project supervised by someone from another department in the University <strong>of</strong> York, or from<br />

someone working in one <strong>of</strong> our industrial partners.<br />

In the final section <strong>of</strong> this document, we give you information that is passed to the first <strong>and</strong><br />

second markers <strong>of</strong> your dissertation as well as to the external examiner <strong>of</strong> our <strong>MSc</strong>. These are<br />

our dissertation marking guidelines <strong>and</strong> it gives you a good idea <strong>of</strong> what the markers will be<br />

looking for in your dissertation to evaluate its quality <strong>and</strong> attach a corresponding mark.


We finish this initial section with highlights <strong>of</strong> some information already present in the <strong>MSc</strong><br />

h<strong>and</strong>book <strong>and</strong> with some additional information. Your dissertation should normally be in A4<br />

paper format <strong>of</strong> length:<br />

About 50 to 80 pages long.<br />

Important Note: It is compulsory that your dissertation should have a cover page containing<br />

the title <strong>of</strong> the dissertation with your name underneath. Also there should be a statement<br />

saying: Dissertation submitted for the <strong>MSc</strong> in Data Analysis, Networks <strong>and</strong> Nonlinear<br />

Dynamics, Department <strong>of</strong> <strong>Mathematics</strong>, University <strong>of</strong> York, UK. (With month <strong>and</strong> year <strong>of</strong><br />

submission underneath.)<br />

This year we are asking you to submit three copies <strong>of</strong> your <strong>MSc</strong> dissertation with deadline<br />

4:30pm on Friday 20th August 2010 to be h<strong>and</strong>ed-in in the General Office G/127 in<br />

<strong>Mathematics</strong>. The copies <strong>of</strong> your dissertation should be spiral s<strong>of</strong>t-bound with transparent<br />

front cover <strong>and</strong> blue back. You can use the University Printing Unit <strong>of</strong> the University <strong>of</strong> York,<br />

located in the Market Square <strong>of</strong> Heslington Campus. They would be able to do the binding<br />

while you wait, however, note that there may be others that will require the same service at the<br />

same time. So you should organise yourself in advance to avoid congestion. Each copy will<br />

cost you around £3 to be bound.<br />

You are also required to submit a CD-R containing your dissertation in both Postscript <strong>and</strong><br />

PDF formats, <strong>and</strong> you have until 4:30pm Friday 20th August 2010 to h<strong>and</strong>-in your disk to the<br />

General Office in <strong>Mathematics</strong>. As mentioned earlier, do not forget to write your name on the<br />

disk using a CD-R pen.<br />

Writing the dissertation<br />

As mentioned above, the dissertation may have very different styles depending on the subject<br />

it covers. What will follow are simply general guidelines.<br />

Usually you have gathered research about a topic or various interconnecting topics during the<br />

work towards the dissertation. In many cases, the dissertation has a practical element, i.e. you<br />

may need to write some programs to carry out some calculations, tests, etc. It is expected that<br />

you would spend an initial part <strong>of</strong> your dissertation covering some <strong>of</strong> the background material<br />

<strong>of</strong> the subject your research topic is all about. Then you would describe the topic or topics <strong>of</strong><br />

research in more detail, <strong>and</strong> then if there are some work that you have carried out on your own<br />

or not, you should complete with some conclusions, maybe suggesting further work or further<br />

reading.<br />

It is expected that you will have a contents page for your dissertation, where the headings <strong>of</strong><br />

chapters <strong>and</strong>/or sections are clearly laid out. This gives a quick way for the reader to have an<br />

impression <strong>of</strong> the content organisation <strong>of</strong> your dissertation.<br />

It is important to note the following general aspects for the dissertation. If you make a<br />

comment about a result or calculation, then you should give it a reference (in case it is referred<br />

somewhere in the literature), otherwise any comments you make will be attributed to yourself.


If you state something that is your own work or your own opinion, then make it clear for the<br />

reader. When covering background material, make clear the references you have used to write<br />

that part <strong>of</strong> your dissertation.<br />

If your dissertation contains programming material, then it is good if you attach the source<br />

code as an Appendix to your dissertation. If you have written the code yourself, state it clearly,<br />

<strong>and</strong> if you modified someone else‟s code then also state it clearly. If you use code that has<br />

been downloaded from the internet, you should explicitly mention the name <strong>of</strong> the authors,<br />

<strong>and</strong> the webpage where the code was obtained. If it is a long program, then you can mention it<br />

in the text <strong>and</strong> in the references, without explicitly including the code in an Appendix.<br />

Finally, as mentioned in the first week <strong>of</strong> your course in the Transferable <strong>and</strong> Generic Skills<br />

module, there is no obligation to write your dissertation in LaTeX <strong>and</strong> you can use Micros<strong>of</strong>t<br />

Word or other typesetting package. Remember that the presentation <strong>of</strong> your dissertation is one<br />

aspect which receives marks <strong>and</strong> counts for the overall quality assessment <strong>of</strong> your dissertation.<br />

Your dissertation supervisor will be able to help you with various aspects <strong>of</strong> the writing-up,<br />

which has been mentioned above. However, in order to make the most <strong>of</strong> their help, you need<br />

to submit regular updates <strong>of</strong> your progress, <strong>and</strong> when close to submission, you should give<br />

enough time for your supervisor to comment on an almost complete draft <strong>of</strong> your dissertation.<br />

Do not leave to write-up your dissertation in the last two weeks or less before submission.<br />

There are good <strong>and</strong> bad examples <strong>of</strong> submitted dissertations, which you can see in the Library<br />

from previous <strong>MSc</strong> students. Ask your supervisor for advice on how to write your dissertation.<br />

Dissertation Marking Guidelines<br />

These are the Marking Guidelines for the dissertation supervisor <strong>and</strong> second marker (as well<br />

as the external examiner, when considering borderline cases).<br />

INSTRUCTIONS FOR MARKERS:<br />

Please mark the dissertation out <strong>of</strong> 100.<br />

You are encouraged to annotate comments <strong>and</strong> corrections on a separate piece <strong>of</strong> paper. You<br />

will be asked to write a short report about the quality <strong>of</strong> the dissertation <strong>and</strong> the reasoning for<br />

your chosen mark. The following is a general guideline:<br />

1-29: No useful results obtained. Inadequate or incorrect descriptions <strong>of</strong> the methods or<br />

theory involved. Little or no underst<strong>and</strong>ing <strong>of</strong> what the work was about. Mathematical errors.<br />

30-39: Very limited results <strong>of</strong> poor quality. Poor description <strong>of</strong> the methods or theory<br />

involved. Very limited underst<strong>and</strong>ing <strong>of</strong> the work. Very limited mathematical skill displayed.<br />

40-49: Some worthwhile results <strong>of</strong> modest quality, but not presented at all well. Reasonable<br />

description <strong>of</strong> the methods or theory involved. Signs <strong>of</strong> misunderst<strong>and</strong>ing <strong>of</strong> the work


involved. Limited ability in developing sound applications <strong>of</strong> the methods <strong>and</strong> theory. Limited<br />

mathematical underst<strong>and</strong>ing.<br />

50-59: Reasonable number <strong>of</strong> results <strong>of</strong> satisfactory quality, presented in a comprehensible<br />

way. Evidence <strong>of</strong> reasonable underst<strong>and</strong>ing <strong>of</strong> the basis <strong>of</strong> the methods or theory, <strong>and</strong> how to<br />

apply them. Evidence <strong>of</strong> an ability to utilise accurate relevant mathematical arguments within<br />

applications.<br />

60-69: A good number <strong>of</strong> results <strong>of</strong> satisfactory quality presented in a lucid <strong>and</strong> thoughtful<br />

way. The methods <strong>and</strong> theory well described <strong>and</strong> understood with clear evidence <strong>and</strong> <strong>of</strong> ability<br />

to apply them in a sensible <strong>and</strong> appropriate way. Evidence <strong>of</strong> an ability to utilise sound<br />

mathematical arguments with applications.<br />

70-100: A high quality dissertation in which the results (<strong>of</strong> good quality <strong>and</strong> quantity) are<br />

presented with clarity <strong>and</strong> precision, the methods <strong>and</strong> theory are well described <strong>and</strong><br />

understood, <strong>and</strong> there is very clear evidence <strong>of</strong> the student‟s ability to apply these in an<br />

appropriate manner, thoughtfully <strong>and</strong> incisively. Substantial evidence <strong>of</strong> an ability to utilise<br />

sound mathematical arguments in applications.<br />

Further guidance on marking in the range 70-100 is given below:<br />

Awarding excellence marks in the range 70-100<br />

The basic criteria for marks greater than 70 are that the c<strong>and</strong>idate shows signs <strong>of</strong> the<br />

following:<br />

E1 Some originality <strong>and</strong> creativity – clever ideas, novel combination <strong>of</strong> ideas;<br />

E2 Knowledge <strong>of</strong> facts or opinions or methodologies not directly taught, but gleaned by the<br />

student from reading <strong>and</strong>/or his/her own work;<br />

E3 Critical ability – an ability to make valid criticisms <strong>of</strong> evidence, views or arguments;<br />

E4 An ability to synthesise a fine argument from many sources – evidence <strong>of</strong> an ability to<br />

combine many ideas <strong>and</strong>/or opinions to build a convincing argument. Evidence <strong>of</strong> the<br />

ability to apply mathematical reasoning accurately <strong>and</strong> sensitively within the relevant<br />

context.<br />

As a guide it is suggested that these „excellence indicators‟ can be used as follows to assign<br />

marks:<br />

70-79: The c<strong>and</strong>idate shows one (marks nearer to 70) or two (marks nearer to 80) <strong>of</strong> the<br />

excellence indicators E1 to E4.<br />

80-89: The c<strong>and</strong>idate shows two (marks nearer to 80) or three (marks nearer to 90) <strong>of</strong> the<br />

indicators E1 to E4.<br />

90-100: The c<strong>and</strong>idate shows all <strong>of</strong> the indicators E1 to E4.


The above can clearly only represent guidelines in forming your judgement. There may be for<br />

example other factors influencing the quality <strong>and</strong> quantity <strong>of</strong> results that need to be taken into<br />

account in assessing the dissertation. The dissertation supervisors should make such<br />

circumstances clear in their reports for second markers.<br />

The dissertation supervisor gives a mark out <strong>of</strong> 20 towards the effort <strong>and</strong> dedication <strong>of</strong> the<br />

student in the work during the dissertation, <strong>and</strong> should write a short paragraph about the<br />

reasoning behind the mark. This information should be available to the second marker, prior to<br />

the independent marking <strong>of</strong> the dissertation.<br />

Finally, a good presentation in the dissertation is also important as it is one <strong>of</strong> the skills we<br />

seek to develop in the <strong>MSc</strong> students. At the end <strong>of</strong> the course, the students should give a 10<br />

minutes oral presentation based on their dissertations. The oral presentation will be marked<br />

out <strong>of</strong> 5 marks. Note that although the presentation mark counts little towards the degree, for<br />

students that are borderline cases this could be an important element. Also, since the external<br />

examiner may not be able to read all the dissertations, it is important that you give a good<br />

impression <strong>of</strong> the work done in your dissertation since the external examiner will attend all the<br />

presentations.<br />

The dissertation mark is a weighted average <strong>of</strong> the agreed mark between first <strong>and</strong> second<br />

markers, the effort <strong>and</strong> dedication mark given by the dissertation supervisor <strong>and</strong> the<br />

presentation mark.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!