COMPM058 - Bioinformatics

Note: Whilst every effort is made to keep the syllabus and assessment records correct, the precise details must be checked with the lecturer(s).

Code
COMPM058 (Also taught as: COMPGI10)
Year
4
Prerequisites
IIt is expected that students will already be familiar with the principles of techniques such as neural networks, Support Vector Machines, and ideally Hidden Markov Models from earlier parts of their degree course. Also, students will need to have taken the Supervised Learning option (4055)
Term
2
Taught By
David Jones (66%)
Kevin Bryson (33%)
Aims
The overall aim of this course is to introduce students to the new field of bioinformatics (computational biology) and how machine learning techniques can be employed in this area. The course is aimed at students who have no previous knowledge of biology and so the aim of Part 1 of the course is to give a basic introduction to molecular biology as a background for bioinformatics. Part 2 will concentrate on modern bioinformatics applications, particularly those which make good use of pattern recognition and machine learning methods.
Learning Outcomes
To have a basic knowledge of modern molecular biology and genomics. To understand the advantages and disadvantages of different machine learning techniques in bioinformatics and how the relative merits of different approaches can be evaluated by correct benchmarking techniques. To understand how theoretical approaches can be used to model and analyse complex biological systems.

Content:

Part 1: Basic molecular biology (6 lectures)
Introduction to Basic Cell Chemistry: Cell chemistry and macromolecules. Biochemical pathways e.g. Glycolysis. Protein structure and functions.
Part 1
Cell Structure and Function: Cell components. Different types of cell. Chromosome structure and organisation. Cell division.
Part 1
The Hereditary Material: DNA structure, replication and protein synthesis. Structure and roles of RNA. Genetic code. Mechanism of protein synthesis: transcription and translation. Mutation.
Part 1
Recombinant DNA Technology: Restriction enzymes. Hybridisation techniques. Gene cloning. Polymerase chain reaction.
Part 1
Genomics and Structural Genomics: Genes, genomes, mapping and DNA sequencing.
Part 2: Bioinformatics Applications (3 lectures per subject)
Biological Databases: Overview of the use and maintenance of different databases in common use in biology. Case study: the CATH database of protein structure.
Part 2
Gene Prediction: Methods for analysing genomic DNA to identify genes. Techniques: neural networks and HMMs.
Part 2
Detecting Distant Homology: Methods for inferring remote relationships between genes and proteins. Techniques: dynamic programming, HMMs, hierachical clustering.
Part 2
Protein Structure Prediction: Methods for predicting the secondary and tertiary structure of proteins. Techniques: neural networks, SVMs, genetic algorithms and stochastic global optimization.
Part 2
Transcriptomics: Methods for analysing gene expression and microarray data. Techniques: clustering, SVMs.
Part 2
Agent-based Genome Analysis: Automation of genome analysis using intelligent software agents
Part 2
Drug Discovery Informatics: Approaches to drug discovery using bioinformatics techniques

Method of Instruction:

Lecture presentations with associated class problems and group presentation/discussion of key research papers

Assessment:

The course has the following assessment components:

  • Written Examination (2.5 hours, 85%)
  • Coursework Section (1 piece, 15%)

To pass this course, students must:

  • Obtain an overall pass mark of 50% for all sections combined

The examination rubric is:
Answer THREE questions out of FIVE. All questions cary equal marks.

Resources:

Biochemistry- Lubert Stryer, WH Freeman and Co.

Post-genome Informatics, M. Kanehisa, Oxford University Press.

Bioinformatics- Genes, Proteins and Computers, C.A. Orengo, D.T. Jones and

J.M. Thornton, BIOS Scientific Publishers, 2003

Mathematical Biology, J.D. Murray, Springer, 1993.

Other references (including research papers) to be confirmed.

Lecture notes

Lecture notes