Machine Learning for Bioinformatics (COMP-766-02)

Winter Session, 2004

Taught by: Theodore J. Perkins
Office: McGill Centre for Bioinformatics
Email: perkins@mcb.mcgill.ca
Phone: 398-7071 x 09317

Course web page: http://www.mcb.mcgill.ca/~perkins/COMP76602/COMP76602.html
Location: Arts Building 210
Time: 10:35 am - 11:25 am, MWF


The purpose of this course is to introduce students with background in bioinformatics to the major principles and techniques of machine learning, and to look at how these can be applied to problems in bioinformatics. This course is aimed at students who have not previously studied machine learning and who may have limited background in probability and statistics, but who do have a basic background in computer science and an understanding of the problems studied in bioinformatics. The goals of this course are to:

  1. Provide students with a "toolbox" of practical machine learning techniques that are useful for bioinformatics data analysis and research.
  2. Describe proper methodology for applying machine learning techniques, and common pitfalls.
  3. Give students enough expertise to understand and evaluate bioinformatics research papers that involves machine learning.
  4. Provide a sense of what can and what cannot be inferred from data.
  5. To examine which machine learning approaches have been most successful in bioinformatics to date.

Format: Approximately half of the classes will be lectures taught by Dr. Perkins, and half will be discussions of bioinformatics research papers that use machine learning.

Evaluation:

Credits: 4

Prerequisites:

Readings: The official text for the course is

Other references:

Course outline:

Section 1: Unsupervised Learning (Dimensionality reduction, visualization, clustering -- approx 2 weeks)

Section 2: Supervised Learning (a.k.a. function approximation -- approx 4 weeks)

Section 3: Probabilistic Modeling (including some more supervised learning -- approx 4 weeks)

Section 4: Modeling Dynamical Systems (approx 2 weeks)