COMP 565 Winter 2023: ML in genomics and healthcare

Course Overview

Genetics is instrumental in understanding complex human phenotypes ranging from human heights to complex diseases. Large-scale molecular and phenotypic profiling technologies provide exciting opportunities and unique challenges for conducting research in data-driven way, thereby linking common diseases to novel phenotypes and novel mutations via the lense of regulatory genomics.

The rapid advance in machine learning research and computing technologies provide new ways to address some of the most important problems in genomics and healthcare. In this topic course, we will gain a broad perspective on the current fields of computational genomics with primary focus on using ML approaches for modeling genomic and healthcare data. We will explore in-depth some of the recently developed and crucial computational methods for modeling big data in life science and health.

Class location and time:

Class Format

There are participation mark taken at each lecture based on the questions and discussion.

There are 10 quizzes. Typically released at the end of every week and due the end of following week.

Each student needs to write a 2-page review of five research papers chosen by the instructor based on the topics discussed in class.

There are five assignments. In each assignment, students will derive and/or implement the key components of some of the algorithms discussed in class and use them to analyze real or simulated dataset.

Students will be working on a course project in a small group. There will be a few default projects provided by the instructor. Provided this is a research-oriented course, each student can also come up with a suitable project based on the research topics discussed in class upon approval of the instructor. The last few lectures of the class will mainly consist of students' project presentations.

Prerequisite courses:

Recommended courses:

Instructor

Yue Li, Office hours: 11 am every Friday in Trottier Bldg 3105.

Teaching Assistant

Liam Hodgson

Evaluation

Relevant Textbooks

Course Syllabus (Tentative):

  1. Statistical genetic approaches (9 hrs, 3 weeks):
  2. Genomics and multi-omic learning (9 hrs, 3 week)
  3. Computational approaches in single-cell analysis (6 hrs, 2 weeks):
  4. Mining big data in healthcare (9 hrs, 3 weeks)