BINF 733 Spring 2005 Microarry Analysis

 

Instructors - Jennifer Weller and Jeff Solka
Meeting Place - Discovery Hall 185

Meeting Time - 4:30 pm - 7:10 pm Wednesdays

Important Dates
First day of classes January 24
Last day to drop with no tuition penalty February 8
Last day to add classes February 8
Last day to drop February 25
Spring Break March 13 - March 20
Last day of classes May 7
Reading Days (Monday all day, Tuesday until 4:30 p.m.) May 9 - 10
Exam period (beginning 4:30 p.m. on Tues., May 10) May 10 - 18
Grades available via Patriot Web May 10
Degree conferral date May 21, 2005

Course Structure - Students will be introduced to both concepts and practical examples for handling and analyzing the data sets arising from large-scale microarray experiments. General analysis tools will include R, and the Bioconductor packages in its library, and Matlab, as well as software packages developed for handling specific analysis problems. Each instructor will lecture for one hour of each class meeting, and there will be a 30-minute practical demonstration of some of the tools discussed at the end of each class session. Homework problems will be assigned to reinforce concepts and develop skills needed for analyzing relevant data sets. Many of the problems will come from the Amaratunga and Cabrera end of chapter problems. Students will give one presentation that comprises a critical discussion of a relevant journal article (the instructors will provide a list of articles from which students may select or a student may independently find an article, subject to the instructors' approval) and a second presentation that presents an original project, for which a report must be written and submitted following an approved format.

Course Description - The student will learn concepts underlying manufacture and experimental design for micro-array studies and numerous micro-array data analysis techniques including data normalization, visualization, discriminant analysis, dimensionality reduction, and clustering.
Course grades: There will be six homework assignments (50%), an in-class presentation of an article from the literature (20%), and an in-class presentation of a project (20%) with an accompanying written report (10%). All homework must be turned in as hard copy at the time it is due; no late assignments will be accepted (however the lowest grade of the six may be dropped). There are no 'extra credit' assignments.


Course Textbooks -
"Exploration and Analysis of DNA micorarray and protein array data" by Amaratunga, D., and Cabrera, J. from Wiley Series in Probability and Statistics, 2004. [abbreviated AC for chapter assignments]

"Analyzing Microarray Gene Expression Data: by McLachlan, GJ, Do, K-A, and Ambroise, C from Wiley series in Probability and Statistics, 2004. [abbreviated MDA for chapter assignments]

"Microarray Bioinformatics" by Stekel, D. from Cambridge University Press, 2003. [abbreviated Stek for chapter assignments]


Tentative Course Schedule -

Data Instructors Topics Assigned Reading Homework

Jan. 26, 2005

Class # 1

Weller/Solka

Manufacturing microarrays & probe design;

Intro to R

MDA: Ch 1, 3

Stek: Ch 1, 3

AC: Ch 2, 3

 

Feb. 2, 2005

Class # 2

Weller/Solka

Data standards and public databases;

Introduction to BioConductor

Stek: Ch 2, 11

AC:Ch1

Biocondutor synopsis

HW 1 Assigned

Feb. 9, 2005

Class # 3

Weller/Solka

Scanning the image;

Image processing and quality measures

MDA: Ch 2.1

Stek: Ch 4

AC: Ch 4

 

Feb. 16, 2005

Class # 4

Solka/Weller

Data cleansing and preprocessing;

Statistical measures for outlier detection

MDA: Ch 2.2

AC: Ch 5.3, 5.7

HW 2 Assigned

HW 1, Due

Feb. 23, 2005

Class # 5

Solka/Weller

Case studies of classic normalization and standardization papers;

Selecting appropriate methods for normalization and standardization

MDA: Ch 2

Stek: Ch 5

AC: Ch 5

 

March 2, 2005

Class # 6

Weller/Solka

Experimental design, replicates and model-based inference;

Choosing appropriate statistical methods for a given design

MDA: Ch 2.5

Stek: Ch 6, 10

AC: Ch 6, 8

HW 3 assigned

HW 2 due

March 9, 2005

Class # 7

Solka

Case studies for identifying differentially expressed genes;

Methods for defining and determining significant levels of differential expression

Stek: Ch 7

AC: Ch 6, 7

 

March 23, 2005

Class # 8

Solka/Weller

Case studies using a variety of clustering techniques;

Methods and parameters for clustering data

MDA: Ch 3-5

Stek: Ch 8

AC: Ch 9

HW 4 assigned

HW 3 due

March 30, 2005

Class # 9

Solka/Weller

Case studies using discriminant analysis methods;

Appropriate selection of discriminant analysis techniques

MDA: Ch 6,7

Stek: Ch 8, 9

AC: Ch 10

 

April 6, 2005

Class # 10

Weller/Solka

Resources for obtaining meaningful annotation of genes;

Survival analysis

MDA: Ch 8 Handouts in class

HW 5 assigned

HW 4 due

April 12, 2005   First Student Presentations (articles)   Slides Due for presentations from all students (media to Dr. Weller by 5pm)

April 13, 2005

Class # 11

Student Presentations      

April 20, 2005

Class # 12

Student Presentations    

HW 6 assigned

HW 5 due

April 26, 2005   Second Student Presentation (projects)   Slides Due for presentations from all students (media to Dr. Weller by 5pm)

April 27, 2005

Class # 13

Student Projects      

May 4, 2005

Class # 14

Student Projects     HW # 6 Due
May 6, 2005   Project Reports   Project reports due from all students (hard copy) by 5pm to Prof. Weller

Slides
Feb. 2, 2005
Introduction to R (slides)
Introduction to R (handout)

Introduction to Bioconductor and R from the Recent MBI Conference

An Introductory Lab from the Seattle 2002 Short Course


My solutions to lab1.


Feb. 9, 2005
Preprocessing Microarry Data from the Recent MBI Conference

Solka Preprocessing Slides

An Introductory Marray Lab from the Recent MBI Course


An Introductory Affy Lab from the Recent MBI Course


Feb. 16, 2005
Solka Outlier Detection Slides

A Couple of Possible Papers for Presentation
"Approximate geodesic distances reveal biologically relevant structures in microarray data," Jens Nilsson ,Thoas Fioretos , Mattias Höglund and Magnus Fontes, Bioinformatics. 2004 Apr 12;20(6):874-80. Epub 2004 Jan 29.

"BagBoosting for tumor classification with gene expression data," Marcel Dettling, Bioinformatics 2004 20(18):3583-3593; doi:10.1093/bioinformatics/bth447 .

"Identifying Splits with Clear Separation: A New Class Discovery Method for Gene Expression Data," Anja von Heydebreck, Wolfgang Huber, Annemarie Poustka and Martin Vingron, Bioinformatics, vol. 1, no. 1, 2001.

"Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data," Rafael A. Irizarry et al., Biostatistics. Vol. 4, Number 2: 249-264..

Links
Here is a link to the webpage from BINF733 - Microarray Analysis 2004 Class

 


Colloquium Talk 2/15/2005