This lecture will take place online only (via Zoom). Link will be send to registered students in due time. Online lecture: This lecture will primarily take place online. Reserved rooms will remain blocked on campus for students to follow the course from there.
The course offers an introduction to graphical models and their application to complex biological systems. Graphical models combine a statistical methodology with efficient algorithms for inference in settings of high dimension and uncertainty. The unifying graphical model framework is developed and used to examine several classical and topical computational biology methods.
Learning objective
The goal of this course is to establish the common language of graphical models for applications in computational biology and to see this methodology at work for several real-world data sets.
Content
Graphical models are a marriage between probability theory and graph theory. They combine the notion of probabilities with efficient algorithms for inference among many random variables. Graphical models play an important role in computational biology, because they explicitly address two features that are inherent to biological systems: complexity and uncertainty. We will develop the basic theory and the common underlying formalism of graphical models and discuss several computational biology applications. Topics covered include conditional independence, Bayesian networks, Markov random fields, Gaussian graphical models, EM algorithm, junction tree algorithm, model selection, Dirichlet process mixture, causality, the pair hidden Markov model for sequence alignment, probabilistic phylogenetic models, phylo-HMMs, microarray experiments and gene regulatory networks, protein interaction networks, learning from perturbation experiments, time series data and dynamic Bayesian networks. Some of the biological applications will be explored in small data analysis problems as part of the exercises.
Lecture notes
no
Literature
- Airoldi EM (2007) Getting started in probabilistic graphical models. PLoS Comput Biol 3(12): e252. doi:10.1371/journal.pcbi.0030252 - Bishop CM. Pattern Recognition and Machine Learning. Springer, 2007. - Durbin R, Eddy S, Krogh A, Mitchinson G. Biological Sequence Analysis. Cambridge university Press, 2004
Performance assessment
Performance assessment information (valid until the course unit is held again)
The performance assessment is only offered in the session after the course unit. Repetition only possible after re-enrolling.
Mode of examination
written 90 minutes
Additional information on mode of examination
Repetition possible only with re-enrollment, including projects. The final grade is 70% written session examination and 30% project. The practical projects are an integral part (60 hours of work, 2 credits) of the course. The project has to be re-run in case of a repetition.
Written aids
None
This information can be updated until the beginning of the semester; information on the examination timetable is binding.
Learning materials
No public learning materials available.
Only public learning materials are listed.
Groups
No information on groups available.
Restrictions
There are no additional restrictions for the registration.