# 252-0526-00L Statistical Learning Theory

Semester | Spring Semester 2019 |

Lecturers | J. M. Buhmann |

Periodicity | yearly recurring course |

Language of instruction | English |

### Courses

Number | Title | Hours | Lecturers | |||||||
---|---|---|---|---|---|---|---|---|---|---|

252-0526-00 V | Statistical Learning Theory | 3 hrs |
| J. M. Buhmann | ||||||

252-0526-00 U | Statistical Learning Theory | 2 hrs |
| J. M. Buhmann | ||||||

252-0526-00 A | Statistical Learning Theory | 1 hrs | J. M. Buhmann |

### Catalogue data

Abstract | The course covers advanced methods of statistical learning : Statistical learning theory;variational methods and optimization, e.g., maximum entropy techniques, information bottleneck, deterministic and simulated annealing; clustering for vectorial, histogram and relational data; model selection; graphical models. |

Objective | The course surveys recent methods of statistical learning. The fundamentals of machine learning as presented in the course "Introduction to Machine Learning" are expanded and in particular, the theory of statistical learning is discussed. |

Content | # Theory of estimators: How can we measure the quality of a statistical estimator? We already discussed bias and variance of estimators very briefly, but the interesting part is yet to come. # Variational methods and optimization: We consider optimization approaches for problems where the optimizer is a probability distribution. Concepts we will discuss in this context include: * Maximum Entropy * Information Bottleneck * Deterministic Annealing # Clustering: The problem of sorting data into groups without using training samples. This requires a definition of ``similarity'' between data points and adequate optimization procedures. # Model selection: We have already discussed how to fit a model to a data set in ML I, which usually involved adjusting model parameters for a given type of model. Model selection refers to the question of how complex the chosen model should be. As we already know, simple and complex models both have advantages and drawbacks alike. # Statistical physics models: approaches for large systems approximate optimization, which originate in the statistical physics (free energy minimization applied to spin glasses and other models); sampling methods based on these models |

Lecture notes | A draft of a script will be provided; transparencies of the lectures will be made available. |

Literature | Hastie, Tibshirani, Friedman: The Elements of Statistical Learning, Springer, 2001. L. Devroye, L. Gyorfi, and G. Lugosi: A probabilistic theory of pattern recognition. Springer, New York, 1996 |

Prerequisites / Notice | Requirements: knowledge of the Machine Learning course basic knowledge of statistics, interest in statistical methods. It is recommended that Introduction to Machine Learning (ML I) is taken first; but with a little extra effort Statistical Learning Theory can be followed without the introductory course. |

### Performance assessment

Performance assessment information (valid until the course unit is held again) | |

Performance assessment as a semester course | |

ECTS credits | 7 credits |

Examiners | J. M. Buhmann |

Type | session examination |

Language of examination | English |

Repetition | The performance assessment is only offered in the session after the course unit. Repetition only possible after re-enrolling. |

Mode of examination | written 180 minutes |

Additional information on mode of examination | 70% session examination, 30% project; the final grade will be calculated as weighted average of both these elements. As a compulsory continuous performance assessment task, the project must be passed on its own and has a bonus/penalty function. The practical project are an integral part (60 hours of work, 2 credits) of the course. Participation is mandatory. Failing the project results in a failing grade for the overall examination of Statistical Learning Theory (252-0526-00S). Students who fail to fulfil the project requirement have to de-register from the exam. Otherwise, they are not admitted to the exam and they will be treated as a no show. |

Written aids | 2 sheets A4 (= 4 pages) summary, script |

This information can be updated until the beginning of the semester; information on the examination timetable is binding. |

### Learning materials

Main link | Information |

Recording | Statistical Learning Theory recorings |

Only public learning materials are listed. |

### Groups

No information on groups available. |

### Restrictions

There are no additional restrictions for the registration. |