NDAK15007U Machine Learning (ML)

Volume 2015/2016
Education

MSc Programme in Computer Science
MSc Programme in IT and Cognition
MSc Programme in Bioinformatics

Content

The amount and complexity of available data is steadily increasing. To make use of this wealth of information, computing systems are needed that turn the data into knowledge. Machine learning is about developing the required software that automatically analyses data for making predictions, categorizations, and recommendations. Machine learning algorithms are already an integral part of today's computing systems - for example in search engines, recommender systems, or biometrical application. Machine learning provides a set of tools that are widely applicable for data analysis within a diverse set of problem domains such as data mining, search engines, digital image and signal analysis, natural language modeling, bioinformatics, physics, economics, biology, etc.

The purpose of the course is to introduce students to probabilistic data modeling and the most common techniques from statistical machine learning and pattern recognition. The students will obtain a working knowledge of probabilistic data modeling and statistical machine learning for pattern recognition.

This course is relevant for computer science students as well as for students from others studies (e.g., Bioinformatics, Physics, Mathematics, …) with sufficient mathematical background and programming skills.

The course covers the following tentative topic list:

  • Foundations of statistical learning, probability theory.
  • Likelihood framework, parametric and non-parametric representations. 
  • Classification methods, such as: Linear models, K-Nearest Neighbor, kernel-based methods (e.g., support vector machines), and neural networks.
  • Regression methods, such as: Linear regression, non-linear regression.
  • Clustering.
  • Dimensionality reduction and visualization techniques such as principal component analysis (PCA).
Learning Outcome

At course completion, the successful student will have:

Knowledge of

  • the general principles of machine learning;
  • basic probability theory for modeling and analyzing data;
  • the theoretical concepts underlying classification, regression, and clustering;
  • the mathematical foundations of selected machine learning algorithms;
  • common pitfalls in machine learning.

 

Skills in

  • applying linear and non-linear techniques for classification and regression;
  • performing elementary dimensionality reduction;
  • elementary data clustering;
  • implementing selected machine learning algorithms;
  • visualizing and evaluating results obtained with machine learning techniques;
  • using software libraries for solving machine learning problems;
  • identifying and handling common pitfalls in machine learning.

 

Competences in

  • recognizing and describing possible applications of machine learning;
  • comparing, appraising and selecting machine learning methods of for specific tasks;
  • solving real-world data mining and pattern recognition problems by using machine learning techniques.

See Absalon when the course is set up.

Knowledge of linear algebra corresponding to the course Linear Algebra. Knowledge of basic calculus. Knowledge of programming, sufficient to implement basic machine learning and other data analysis algorithms.
Lecture and exercise classes.
This course was formerly known as "Statistical Methods for Machine Learning". The replacement course for non-Computer Science students is called "Data Analysis Methods".
  • Category
  • Hours
  • Lectures
  • 28
  • Practical exercises
  • 57
  • Preparation
  • 14
  • Project work
  • 50
  • Theory exercises
  • 57
  • Total
  • 206
Credit
7,5 ECTS
Type of assessment
Written assignment, Due on the last day of the block.
One written take-home assignment, which includes programming tasks.
Exam registration requirements

There are three mandatory written take-home assignments (which include programming tasks) that must be passed in order to be eligible for the exam.

Aid
All aids allowed
Marking scale
7-point grading scale
Censorship form
External censorship
Re-exam

If student is not qualified then qualification can be achieved by hand-in and approval of equivalent assignments.

20 minutes oral exam without preparation in course curriculum.

Criteria for exam assesment

See learning outcome.