NDAA09028U Statistical Methods for Machine Learning
Volume 2013/2014
Education
MSc Programme in Computer
Science
MSc Programme in It and Cognition (HUM)
MSc Programme in It and Cognition (HUM)
Content
The amount and complexity
of available data is steadily increasing. To make use of this
wealth of information, computing systems are needed that turn the
data into knowledge. Machine learning is about developing the
required software that automatically analyses data for making
predictions, categorizations, and recommendations. Machine learning
algorithms are already an integral part of today's computing
systems - for example in search engines, recommender systems, or
biometrical application. Machine learning provides a set of
tools that are widely applicable for data analysis within a diverse
set of problem domains such as data mining, search engines, digital
image and signal analysis, natural language modeling,
bioinformatics, physics, economics, biology, etc.
The purpose of the course is to introduce students to probabilistic data modeling and the most common techniques from statistical machine learning and pattern recognition. The students will obtain a working knowledge of probabilistic data modeling and statistical machine learning for pattern recognition.
This course is relevant for students from among others the studies of Computer Science, E-Science, Bioinformatics, Physics, and Mathematics.
The course covers the following tentative topic list:
The purpose of the course is to introduce students to probabilistic data modeling and the most common techniques from statistical machine learning and pattern recognition. The students will obtain a working knowledge of probabilistic data modeling and statistical machine learning for pattern recognition.
This course is relevant for students from among others the studies of Computer Science, E-Science, Bioinformatics, Physics, and Mathematics.
The course covers the following tentative topic list:
- Foundation of statistical learning, probability theory.
- Likelihood framework, parametric and non-parametric representations. This includes Gaussian distributions, histograms, kernel density estimation, neighborhood based estimation (KNN).
- Classification methods: Linear models, K-Nearest Neighbor (KNN), kernel-based methods such as support vector machines (SVMs), and neural networks.
- Regression methods: Linear regression, non-linear regression
- K-means clustering and mixture modeling.
- Dimensionality reduction and visualization techniques such as principal component analysis (PCA).
Learning Outcome
At course completion, the successful student will have:
Knowledge of
- the general principles of machine learning;
- basic probability theory for modeling and analyzing data;
- the theoretical concepts underlying classification, regression, and clustering;
- the mathematical foundations of selected machine learning algorithms;
- common pitfalls in machine learning.
Skills in
- applying linear and non-linear techniques for classification and regression;
- performing elementary dimensionality reduction;
- elementary data clustering;
- implementing selected machine learning algorithms;
- visualizing and evaluating results obtained with machine learning techniques;
- using software libraries for solving machine learning problems;
- identifying and handling common pitfalls in machine learning.
Competences in
- recognizing and describing possible applications of machine learning;
- comparing, appraising and selecting machine learning methods of for specific tasks;
- solving real-world data mining and pattern recognition problems by using machine learning techniques.
Literature
See Absalon when the course
is set up.
Academic qualifications
Knowledge of basic linear
algebra corresponding to the course Linear Algebra. Knowledge of
programming at an introductory level.
Teaching and learning methods
Lecture and exercise
classes
Workload
- Category
- Hours
- Lectures
- 28
- Practical exercises
- 57
- Preparation
- 14
- Project work
- 50
- Theory exercises
- 57
- Total
- 206
Sign up
Self Service at KUnet
As an exchange, guest and credit student - click here!
Continuing Education - click here!
As an exchange, guest and credit student - click here!
Continuing Education - click here!
Exam
- Credit
- 7,5 ECTS
- Type of assessment
- Written assignmentWritten take-home assignment including a mini-project. Grade: 7-point scale. External grading. Submission in Absalon.
- Exam registration requirements
- All mandatory written take-home assignments must be passed in order to be eligible for the exam.
- Aid
- All aids allowed
- Marking scale
- 7-point grading scale
- Censorship form
- External censorship
- Re-exam
- 20 minutes oral exam without preparation in course curriculum. Grade: 7-point scale. External grading.
Criteria for exam assesment
See learning outcome.
Course information
- Language
- English
- Course code
- NDAA09028U
- Credit
- 7,5 ECTS
- Level
- Full Degree Master
- Duration
- 1 block
- Placement
- Block 3
- Schedule
- A
- Continuing and further education
- Study board
- Study Board of Mathematics and Computer Science
Contracting department
- Department of Computer Science
Course responsibles
- Christian Igel (igel@di.ku.dk)
Lecturers
Sami Brandt
Saved on the
30-04-2013