NFYK15002U Advanced Methods in Applied Statistics
MSc Programme in Physics
MSc Programme in Environmental Science
The course will offer the practical knowledge and hands-on experience in computational analysis of data in all frontier physics research, with particular relevance for particle physics, astrophysics, and cosmology. The course content is based on statistical methods and does not require a specific or broad physics background. It is therefore applicable for many non-physics disciplines in the physical sciences.
Interested Ph.D. students and non-physics M.Sc. students in the Physical Sciences are very welcome to enroll.
Knowledge:
- Be familiar with a supervised machine learning algorithm and multivariate analysis techniques, e.g. Boosted Decision Trees
- Parameter estimation and uncertainty estimation using likelihood and Bayesian techniques
- Minimization techniques using Markov Chain Monte Carlo and numerical methods
Skills:
- Maximum Likelihood fitting
- Construction of Confidence Intervals and contours
- Code a chi-squared function in the language of the students preference (Python, C/C++, Ruby, JAVA, R, etc)
- Creation and usage of spline functions
- Application of Kernel Density Estimators
- Inputing and processing data from both ASCII-readable files as well as internet data scraping.
Competences:
This course will help students develop the computational tools, software development, and use of statistical software packages for data analysis. The data analysis techniques are reinforced through assignments, which are import for manuscript preparation, thesis writing, and understanding the methodology and statistical relevance of results in journal articles. The students will have enhanced general coding skills useful in the both academia and industry. Students will develop their own software solutions and tools and strengthen their independent problem solving skills.
No required literature.
For those looking for additional material, “Statistical Data Analysis” by G. Cowan is an excellent choice.
Class lecture notes and links to scholarly articles will be posted online.
- The ability and experience to install external software packages, e.g. the MultiNest Bayesian inference package or “emcee” Markov Chain Monte Carlo sampler.
- Completion of “Applied Statistics: From Data to Results”, or equivalent, is strongly encouraged but not strictly required.
Academic qualifications equivalent to a BSc degree is recommended.
- Category
- Hours
- Lectures
- 36
- Practical exercises
- 32
- Preparation
- 102
- Project work
- 36
- Total
- 206
- Credit
- 7,5 ECTS
- Type of assessment
- Continuous assessmentWritten assignment, 28 hoursAssessment will be based on:
- An in-class short oral presentation (10%)
- Graded problem set(s) and project(s) centering around the coding, implementation, and execution of a statistical method (50%)
- Take home final exam (40%) - Aid
- All aids allowed
- Marking scale
- 7-point grading scale
- Censorship form
- No external censorship
Several internal examiners
- Re-exam
Take-home exam in coordination with the course responsible, 28 hours.
Criteria for exam assesment
see learning outcome
Course information
- Language
- English
- Course code
- NFYK15002U
- Credit
- 7,5 ECTS
- Level
- Full Degree Master
- Duration
- 1 block
- Placement
- Block 3
- Schedule
- A
- Course capacity
- no restriction
- Continuing and further education
- Study board
- Study Board of Physics, Chemistry and Nanoscience
Contracting department
- The Niels Bohr Institute
Contracting faculty
- Faculty of Science
Course Coordinators
- D. Jason Koskinen (koskinen@nbi.ku.dk)