NKEB22002U Machine Learning and Molecules (MLmol)

Volume 2024/2025
Content

Students write their own neural network code from scratch using Python and use it to predict chemical properties of molecules. The performance is compared to standard machine learning packages such as scikit-learn, Keras, and DeepChem.

 

Learning Outcome

Knowledge:
Basic principles behind Python programming, machine learning, and cheminformatics. Classification and regression using neural networks. Activation functions, back propagation using gradient descent, Overfitting, regularisation, hyperparameter optimisation, and training/validation/test sets.  SMILES strings, molecular fingerprints, and graph convolution as applied to molecules.

Skills:
Data manipulation and visualisation using Pandas, numpy, and Matplotlib/Seaborn. Manipulation of chemical data using RDKit. Use of scikit-learn, Keras, and DeepChem. 

Competences:
Prediction of chemical properties using machine learning. Critical evaluation of machine learning models.

Literature

See Absalon for a list of course literature

First year organic chemistry and mathematics
Videolectures and classroom discussion
  • Category
  • Hours
  • Class Instruction
  • 12
  • Preparation
  • 93,5
  • E-Learning
  • 50
  • Project work
  • 50
  • Exam
  • 0,5
  • Total
  • 206,0
Written
Oral
Individual
Collective
Continuous feedback during the course of the semester
Feedback by final exam (In addition to the grade)
Credit
7,5 ECTS
Type of assessment
Oral examination, 30 minutes with no preparation time
Type of assessment details
A final 30-minute individual oral examination is without preparation and based on the project report. The exam begins with a 5 minute presentation from the examinee, after which the internal assessors ask questions in the project area.
Exam registration requirements

All three assignments have to be approved by the end of the 7th week; b) the exam report has to be submitted by the end of the 8th week of the block

Aid
All aids allowed

The use of Large Language Models (LLM)/Large Multimodal Models (LMM) – such as ChatGPT and GPT-4 – is permitted.

Marking scale
passed/not passed
Censorship form
No external censorship
Several internal examiners
Re-exam

Same as ordinary exam.

The same report, possibly improved based on feedback, can be used for the oral re-exam.

Criteria for exam assesment

See Learning Outcome