Workshop on Machine Learning organised at the Data Science Center on 20 May 2022.
In the past decade, Deep Learning, originally a subfield of Machine Learning, has gained considerable influence and momentum. In order to distinguish "traditional" Machine Learning such as Random Forest or Regularised Regression from Deep Learning models, we decided to coin this workshop "conventional Machine Learning".
Note: Machine Learning is often abbreviated as ML.
- Get familiarised with Machine Learning terminology and terms.
- Learn about automated Machine Learning: choosing the best ML model
- Practice autoML implementations on two simple datasets, one for classification and one for regression.
To perform autoML, we will use the TPOT Python package that has a great documentation linked to it.
Option 1: To install TPOT using conda/mamba:
conda install -c conda-forge mamba && conda create --name tpot -c conda-forge jupyterlab tpot=0.11.6
A faster alternative to conda is called mamba and is used here.
Option 2: To install TPOT using pip:
pip install virtualenv
(if not available)virtualenv tpot_env
source tpot_env/bin/activate
pip install tpot==0.11.6
Link to the Python package repository of TPOT: https://pypi.org/project/TPOT/
Datasets can be found under 00_datasets/
as a .csv file and a .txt file describing the dataset.
Taken from the UCI Machine Learning Repository Link
Taken from the UCI Machine Learning Repository Link
Authors:
- Iris van der Knaap @Library, UvA Data Science Center Digital Skills Coordinator.
- Casper Thuis, data scientist @IBED, UvA.
- Marc Galland, support data scientist, @SILS, UvA.
- Machine Learning at the Vrije University of Amsterdam: https://mlvu.github.io/
- Genetic programming in Python: https://towardsdatascience.com/genetic-algorithm-implementation-in-python-5ab67bb124a6