diff --git a/notebooks/01_structuring_input.ipynb b/notebooks/01_structuring_input.ipynb index bce7443..fc5e0da 100644 --- a/notebooks/01_structuring_input.ipynb +++ b/notebooks/01_structuring_input.ipynb @@ -6,36 +6,47 @@ "source": [ "# 1. Structuring the input\n", "\n", - "In order to train and test models, we need to structure the input data in a way that is compatible with the model and framweork's experiments.\n", + "In order to train and test models, we need to structure the input data in a way that is compatible with the model and the framework's experiments.\n", "\n", - "This framework is designed to work with time-series data and Pytorch Lightning. \n", - "Thus, it provides the necessary tools to create a `Dataset` object and a `LightningDataModule` object.\n", + "For now, this framework is designed to work with time-series data and Pytorch Lightning. \n", + "Thus, it provides the necessary tools to create `Dataset` and `LightningDataModule` objects, required by Pytorch Lightning to train and test models.\n", "\n", - "The `Dataset` object is responsible for loading the data. It is a Pytorch object that is used to load the data and make it available to the model. \n", - "Every `Dataset` class must implement two methods: `__len__` and `__getitem__`.\n", - "The `__len__` method returns the number of samples in the dataset, and the `__getitem__`, given an integer from 0 to `__len__` - 1, returns the corresponding sample from the dataset.\n", - "The returned type of the `__getitem__` method is not specified, but it is usually a 2-element tuple with the input and the target. The input is the data that will be used to make the predictions, and the target is the data that the model will try to predict.\n", - "\n", - "For now, this framework provide implementations for the `Dataset` objects for time-series data, where data is organized in two different ways:\n", + "In this notebook, we explain the default data pipeline, which includes:\n", + "1. Creating `Dataset` objects, that are responsible for loading the data.\n", + "2. Creating `DataLoader` objects, that are responsible for batched loading of the data. It encapsulates the `Dataset` object and provides an iterator to iterate over the data in batches.\n", + "3. Creating `LightningDataModule` objects, that are responsible for loading the data and creating the `Dataset` and encapsulate it into `DataLoader` objects for training, validation, and test sets.\n", "\n", - "- A directory with several CSV files, where each file contains a time-series. Each row in a CSV file is a time-step, each column is a feature, and the whole file is a time-series. This is handled by the `SeriesFolderCSVDataset` class.\n", - "- A single CSV file with a windowed time-series. Each row in the CSV file is a window, and each column is a feature. This is handled by the `MultiModalSeriesCSVDataset` class.\n", - "\n", - "We explain both classes in detais nextly." + "\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "## Time-series dataset implementations" + "## Time-series dataset implementations\n", + "\n", + "The `Dataset` object is responsible for loading the data. \n", + "It is a Pytorch object that is used to load the data and make it available to the model. \n", + "\n", + "Every `Dataset` class must implement two methods: `__len__` and `__getitem__`.\n", + "The `__len__` method returns the number of samples in the dataset, and the `__getitem__`, given an integer from 0 to `__len__` - 1, returns the corresponding sample from the dataset.\n", + "The returned type of the `__getitem__` method is not specified, but it is usually a 2-element tuple with the input and the target. The input is the data that will be used to make the predictions, and the target is the data that the model will try to predict.\n", + "\n", + "The first step when creating a `Dataset` object is to identify the layout of the data directory and choose the appropriate class to handle it.\n", + "For now, this framework provides default `Dataset` classes for time-series data, which are the `SeriesFolderCSVDataset` and `MultiModalSeriesCSVDataset` classes.\n", + "Both classes assumes that data are stored in CSV files, but with different layouts, to know:\n", + "\n", + "- A directory with several CSV files, where each file contains a time-series. Each row in a CSV file is a time-step, each column is a feature. Thus, the whole file is a single multi-modal time-series. Also, if you want to use labels, it must be in a separated column of the CSV file and it should exists to all rows (time-steps). This layout is handled by the `SeriesFolderCSVDataset` class.\n", + "- A single CSV file with a windowed time-series. Each row contains different modalities of the same windowed time-series. The prefix of the column names is used to identify the modalities. For instance, if the is `accel-x`, all columns that start with this prefix, like `accel-x-1`, `accel-x-2`, `accel-x-3`, are considered time-steps from the same modality (`accel-x`). Also, if you want to use labels, it must be in a separated column and it should exists to all rows, that is, for each windowed multimodal time-series. This layout is handled by the `MultiModalSeriesCSVDataset` class.\n", + "\n", + "We will show how to use these classes in the next sections." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "### SeriesFolderCSVDataset\n", + "### `SeriesFolderCSVDataset`\n", "\n", "The `SeriesFolderCSVDataset` class is designed to work with a directory containing several CSV files, where each file represent a time-series. \n", "Each row in a CSV file is a time-step, and each column is a feature. \n", @@ -50,7 +61,7 @@ " ...\n", "```\n", "\n", - "Where each CSV file represents a time-series. \n", + "Where each CSV file represents a time-series, similar to the one below: \n", "\n", "| accel-x | accel-y | accel-z | gyro-x | gyro-y | gyro-z | class |\n", "|---------|---------|---------|---------|---------|---------|---------|\n", @@ -59,7 +70,8 @@ "| 0.498217| 0.00001 | 0.12312 | 0.12312 | 0.12312 | 0.12312 | 1 |\n", "\n", "\n", - "Note that the CSV must have a header with the column names." + "Note that the CSV must have a header with the column names.\n", + "Also, columns that are not used as features or labels are ignored." ] }, { @@ -67,14 +79,13 @@ "metadata": {}, "source": [ "To handle this kind of data, we use the `SeriesFolderCSVDataset` class. This class is a Pytorch `Dataset` object that loads the data from the CSV files and makes it available to the model.\n", - "For this class, we must specify the path to the directory containing the CSV files, the name of the columns that will be used as features, and the name of the column that will be used as the target.\n", - "Note that, each feature (column) represent a dimension of the time-series, while the rows represent the time-steps.\n", + "Note that, each feature (column) represent a dimension of the time-series, while the rows represent the time-steps. The sample is a numpy array.\n", "\n", - "Thus, the `SeriesFolderCSVDataset` class minimally requires:\n", + "For this class, we must specify the following parameters:\n", "\n", - "- `data_path`: the path to the directory containing the CSV files\n", - "- `features`: a list of strings with the names of the features columns, e.g. `['accel-x', 'accel-y', 'accel-z', 'gyro-x', 'gyro-y', 'gyro-z']`\n", - "- `label`: a string with the name of the label column, e.g. `'class'`" + "- `data_path`: the path to the directory containing the CSV files;\n", + "- `features`: a list of strings with the names of the features columns, *e.g.* `['accel-x', 'accel-y', 'accel-z', 'gyro-x', 'gyro-y', 'gyro-z']`;\n", + "- `label`: a string with the name of the label column, *e.g.* `'class'`." ] }, { @@ -83,11 +94,10 @@ "metadata": {}, "outputs": [ { - "name": "stderr", + "name": "stdout", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n", - " from .autonotebook import tqdm as notebook_tqdm\n" + "[1706883997.242541] [aae107fc745c:2264626:f] vfs_fuse.c:281 UCX ERROR inotify_add_watch(/tmp) failed: No space left on device\n" ] }, { @@ -122,10 +132,10 @@ "metadata": {}, "source": [ "We can get the number of samples in the dataset with the `len` function, and we can retrive a sample with the `__getitem__` method, that is, using `[]`, such as `dataset[0]`.\n", - "The dataset may return:\n", + "The dataset return type is different depending on the `label` parameter.\n", "\n", - "- A 2-element tuple, where the first element is a 2D numpy array with shape `(num_features, time_steps)`, and the second element is a 1D tensor with shape `(time_steps,)`.\n", - "- A 2D numpy array with shape `(num_features, time_steps)`, if `label` is `None`, at the time of the dataset object's creation.\n", + "- If `label` is speficied, the return type is a 2-element tuple, where the first element is a 2D numpy array with shape `(num_features, time_steps)`, and the second element is a 1D tensor with shape `(time_steps,)`.\n", + "- If `label` is not speficied, the return type is a single 2D numpy array with shape `(num_features, time_steps)`.\n", "\n", "Let's check the number of samples and access the first sample and its label." ] @@ -163,7 +173,7 @@ } ], "source": [ - "# Get the first sample\n", + "# Get the first sample. We can go from 0 to length_of_dataset - 1 (56)\n", "sample = dataset[0]\n", "type_of_sample = type(sample).__name__\n", "print(f\"Type of sample: {type_of_sample} with {len(sample)} elements\")" @@ -201,12 +211,14 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "### MultiModalSeriesCSVDataset\n", + "### `MultiModalSeriesCSVDataset`\n", "\n", "\n", - "The `MultiModalSeriesCSVDataset` class is designed to work with a single CSV file containing a windowed time-series. \n", - "The CSV is a multi-modal time-series, where each row is a sample and each column is a feature at a given time-step. \n", - "Features are organized in a way that each group of columns represent a different modality.\n", + "The `MultiModalSeriesCSVDataset` class is designed to work with a single CSV file containing a windowed time-series.\n", + "Each row contains different modalities of the same windowed time-series. \n", + "The prefix of the column names is used to identify the modalities. \n", + "For instance, if the prefix is `accel-x`, all columns that start with this prefix, like `accel-x-1`, `accel-x-2`, `accel-x-3`, are considered time-steps from the same modality (`accel-x`). \n", + "Also, if you want to use labels, it must be in a separated column and it should exists to all rows, that is, for each windowed multimodal time-series. \n", "\n", "The CSV file looks like this:\n", "\n", @@ -217,24 +229,19 @@ "| 0.6820123 | 0.02123 | 0.502123 | 0.502123 | 1 |\n", "| 0.498217 | 0.00001 | 1.414141 | 3.141592 | 1 |\n", "\n", - "In the example, columns `accel-x-0` and `accel-x-1` are the `accel-x` feature at time `0` and time `1`, respectively. The same goes for the `accel-y` feature. Finally, the `class` column is the label. " - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "To handle this kind of data, we use the `MultiModalSeriesCSVDataset` class.\n", - "For this class, we must specify the path to the CSV file, the prefix of the columns that will be used as features, and the columns that will be used as label.\n", - "Note that, each feature (column) represent a dimension of the time-series, while the rows represent the samples.\n", + "In the example, columns `accel-x-0` and `accel-x-1` are the `accel-x` feature at time `0` and time `1`, respectively. \n", + "The same goes for the `accel-y` feature. Finally, the `class` column is the label. \n", + "Columns that are not used as features or labels are ignored.\n", "\n", - "The `MultiModalSeriesCSVDataset` class minimally requires:\n", + "To use `MultiModalSeriesCSVDataset`, we must specify the following parameters:\n", "\n", "- `data_path`: the path to the CSV file\n", "- `feature_prefixes`: a list of strings with the prefixes of the feature columns, e.g. `['accel-x', 'accel-y']`. The class will look for columns with these prefixes and will consider them as features of a modality.\n", "- `label`: a string with the name of the label column, e.g. `'class'`\n", "- `features_as_channels`: a boolean indicating if the features should be treated as channels, that is, if each prefix will become a channel. If ``True``, the data will be returned as a vector of shape `(C, T)`, where C is the number of channels (features/prefixes) and `T` is the number of time steps. Else, the data will be returned as a vector of shape `T*C` (a single vector with all the features).\n", "\n", + "Note that, each feature (column) represent a dimension of the time-series, while the rows represent the samples.\n", + "\n", "Let's show how to read this data and create a `MultiModalSeriesCSVDataset` object." ] }, @@ -343,15 +350,19 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "## Loading batches of data\n", + "## Loading batches of data using DataLoader\n", "\n", "Pytorch models are trained using batches of data. Thus, we do not feed the model with a single sample at a time, but with a batch of samples.\n", "If we see the last example, the `MultiModalSeriesCSVDataset` object returns a single sample at a time. Each sample is a 2-element tuple, where first element is a `(6, 60)` numpy array and the second is an integer, representing the label.\n", "\n", - "A batch of samples add an extra dimension to the data. Thus, in our case, a batch of samples is a 3D tensor, where the first dimension is the batch size (`B`), the second dimension is the number of features, or channels (`C`), and the third dimension is the number of time-steps (`T`).\n", - "Thus, if the data have the shape `(6, 60)`, a batch of samples will have the shape `(B, 6, 60)`. The same happens to `label`, which gains an extra dimension. \n", + "A batch of samples add an extra dimension to the data. \n", + "Thus, in our case, a batch of samples would be a 3D tensor, where the first dimension is the batch size (`B`), the second dimension is the number of features, or channels (`C`), and the third dimension is the number of time-steps (`T`).\n", + "Thus, if the data have the shape `(6, 60)`, a batch of 32 samples will be a tensor with shape `(32, 6, 60)`. \n", + "The same happens to `label`, which gains an extra dimension, and would be an 1D tensor with shape `(32,)`.\n", "\n", - "The batching of samples is done using a `DataLoader` object. This object is a Pytorch object that takes a `Dataset` object and returns batches of samples. The `DataLoader` object is responsible for shuffling the data, dividing it into batches, and loading the data in parallel.\n", + "The batching of samples is done using a `DataLoader` object. \n", + "This object is a Pytorch object that takes a `Dataset` object and returns batches of samples. \n", + "The `DataLoader` object is responsible for shuffling the data, dividing it into batches, and loading the data in parallel.\n", "Thus, given a `Dataset` object, we can easilly create a `DataLoader` object using the `torch.utils.data.DataLoader` class." ] }, @@ -363,7 +374,7 @@ { "data": { "text/plain": [ - "" + "" ] }, "execution_count": 9, @@ -383,8 +394,11 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "We can fetch a batch of samples from the `DataLoader` object using the `__iter__` method, that is, using a `for` loop. Each iteration returns a batch of samples.\n", - "In our case, each batch is a 2-element tuple, where the first element is a 3D tensor with shape `(B, C, T)`, and the second element is a 1D tensor with shape `(B,)`." + "Datasets implement the `__len__` and `__getitem__` methods. \n", + "However, the `DataLoader` object implements the iterable protocol, that is, it implements the `__iter__` method, which returns an iterator to iterate over the data in batches.\n", + "Thus, to fetch a batch of samples from the `DataLoader` object, we can use a `for` loop, as we do with any other iterable object in Python, like lists and tuples (*e.g.* `for batch in dataloader: ...`).\n", + "We can also use the `next` function to fetch a single batch of samples, such as `batch = next(iter(dataloader))`.\n", + "Let's fetch a single sample from the `DataLoader` object and check its shape." ] }, { @@ -401,24 +415,30 @@ } ], "source": [ - "for batch in dataloader:\n", - " inputs, labels = batch\n", - " print(f\"Inputs shape: {inputs.shape}, labels shape: {labels.shape}\")\n", - " break" + "batch = next(iter(dataloader))\n", + "# Batch is a tuple with two elements: inputs and labels. \n", + "# Let's extract it to two different variables\n", + "inputs, labels = batch\n", + "# Print the shape of the inputs and labels\n", + "print(f\"Inputs shape: {inputs.shape}, labels shape: {labels.shape}\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ - "## Handling data splits (train, validation, and test)\n", + "## Handling data splits (train, validation, and test) using `LightningDataModule`\n", "\n", "Usually, we create a `DataLoader` object for the training data, another for the validation data, and another for the test data. \n", - "A simple way to do this is to create a `LightningDataModule` object, which is a Pytorch Lightning object that is responsible for creating the `DataLoader` objects for the training, validation, and test data.\n", + "We can encapsulate the `DataLoader` creation logic in a single place, and make it easy to use the same data processing logic across different experiments.\n", + "A simple way to do this is to create a `LightningDataModule` object.\n", + "\n", + "A `LightningDataModule` object is responsible for splitting the data into training, validation, and test sets, and creating the `DataLoader` objects for each set. \n", + "This object may also be responsible for setting up the data, such as downloading the data from the internet, checking the data, and add the augmentations. \n", "\n", - "A `LightningDataModule` object is responsible for splitting the data into training, validation, and test sets, and creating the `DataLoader` objects for each set. This object may also be responsible for setting up the data, such as downloading the data from the internet, checking the data, and add the augmentations. This module is used to encapsulate all the data loading and processing logic in a single place, and to make it easy to use the same data processing logic across different experiments.\n", + "The `LightningDataModule` object must implement four methods: `setup`, `train_dataloader`, `val_dataloader`, and `test_dataloader`. The `setup` is optional, and is responsible for splitting the data into training, validation, and test sets, and `train_dataloader`, `val_dataloader` and `test_dataloader` methods are responsible for creating the `DataLoader` objects for the training, validation and test sets, respectively.\n", "\n", - "The `LightningDataModule` object must implement three methods: `setup`, `train_dataloader`, and `val_dataloader`. The `setup` is optional, and is responsible for splitting the data into training, validation, and test sets, and the `train_dataloader` and `val_dataloader` methods are responsible for creating the `DataLoader` objects for the training and validation sets, respectively." + "A data module may be implemented as shown below." ] }, { diff --git a/notebooks/02_training_model.ipynb b/notebooks/02_training_model.ipynb index e7260ef..4ce9a64 100644 --- a/notebooks/02_training_model.ipynb +++ b/notebooks/02_training_model.ipynb @@ -7,7 +7,7 @@ "# 2. Training a Pytorch Lighning model\n", "\n", "In this notebook, we show the training of a simple CNN model using Pytorch Lightning. \n", - "We first start with data, then we define the model, and finally we train it." + "We first start with data, then define the model, and finally train it for a HAR task." ] }, { @@ -16,8 +16,8 @@ "source": [ "## Creating KuHar LightningDataModule\n", "\n", - "In order to train a model, we must first create a LightningDataModule.\n", - "In this work, we will use the Standartized KuHar HAR data. Our data folder looks like this:\n", + "In order to train a model, we must first create a `LightningDataModule`, that will define the data loaders for training, validation and test.\n", + "Here, we will use the Standartized KuHar data. Therefore, the data directory may looks like this:\n", "\n", "```\n", "KuHar/\n", @@ -28,35 +28,47 @@ "\n", "The `train.csv` file may look like this:\n", "\n", - "| accel-x-0 | accel-x-1 | accel-y-0 | accel-y-1 | class |\n", - "|-----------|-----------|-----------|-----------|--------|\n", - "| 0.502123 | 0.02123 | 0.502123 | 0.502123 | 0 |\n", - "| 0.6820123 | 0.02123 | 0.502123 | 0.502123 | 1 |\n", - "| 0.498217 | 0.00001 | 1.414141 | 3.141592 | 1 |\n", + "| accel-x-0 | accel-x-1 | accel-y-0 | accel-y-1 | ... | standard activity code |\n", + "|-----------|-----------|-----------|-----------|------|------------------------|\n", + "| 0.502123 | 0.02123 | 0.502123 | 0.502123 | ... | 0 |\n", + "| 0.6820123 | 0.02123 | 0.502123 | 0.502123 | ... | 0 |\n", + "| 0.498217 | 0.00001 | 1.414141 | 3.141592 | ... | 1 |\n", "\n", - "As each CSV file contains time-windows signals of two 3-axis sensors (accelerometer and gyroscope), we must use the `MultiModalSeriesCSVDataset` class. After it, we must create a LightningDataModule, that will define the data loaders for training, validation and test. \n", + "As each CSV file contains windowed time signals of two 3-axial sensors, we may use the `MultiModalSeriesCSVDataset` class to handle this data structure.\n", + "After it, we must create a `LightningDataModule`, that will define the data loaders for training, validation and test. \n", + "The implementation of `LightningDataModule` may look like the snippet below:\n", "\n", - "### Faciliting the creation of the LightningDataModule with MultiModalHARSeriesDataModule\n", - "\n", - "In order to facilitate the `Dataset` and `DataLoader` creation, we will use the `MultiModalHARSeriesDataModule`. If:\n", - "\n", - "1. Your directory is organized like the one above; and \n", - "2. Each CSV file is a collection os time-windows of signals (that possibly would be used as a dataset wrapping `MultiModalSeriesCSVDataset`).\n", - "\n", - "Then, you can use the `The `train.csv` file may look like this:\n", + "```python\n", + "import lightning as L\n", + "from torch.utils.data import DataLoader\n", + "from ssl_tools.data.datasets import MultiModalSeriesCSVDataset\n", "\n", - "| accel-x-0 | accel-x-1 | accel-y-0 | accel-y-1 | class |\n", - "|-----------|-----------|-----------|-----------|--------|\n", - "| 0.502123 | 0.02123 | 0.502123 | 0.502123 | 0 |\n", - "| 0.6820123 | 0.02123 | 0.502123 | 0.502123 | 1 |\n", - "| 0.498217 | 0.00001 | 1.414141 | 3.141592 | 1 |\n", + "class HARDataModule(L.LightningDataModule):\n", + " def __init__(self, data_path: Path, batch_size: int):\n", + " super().__init__()\n", + " self.data_path = data_path\n", + " self.batch_size = batch_size\n", + " \n", + " def train_dataloader(self):\n", + " dataset = MultiModalSeriesCSVDataset(self.data_path / 'train.csv')\n", + " return DataLoader(dataset, batch_size=self.batch_size, shuffle=True)\n", + " \n", + " ...\n", + "```" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Faciliting the creation of the LightningDataModule with MultiModalHARSeriesDataModule\n", "\n", - "As each CSV file contains time-windows signals of two 3-axis sensors (accelerometer and gyroscope), we must use the `MultiModalSeriesCSVDataset` class. After it, we must create a LightningDataModule, that will define the data loaders for training, validation and test. ` to create a `LightningDataModule`, easily. \n", - "The `train_dataloader` method will use `train.csv`, `val_dataloader` will use `validation.csv` and `test_dataloader` will use `test.csv`.\n", + "If your directory is organized like the one above, the CSVs are a collection of time-windows of signals, and the `LightningDataModule` implementation may looks like the one above, you can use the `MultiModalHARSeriesDataModule` to create a `LightningDataModule` easily for you.\n", + "The `train_dataloader` method will use `train.csv`, `val_dataloader` will use `validation.csv` and `test_dataloader` will use `test.csv` to create the `MultiModalSeriesCSVDataset` and encapsulate into `DataLoader`.\n", "\n", "To create a `MultiModalHARSeriesDataModule`, we must pass:\n", "\n", - "- `data_path`: the path to the `KuHar` folder;\n", + "- `data_path`: the path to the directory containing the CSV files (`train.csv`, `validation.csv` and `test.csv`). We use `standardized_balanced/KuHar` in this case;\n", "- `feature_prefixes`: the prefixes of the features in the CSV files. In this case, we have `accel-x`, `accel-y`, `accel-z`, `gyro-x`, `gyro-y` and `gyro-z`;\n", "- `batch_size`: the batch size for the data loaders; and\n", "- `num_workers`: the number of workers for the data loaders. Essentially, the number of parallel processes to load the data.\n", @@ -69,18 +81,10 @@ "execution_count": 1, "metadata": {}, "outputs": [ - { - "name": "stderr", - "output_type": "stream", - "text": [ - "/usr/local/lib/python3.10/dist-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n", - " from .autonotebook import tqdm as notebook_tqdm\n" - ] - }, { "data": { "text/plain": [ - "" + "MultiModalHARSeriesDataModule(data_path=/workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/KuHar, batch_size=64)" ] }, "execution_count": 1, @@ -99,7 +103,6 @@ " label=\"standard activity code\",\n", " features_as_channels=True,\n", " batch_size=64,\n", - " num_workers=0, # Sequential, for notebook compatibility\n", ")\n", "data_module" ] @@ -108,12 +111,14 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "We can test the dataloaders by getting the first batch of each one. Let's do it, but just for the `train_dataloader`. Note that the `.setup()` method must be called before getting the data loaders. If you don't call it, the data loaders will not be created. However, when used to train a model, the Pytorch Lightning `.fit()` method will call the `.setup()` method for you. So, we put it here just to show how to use it." + "We can test the dataloaders by getting the first batch of each one. Let's do it (only for`train_dataloader`)!. \n", + "\n", + "> **NOTE**: We use the data_module.train_dataloader() method to get the data loader for the training set. Note that the `.setup()` method must be called before getting the data loaders. If you don't call it, the data loaders will not be created. However, when used to train a model, the Pytorch Lightning `Trainer.fit()` method will automatically call the `.setup()` method for you. So, we put it here just to show how to fetch a data from `train_dataloader` and check if it is working." ] }, { "cell_type": "code", - "execution_count": 2, + "execution_count": 4, "metadata": {}, "outputs": [ { @@ -126,14 +131,21 @@ ], "source": [ "data_module.setup(\"fit\") # We just put it here to test.\n", - " # When training a model, the Trainer will call this method.\n", + " # When training a model, the Trainer will \n", + " # call this method.\n", + "\n", "train_dataloader = data_module.train_dataloader()\n", "\n", - "# Pick the first batch to inspect. The batch size is 64, so we have 64 samples.\n", + "# Pick the first batch to inspect. As batch size is 64, we will have 64 samples.\n", + "# Note that dataloader only implement iterator protocol, \n", + "# so we can use next() to fetch one batch.\n", "batch = next(iter(train_dataloader))\n", - "# Each batch is a 2-element tuple with the first element being the 64 sample input and the second the 64 sample target.\n", + "# Each batch is a 2-element tuple:\n", + "# First element is a Tensor with 64 input samples\n", + "# and the second is a Tensor with 64 labels.\n", "inputs, targets = batch\n", "\n", + "# (B, C, T) = (Batch size, Channels, Time steps) = (64, 6, 60)\n", "print(f\"Inputs shape: {inputs.shape}, Targets shape: {targets.shape}\")" ] }, @@ -143,22 +155,23 @@ "source": [ "## Training a simple model\n", "\n", - "We will create a simple 1D CNN Pytorch Lightning model using the `Simple1DConvNetwork`. The model will be trained to classify the activities in the KuHar dataset. \n", + "We will create a simple 1D CNN Pytorch Lightning model using the `Simple1DConvNetwork`. The model will be trained to classify the activities in KuHar dataset. \n", "\n", - "Pytorch Lightning models must implement the `forward` method, `training_step` and `configure_optimizers` methods. Also, the `__init__` method is used to define the model.\n", + "Pytorch Lightning models must implement the `forward` method, `training_step` and `configure_optimizers` methods. \n", + "Also, the `__init__` method is used to define the model.\n", "The `forward` method is the same as the Pytorch `forward` method. \n", - "The `training_step` method is the method that will be called for each batch of data during the training. \n", + "The `training_step` method is the method that will be called for each batch of data during the training. It should return the loss of the batch.\n", "The `configure_optimizers` method is the method that will define the optimizer to be used during the training.\n", "\n", - "The `Simple1DConvNetwork` is a simple 1D CNN model that will be used to classify the activities in the KuHar dataset. It has 3 convolutional layers and 2 fully connected layers. It is trained using the `Adam` optimizer and the `CrossEntropyLoss` loss function.\n", + "The `Simple1DConvNetwork` is a simple 1D CNN model, that has 3 convolutional layers and 2 fully connected layers. \n", + "It is trained using the `Adam` optimizer and the `CrossEntropyLoss` loss function.\n", "\n", - "Besides that, Lightning models implemented in this framework, usually logs the training and validation losses.\n", - "Also, the `test` usually implement common metrics, such as accuracy." + "Besides that, Lightning models implemented in this framework, usually logs the training and validation losses." ] }, { "cell_type": "code", - "execution_count": 3, + "execution_count": 5, "metadata": {}, "outputs": [ { @@ -186,7 +199,7 @@ ")" ] }, - "execution_count": 3, + "execution_count": 5, "metadata": {}, "output_type": "execute_result" } @@ -195,10 +208,10 @@ "from ssl_tools.models.nets.convnet import Simple1DConvNetwork\n", "\n", "model = Simple1DConvNetwork(\n", - " input_channels=6, # The number of input channels (accel-x, accel-y, accel-z, gyro-x, gyro-y, gyro-z)\n", - " num_classes=6, # The number of output classes\n", - " time_steps=60, # Used to automatically calculate the input size of the linear layer\n", - " learning_rate=1e-3, # The learning rate for the optimizer\n", + " input_channels=6, # The number of input channels (accel-x, accel-y, ...)\n", + " num_classes=6, # The number of output classes\n", + " time_steps=60, # Used to auto calculate the input size of FC layers\n", + " learning_rate=1e-3, # The learning rate of the Adam optimizer\n", ")\n", "\n", "model" @@ -208,16 +221,19 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "To train a Lightning model using Pytorch Lightning, we must create a `Trainer` and call the `fit` method. The `Trainer` is responsible for training the model. It has several parameters, such as the number of epochs, the number of GPUs to use, the number of TPU cores to use, etc. \n", + "To train a Lightning model using Pytorch Lightning, we must create a `Trainer` and call the `fit` method. The `Trainer` is responsible for training the model. \n", + "It has several parameters, such as the number of epochs, the number of GPUs/CPUs to use, *etc*. \n", "\n", - "We will train our model using the already defined dataloader. The `fit` method will be responsible for training the model using the training and validation data loaders. After the training, we will test the model using the test data loader.\n", + "We will train our model using the already defined dataloader. \n", + "The `fit` method will be responsible for training the model using the training and validation data loaders. \n", + "After training, we will test the model using the test data loader and Trainer's `test` method.\n", "\n", - "The training will run for 300 epochs (`max_epochs`) and will use 1 (`devices`) GPU only (`accelerator`)." + "Here, the training will run for 300 epochs (`max_epochs`) and will use only 1 (`devices`) GPU (`accelerator`)." ] }, { "cell_type": "code", - "execution_count": 4, + "execution_count": 6, "metadata": {}, "outputs": [ { @@ -227,13 +243,7 @@ "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", - "HPU available: False, using: 0 HPUs\n" - ] - }, - { - "name": "stderr", - "output_type": "stream", - "text": [ + "HPU available: False, using: 0 HPUs\n", "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n", "\n", " | Name | Type | Params\n", @@ -249,114 +259,133 @@ ] }, { - "name": "stdout", - "output_type": "stream", - "text": [ - "Sanity Checking DataLoader 0: 0%| | 0/2 [00:00┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", - "┃ Test metric DataLoader 0 ┃\n", - "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│ test_acc 0.9027777910232544 │\n", - "│ test_loss 0.626140832901001 │\n", - "└───────────────────────────┴───────────────────────────┘\n", - "\n" - ], + "application/vnd.jupyter.widget-view+json": { + "model_id": "22489dabd4ac414597c2e5684f91780b", + "version_major": 2, + "version_minor": 0 + }, "text/plain": [ - "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", - "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", - "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.9027777910232544 \u001b[0m\u001b[35m \u001b[0m│\n", - "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.626140832901001 \u001b[0m\u001b[35m \u001b[0m│\n", - "└───────────────────────────┴───────────────────────────┘\n" + "Validation: | | 0/? [00:00┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", + "┃ Test metric DataLoader 0 ┃\n", + "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", + "│ test_acc 0.8333333134651184 │\n", + "│ test_loss 1.9901254177093506 │\n", + "└───────────────────────────┴───────────────────────────┘\n", + "\n" + ], + "text/plain": [ + "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", + "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", + "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.8333333134651184 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.9901254177093506 \u001b[0m\u001b[35m \u001b[0m│\n", + "└───────────────────────────┴───────────────────────────┘\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/plain": [ + "[{'test_loss': 1.9901254177093506, 'test_acc': 0.8333333134651184}]" + ] + }, + "execution_count": 7, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "trainer.test(model, data_module)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Using any other set from data module" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "And if we want to test the model using the validation data loader, we also can use the `trainer.test` method, but passing the `val_dataloader`. \n", + "Remember that as we are not passing a `LightningDataModule` to the `test` method, but a `DataLoader`, we must call `setup` method." + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n" + ] + }, + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "bd31957d8c5a40bfa0624948e306396e", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + "Testing: | | 0/? [00:00┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", "┃ Test metric DataLoader 0 ┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│ test_acc 0.5680751204490662 │\n", - "│ test_loss 13.804328918457031 │\n", + "│ test_acc 0.5962441563606262 │\n", + "│ test_loss 14.916933059692383 │\n", "└───────────────────────────┴───────────────────────────┘\n", "\n" ], @@ -419,8 +4653,8 @@ "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.5680751204490662 \u001b[0m\u001b[35m \u001b[0m│\n", - "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 13.804328918457031 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.5962441563606262 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 14.916933059692383 \u001b[0m\u001b[35m \u001b[0m│\n", "└───────────────────────────┴───────────────────────────┘\n" ] }, @@ -430,17 +4664,18 @@ { "data": { "text/plain": [ - "[{'test_loss': 13.804328918457031, 'test_acc': 0.5680751204490662}]" + "[{'test_loss': 14.916933059692383, 'test_acc': 0.5962441563606262}]" ] }, - "execution_count": 6, + "execution_count": 8, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data_module.setup(\"fit\")\n", - "trainer.test(model, data_module.val_dataloader())" + "validation_dataloader = data_module.val_dataloader()\n", + "trainer.test(model, validation_dataloader)" ] } ], diff --git a/notebooks/03_training_ssl_model.ipynb b/notebooks/03_training_ssl_model.ipynb index 88886ea..0442211 100644 --- a/notebooks/03_training_ssl_model.ipynb +++ b/notebooks/03_training_ssl_model.ipynb @@ -4,20 +4,20 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "# 3. Training a self-supervised model (CPC)\n", + "# 3. Training a self-supervised model: Contrastive Predictive Coding (CPC)\n", "\n", "In this notebook, we will train a self-supervised model using the Contrastive Predictive Coding (CPC) method. \n", "This method is based on the idea of predicting future tokens in a sequence, and it has been shown to be very effective in learning useful representations for downstream tasks.\n", "This framework already provides an implementation of CPC, so we will use it to train the model.\n", "\n", - "We will pre-train the model using KuHar dataset, and then we will use the learned representations to train a classifier for the downstream task. \n", + "We will pre-train the model using KuHar dataset, and then we will use the learned representations to train a classifier for the downstream task (fine tuning). \n", "For both stages of training, as the last notebook, we will:\n", "\n", "1. Create a `Dataset` and then `LightningDataModule` to load the data;\n", "2. Instantiate the CPC model; and\n", "3. Train the model using PyTorch Lightning.\n", "\n", - "We can instantiate the model in two ways:\n", + "Every SSL model in this framework can instantiate in two ways:\n", "\n", "1. Instantiate each element, such as the encoder, the autoregressive model, and the CPC model, and then pass them to the CPC model; or\n", "2. Using builder methods to instantiate the model. In this case, we do not need to instantiate each element separately, but we can still customize the model by passing the desired parameters to the builder methods. This is the approach we will use in this notebook.\n", @@ -31,7 +31,9 @@ "source": [ "## Pre-training the model\n", "\n", - "We will pre-train the model using the KuHar dataset. CPC is a self-supervised method, so we do not need labels to train the model. However, CPC assumes that the input data is sequential, that is, an input is a sequence of time-steps comprising different acitivities. Thus, for HAR, usually, one sample (a multi-modal time-series) correspond to the whole time-series of a single user.\n", + "We will pre-train the model using the KuHar dataset. CPC is a self-supervised method, so we do not need labels to train the model. \n", + "However, CPC assumes that the input data is sequential, that is, an input is a sequence of time-steps comprising different acitivities. \n", + "Thus, for HAR, usually, one sample is a multi-modal time-series correspond to the whole time-series of a single user.\n", "\n", "### Creating the LightningDataModule\n", "\n", @@ -53,7 +55,7 @@ " ...\n", "```\n", "\n", - "And the content of each file should be something like:\n", + "And the content of each CSV file should be something like:\n", "\n", "| timestamp | accel-x | accel-y | accel-z | gyro-x | gyro-y | gyro-z | activity |\n", "|-----------|---------|---------|---------|--------|--------|--------|-----------|\n", @@ -66,16 +68,16 @@ "In this way, we should use the `SeriesFolderCSVDataset` to load the data.\n", "This will create a `Dataset` for us, where each CSV file is a sample, and each row of the CSV file is a time-step, and the columns are the features.\n", "\n", - "> **NOTE**: The samples may have different lengths, so, for this method, the `batch_size` must be 1.\n", - "\n", "If your data is organized as above, where inside the root folder (`data/` in this case) there are sub-folders for each split (`train/`, `validation/`, and `test/`), and inside each split folder there are the CSV files, you can use the `UserActivityFolderDataModule` to create a `LightningDataModule` for you.\n", "This class will create `DataLoader` of `SeriesFolderCSVDataset` for each split (train, validation, and test), and will setup data correctly.\n", "\n", - "In this notebook, we will use the `UserActivityFolderDataModule` to create the `LightningDataModule` for us. This class minimally requires:\n", + "In this notebook, we will use the `UserActivityFolderDataModule` to create the `LightningDataModule` for us. This class requires the following parameters:\n", "\n", "- `data_path`: the root directory of the data;\n", "- `features`: the name of the features columns;\n", - "- `pad`: a boolean indicating if the samples should be padded to the same length, that is, the length of the longest sample in the dataset. The padding scheme will replicate the samples, from the beginning, until the length of the longest sample is reached. " + "- `pad`: a boolean indicating if the samples should be padded to the same length, that is, the length of the longest sample in the dataset. The padding scheme will replicate the samples, from the beginning, until the length of the longest sample is reached. \n", + " \n", + "> **NOTE**: The samples may have different lengths, so, for this method, the `batch_size` must be 1." ] }, { @@ -83,6 +85,13 @@ "execution_count": 1, "metadata": {}, "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "[1706884475.353781] [aae107fc745c:2265333:f] vfs_fuse.c:281 UCX ERROR inotify_add_watch(/tmp) failed: No space left on device\n" + ] + }, { "data": { "text/plain": [ @@ -120,9 +129,10 @@ "### Pre-training the model\n", "\n", "Here we will use the builder method `build_cpc` to instantiate the CPC model.\n", - "This will instantiate an CPC self-supervised model, with the default encoder (`ssl_tools.models.layers.gru.GRUEncoder`), that is an GRU+Linear, and the default autoregressive model (`torch.nn.GRU`), a linear layer.\n", + "This will instantiate an CPC self-supervised model, with the default encoder (`ssl_tools.models.layers.gru.GRUEncoder`), that is an GRU+Linear, and the default autoregressive model (`torch.nn.GRU`).\n", "\n", - "We can parametrize the creation of the model by passing the desired parameters to the builder method. The `build_cpc` method can be parametrized the following parameters:\n", + "We can parametrize the creation of the model by passing the desired parameters to the builder method. T\n", + "he `build_cpc` method can be parametrized the following parameters:\n", "\n", "- `encoding_size`: the size of the encoded representation;\n", "- `in_channels`: number of input features;\n", @@ -131,9 +141,10 @@ "- `learning_rate`: the learning rate of the optimizer;\n", "- `window_size` : size of the input windows (`X_t`) to be fed to the encoder (GRU).\n", "\n", - "All parameters are optional, and have default values. You may want to consult the documentation of the method to see the default values and additional parameters.\n", + "All parameters are optional, and have default values. \n", + "You may want to consult the documentation of the method to see the default values and additional parameters.\n", "\n", - "Note that the `LightningModule` returned by the `build_cpc` method is already configured to use the `CPC` loss, and the `Adam` optimizer." + "Note that the `LightningModule` returned by the `build_cpc` method is already configured to use the CPC loss, and the `Adam` optimizer." ] }, { @@ -217,7 +228,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "ae5e2aa76b724bd28d4a6a30a24741b6", + "model_id": "8e3634f46d0440d78d1cc4df789b6f63", "version_major": 2, "version_minor": 0 }, @@ -231,7 +242,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "09c22282fde24cfd9de81b97f95e86fd", + "model_id": "640b0f0a79794c2d97ca00a311e4b08d", "version_major": 2, "version_minor": 0 }, @@ -245,7 +256,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "bef77ce96a814d7285754998c34e9ac9", + "model_id": "36aa3a12d53f47e39962e445c39d2af3", "version_major": 2, "version_minor": 0 }, @@ -259,7 +270,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "1937e63c20084b50bb692146e61413a1", + "model_id": "9b41ae7b312341bb8872f7b4e56a753e", "version_major": 2, "version_minor": 0 }, @@ -273,7 +284,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "f69d85b4c6d740a69283266cec51d423", + "model_id": "dbcc4c7c3da648fa94d7a3f2663ffe5e", "version_major": 2, "version_minor": 0 }, @@ -287,7 +298,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "b646a368e4604032afa6c76ad58e442c", + "model_id": "7f89209f2f5b44488be9b5d002fb949f", "version_major": 2, "version_minor": 0 }, @@ -301,7 +312,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "34378bbe5c29451b8e8699f5d5e5c5f3", + "model_id": "52322530fcc641b2b3f2da9e76d23a5d", "version_major": 2, "version_minor": 0 }, @@ -315,7 +326,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "7eb66332ab514500beab85981de2cc86", + "model_id": "268860b5d4104d1f83692fe48f173f7c", "version_major": 2, "version_minor": 0 }, @@ -329,7 +340,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "c497708ecea642399c3972d6588834dc", + "model_id": "96c45fd34de246cea3b0321434c8205a", "version_major": 2, "version_minor": 0 }, @@ -343,7 +354,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "81d632bcc570499580819d00fad1f48c", + "model_id": "66aa47a3e2e34cd383f03c62dde9bbaf", "version_major": 2, "version_minor": 0 }, @@ -357,7 +368,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "5f64cf4dbb41406da82a5a8028f50080", + "model_id": "e6ce5719fcd540dfad7536ac5ac75baf", "version_major": 2, "version_minor": 0 }, @@ -371,7 +382,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "2c930a3f8e18484bb2fc18b2a4773d1a", + "model_id": "e472b12bf71c4b2f909f481385a2b371", "version_major": 2, "version_minor": 0 }, @@ -404,7 +415,9 @@ "source": [ "This finishes the pre-training stage. \n", "\n", - "To obtain the latent representations of the data, we must use cpc `forward` method on the data. In this framework, the `forward` method of the SSL models returns the latent representations of the input data. Usually this is the output of the encoder, as in this case, but it may vary depending on the model.\n", + "To obtain the latent representations of the data, we must use `model.forward()` method on the data. \n", + "In this framework, the `forward` method of the SSL models returns the latent representations of the input data. \n", + "Usually this is the output of the encoder, as in this case, but it may vary depending on the model.\n", "\n", "We will use the encoder to obtain the latent representations of the data, and then we will use these representations to train a classifier for the downstream task." ] @@ -429,7 +442,8 @@ "Human acivity recognition is a supervised classification task, that usually receives multi-modal windowed time-series as input, diferently from the self-supervised task, that receives the whole time-series of a single user.\n", "Thus, we cannot use the same `LightningDataModule` to load the data for the downstream task. \n", "\n", - "In this notebook, we will use the windowed time-series version of the KuHar dataset, that each split is a single CSV file, containing windowed time-series of the users. The content of the file should be something like:\n", + "In this notebook, we will use the windowed time-series version of the KuHar dataset, that each split is a single CSV file, containing windowed time-series of the users. \n", + "The content of the file should be something like:\n", "\n", "```\n", "KuHar/\n", @@ -438,7 +452,7 @@ " test.csv\n", "```\n", "\n", - "The `train.csv` file may look like this:\n", + "The CSVs file may look like this:\n", "\n", "| accel-x-0 | accel-x-1 | accel-y-0 | accel-y-1 | class |\n", "|-----------|-----------|-----------|-----------|--------|\n", @@ -494,18 +508,24 @@ "To handle the fine-tune process, we can design a new model, that is composed of the pre-trained backbone and the prediction head, and then train this new model with the labeled data. \n", "In order to facilitate this process, this framework provides the `SSLDiscriminator` class, that receives the backbone model and the prediction head, and then trains the classifier with the labeled data.\n", "\n", - "In summary, the `SSLDiscriminator` class is a `LightningModule` that generate the representations of the input data using the backbone model, that is, using the `forward` method of the backbone model, and then uses the prediction head to output the predictions. The predictions and labels are then used to compute the loss and train the model. \n", - "By default, the `SSLDiscriminator` is trained using the `Adam` optimizer with the `learning_rate` defined by the user (1e-3 by default).\n", + "In summary, the `SSLDiscriminator` class is a `LightningModule` that generate the representations of the input data using the backbone model, that is, using the `forward` method of the pre-trained backbone model, and then uses the prediction head to output the predictions, something like `y_hat = prediction_head(backbone(sample))`. \n", + "The predictions and labels are then used to compute the loss and train the model. \n", + "By default, the `SSLDiscriminator` is trained using the `Adam` optimizer with parametrizable `learning_rate`.\n", "\n", - "It worth to mention that the `SSLDiscriminator` class `forward` method receives the input data and the labels, and returns the predictions. This is different from the `forward` method of the self-supervised models, that receives only the input data and returns the latent representations of the input data.\n", + "It worth to mention that the `SSLDiscriminator` class `forward` method receives the input data and the labels, and returns the predictions. \n", + "This is different from the `forward` method of the self-supervised models, that receives only the input data and returns the latent representations of the input data.\n", "\n", "It worth to notice that the fine-tune train process can be done in two ways: \n", "\n", "1. Fine-tuning the whole model, that is, backbone (encoder) and classifier, with the labeled data; or \n", "2. Fine-tuning only the classifier, with the labeled data.\n", - "The `SSLDisriminator` class can handle both cases, with the `update_backbone` parameter. If `update_backbone` is `True`, the whole model is fine-tuned (case 1, above), otherwise, only the classifier is fine-tuned (case 2, above).\n", "\n", - "Let's create our prediction head and `SSLDisriminator` model and train it with the labeled data. Prediction heads for most popular tasks are already implemented in the `ssl_tools.models.ssl.modules.heads` module. In this notebook, we will use the `CPCPredictionHead` prediction head, that is a MLP with 3 hidden layers and dropout." + "The `SSLDisriminator` class can handle both cases, with the `update_backbone` parameter. \n", + "If `update_backbone` is `True`, the whole model is fine-tuned (case 1, above), otherwise, only the classifier is fine-tuned (case 2, above).\n", + "\n", + "Let's create our prediction head and `SSLDisriminator` model and train it with the labeled data. \n", + "Prediction heads for most popular tasks are already implemented in the `ssl_tools.models.ssl.modules.heads` module. \n", + "In this notebook, we will use the `CPCPredictionHead` prediction head, that is a MLP with 3 hidden layers and dropout." ] }, { @@ -556,14 +576,14 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "We will create the `SSLDisriminator` model. \n", - "The `SSLDisriminator` minimally requires:\n", + "Now we create the `SSLDisriminator` model. This class requires the following parameters:\n", "\n", "- `backbone`: the backbone model, that is, the pre-trained model;\n", "- `head`: the prediction head model;\n", "- `loss_fn`: the loss function to be used to train the model;\n", "\n", - "Also, we can attach metrics that will be calculated with for every batch of `validation` and `test` sets. The metrics is passed using the `metrics` parameter of the `SSLDisriminator` class, that receives a dictionary with the name of the metric as key and the `torchmetrics.Metric` as value.\n", + "Also, we can attach metrics that will be calculated with for every batch of `validation` and `test` sets. \n", + "The metrics is passed using the `metrics` parameter of the `SSLDisriminator` class, that receives a dictionary with the name of the metric as key and the `torchmetrics.Metric` as value.\n", "\n", "Let's create the `SSLDiscriminator` and attach the `Accuracy` metric to the model, to check the validation accuracy per epoch." ] @@ -667,7 +687,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "2cf48079d50145c7afc3309882108058", + "model_id": "b4a9bc62bb02433c814bb266ad167222", "version_major": 2, "version_minor": 0 }, @@ -690,7 +710,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "e2ae4d6c11b94de5995bb8a7ef1c495b", + "model_id": "5a315d4c45dc43baac89162331a17467", "version_major": 2, "version_minor": 0 }, @@ -704,7 +724,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "b5eb6ffc4ee4423a93c60c73a1edfbf1", + "model_id": "a43e85394ce34b1f987664c15e50fd30", "version_major": 2, "version_minor": 0 }, @@ -718,7 +738,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "c66686d98e9d4f6daf3e11ad6dbba580", + "model_id": "ae46f2bf6c324fe7b442a2f88f27a28e", "version_major": 2, "version_minor": 0 }, @@ -732,7 +752,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "f5ddf90cc6cb45c698b80b613d67ba43", + "model_id": "242bcdc871da408eaf38a5a56334a8b1", "version_major": 2, "version_minor": 0 }, @@ -746,7 +766,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "a2e606bf41644aa3ba73f1dd1af1f0ba", + "model_id": "1970a89b26fd4cf39c0ca393b9b6c027", "version_major": 2, "version_minor": 0 }, @@ -760,7 +780,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "5225a29cc4734fe2a51e0a672c017e44", + "model_id": "6bdbd86f03074702a869b386d177fb7d", "version_major": 2, "version_minor": 0 }, @@ -774,7 +794,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "1daec8bdfe244e98a0506f59cadce3e8", + "model_id": "d2167f1510bb4d64a54da935335ccc21", "version_major": 2, "version_minor": 0 }, @@ -788,7 +808,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "0bf6a6c07005402aa725b3711cfc1e8a", + "model_id": "1f3e63d7f34f4dd2b0dc31467dd464dd", "version_major": 2, "version_minor": 0 }, @@ -802,7 +822,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "6f8c6f1f5b7e42c5b4f7a94e3ab5c929", + "model_id": "69e4bfc1a324439a8ecb59e82b9e810f", "version_major": 2, "version_minor": 0 }, @@ -816,7 +836,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "e378c409ea68429b80f4e26858a472dd", + "model_id": "220928c965124f8fa5d5feeb1ec05f2d", "version_major": 2, "version_minor": 0 }, @@ -830,7 +850,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "e6535154a526421997abf0f6c82cda9f", + "model_id": "6e0696c0f6a144909060b0ab7d53b45a", "version_major": 2, "version_minor": 0 }, @@ -862,7 +882,7 @@ "metadata": {}, "source": [ "Let's evaluate the model using the test set. If we have added the `Accuracy` metric to the model, it will calculate the accuracy of the model on the test set.\n", - "All logged metrics will be returnet by `.test()` method, as a dictionary." + "All logged metrics will be returnet by `.test()` method, as a list of dictionaries." ] }, { @@ -881,7 +901,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "0075d6e46e5243eca4724a1a3ead2011", + "model_id": "96b0536249cb4290abbe1fb6367f6de6", "version_major": 2, "version_minor": 0 }, @@ -899,7 +919,7 @@ "┃ Test metric DataLoader 0 ┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", "│ test_acc 0.5277777910232544 │\n", - "│ test_loss 1.5032936334609985 │\n", + "│ test_loss 1.4903016090393066 │\n", "└───────────────────────────┴───────────────────────────┘\n", "\n" ], @@ -908,7 +928,7 @@ "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.5277777910232544 \u001b[0m\u001b[35m \u001b[0m│\n", - "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.5032936334609985 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.4903016090393066 \u001b[0m\u001b[35m \u001b[0m│\n", "└───────────────────────────┴───────────────────────────┘\n" ] }, @@ -918,7 +938,7 @@ { "data": { "text/plain": [ - "[{'test_loss': 1.5032936334609985, 'test_acc': 0.5277777910232544}]" + "[{'test_loss': 1.4903016090393066, 'test_acc': 0.5277777910232544}]" ] }, "execution_count": 8, @@ -957,7 +977,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "c344339416884a01b84c3bc5dace1252", + "model_id": "64924b90def24c998374ee4e9ebd8478", "version_major": 2, "version_minor": 0 }, diff --git a/notebooks/04_using_experiments.ipynb b/notebooks/04_using_experiments.ipynb index f16629a..4ece8ba 100644 --- a/notebooks/04_using_experiments.ipynb +++ b/notebooks/04_using_experiments.ipynb @@ -6,9 +6,12 @@ "source": [ "# 4. Using Experiments\n", "\n", - "Although the process of training and evaluating models becomes easier due to the abstractions and facilities provided by this framework and Pytorch Lightning, we also standarize the way we conduct experiments, in order to allow for a more systematic and organized approach to the development of models.\n", + "Although the process of training and evaluating models becomes easier due to the abstractions and facilities provided by this framework and Pytorch Lightning, we also aims to standarize the way we conduct experiments, in order to allow for a more systematic and organized approach to the development of models.\n", "\n", - "The `LightningExperiment` class aims to standartize the way we conduct experiments, including: default callacks and loggers, the directory structure for the logs and checkpoints, logging of hyperparameters, and the way we handle the training and evaluation of models and data modules.\n", + "The `LightningExperiment` implements a default pipeline (similar to ones used in previous notebooks) and provides a set of default configurations and settings for the experiments. \n", + "This includes the default configurations for the Lightning Trainer, Logger, and Callbacks, as well as model and data module configurations.\n", + "Also, it standardize the ouputs of the experiments, and the way we log the hyperparameters and results.\n", + "However, it also provides flexibility as the user can customize the experiment by overriding the default configurations and settings.\n", "\n", "In this notebook, we will demonstrate how to use the `LightningExperiment` class to conduct experiments in a systematic and organized way." ] @@ -20,55 +23,86 @@ "\n", "## Experiment Structure\n", "\n", - "The `LightningExperiment` follows the structure below. The first box is the name of the class, the second box is the name of the attributes and their type, and the third box is the methods of that class, the input parameters and return type. \n", - "The arrows represent the inheritance relationship between the classes. \n", - "Derived classes inherit the attributes and methods of their parent classes, that is, it have access to all the attributes and methods of the parent class. \n", - "Methods named in italic are abstract methods, that is, they must be implemented by the derived class. Some methods are not abstract, or it may already e implemented in some childs (overriden). \n", + "The `LightningExperiment` follows the structure below. Each rectangle (vertex of the graph) corresponds to a class, and the arrows (edges of the graph) correspond to the inheritance relationship between the classes. Inside each rectangle, there are three boxes. The first box is the name of the class, the second box is the name of the attributes and their type, and the third box is the methods of that class, the input parameters and return type. \n", "\n", + "As derived classes inherit the attributes and methods of their parent classes, they have access to all the attributes and methods of the parent class. \n", + "Methods named in italic are abstract methods, that is, they must be implemented in some of the derived classes (below him). \n", + "Some methods are not abstract, thus they have a default implementation, but can be overriden by the user to customize the experiment.\n", + "You may want to check some useful material, if you are not familiar with the concept of inheritance in object-oriented programming, such as [this one from Real Python](https://realpython.com/inheritance-composition-python/), that gives a comprehensive overview of inheritance in Python, or [this one from Geeks For Geeks](https://www.geeksforgeeks.org/inheritance-in-python/).\n", "\n", - "![Experiment Structure](experiment_classes.svg)\n", "\n", + "![Experiment Structure](experiment_classes.svg)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ "### The `Experiment` class\n", "\n", "The `Experiment` class is the base class for all experiments and includes the `experiment_dir` (where logs, checkpoints, and outputs are saved), the `name` and `run_id` (tipically, the time).\n", "The experiment directory is created when the experiment is instantiated, and the `experiment_dir` attribute is set to the path of the created directory.\n", "The experiment consist in 3 stages: `setup`, `run` and `teardown`.\n", - "You can use the `execute` method to run the experiment, that will call the `setup`, `run` and `teardown` methods in sequence.\n", + "The `run` method is an abstract method, and must be implemented in the derived classes.\n", "\n", + "You can use the `execute` method to run the experiment, that will call the `setup`, `run` and `teardown` methods in sequence." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ "### The `LightningExperiment` class\n", "\n", - "The `LightningExperiment` adds common parameters for train and test models using Pytorch Lightning. Usually this is the base class for any experiment that uses Pytorch Lightning.\n", - "This class also implements the `run` method, that execute a generic Pytorch Lightning pipeline, and calls the `get_callbacks`, `get_logger`, `get_data_module`, `get_model`, `get_trainer`, `load_checkpoint`, `run_model` and `log_hyperparameter` methods. \n", - "The pseudo-code for the `run` method is:\n", + "The `LightningExperiment` adds common parameters when using Pytorch Lightning for training or testing. \n", + "Usually this is the base class for any experiment that uses Pytorch Lightning.\n", + "This class also implements the `run` method, that execute a generic Pytorch Lightning pipeline, similar to ones used in previous notebooks. \n", + "This pipeline calls some methods that it defines.\n", + "In fact, the pseudo-code for pipeline implemented by the `run` method is:\n", "\n", "1. Get the model and data module using `get_model` and `get_data_module` methods.\n", - "2. If `self.load` is provided, load the checkpoint using the `load_checkpoint` method.\n", + "2. If `self.load` is provided (path to the checkpoint), load the checkpoint using the `load_checkpoint` method.\n", "3. Get the callbacks and logger using `get_callbacks` and `get_logger` methods.\n", - "4. Log the hyperparameters using the `log_hyperparameters` method.\n", - "5. Get the trainer using the `get_trainer` method.\n", + "4. Log the hyperparameters of experiment and model using the `log_hyperparameters` method.\n", + "5. Get the trainer using the `get_trainer` method and attach the logger and callbacks.\n", "6. Run the model using the `run_model` method.\n", "\n", - "The user can override these methods to customize the experiment. By default, `get_callbacks`, `get_logger`, `load_checkpoint`, and `log_hyperparameters` have default implementations, and `get_data_module`, `get_model`, `get_trainer`, and `run_model` are abstract methods that must be implemented by the derived class.\n", - "\n", - "\n", + "The user can override these methods to customize the experiment. By default, `get_callbacks`, `get_logger`, `load_checkpoint`, and `log_hyperparameters` have default implementations, and `get_data_module`, `get_model`, `get_trainer`, and `run_model` are abstract methods that must be implemented by the derived class." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ "### The `LightningTrain` and `LightningTest` classes\n", "\n", - "The `LightningTrain` and `LightningTest` classes are derived from `LightningExperiment` and are used to train and test models, respectively. These classes adds more specific parameters for training and testing models using Pytorch Lightning and implements specific `get_callbacks`, `get_trainer`, and `run_model` methods, that are specific for training and testing models, respectively. This standardizes the way we train and test models, logging the same information and using the same callbacks and loggers. Thus, it allows the user to focus on the model and data module, and not on the training and testing process, that is already standardized (and can be customized) and can be reused in different experiments.\n", - "In fact, `get_model` and `get_data_module` are abstract methods that must be implemented by the derived class, that varies according to the model and data module used in the experiment.\n", - "\n", + "The `LightningTrain` and `LightningTest` classes are derived from `LightningExperiment` and are used to train and test models, respectively. \n", + "These classes adds more specific parameters for different contexts, such as training and testing. \n", + "For instance, training usually requires the number of epochs, learning rate, and other parameters.\n", + "Both classes implement parent's `get_callbacks`, `get_trainer` and `run_model` methods. \n", "\n", - "### The `LightningSSLTrain` class\n", + "This standardizes the way we train and test models, logging the same information and using the same callbacks and loggers. \n", + "It allows the user to focus on the model and data module, and not on the training and testing process, that is already standardized (and can be customized) and can be reused in different experiments.\n", + "In fact, `get_model` and `get_data_module` are abstract methods that must be implemented by the derived class, as it varies according to the model and data module used in the experiment." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### `LightningSSLTrain` class\n", "\n", - "The `LightningTrain` class allow to train arbitrary models. \n", - "The `LightningSSLTrain` class is a derived class that is used to train models using self-supervised learning. It adds 4 new methods: \n", + "While `LightningTrain` class allows training arbitrary models, the `LightningSSLTrain` class is a derived class used to train models using self-supervised learning. This class adds more 4 new abstract methods: \n", "\n", "* `get_pretrain_model` and `get_pretrain_data_module`: the user must return the model and data module used to pretrain the model.\n", "* `get_finetune_model` and `get_finetune_data_module`: the user must return the model and data module used to finetune the model.\n", "\n", - "The `training_mode` variable is used to indicate if the model is in pretrain or finetune mode. In fact, the `get_model` and `get_data_module` methods will call the `get_pretrain_model` and `get_pretrain_data_module` methods if `training_mode` is `pretrain`, and the `get_finetune_model` and `get_finetune_data_module` methods if `training_mode` is `finetune`. \n", + "The `training_mode` variable is also introduced and it is used to indicate if the model is in pretrain or finetune mode. \n", + "Then, the `get_model` and `get_data_module` methods will call the `get_pretrain_model` and `get_pretrain_data_module` methods if `training_mode` is `pretrain`, and the `get_finetune_model` and `get_finetune_data_module` methods if `training_mode` is `finetune`. \n", "\n", "One important thing to note is about `load` parameter. \n", - "If it is provided, the `load_checkpoint` method will load the checkpoint for the model, in order to resume the training. The `get_finetune_model` receives an additional parameter, the `load_backbone` parameter. After the backbone is loaded, the `load` parameter is used to resume the finetuning, that is, load the checkpoint for the finetune model (`SSLDiscriminator`)." + "If it is provided, it will load the checkpoint for the model, in order to resume the training. This is valid both for pretrain and finetune modes. \n", + "The `load_backbone` parameter is only used in finetune mode, in order to load the checkpoint for the backbone model, that is, load a model that was pretrained using self-supervised learning. This is usually used to start a finetuning from a model that was pretrained using self-supervised learning. If you want to resume a finetuning from a checkpoint, you should use the `load` parameter." ] }, { @@ -77,18 +111,24 @@ "source": [ "## Running CPC Experiment\n", "\n", - "In this notebook, we will demonstrate how to run a CPC experiment, from pretrain to finetune. The `CPCTrain` class derives from `LightningSSLTrain` and implements the `get_pretrain_model`, `get_pretrain_data_module`, `get_finetune_model` and `get_finetune_data_module` methods, while the `CPCTest` class derives from `LightningTest` and implements the `get_model` and `get_data_module` methods.\n", - "Both classes add specific parameters to create CPC model and instantiate the data module.\n", + "In this notebook, we will demonstrate how to run a CPC experiment, from pretrain to finetune and, finally, test. \n", + "The `CPCTrain` class derives from `LightningSSLTrain` and implements the `get_pretrain_model`, `get_pretrain_data_module`, `get_finetune_model` and `get_finetune_data_module` methods, while the `CPCTest` class derives from `LightningTest` and implements the `get_model` and `get_data_module` methods.\n", + "Both classes add specific parameters to instantiate CPC model and the respective data module.\n", "\n", - "Let's first start by pretraining the CPC model, using KuHAR dataset, as in previous notebooks.\n", - "\n", - "### Experiment of Pretraining CPC\n", + "Let's first start by pretraining the CPC model, using KuHAR dataset, as in previous notebooks." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Experiment: Pretraining CPC\n", "\n", "The `CPCTrain` class will encapsuate the default code for creating models and data modules from previous notebooks into the `get_pretrain_model` and `get_pretrain_data_module` methods. \n", - "Thus, we just need to pass the required parameters to the `CPCTrain` class and call the `execute` method to run the experiment.\n", - "As `CPCTrain` is a derived class, we can pass the parameters from all parent classes (`epochs`, `accelerator`, `batch_size`, *etc.*), as well as the parameters from the `CPCTrain` class (`window_size`, `num_classes`, *etc.*) in the class constructor.\n", + "Thus, we just need to pass the required parameters to the `CPCTrain` class constructor and call the `execute` method to run the experiment.\n", + "As `CPCTrain` is a derived class, we can pass the parameters from all parent classes (`epochs`, `accelerator`, `seed`, *etc.*), as well as the parameters from the `CPCTrain` class (`window_size`, `num_classes`, *etc.*) in the class constructor.\n", "\n", - "The `CPCTrain` includes parameters to create the model as well as the data module. These parameters include:\n", + "These main parameters include for `CPCTrain` class are:\n", "\n", "* `data`: the path to the dataset folder. For pretrain, the data must be the path to a dataset where the samples are the whole time-series of an user. For finetune, the data must be the path to a dataset where the samples are the windows of the time-series, as in previous notebooks.\n", "* `encoding_size`: the size of the latent representation of the CPC model.\n", @@ -98,25 +138,28 @@ "* `num_classes`: number of classes in the dataset.\n", "* `update_backbone`: boolean indicating if the backbone should be updated during finetuning (only useful for fine-tuning process).\n", "\n", - "Only the `data` parameter is required, the others have default values. Please check the documentation of the `CPCTrain` class for more details.\n", + "Only the `data` parameter is required, the others have default values. \n", + "Please check the documentation of the `CPCTrain` class for more details.\n", "\n", "Let's create the `CPCTrain` class and run the pretraining experiment." ] }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [] - }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "[1706883882.274114] [aae107fc745c:2257856:f] vfs_fuse.c:281 UCX ERROR inotify_add_watch(/tmp) failed: No space left on device\n" + ] + }, { "data": { "text/plain": [ - "LightningExperiment(experiment_dir=logs/pretrain/CPC/2024-02-01_23-52-39, model=CPC, run_id=2024-02-01_23-52-39, finished=False)" + "LightningExperiment(experiment_dir=logs/pretrain/CPC/2024-02-02_11-24-49, model=CPC, run_id=2024-02-02_11-24-49, finished=False)" ] }, "execution_count": 1, @@ -141,7 +184,6 @@ " num_classes=6,\n", " # Trainer params\n", " epochs=10,\n", - " num_workers=12,\n", " batch_size=1,\n", " accelerator=\"gpu\",\n", " devices=1,\n", @@ -159,7 +201,7 @@ "name": "stderr", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/pretrain/CPC/2024-02-01_23-52-39 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/pretrain/CPC/2024-02-02_11-24-49 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", @@ -175,7 +217,7 @@ "Setting up experiment: CPC...\n", "Running experiment: CPC...\n", "Training will start\n", - "\tExperiment path: logs/pretrain/CPC/2024-02-01_23-52-39\n" + "\tExperiment path: logs/pretrain/CPC/2024-02-02_11-24-49\n" ] }, { @@ -234,7 +276,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "d3d79b4d1870487a9225f9f9afdc262c", + "model_id": "84a17deffcaa4fa992ad2c2e009290d2", "version_major": 2, "version_minor": 0 }, @@ -255,11 +297,11 @@ { "data": { "text/html": [ - "
--> Overall fit time: 19.928 seconds\n",
+       "
--> Overall fit time: 31.599 seconds\n",
        "
\n" ], "text/plain": [ - "--> Overall fit time: 19.928 seconds\n" + "--> Overall fit time: 31.599 seconds\n" ] }, "metadata": {}, @@ -293,7 +335,7 @@ "output_type": "stream", "text": [ "Training finished\n", - "Last checkpoint saved at: logs/pretrain/CPC/2024-02-01_23-52-39/checkpoints/last.ckpt\n", + "Last checkpoint saved at: logs/pretrain/CPC/2024-02-02_11-24-49/checkpoints/last.ckpt\n", "Teardown experiment: CPC...\n" ] } @@ -307,13 +349,13 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Once the experiment finished, we may have a directory structure like this:\n", + "Once the experiment finished, we may have a directory structure similar to this:\n", "\n", "```\n", "logs/\n", " pretrain/\n", " CPC/\n", - " 2024-02-01_22-01-31/\n", + " 2024-02-02_10-53-31/\n", " checkpoints/\n", " epoch=9-step=570.ckpt\n", " last.ckpt\n", @@ -321,16 +363,14 @@ " metrics.csv\n", "```\n", "\n", - "This is the default directory structure for experiments, where the experiment directory is `logs/pretrain/CPC/2024-02-01_22-01-31/`. The `checkpoints directory` contains the saved checkpoints and inside it we may have a `last.ckpt` file which is the last checkpoint saved.\n", - "The `hparams.yaml` file contains the hyperparameters, and the `metrics.csv` file contains the metrics logged during training.\n", - "\n", + "This is the default directory structure for experiments. The experiment directory is `logs/pretrain/CPC/2024-02-01_22-01-31/`, and can be accessed using the `cpc_experiment.experiment_dir` attribute. The `checkpoints directory` contains the saved checkpoints and inside it we may have a `last.ckpt` file which is the last checkpoint saved. It can be accessed using the `cpc_experiment.checkpoint_dir` attribute. The `hparams.yaml` file contains the hyperparameters, and the `metrics.csv` file contains the metrics logged during training.\n", "\n", "We can obtain the experiment's model, data module, logger, checkpoint directory, callbacks, trianer, and hyperparameters using the `cpc_experiment.model`, `cpc_experiment.data_module`, `cpc_experiment.logger`, `cpc_experiment.checkpoint_dir`, `cpc_experiment.callbacks`, `cpc_experiment.trainer`, and `cpc_experiment.hyperparameters` attributes, respectively. \n", "These objects are cached in the `cpc_experiment` object, thus, it is instantiated only once, and can be accessed multiple times.\n", "Also, the `cpc_experiment.finished` attribute is a boolean indicating if the experiment has finished sucessfuly or not.\n", "\n", - "We will need this checkpoint to load the weights of the backbone for the finetuning process.\n", - "Let's obtain the checkpoint file and the experiment's model and data module, and then run the finetuning experiment." + "For fine-tunning, we will need this checkpoint to load the weights of the backbone.\n", + "Let's obtain the checkpoint file and then run the finetuning experiment." ] }, { @@ -341,7 +381,7 @@ { "data": { "text/plain": [ - "PosixPath('logs/pretrain/CPC/2024-02-01_23-52-39/checkpoints/last.ckpt')" + "PosixPath('logs/pretrain/CPC/2024-02-02_11-24-49/checkpoints/last.ckpt')" ] }, "execution_count": 3, @@ -358,16 +398,18 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "### Experiment of Fine-tune CPC\n", + "### Experiment: Fine-tunning CPC\n", "\n", "The `CPCTrain` class also encapsuate the default code for creating models and data modules from previous notebooks into the `get_finetune_model` and `get_finetune_data_module` methods. \n", "The behaviour of these methods is similar to the `get_pretrain_model` and `get_pretrain_data_module` methods, but they are used to create the model and data module for the finetuning process.\n", "In fact, the `get_finetune_model` will encapsulate the CPC code inside `SSLDisriminator` class, as seen in previous notebooks.\n", "\n", - "As we use the same class for pretrain and finetune, we just need to set the `training_mode` attribute to `finetune` and set the `load_backbone` parameter to the checkpoint file obtained in the pretrain process. \n", + "As we use the same class for pretrain and finetune, we just need to set the `training_mode` attribute to `finetune` and set the `load_backbone` parameter to the checkpoint file obtained in the pretrain process, in order to load the weights of the backbone model. \n", "Then, we can call the `execute` method to run the experiment.\n", "\n", - "However, it worth to notice that fine tune is an supervised learning process and uses windowed time-series as input. Thus, the `data` parameter must be the path to a dataset where the samples are the windows of the time-series, as in previous notebooks. In our case, we will use the standardized balanced view of the KuHar dataset." + "However, it worth to notice that fine tune is an supervised learning process and uses windowed time-series as input. \n", + "Thus, the `data` parameter must be the path to a dataset where the samples are the windows of the time-series, as in previous notebooks. \n", + "In our case, we will use the standardized balanced view of the KuHar dataset." ] }, { @@ -378,7 +420,7 @@ { "data": { "text/plain": [ - "LightningExperiment(experiment_dir=logs/finetune/CPC/2024-02-02_00-03-12, model=CPC, run_id=2024-02-02_00-03-12, finished=False)" + "LightningExperiment(experiment_dir=logs/finetune/CPC/2024-02-02_11-31-12, model=CPC, run_id=2024-02-02_11-31-12, finished=False)" ] }, "execution_count": 4, @@ -420,7 +462,7 @@ "name": "stderr", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/finetune/CPC/2024-02-02_00-03-12 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/finetune/CPC/2024-02-02_11-31-12 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", @@ -435,10 +477,10 @@ "text": [ "Setting up experiment: CPC...\n", "Running experiment: CPC...\n", - "Loading model from: logs/pretrain/CPC/2024-02-01_23-52-39/checkpoints/last.ckpt...\n", + "Loading model from: logs/pretrain/CPC/2024-02-02_11-24-49/checkpoints/last.ckpt...\n", "Model loaded successfully\n", "Training will start\n", - "\tExperiment path: logs/finetune/CPC/2024-02-02_00-03-12\n" + "\tExperiment path: logs/finetune/CPC/2024-02-02_11-31-12\n" ] }, { @@ -495,7 +537,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "f478640308374e5894df3302aa127e92", + "model_id": "92081a68a29343e18b11f8ae49a8f37c", "version_major": 2, "version_minor": 0 }, @@ -518,6 +560,58 @@ }, "metadata": {}, "output_type": "display_data" + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "`Trainer.fit` stopped: `max_epochs=10` reached.\n" + ] + }, + { + "data": { + "text/html": [ + "
--> Overall fit time: 12.987 seconds\n",
+       "
\n" + ], + "text/plain": [ + "--> Overall fit time: 12.987 seconds\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
\n"
+      ],
+      "text/plain": []
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "text/html": [
+       "
\n",
+       "
\n" + ], + "text/plain": [ + "\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Training finished\n", + "Last checkpoint saved at: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt\n", + "Teardown experiment: CPC...\n" + ] } ], "source": [ @@ -534,13 +628,13 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/plain": [ - "PosixPath('logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt')" + "PosixPath('logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt')" ] }, "execution_count": 6, @@ -557,26 +651,27 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "### CPC performance evaluation experiment\n", + "## Experiment: Evaluating CPC Performance\n", "\n", - "Finally, we can evaluate the performance of the CPC model using the `CPCTest` class. This class inherits from `LightningTest` and encapsulate the default code for creating models and data modules from previous notebooks into the `get_model` and `get_data_module` methods.\n", + "Finally, we can evaluate the performance of the CPC model using the `CPCTest` class. \n", + "This class inherits from `LightningTest` and encapsulate the default code for creating models and data modules from previous notebooks into the `get_model` and `get_data_module` methods.\n", "\n", - "The signature of the `CPCTest` class is very similar to the `CPCTrain` class. Also, we will use the same data module used in the finetuning process. However, differently from the train process the test process uses the `.test` method in the trainer and not the `.fit` method.\n", - "Also, the `load` parameter is used to load the checkpoint obtained in the finetuning process (that load the weights from `SSLDiscriminator`, backbone and prediction haad).\n", + "The signature of the `CPCTest` class is very similar to the `CPCTrain` class. However, differently from the train process the test process uses the `.test()` method in the Trainer and not the `.fit()` method.\n", + "Also, the `load` parameter is used to load the checkpoint obtained in the finetuning process (that load the weights from `SSLDiscriminator`, backbone and prediction head).\n", "\n", "Let's create experiments to test the CPC model, using the test set from different datasets besides KuHAR." ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-01_23-01-24 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-02_11-32-55 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", @@ -590,17 +685,17 @@ "output_type": "stream", "text": [ "Dataset at: /workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/KuHar\n", - "Loading model from logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt and executing test using dataset at KuHar...\n", + "Loading model from logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt and executing test using dataset at KuHar...\n", "Setting up experiment: CPC...\n", "Running experiment: CPC...\n", - "Loading model from: logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt...\n", + "Loading model from: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt...\n", "Model loaded successfully\n" ] }, { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "f580fe5d3fa9441a846ad0962953d3c3", + "model_id": "d2398c86ef214c4d96783616d573caf7", "version_major": 2, "version_minor": 0 }, @@ -617,8 +712,8 @@ "
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n",
        "┃        Test metric               DataLoader 0        ┃\n",
        "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n",
-       "│         test_acc              0.4583333432674408     │\n",
-       "│         test_loss              1.576676845550537     │\n",
+       "│         test_acc              0.4652777910232544     │\n",
+       "│         test_loss             1.5481091737747192     │\n",
        "└───────────────────────────┴───────────────────────────┘\n",
        "
\n" ], @@ -626,8 +721,8 @@ "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.4583333432674408 \u001b[0m\u001b[35m \u001b[0m│\n", - "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.576676845550537 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.4652777910232544 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.5481091737747192 \u001b[0m\u001b[35m \u001b[0m│\n", "└───────────────────────────┴───────────────────────────┘\n" ] }, @@ -661,13 +756,12 @@ "name": "stderr", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-01_23-01-28 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-02_11-32-57 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", "HPU available: False, using: 0 HPUs\n", - "`Trainer(limit_test_batches=1.0)` was configured so 100% of the batches will be used..\n", - "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n" + "`Trainer(limit_test_batches=1.0)` was configured so 100% of the batches will be used..\n" ] }, { @@ -675,19 +769,26 @@ "output_type": "stream", "text": [ "Teardown experiment: CPC...\n", - "Test on dataset KuHar finished !\n", + "Test on dataset KuHar finished!\n", "Dataset at: /workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/MotionSense\n", - "Loading model from logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt and executing test using dataset at MotionSense...\n", + "Loading model from logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt and executing test using dataset at MotionSense...\n", "Setting up experiment: CPC...\n", "Running experiment: CPC...\n", - "Loading model from: logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt...\n", + "Loading model from: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt...\n", "Model loaded successfully\n" ] }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n" + ] + }, { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "483322b4b438455c8455b6417a625b5e", + "model_id": "813175344f71488aaf2c634d00c5f080", "version_major": 2, "version_minor": 0 }, @@ -704,8 +805,8 @@ "
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n",
        "┃        Test metric               DataLoader 0        ┃\n",
        "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n",
-       "│         test_acc              0.3860640227794647     │\n",
-       "│         test_loss             1.6338088512420654     │\n",
+       "│         test_acc              0.43879473209381104    │\n",
+       "│         test_loss             1.5955957174301147     │\n",
        "└───────────────────────────┴───────────────────────────┘\n",
        "
\n" ], @@ -713,8 +814,8 @@ "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", - "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.3860640227794647 \u001b[0m\u001b[35m \u001b[0m│\n", - "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.6338088512420654 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.43879473209381104 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.5955957174301147 \u001b[0m\u001b[35m \u001b[0m│\n", "└───────────────────────────┴───────────────────────────┘\n" ] }, @@ -748,7 +849,7 @@ "name": "stderr", "output_type": "stream", "text": [ - "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-01_23-01-39 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-02_11-32-59 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", "GPU available: True (cuda), used: True\n", "TPU available: False, using: 0 TPU cores\n", "IPU available: False, using: 0 IPUs\n", @@ -761,12 +862,105 @@ "output_type": "stream", "text": [ "Teardown experiment: CPC...\n", - "Test on dataset MotionSense finished !\n", + "Test on dataset MotionSense finished!\n", "Dataset at: /workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/RealWorld_thigh\n", - "Loading model from logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt and executing test using dataset at RealWorld_thigh...\n", + "Loading model from logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt and executing test using dataset at RealWorld_thigh...\n", + "Setting up experiment: CPC...\n", + "Running experiment: CPC...\n", + "Loading model from: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt...\n", + "Model loaded successfully\n" + ] + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n" + ] + }, + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "612ec0d361884c7399217888425f69d8", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + "Output()" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n",
+       "┃        Test metric               DataLoader 0        ┃\n",
+       "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n",
+       "│         test_acc              0.40372908115386963    │\n",
+       "│         test_loss              1.643248438835144     │\n",
+       "└───────────────────────────┴───────────────────────────┘\n",
+       "
\n" + ], + "text/plain": [ + "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", + "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", + "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.40372908115386963 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.643248438835144 \u001b[0m\u001b[35m \u001b[0m│\n", + "└───────────────────────────┴───────────────────────────┘\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
\n"
+      ],
+      "text/plain": []
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "text/html": [
+       "
\n",
+       "
\n" + ], + "text/plain": [ + "\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-02_11-33-01 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "GPU available: True (cuda), used: True\n", + "TPU available: False, using: 0 TPU cores\n", + "IPU available: False, using: 0 IPUs\n", + "HPU available: False, using: 0 HPUs\n", + "`Trainer(limit_test_batches=1.0)` was configured so 100% of the batches will be used..\n" + ] + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Teardown experiment: CPC...\n", + "Test on dataset RealWorld_thigh finished!\n", + "Dataset at: /workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/RealWorld_waist\n", + "Loading model from logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt and executing test using dataset at RealWorld_waist...\n", "Setting up experiment: CPC...\n", "Running experiment: CPC...\n", - "Loading model from: logs/finetune/CPC/2024-02-01_22-38-32/checkpoints/last.ckpt...\n", + "Loading model from: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt...\n", "Model loaded successfully\n" ] }, @@ -780,7 +974,7 @@ { "data": { "application/vnd.jupyter.widget-view+json": { - "model_id": "bc9753a4e8d549daad98bc8a70185c97", + "model_id": "1d507d05b32d4f5e9563d6393d53b022", "version_major": 2, "version_minor": 0 }, @@ -790,6 +984,147 @@ }, "metadata": {}, "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n",
+       "┃        Test metric               DataLoader 0        ┃\n",
+       "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n",
+       "│         test_acc              0.4031635820865631     │\n",
+       "│         test_loss             1.6421446800231934     │\n",
+       "└───────────────────────────┴───────────────────────────┘\n",
+       "
\n" + ], + "text/plain": [ + "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", + "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", + "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.4031635820865631 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.6421446800231934 \u001b[0m\u001b[35m \u001b[0m│\n", + "└───────────────────────────┴───────────────────────────┘\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
\n"
+      ],
+      "text/plain": []
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "text/html": [
+       "
\n",
+       "
\n" + ], + "text/plain": [ + "\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "/usr/local/lib/python3.10/dist-packages/lightning/fabric/loggers/csv_logs.py:198: Experiment logs directory logs/test/CPC/2024-02-02_11-33-03 exists and is not empty. Previous log files in this directory will be deleted when the new ones are saved!\n", + "GPU available: True (cuda), used: True\n", + "TPU available: False, using: 0 TPU cores\n", + "IPU available: False, using: 0 IPUs\n", + "HPU available: False, using: 0 HPUs\n", + "`Trainer(limit_test_batches=1.0)` was configured so 100% of the batches will be used..\n", + "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1]\n" + ] + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Teardown experiment: CPC...\n", + "Test on dataset RealWorld_waist finished!\n", + "Dataset at: /workspaces/hiaac-m4/ssl_tools/data/standartized_balanced/UCI\n", + "Loading model from logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt and executing test using dataset at UCI...\n", + "Setting up experiment: CPC...\n", + "Running experiment: CPC...\n", + "Loading model from: logs/finetune/CPC/2024-02-02_11-31-12/checkpoints/last.ckpt...\n", + "Model loaded successfully\n" + ] + }, + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "a48db052fd0141dea8947f2ffebf7fbf", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + "Output()" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n",
+       "┃        Test metric               DataLoader 0        ┃\n",
+       "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n",
+       "│         test_acc              0.35652172565460205    │\n",
+       "│         test_loss             1.6707664728164673     │\n",
+       "└───────────────────────────┴───────────────────────────┘\n",
+       "
\n" + ], + "text/plain": [ + "┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓\n", + "┃\u001b[1m \u001b[0m\u001b[1m Test metric \u001b[0m\u001b[1m \u001b[0m┃\u001b[1m \u001b[0m\u001b[1m DataLoader 0 \u001b[0m\u001b[1m \u001b[0m┃\n", + "┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩\n", + "│\u001b[36m \u001b[0m\u001b[36m test_acc \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 0.35652172565460205 \u001b[0m\u001b[35m \u001b[0m│\n", + "│\u001b[36m \u001b[0m\u001b[36m test_loss \u001b[0m\u001b[36m \u001b[0m│\u001b[35m \u001b[0m\u001b[35m 1.6707664728164673 \u001b[0m\u001b[35m \u001b[0m│\n", + "└───────────────────────────┴───────────────────────────┘\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
\n"
+      ],
+      "text/plain": []
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "text/html": [
+       "
\n",
+       "
\n" + ], + "text/plain": [ + "\n" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Teardown experiment: CPC...\n", + "Test on dataset UCI finished!\n" + ] } ], "source": [ @@ -803,8 +1138,7 @@ " \"MotionSense\",\n", " \"RealWorld_thigh\",\n", " \"RealWorld_waist\",\n", - " \"UCI\"\n", - " \"WISDM\"\n", + " \"UCI\",\n", "]\n", "\n", "results = dict()\n", @@ -822,6 +1156,7 @@ " in_channel=6,\n", " num_classes=6,\n", " # Trainer params\n", + " batch_size=256,\n", " accelerator=\"gpu\",\n", " devices=1,\n", " )\n", @@ -830,6 +1165,34 @@ " print(f\"Test on dataset {dataset} finished!\")" ] }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "{'KuHar': [{'test_loss': 1.5481091737747192, 'test_acc': 0.4652777910232544}],\n", + " 'MotionSense': [{'test_loss': 1.5955957174301147,\n", + " 'test_acc': 0.43879473209381104}],\n", + " 'RealWorld_thigh': [{'test_loss': 1.643248438835144,\n", + " 'test_acc': 0.40372908115386963}],\n", + " 'RealWorld_waist': [{'test_loss': 1.6421446800231934,\n", + " 'test_acc': 0.4031635820865631}],\n", + " 'UCI': [{'test_loss': 1.6707664728164673, 'test_acc': 0.35652172565460205}]}" + ] + }, + "execution_count": 9, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "# We can acess the results\n", + "results" + ] + }, { "cell_type": "markdown", "metadata": {}, diff --git a/notebooks/experiment_classes.pdf b/notebooks/experiment_classes.pdf deleted file mode 100644 index 73a79b2..0000000 Binary files a/notebooks/experiment_classes.pdf and /dev/null differ