This is the repository for the Barchelor's Degree Thesis/Project carried out during the 2020/21 Winter Semester by Joaquim Gómez Sanchez.
The thesis/project is available in: UPC's Repository.
This repository contains all the code developed for the thesis, as well as the raw results obtained and the trained and used models. Following, the code not implemented by the author and the pretrained models used are referenced.
Code:
- Code for training GloVe. Obtained from the official repository, mantained by model's authors.
- Code for computing Normalized Relative Compression distance. Provided by the thesis' director, who got it from Armando J. Pinho.
Models:
- ELMo model. The used model is the one available in TensorFlow Hub.
- BERT model. The used model (12/768 or BERT-base) is the one provided by Google in the official BERT's repository.
Regarding the data, it has been decided not to publish anything in order to avoid legal problems. The data used for the experiments are listed in the thesis and can be obtained from the UPC's (Universitat Politècnica de Catalunya) repository or from other papers' repositories. The data used for training the models has been completely collected from UPC's repository.
In case you are interested in knowing about the training data, the preprocessed files or the experiments files elaborated, send me an e-mail.