Protein Mutant generator : a platform for generating advantageous point mutants based on explicit modelling of evolutionary history.
ProMutor is a web-based tool that predicts the effect of suggested mutations based on explicit modelling of the evolutionary history of natural sequences. Given an input sequence, it either generates a complete landscape of protein mutations or predicts an epistatic effect of mutations of interest. It also incorporates mutant sequence generation step as well as provides direct access to ColabFold notebook for predicting protein structure.
Depending on the selected mode, steps of the pipeline vary. However, both start with the generation of a multiple sequence alignment (MSA).
Screening mode:
- MSA generation.
- Mutation effect landsccape generation (using GEMME tool).
Mutant generation mode:
- MSA generation.
- Generation of all possible mutants containing desired mutation sites.
- Prediction of their effect on protein structure based on evolutionary conservation (GEMME).
As an additional suggested step to procceed with, we propose using AlphaFold 2 or mutant protein structure prediction. We provide direct access to its notebook version ColabFold.
There are several options of running promutor remotely either using ProMutor webpage (does not require any installation) or via downloading promutor.html
web-form to submit request to the remote compute from a local device.
ProMutor is written in python. In addition, it utilises singularity
for running hhblits
.
To run ProMutor locally the following tools are needed:
python
v3.9 or highersingularity
To use ProMutor, you have to download the repository and pull required containers from the DockerHub as well as install required python packages:
git clone https://gitlab.igem.org/2022/software-tools/stockholm.git
cd stockholm
singularity pull docker://elodielaine/gemme:gemme
singularity pull docker://soedinglab/hh-suite:latest
pip3 install -r requirements.txt
For constructing an MSA, hhlits
tool utilises protein sequence databases. Here, we provide commands to install Pfam-A database, however other databases could be easily installed by providing an appropriate link.
mkdir ./uniclust
cd uniclust/
wget https://wwwuser.gwdg.de/~compbiol/data/hhsuite/databases/hhsuite_dbs/pfamA_35.0.tar.gz
tar xvfz pfamA_35.0.tar.gz
List of available databases
In addition, one can build its own database. Please check hh-suite
user guide or more information.
- Go to ProMutor webpage
- Fill in the form acccording to tutorials provided a. Screening mode b. Mutant Generation mode
- Download
promutor.html
file - Open file in any browser
- Fill in the form acccording to tutorials provided a. Screening mode b. Mutant Generation mode
- Clone the repository
cd stockholm
- Run tool locally in Mutant generation mode:
python3 run_container.py protein example/P27352.fasta 1,2,3 Blosum62 True pfama
- Run tool locally in Screening mode:
python3 run_container.py protein example/P27352.fasta 0 Blosum62 True pfama
A command to run ProMutor can be used as follows:
python3 run_container.py <type_inp> <fasta> <pos> <matrix> <preserve> <database> --mand <string> --num <string>
Arguments | Description | Valid value |
---|---|---|
type_inp string | Type of input. | nucleotide or protein |
fasta string | Input sequence. | -- |
pos string | Comma separated list of positions to be mutated. | -- |
matrix string | Substitution matrix to be used. | Blosum45, Blosum50, Blosum62, Blosum80, Blosum90, Pam30, Pam90 or Pam250 |
preserve string | Whether the closest (True) or the furthest (False) amino acid substituent is used. | True or False |
database string | Database to be used for creating multiple sequence alignment (MSA). | uniclust, pdb70, scop70, pfama |
‑‑mand string | Comma separated list of positions ALWAYS to be mutated. | -- |
‑‑num string | Maximal number of mutations per sequence. | -- |
Example:
Example input fasta
file is provided in example/
directory. This simple command will run ProMutor in the Mutant generation mode.
python3 run_container.py protein example/P27352.fasta 1,2,3 Blosum62 True pfama
We are open to any contributions and collaborations. When contributing to this repository, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository before making a change.
We would like to express our greatest gratitude to our principal technical supervisor Frédéric Mauron who has been of the greatest help on the technical parts of the software creation. His availability, knowledge and faith in our project allowed us to bring it to life. More specifically, Mr Mauron guided us throughout the webform-server connection steps. Last but not least, we are remarkably thankful for the server he provided us with for free.
Raphaël Mauron & Mariia Minaeva, iGEM Stockholm Team 2022