Antibody Modelling Benchmark

This repository contains the raw, intermediate and output data involved in the comparison of three tools of protein structure modelling from plain amino acid sequence. The evaluated tools are:

Each folder in this repository is named after an antibody PDB entry whose crystal structure was used as the baseline in the calculation of RMSD values.

Overall Procedure

Antibody amino acid sequences were obtained from IMGT.
Antibody crystal structures were obtained from RCSB.
The three tools were used to generate 3D structures for each antibody sequence.
ProFit was used to calculate RMSD of the modelled structure and the crystal structure.
GraphPad was used for statistical analysis.

RoseTTAFold

Input: Antibody heavy and light chain sequences in FASTA format, i.e., input_h.fa and input_l.fa
Output: complex_0001.pdb

Run the following scripts at a command line. It may take hours to days to run depending on user's computer performance.

# Generating MSA
run_e2e_ver.sh input_h.fa .
run_e2e_ver.sh input_l.fa .

# Generating pMSA
python make_joint_MSA_bacterial.py H.a3m L.a3m

# Filtering paired alignment
hhfilter -i paired.a3m -o filtered.a3m -id 90 -cov 75

# Modeling 3D structure
complex.sh H.fa .
complex.sh L.fa .

# Relaxing complex structure
relax.static.linuxgccrelease \
-database rosetta_bin_linux_2021.16.61629_bundle/main/database \
-s complex.pdb \
-out:pdb \
-relax:fast \
-relax:constrain_relax_to_start_coords \
-relax:ramp_constraints false \
-ex1 \
-ex2 \
-use_input_sc \
-flip_HNQ \
-no_optH false \
-nstruct 1

SWISS-MODEL Modelling

Input: Plain antibody heavy and light chain sequences
Output: <PDB_ID>_GMQE.pdb

ABodyBuilder Modelling

Input: Plain antibody heavy and light chain sequences
Output: <PDB_ID>_ab.pdb

RMSD Calculation

A documentation of ProFit can be found here.

To calculate the RMSD, first launch ProFit at a command line:

profit

Then run the following scripts under the ProFit prompt, replacing with actual file names and parameters:

reference 1fl5_heavy.pdb # Heavy chain of 1fl5 crystal structure
mobile 1fl5_090_heavy.pdb # Heavy chain of structure which chose template with GMQE score 0.9
atoms N,CA,C,O # Backbone atoms
zone 1-25 # The framework regions of the first fit
zone 34-50
zone 61-98
zone 109-119
fit
rzone 26-33 # Fit CDR regions
delrzone * # Must clear the rzone before testing another CDR region
rzone 51-60
delrzone *
rzone 99-108

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
1fl5		1fl5
1it9		1it9
1jhk		1jhk
1kcu		1kcu
2w9d		2w9d
3cfc		3cfc
3dif		3dif
3n9g		3n9g
3utz		3utz
3vfg		3vfg
4r4b		4r4b
4r97		4r97
5dr5		5dr5
5f6h		5f6h
5ihz		5ihz
5jor		5jor
5jqd		5jqd
5x4g		5x4g
5ysl		5ysl
6btj		6btj
6dwi		6dwi
6gfe		6gfe
6mty		6mty
6obz		6obz
6pzh		6pzh
6qnk		6qnk
6rnn		6rnn
6u8c		6u8c
6uga		6uga
6vu2		6vu2
6wj0		6wj0
6wos		6wos
README.md		README.md
RMSD.pzfx		RMSD.pzfx
ltjdata.xlsx		ltjdata.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Antibody Modelling Benchmark

Overall Procedure

RoseTTAFold

SWISS-MODEL Modelling

ABodyBuilder Modelling

RMSD Calculation

About

stcmz/antibody-modelling-benchmark

Folders and files

Latest commit

History

Repository files navigation

Antibody Modelling Benchmark

Overall Procedure

RoseTTAFold

SWISS-MODEL Modelling

ABodyBuilder Modelling

RMSD Calculation

About

Topics

Resources

Stars

Watchers

Forks