Skip to content

This repository includes the scripts used for analysis investigating the dynamics of indels in mammalian orthologous proteins and the examination of the ancestral reconstruction of multiple-character indels under the PIP.

License

Notifications You must be signed in to change notification settings

acg-team/single-char-indel-ASR-preserves-long-indels

Repository files navigation

The Python scripts used for the paper entitled:

Single-character insertion-deletion model preserves long indels in ancestral sequence reconstruction

Requirments

For this tutorial, you should already have Python 3.9 or higher, jupyter notebook along with the following libraries:

numpy, ete3, biopython, seaborn, matplotlib, sklearn and pandas

Installation

To install the package you can simply download the repository and run the following command in the root directory.

Install the dependencies using this command:

pip3 install -r requirements.txt

Files

  • mammals_01_stat.ipynb contains functions for indel pattern plots for mammalian data.
  • mammals_02_dynamic_of_gaps.ipynb includes functions for calculating dynamic of gap pattern for each mammalian data.
  • mammals_03_indel_length.ipynb includes functions for ploting indel length for mammalian data.
  • simulation_01_acc.ipynb contains functions for computing accuracy of ARPIP inference on simulated data.
  • simulation_02_dynamic_of_gaps.ipynb contains functions for calculating dynamic of gap pattern for each simulated data.
  • simulation_03_stat.ipynb contains functions for indel pattern plots for simulated data.
  • simulation_04_discussion.ipynb contains scripts for the appendix figures.
  • requirements.txt contains library versions of dependencies.

To get the figures in the manuscript all the necessary files and scripts are provided here. Moreover, suplemental data is stored in another repository with this link.

Citation

Please cite:

Gholamhossein Jowkar, Julija Pecerska, Manuel Gil, and Maria Anisimova
Single-character insertion-deletion model preserves long indels in ancestral sequence reconstruction.
BioRxiv, 2024;
doi:10.1101/2024.03.09.584071


Author

Gholam-Hossein Jowkar E-mail

About

This repository includes the scripts used for analysis investigating the dynamics of indels in mammalian orthologous proteins and the examination of the ancestral reconstruction of multiple-character indels under the PIP.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published