GitHub

M0 model

Model M0 has single omega and kappa value for the whole tree. To generate M0 data use the following steps:

Generate parameters for the simulations (src/m0_par.R).
Simulate trees (src/m0_trees.sh).
Simulate sequences (src/m0_seqs.sh).
Export data to numpy format (src/m0_export.py).

The resulting file is data/m0.npy.xz. This is lzma compressed numpy file. It can be decompressed using xz -d. It contains a dictionary, every item is a single dataset (tree + alignment).

import numpy as np

data = np.load('data/m0.npy')[()]

This is dictionary where keys are dataset IDs, and values are dictionaries. Dictionary describing a single dataset looks as follows:

{
    'tlen': '1.84', # tree length in codon substitutions
    'nseq': '8', # number of leaf sequences
    'w': '0.5', # omega (dN/dS), simulation parameter
    'k': '2.5', # kappa, ts/tv ratio, simulation parameter
    'ncodons': '110', # alignment length in codons
    'bifurcations': [ # list of bifurcations, post-ordered
        {
            'level': 2, # node depth, 0 is root
            'dist': 0.18, # distance between the two nodes (in codon substitions)
            'input': np.array, # child sequences, shape (2, ncodons*3)
            'output': np.array, # parent sequence, shape (ncodons*3,)
        },
        ...
    ],
}

Sequences represented as arrays of bytes, each bytes is a nucleotide (ATGC -> 0123). Bifurcations are listed in post-order.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
res		res
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M0 model

About

Releases

Packages

Languages

idavydov/treenet

Folders and files

Latest commit

History

Repository files navigation

M0 model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages