Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New pipeline] EvidenceAlignment #35

Open
Juke34 opened this issue May 20, 2020 · 0 comments
Open

[New pipeline] EvidenceAlignment #35

Juke34 opened this issue May 20, 2020 · 0 comments

Comments

@Juke34
Copy link
Collaborator

Juke34 commented May 20, 2020

See #17 for the general picture.

The purpose of this pipeline is to generate gff alignment from protein or transcript fasta files.
Those gff must be formatted in match match/part (see AGAT script agat_sp_alignment_output_style.pl for that purpose if tools producing the gff output do not do it by default)

2 type of inputs: Protein fasta file and/or nucleotide fasta file.
For both type of alignment we could offer an option to select which tool to use (indeed many tools exist this task). so would be nice to allow several choices (e.g for protein splice aware alignment, genomethreader, exonerated gmap, etc...).

For protein alignment:
diamond or blastx for raw alignment and exonerate or scipio or spawn or genome threader for polished (splice aware) alignment
=>priority to implement diamond, blastx and exonerate

For transcript alignment:
=> Minimap2
=> we should also implement the MAKER method in two steps: 1) raw alignment with tblastx for related species data, or blastn for species-specific data; 2) exonerate for polished alignment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant