[New pipeline] EvidenceAlignment #35

Juke34 · 2020-05-20T07:18:53Z

See #17 for the general picture.

The purpose of this pipeline is to generate gff alignment from protein or transcript fasta files.
Those gff must be formatted in match match/part (see AGAT script agat_sp_alignment_output_style.pl for that purpose if tools producing the gff output do not do it by default)

2 type of inputs: Protein fasta file and/or nucleotide fasta file.
For both type of alignment we could offer an option to select which tool to use (indeed many tools exist this task). so would be nice to allow several choices (e.g for protein splice aware alignment, genomethreader, exonerated gmap, etc...).

For protein alignment:
diamond or blastx for raw alignment and exonerate or scipio or spawn or genome threader for polished (splice aware) alignment
=>priority to implement diamond, blastx and exonerate

For transcript alignment:
=> Minimap2
=> we should also implement the MAKER method in two steps: 1) raw alignment with tblastx for related species data, or blastn for species-specific data; 2) exonerate for polished alignment.

The text was updated successfully, but these errors were encountered:

mahesh-panchal mentioned this issue May 20, 2020

Integrating workflows into a single workflow #17

Open

10 tasks

Juke34 added the New pipeline label Jun 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New pipeline] EvidenceAlignment #35

[New pipeline] EvidenceAlignment #35

Juke34 commented May 20, 2020 •

edited

Loading

[New pipeline] EvidenceAlignment #35

[New pipeline] EvidenceAlignment #35

Comments

Juke34 commented May 20, 2020 • edited Loading

Juke34 commented May 20, 2020 •

edited

Loading