Skip to content

Commit

Permalink
updated the vignette
Browse files Browse the repository at this point in the history
  • Loading branch information
KarakulakTulay committed May 7, 2024
1 parent 7096fe1 commit a0204df
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions vignettes/MDTToolset-vignette.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ knitr::kable(head(cond2_ENST_ENSG,2), caption = 'TPM Matrix')
As transcripts with the same sequences might have different peptide lengths, we aim to select the ones with the longest sequence when there is a redundancy. This information cannot be accessed directly via Ensembl Biomart, this we need to download the transcripts' peptide sequences and then calculate the length of the peptides. We provide Ensembl version 104 as an example. In case you want to prepare a different version, download ENSG, ENST, peptide sequences from Biomart and use the following function from MDTToolset:

df_biomart_seq_length <- MDTToolset::MDTToolset::prepare_seq_length('mart_export.txt')
File format of mart_export.txt if fasta format:
The file 'mart_export.txt' is in the fasta file format:
>ENSG0000000XXXX|ENST0000000KKK
MQRSPLEKASVVSKLFFSWTRPILRKGYRQRLELSDIYQIPSVDSADNLSEKLEREWDRE
LASKKNPKLINALRRCFFWRFMFYGIFLYLGEVTKAVQPLLLGRIIASYDPDNKEERSIA
Expand All @@ -84,8 +84,7 @@ IYLG*
MTAEEMKATESGAQSAPLPMEGVDISPKQDEGVLKVIKREGTGTEMPMIGDRVFVHYTGW
LLDGTKFDSSLD


```{r }
```{r PeptideLength}
# Example dataframe for df_biomart_seq_length
df_biomart_seq_length <- MDTToolset::df_biomart_seq_length
Expand Down

0 comments on commit a0204df

Please sign in to comment.