what dose the "N" mean in the targetSequences filed? #1395
-
I've observed that the exported tsv files contain numerous reads with 'N' in the sequence. I utilized the Thanks! Such as, GGAGGTCTCAGAGAGGAGCCTTAGCCCTGGACTCCAAGGCCTTTCCACTTGGTGATCAGCACTGAGCACAGAGGACTCACCATGGAATTGGGGCTGATCTGGGTTTTCCTTGTTGCTGTTTTAGAAGGTGTCNAGTGNNANNNNCGNNGAGTCGCGTATAATGTAGTCTTGGNNNNNGNGGNNNC,TAAACCCTGATAGAAGTGAATTCTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGGCAAGAACTCACTGTATCTGCAAATGAATAGCCTGAGAGTCGAGGACTCGGCTGTATATTACTGTGCAAAAGAAGCAGGAACCCACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAGCATCCCCGACCAGCCCCAAGGTCTTCCCGCTGAG |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 7 replies
-
Hi, Does the original read include the 'N' nucleotides? An alignment displays how the original read sequence matches up to the reference. If the sequence has 'N' nucleotides, they will appear in the target sequence of the alignment. This mismatch (an unidentified nucleotide in this context) will impact the alignment score. However, this ambiguity might be resolved during clone assembly if multiple reads covering the same position clearly identify a specific base pair there. Sincerely, |
Beta Was this translation helpful? Give feedback.
-
I see, the N nucleotides present in the clonotype table and not in alignments. In that case most likely there was an ambiguity between different alignments that comprise the clone. You can backtrack all alignments that were assembled into the clone and check the nucleotide at this position. If most alignments would have reported the same nucleotide at this spot (even if it is a mismatch relative to the reference) you would have seen this nucleotide in the clonal sequence. |
Beta Was this translation helpful? Give feedback.
-
Hi, In that case the 'N' nucleotides originate from the You are correct; you can use |
Beta Was this translation helpful? Give feedback.
Hi,
In that case the 'N' nucleotides originate from the
assembleContigs
command. The algorithms for this command have undergone a complete revision, and several critical bug fixes have been implemented since version 3. There are various reasons why 'N' nucleotides may arise, such as low quality, which is deemed unreliable, too few data to resolve into different clonotypes (for example, if you have two alignments with a different nucleotide at a certain position, there isn’t enough information to confirm they are indeed two different clones), or the discrepancy may be outside of the subcloning region (the region where ambiguous nucleotides lead to splitting into two clones). Given how the …