Remove sequences of certain length.
I need to do the following steps
1. Set pair reads
2. Remove duplicate reads
3. Merge paired reads
4. Error correct and normalise reads
5 remove sequence in each data , which is lower than 151 nucleotides and more than 320 nucleotides.
I have the problem in adding the last step in the workwflow
-
You can extract sequences of the length you want using the Lengths Graph above the viewer, as shown in the screenshot below:
If you want to automate this in a workflow, you need to do two filtering steps - one to remove the short reads and the other to remove the longer reads.
The workflow setup would look something like in the screenshot below:
This produces a sequence list with the same names as the input document but containing only sequences between 151 and 320 bp.
0
Please sign in to leave a comment.
Comments
1 comment