Fastaq_to_unique_by_id
fastaq_to_unique_by_id <infile> <outfile>
Removes duplicate sequences from a fasta/q file, based on their names. If the same name is found more than once, then the longest sequence is kept. Order of sequences is preserved in output
infile
Name of input fasta/q file
outfile
Name of output fasta/q file
-h, --help
show this help message and exit
fastaq_to_unique_by_id was originally written by Martin Hunt ([email protected])
Wellcome Trust Sanger Institute Copyright © 2013 Wellcome Trust Sanger Institute This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.