gustaf (1)

Gustaf - generic multi-split alignment finder: tool for split-read mapping allowing multiple splits. synopsis gustaf [options] genome fasta file read fasta file description gustaf uses seqans stellar to find splits as local matches on different strands or chromosomes. criteria and penalties to chain these matches can be specified. output file contains the breakpoints along the best chain. the genome file is used as database input, the read file as query input. all stellar options are supported. see stellar documentation for stellar parameters and options. (c) 2011-2012 by kathrin trappe -h, --help displays this help message. --version display version information gustaf options: main options: -tp, --transpen int interchromosomal translocation penalty default: 5. -ip, --invpen int inversion penalty default: 5. -op, --orderpen int intrachromosomal order change penalty default: 0. -oth, --overlapthresh double allowed overlap between matches default: 0.5. -gth, --gapthresh int allowed gap length between matches default: 10. -ith, --initgapthresh int allowed initial or ending gap length at begin and end of read default: 15. -st, --support int number of supporting reads default: 2. input options: -m, --matchfile file file of (stellar) matches valid filetypes are: gff and gff. output options: -bpo, --breakpointout file name of breakpoint output file. valid filetypes are: gff and txt. default: breakpoints.gff. -j, --jobname str job/queue name default: . -do, --dots enable graph output in dot format stellar options: main options: -e, --epsilon num maximal error rate (max 0.25). in range [0.0000001..0.25]. default: 0.05. -l, --minlength num minimal length of epsilon-matches. in range [0..inf]. default: 100. -f, --forward search only in forward strand of database. -r, --reverse search only in reverse complement of database. -a, --alphabet str alphabet type of input sequences (dna, rna, dna5, rna5, protein, char). one of dna, dna5, rna, rna5, protein, and char. -v, --verbose set verbosity mode. filtering options: -k, --kmer num length of the q-grams (max 32). in range [1..32]. -rp, --repeatperiod num maximal period of low complexity repeats to be filtered. default: 1. -rl, --repeatlength num minimal length of low complexity repeats to be filtered. default: 1000. -c, --abundancecut num k-mer overabundance cut ratio. in range [0..1]. default: 1. verification options: -x, --xdrop num maximal x-drop for extension. default: 5. -vs, --verification str verification strategy: exact or bestlocal or bandedglobal one of exact, bestlocal, and bandedglobal. default: exact. -dt, --disablethresh num maximal number of verified matches before disabling verification for one query sequence (default infinity). in range [0..inf]. -n, --nummatches num maximal number of kept matches per query and database. if stellar finds more matches, only the longest ones are kept. default: 50. -s, --sortthresh num number of matches triggering removal of duplicates. choose a smaller value for saving space. default: 500. version gustaf version: 1.0 last update july 2012