NAME
VERSION
DESCRIPTION
AUTHORS
CATEGORY
USAGE
INPUT FORMAT
OUTPUT FORMAT
STATISTICAL MODEL
SEE ALSO

NAME

oligo-diff

VERSION

$program_version

DESCRIPTION

Compare frequencies of oligonucleotides between two input sequence files, and return oligos that are significantly enriched in one of the files respective to the other one.

AUTHORS

Jacques.van-Helden\@univ-amu.fr

USAGE

oligo-diff [-i inputfile] [-o outputfile] [-v #] [...]

INPUT FORMAT

The program takes as input a pair of sequence files in fasta format.

OUTPUT FORMAT

The output is a tab-delimted file with one row per oligonucleotide, and one column per statistics. The column content is detailed in the header of the output (for this, the verbosity needs to be at least 1).

STATISTICAL MODEL

WISH LIST

OPTIONS

-v #: Level of verbosity (detail in the warning messages during execution)
-h: Display full help message
-help: Same as -h
-file1 first_seq_file: First sequence file.
-file2 second_seq_file: Second sequence file.
-l oligo_len: Oligonucleotide length.
-1str: Count oligonucleotides on a single strand only.; Alternative option: -2str
-2str: Sum oligonucleotides on both strands.; More precisely, each pair of reverse complements is counted as a single motif (the count is performed on a single strand, but pairs of reverse complements are merged).; Alternative option: -1str
-noov: Do not accept overlap between successive occurrences of the same word. Only renewing occurrences are counted.; E.g.: TATATATATATA is counted as 2 occurrences of TATATA; Alternative option: -ovlp
-ovlp: Count all occurrences of self-overlapping words.; E.g.: TATATATATATA is counted as 4 occurrences of TATATA; Alternative option: -noov
-o outputfile: If no output file is specified, the standard output is used. This allows to use the command within a pipe.
-lth key value: Lower threshold on some output field.; Supported fields for threshold: occ,sig
-uth key value: Upper threshold on some output field.