position-analysis -v 1 -i $RSAT/public_html/tmp/www-data/2026/04/21/peak-motifs.2026-04-21.084826_2026-04-21.084826_JUm4D2/data/sequences/peak-motifs_test_maxlen1000_purged_ml40_mis3.fasta -format fasta -sort -return html,chi,sig,distrib,graphs,rank,index -max_graphs 20 -1str -ovlp -seqtype dna -l 2 -ci 20 -img_format png -title KingstonB -origin center -offset 0 -o $RSAT/public_html/tmp/www-data/2026/04/21/peak-motifs.2026-04-21.084826_2026-04-21.084826_JUm4D2/results/composition/peak-motifs_test_profiles-1str-ovlp_2nt_ci20.tab Citation: van Helden, et al. (2000). Nucleic Acids Res 28, 1000-1010. Sequence file $RSAT/public_html/tmp/www-data/2026/04/21/peak-motifs.2026-04-21.084826_2026-04-21.084826_JUm4D2/data/sequences/peak-motifs_test_maxlen1000_purged_ml40_mis3.fasta Sequence format fasta Sequence type dna Output file $RSAT/public_html/tmp/www-data/2026/04/21/peak-motifs.2026-04-21.084826_2026-04-21.084826_JUm4D2/results/composition/peak-motifs_test_profiles-1str-ovlp_2nt_ci20.tab Oligo length 2 Occurrences counted on a single strands Conditions of applicability checked. Background model estimation: homogeneous repartition Sequence statistics: Nb of sequences 36 Sum of sequence lengths 791 Min sequence length 0 Max sequence length 22 Average sequence length 21 Possible positions 755 Sequences: # length ID 1 22 GENE2 2 22 GENE3 3 22 GENE4 4 22 GENE5 5 22 GENE6 6 22 GENE49 7 22 GENE50 8 22 GENE51 9 22 GENE52 10 21 GENE53 11 22 GENE54 12 22 GENE55 13 22 GENE56 14 22 GENE57 15 22 GENE 16 22 GENE59 17 22 GENE60 18 22 GENE61 19 22 GENE62 20 22 GENE84 21 22 GENE85 22 22 GENE86 23 22 GENE87 24 22 GENE88 25 22 GENE63 26 22 GENE64 27 22 GENE65 28 22 GENE66 29 22 GENE67 30 22 GENE68 31 22 GENE69 32 22 GENE70 33 22 GENE71 34 22 GENE72 35 22 GENE73 36 22 GENE74 Oligonucleotide statistics: Total occurrences 755 Position interval parameters: Position interval 20 Number of windows 2 Total positions 755 Degrees of freedom 1 K-mer clustering parameters: Number of clusters 0 Clustering method complete Position intervals: window [min,max] mid seq occ 1 -1 [-19,0] -9.5 36 431 2 0 [1,20] 10.5 36 324 Column headers 1 seq pattern sequence 2 id pattern identifier 3 occ pattern occurrences 4 chi2 observed chi-square 5 df degrees of freedom 6 Pval P-value (probability for one word to be a false positive) 7 Eval E-value; expected number of false positives (Eval = Pval * nb_tests) 8 sig Significance (sig = -log10(Eval)) 9 rank rank of the pattern according to sorting criterion 10 -9.5 occurrences in window 1 [-19,0] 11 10.5 occurrences in window 2 [1,20]
| seq | id | occ | chi2 | df | Pval | Eval | sig | rank | -9.5 | 10.5 |
|---|---|---|---|---|---|---|---|---|---|---|
| gg | gg | 62 | 11.8 | 1 | 5.9e-04 | 0.0094 | 2.03 | 1 | 22 | 40 |
| ca | ca | 37 | 6.8 | 1 | 8.9e-03 | 0.14 | 0.85 | 2 | 29 | 8 |
| ag | ag | 83 | 6.4 | 1 | 1.2e-02 | 0.19 | 0.73 | 3 | 36 | 47 |
| cg | cg | 72 | 5.6 | 1 | 1.8e-02 | 0.29 | 0.53 | 4 | 51 | 21 |
| ac | ac | 60 | 3.1 | 1 | 7.8e-02 | 1.3 | -0.10 | 5 | 41 | 19 |
| cc | cc | 68 | 2.3 | 1 | 1.3e-01 | 2.1 | -0.32 | 6 | 45 | 23 |
| aa | aa | 27 | 1.0 | 1 | 3.1e-01 | 5 | -0.70 | 7 | 18 | 9 |
| ga | ga | 113 | 0.7 | 1 | 3.9e-01 | 6.3 | -0.80 | 8 | 60 | 53 |
| ta | ta | 18 | 0.7 | 1 | 4.1e-01 | 6.6 | -0.82 | 9 | 12 | 6 |
| tc | tc | 33 | 0.4 | 1 | 5.2e-01 | 8.3 | -0.92 | 10 | 17 | 16 |
| gt | gt | 33 | 0.4 | 1 | 5.2e-01 | 8.3 | -0.92 | 11 | 17 | 16 |
| ct | ct | 31 | 0.4 | 1 | 5.4e-01 | 8.6 | -0.93 | 12 | 16 | 15 |
| at | at | 26 | 0.2 | 1 | 6.5e-01 | 10 | -1.01 | 13 | 16 | 10 |
| tg | tg | 41 | 0.2 | 1 | 6.6e-01 | 11 | -1.02 | 14 | 22 | 19 |
| gc | gc | 47 | 0.0 | 1 | 9.6e-01 | 15 | -1.19 | 15 | 27 | 20 |
| tt | tt | 4 | 0.0 | 1 | 1.0e+00 | 16 | -1.20 | 16 | 2 | 2 |
Host name rsat Job started 2026-04-21.084829 Job done 2026-04-21.084829 Seconds 0.1 user 0.1 system 0.01 cuser 0 ; csystem 0