position-analysis -v 1 -i peaks-rm-orig/data/sequences/peaks_ctrl_purged_ml40_mis3.fasta -format fasta -sort -return html,chi,sig,distrib,graphs,rank,index -max_graphs 20 -1str -ovlp -seqtype dna -l 2 -ci 20 -img_format png -title analysis_M11 -origin end -offset 0 -o peaks-rm-orig/results/composition/peaks_ctrl_profiles-1str-ovlp_2nt_ci20.tab Citation: van Helden, et al. (2000). Nucleic Acids Res 28, 1000-1010. Sequence file peaks-rm-orig/data/sequences/peaks_ctrl_purged_ml40_mis3.fasta Sequence format fasta Sequence type dna Output file peaks-rm-orig/results/composition/peaks_ctrl_profiles-1str-ovlp_2nt_ci20.tab Oligo length 2 Occurrences counted on a single strands Conditions of applicability checked. Background model estimation: homogeneous repartition Sequence statistics: Nb of sequences 27107 Sum of sequence lengths 5448507 Min sequence length 0 Max sequence length 201 Average sequence length 201 Possible positions 5421400 Oligonucleotide statistics: Total occurrences 4496347 Position interval parameters: Position interval 20 Number of windows 11 Total positions 5421400 Degrees of freedom 10 K-mer clustering parameters: Number of clusters 0 Clustering method complete Position intervals: window [min,max] mid seq occ 1 -11 [-219,-200] -209.5 27107 54214 2 -10 [-199,-180] -189.5 27107 542140 3 -9 [-179,-160] -169.5 27107 542140 4 -8 [-159,-140] -149.5 27107 542140 5 -7 [-139,-120] -129.5 27107 542140 6 -6 [-119,-100] -109.5 27107 542140 7 -5 [-99,-80] -89.5 27107 542140 8 -4 [-79,-60] -69.5 27107 542140 9 -3 [-59,-40] -49.5 27107 542140 10 -2 [-39,-20] -29.5 27107 542140 11 -1 [-19,0] -9.5 27107 487926 Column headers 1 seq pattern sequence 2 id pattern identifier 3 occ pattern occurrences 4 chi2 observed chi-square 5 df degrees of freedom 6 Pval P-value (probability for one word to be a false positive) 7 Eval E-value; expected number of false positives (Eval = Pval * nb_tests) 8 sig Significance (sig = -log10(Eval)) 9 rank rank of the pattern according to sorting criterion 10 -209.5 occurrences in window 1 [-219,-200] 11 -189.5 occurrences in window 2 [-199,-180] 12 -169.5 occurrences in window 3 [-179,-160] 13 -149.5 occurrences in window 4 [-159,-140] 14 -129.5 occurrences in window 5 [-139,-120] 15 -109.5 occurrences in window 6 [-119,-100] 16 -89.5 occurrences in window 7 [-99,-80] 17 -69.5 occurrences in window 8 [-79,-60] 18 -49.5 occurrences in window 9 [-59,-40] 19 -29.5 occurrences in window 10 [-39,-20] 20 -9.5 occurrences in window 11 [-19,0]
| seq | id | occ | chi2 | df | Pval | Eval | sig | rank | -209.5 | -189.5 | -169.5 | -149.5 | -129.5 | -109.5 | -89.5 | -69.5 | -49.5 | -29.5 | -9.5 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| tg | tg | 272267 | 5470.4 | 10 | 0.0e+00 | 0 | 300 | 1 | 6519 | 26144 | 26834 | 26546 | 26601 | 26780 | 26280 | 26628 | 26844 | 27768 | 25323 |
| at | at | 327290 | 5037.1 | 10 | 0.0e+00 | 0 | 300 | 2 | 7196 | 35031 | 32880 | 32398 | 31830 | 31825 | 31591 | 31706 | 31817 | 32143 | 28873 |
| aa | aa | 452118 | 1530.6 | 10 | 0.0e+00 | 0 | 300 | 3 | 5480 | 50232 | 48131 | 46385 | 45572 | 45227 | 44228 | 43236 | 43032 | 42812 | 37783 |
| tc | tc | 341919 | 1410.0 | 10 | 0.0e+00 | 0 | 300 | 4 | 2121 | 31076 | 32121 | 33091 | 33665 | 33963 | 35108 | 35530 | 35946 | 36143 | 33155 |
| ta | ta | 240715 | 960.7 | 10 | 0.0e+00 | 0 | 300 | 5 | 2241 | 27280 | 26113 | 25142 | 24106 | 23738 | 23008 | 22908 | 22999 | 22563 | 20617 |
| ct | ct | 322378 | 860.6 | 10 | 0.0e+00 | 0 | 300 | 6 | 2178 | 29908 | 30753 | 31678 | 31788 | 31958 | 33173 | 33117 | 33420 | 33519 | 30886 |
| cc | cc | 270606 | 853.8 | 10 | 0.0e+00 | 0 | 300 | 7 | 1900 | 24747 | 25386 | 25867 | 26792 | 27632 | 28280 | 28099 | 28261 | 28163 | 25479 |
| cg | cg | 124136 | 566.5 | 10 | 0.0e+00 | 0 | 300 | 8 | 955 | 11074 | 11466 | 11858 | 12080 | 12533 | 12432 | 12952 | 13128 | 13394 | 12264 |
| gg | gg | 180688 | 518.6 | 10 | 0.0e+00 | 0 | 300 | 9 | 1399 | 20225 | 17632 | 17307 | 17699 | 17737 | 17618 | 17610 | 17748 | 18301 | 17412 |
| gc | gc | 200513 | 440.8 | 10 | 0.0e+00 | 0 | 300 | 10 | 1346 | 19596 | 18980 | 19552 | 19586 | 20167 | 20440 | 20262 | 20402 | 20987 | 19195 |
| tt | tt | 430433 | 281.0 | 10 | 1.6e-54 | 2.5e-53 | 52.59 | 11 | 3456 | 43700 | 43299 | 43111 | 42720 | 42152 | 42170 | 42889 | 43145 | 43577 | 40214 |
| gt | gt | 206969 | 210.5 | 10 | 1.0e-39 | 1.6e-38 | 37.79 | 12 | 1614 | 21574 | 21061 | 20810 | 20723 | 20249 | 20030 | 20287 | 20406 | 21012 | 19203 |
| ga | ga | 255400 | 206.6 | 10 | 6.8e-39 | 1.1e-37 | 36.96 | 13 | 2316 | 27364 | 25177 | 24938 | 25572 | 25261 | 24955 | 25367 | 25244 | 25673 | 23533 |
| ag | ag | 267178 | 169.1 | 10 | 4.2e-31 | 6.8e-30 | 29.17 | 14 | 2180 | 27570 | 27376 | 26626 | 26942 | 26741 | 26381 | 26164 | 26425 | 26301 | 24472 |
| ac | ac | 254704 | 127.1 | 10 | 1.8e-22 | 2.9e-21 | 20.53 | 15 | 2121 | 26312 | 25633 | 25428 | 25304 | 25737 | 25897 | 25390 | 25310 | 24949 | 22623 |
| ca | ca | 349033 | 107.8 | 10 | 1.5e-18 | 2.3e-17 | 16.63 | 16 | 3047 | 34633 | 34382 | 34310 | 34605 | 35306 | 35624 | 35237 | 35132 | 34960 | 31797 |
Host name rsat Job started 2023-05-30.162506 Job done 2023-05-30.162537 Seconds 31.25 user 31.25 system 0.01 cuser 0.11 ; csystem 0.02