The raw contigs FASTA file, the GTF annotations and the peptide sequences were downloaded from: https://chi.mpipz.mpg.de/index.html The hard-masked FASTA file was produced with ~/github/plant-scripts/repeats/Red2Ensembl.py --msk_file chi_v1.rm.fa chi_v1.fa chi #Genome length: 198654690 - Repeats length: 70437381 - Repeats content: 35.4572% and then converted and sorted with the following Perl script: perl -lne 'if(!/^>/){ s/[a-z]/N/g } print' chi_v1.rm.fa The rest of files were produced by RSAT-Tools to install this genome as explained at https://rsa-tools.github.io/managing-RSAT/genome_installation/install_organisms_FASTA_GTF.html