The raw contigs FASTA file, the GTF annotations and the peptide sequences were downloaded from: https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/900/626/175/GCF_900626175.2_cs10/ The hard-masked FASTA file was produced by Perl conversion of the soft-masked sequences from NCBI with the following: perl -lne 'if(!/^>/){ s/[a-z]/N/g } print' Cannabis_sativa.cs10.GCF_900626175.2.NCBI.dna.toplevel.fa > \ Cannabis_sativa.cs10.GCF_900626175.2.NCBI.dna_rm.genome.fa The rest of files were produced by RSAT-Tools to install this genome.