To avoid annotation artifacts due to fragmented gene models, we generated a GFF files of barley genes (PMID: 23075845) whose protein sequences is nearly
completely represented in the Morex WGS assembly. Protein sequences were aligned to the genomic contigs with exonerate (PMID: 15713233). Genes were considered near-complete if 98 % of their protein
sequences could be aligned to the genomic sequence. A total of 18,039 (74 %) out of 24,243 high-confidence genes positioned on the Morex WGS assembly had near-complete ORFs.