This repository holds the HpGP Phase 1 genomic dataset for Hp26695 and 1011 study samples. All 1012 genomic sequences were annotated using the NCBI Prokaryotic Genome Annotation Pipeline(PGAP). Also, it has 255 curated public available H. pylori genomic sequences used for population structure analysis in Thorell et al. Nature Communications, 14:8184 (2023).
You can check the NCBI BioProject website for the latest annotation and sequence updates.
https://www.ncbi.nlm.nih.gov/bioproject/?term=HpGP
Please cite the above-mentioned paper if you use the data.