The Pathogen–Host Interactions Database (PHI-base) is an online database that catalogues experimentally-verified pathogenicity, virulence and effector genes from fungal, oomycete, and bacterial pathogens, which infect animal, plant, fungal, and insect hosts. PHI-base is a valuable resource in the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention.
Information in PHI-base is manually curated by domain experts and is supported by strong experimental evidence (for example, gene disruption and gene complementation experiments), as well as references to the literature in which the original experiments are described. Annotations are made using terms from ontologies and controlled vocabularies, including the Gene Ontology (GO), Brenda Tissue Ontology (BTO), and the Pathogen--Host Interaction Phenotype Ontology (PHIPO).
PHI-base 5 includes data that was curated using a new curation process described in Cuzick et. al (2023). Data releases for PHI-base 5 do not use the same schema as data releases from PHI-base 4, but all data records from PHI-base 4 that can be made compatible with the new schema are included with this release. Data releases from PHI-base 4 and PHI-base 5 will occur in parallel until such time that all data from PHI-base 4 can be migrated to PHI-base 5. The PHI-base 4 data releases are available on Zenodo at https://zenodo.org/doi/10.5281/zenodo.5356870.
Data content
phi-base_v5.0.xlsx: the PHI-base dataset as an Excel spreadsheet. This format follows the layout of the PHI-base 5 website, with sheets corresponding to the sections of gene pages on the website. This format is designed for use by non-technical users.
phi-base_v5.0.json: the PHI-base dataset in JSON format. This format is closer to the data format that is exported by PHI-Canto, the curation tool used by PHI-base. This format is primarily intended for programmatic usage and has additional data (e.g. metadata for curation sessions) that is not included in the spreadsheet format.
phi-base.schema.json: a JSON Schema file for the JSON format of the dataset. This is included as documentation for the fields in the JSON file, but can also be used to validate the dataset.