While there is a great availability of medical datasets, they are usually focused on a specific research question. This is useful for making experiments transparent and reproducible, however, these datasets can be more efficiently used in other kinds of analyses, where it not for data privacy issues. The Personal Health Train (PHT) provides a distributed analysis infrastructure that follows the FAIR principles and gives control to the data owners (providers) about how their data are used by scientists or other users (consumers).