DataCite Commons: Benchmarking the performance of GPT-2 type applications on GPU accelerated computing resources

In this report we look at eight GPU-accelerated systems representing a cross-section of the available Tier 2 HPC systems in the UK. These include a mixture of GPU-accelerated platforms from Nvidia, AMD and Intel. For each system, we perform a set of benchmarking experiments by training a GPT-2 model using a mixture of parameters and hyperparameters. These include model size, number of GPUs, floating point data type, training data size, distribution strategies and batch size. Our interest is in performance measured by time taken to complete an epoch of training rather than convergence speed. We also measure memory usage. The overall aim is to compare systems in order to provide researchers intending to perform AI training with some benchmarks for what to expect in terms of training speed for a model in a non-optimised real-world scenario.

Content published 2025 in Zenodo

ReportComputer and information sciencesEnglish

https://doi.org/10.5281/zenodo.11105322

Tomas Lazauskas	The Alan Turing Institute
David Llewellyn-Jones	The Alan Turing Institute

Tomas Lazauskas	The Alan Turing Institute
David Llewellyn-Jones	The Alan Turing Institute

Benchmarking the performance of GPT-2 type applications on GPU accelerated computing resources

Cite as

Download Reports

Benchmarking the performance of GPT-2 type applications on GPU accelerated computing resources

Cite as

Download Reports

Benchmarking the performance of GPT-2 type applications on GPU accelerated computing resources

Cite as

Download Reports

Share

Benchmarking the performance of GPT-2 type applications on GPU accelerated computing resources

Cite as

Download Reports

Share