Relevant Information: This dataset’s physics problem is a two-dimensional, spatial pattern formed from a pollutant that has been released into the atmosphere and dispersed for up to an hour while undergoing deposition to the surface. The pollutant’s release location (sx,sy) is assumed to occur anywhere in a two-dimensional domain of 5000 m × 5000 m. The release is initialized from a small bubble that is centered five meters above the surface, has a radius of five meters, and has internal momentum that causes it to expand radially and rise to a height of about 100 meters within the initial minute of simulation time. The same bubble source was used for all the simulations as a simplification. Only the (sx,sy) coordinates of the locations of the bubble source are relevant. All the realizations used unit mass releases, and the resulting deposition patterns can be scaled proportionately for other mass amounts. The time scale of the simulated data represents the cumulative mass deposited on the surface for one hour. The pollutant is blown in a direction controlled by the large-scale atmospheric inflow winds expressed as wind speed (ws), which varies from 0.5 to 15 m/s, and wind direction (wd), which can be anywhere in the interval [0,360) degrees following standard mathematical convention. The files “inputs_15k_train.npy” and “inputs_1k_test.npy”, however, includes wu = ws cos(wd) and wv = ws sen(wd), the wind velocity components projected onto the x and y axes. We assume that the spatial patterns were collected by a hypothetical imaging device that records the magnitude of the logarithm of deposition as a red, green, and blue (RGB) color image with channels containing integer values ranging from 0 to 255. The goal is to predict a deposition image given its associated release location and wind velocity (four scalar quantities). In other words, we are interested in the following mapping: [sx,sy,wu,wv]→[height×width×RGB channel]. See [1]. The data is obtained from simulations and later post-processed to make it adequate for machine learning training. Given large-scale winds as an inflow boundary condition, the CFD code Aeolus [2] uses millions of grid cells to simulate fluid flow and material transport in complex, three-dimensional environments at high resolution, accounting for turbulenc...