Influence of sketches of different sizes on the runtime performance, at similarity threshold 0.5.
The red dot marks the best configuration for a given algorithm. Each point is labeled with the running time in minutes. When a value is missing, the corresponding experiment crashed due to exceeding the memory capacity of our cluster.
This is the full version of Figure 3 in the paper.
The red dot marks the best configuration for a given algorithm. Each point is labeled with the running time in minutes. When a value is missing, the corresponding experiment crashed due to exceeding the memory capacity of our cluster.
This is the full version of Figure 3 in the paper.