Representative CAM outputs from two CNN models; images on the left, middle, and right indicate the original image, CAM heatmap from VGG-16, and EfficientNet, respectively. Our proposed method, CAM aggregation, enables to visualize unified representation of the diagnostic features over the entire bark. Regions highlighted in red denote the strongest activation from each model, which indicates the most relevant region for the prediction.