The error counts on the top left indicate the number of 6 frame chunks which have been mis-classified by different networks. When the color of a particular method changes to red, it means that the prediction does not match with the ground-truth.
