12 classes: bannock, chapati, pita, salad, salsa, saute, ketchup, chutney, limpa, strawberry, margarine, shortcake
The breads appear at first level/composition. The different sides and deserts follows at later levels. See Section 4.3 in main paper for exhaustive discussion on this hierarchy.
10 classes: cow, insectivore, hound, puppy, garden spider, ptarmigan, phalanger, killer whale, green lizard, kangaroo
This is a versatile set of classes i.e., the contextual information (like background, typical sets of objects that are found etc.) for cow, hound may be drastically different from kangaroo, ptarmigan.
Clearly, the most distinct classes -- green lizard, killer whale are resolved at highest levels. The composition at first level involves those classes which share context -- there is most certainly grass or trees ot bark in these images.
Given the first level, the graph shows that kangaroo is most expressible by the first level composition, compared to green lizard -- implying a clear sense of hierarchy in learned representations.
11 classes: male horse, war horse, pony, mule, elk, blackbuck, deer, goat, bison, sheep, zebra
The categories that MMF picked for first level compositions are highly correlated to begin with, both with respect to the texture/pixel-values as well as the background (most show up with grassy or landscape background).
It is reasonable to say that human perception would imply that blackbuck, deer, goat are most related to the first level compositions -- which were composed at the next level.
zebra clearly is the most distnct one, implying that many compositions of the first and second level categories may not really infer what constitutes a zebra.
12 classes: rule, rack, squach, stick, ski, sheet, couch, table, sail, roller, pot, toilet seat
8 of these 12 classes are sports accessories/objects and the rest (pot, couch, sheet, toilet seat) are more household-type categories.
This was done to see what the MMF will do, if it is forced to pick hierarchy among arbitrary classes. Clearly, pot shows up at last level.
The first level compositions are 5 of the sports related classes. Interestingly couch is closer in composition to the first lvel than a roller coaster or sail, which may be because the context of four of the first level
classes is that they are `indoors'. sail and roller coaster have sky and water as background respectively, making them distinct enough from the first level composition, and pushing couch or toilet seat instead.
10 classes: bag, bathtub, basket, blackboard, box, bench, building, bottle, ball, basketball
This set includes 5 different household/kitchen classes and at least two outlier classes -- ball and basketball -- which belong to sports.
Clearly, they showed up at last level of the factorization. The first level composition is most informing bench and building rather than bottle, which is interesting.
Many of the type of inferences made with earlier set of 12 classes (and its visualization) still hold here, expecially those with respect to outlier/peculier classes that may not directly relate to the rest of the bunch on which MMF is operating.