Well, I think it's primarily a data-gathering exercise, where you have a picture and a human decision/answer (i.e. whether or not the object in question is in the picture). And then that data can be used for supervised learning (to train the forthcoming generation of robot overlords).
Kind of replacing the Turing Test?