Visual Neurons and Machines: Next Reading --

Tuesday, March 4, 2014

Next Reading --

DiCarlo] Shape Similarity, Better than Semantic Membership, Accounts for the Structure of Visual Object Representations in a Population of Monkey Infero-temporal Neurons, Computational Biology 2013

33 comments:

UnknownMarch 4, 2014 at 5:59 PM
This is an interesting paper. It holds a different viewpoint towards the function of IT compared to the paper[Kriegeskorte et al] we studied last week. (Was that because their monkey volunteers coming from a different place? :P)
I have one concern about their experiments. If I understood correctly, they first performed clustering in each representation space (e.g. IT neuron representation, semantic, shape-based, low-level visual property). Then, they counted the cluster overlap between three potential hypotheses and IT neuron representation. My main concern is the parameters used for clustering could influence the results and final conclusion a lot. For example, they used K-means with K=15 for shape-based representation. As K can directly decide the granularity of each cluster, it may change the overlap results. The same question for low-level visual property: why 15 images for each category were selected? If I change the number, will it influence the results?
It seems to me they were trying to estimate the correlation between different feature representations. Can we do that directly in the “raw” feature space (i.e. raw IT response) as opposed to clustered space?
ReplyDelete
Replies
UnknownMarch 4, 2014 at 7:35 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownMarch 4, 2014 at 8:33 PM
Figure 5 interested me the most, since it showed a clear correlation between shape and cluster. However, I also have some questions about the choice of images. If high level semantic information is represented in the IT, then we would expect the neural responses to be invariant to things such as lighting and viewpoint. However, it seems to be that in the 213 images used in the paper, images that share a semantic category also more or less shared the same viewpoint. For example, all faces were front view, all animals were side view, and so on. It might be that because of this, that invariance to viewpoint could not be demonstrated, and sensitivity to shape becomes much more noticeable. So in that sense it seems to be that the choice of the images favors shape based categories.
ReplyDelete
Replies
Jacob WalkerMarch 4, 2014 at 9:17 PM
This is an intriguing paper. The authors contend that
high-level as well as low-level visual features account for the
representations in the IT cortex instead of abstract, semantic
membership. Two concerns:

1. I think one should be careful about the semantic vs shape
representation dichotomy. It may be that the IT may have
semantic representations for *some* objects but not *all*
visual objects. As the authors note, there was some evidence
for a cluster for four-legged animals as well as perhaps birds.

2. Clustering of faces is due to visual similarity, not
semantics. However, it they did not record in face-selective
areas. Is it possible that this generalizes to other categories;
i.e., there are actually specific sub-areas of the IT which encode
semantic categories?
ReplyDelete
Replies
PriyamMarch 4, 2014 at 10:13 PM
Although the author mentions this point, I am still not entirely convinced about the monkey conditioning part. And to add to the confusion, I can't really seem to understand how exactly representation of objects were separated as based on semantics, shapes etc for the monkeys. It seemed a bit dicey to impose such structures on this abstract space.
ReplyDelete
Replies
UnknownMarch 4, 2014 at 11:03 PM
It was interesting to note that grayscale images of natural objects were used for this study. I don't know how reliable this is, but Wikipedia (http://en.wikipedia.org/wiki/Inferotemporal_cortex) states that the IT is specifically involved in the processing of color to determine "what" from the visual stimuli. Although the authors do mention this concern while comparing to other studies which use color images as input (Page 15). There must be a way to look at the color and shape interaction in the IT, which they seem to suggest is difficult to quantify. They agree this may be a major reason why their results differ from past studies.
ReplyDelete
Replies
GauravMarch 5, 2014 at 7:54 AM
This comment has been removed by the author.
ReplyDelete
Replies
GauravMarch 5, 2014 at 8:06 AM
One of the things I gained was that fMRI way might be giving us wrong conclusions. Directly probing neurons is uncovering that semantic categories might not be a deciding factor for specificity in brain regions.
However, that said, I have two doubts :
1. Could the fact that there is no specificity or semantic separation for inanimate objects be because the monkeys haven't learned to manipulate them/have other interactions with them? We know brain region specificity might be derived from connectivity with other brain regions that process motor movements, emotions etc. I'm not convinced that lab monkeys used in this experiment have had sufficient interactions with inanimate objects in this study. Hence, can we compare this study of the fMRI studies in the previous two classes?
2. Could the limited area of the high level brain regions being sampled / other experimental conditions explain the shape selectivity conclusion?
ReplyDelete
Replies

Add comment