Visual Neurons and Machines: How far does top-down travel? V1?

Sunday, April 27, 2014

How far does top-down travel? V1?

Murray et al., The representation of perceived angular size in human primary visual cortex. Nature Neuroscience 2012

16 comments:

UnknownApril 27, 2014 at 12:10 PM
I really liked this paper (ok, given my research, it's a bit predictable). I'm really curious what's actually going on here at a higher level and (more importantly) why.

Here's my first take: the apparently far-away object is given the increased processing in an attempt to ensure that processing power is roughly proportional to actual area (i.e., if the world were actually the 2.1D sketch and all the objects were flat cutouts along the camera axis).This makes some sense to me assuming that some form of grouping is already done: if you've got the 3D of a scene, your processing should somehow be scale independent.

I think what's puzzling is that this is happening at the feature extraction layer. But here's where I think it becomes dangerous to treat the human vision system as a biological camera+matlab setup: I really don't know how the system is hooked up, and I don't know how much surprise to register beyond the fact that top-down information is changing feature extraction. Is this because somehow the eye has top-down information via 3D about how large the object is, and is thus the V1 is extracting features as if the image has been processed to some common scale?

But, irrespective of why, I think the top-down changing of processing was quite interesting. Although it's hard (since the illusion's effect is per-person), I wish they had also done one where the apparent size is the same.
ReplyDelete
Replies
IshanApril 27, 2014 at 3:01 PM
The take-away from this paper is very interesting. This paper however does not spend time analyzing how the retinotopic area increases for far away objects.
Nonetheless, this depth-based feature extraction is something cool.
ReplyDelete
Replies
UnknownApril 27, 2014 at 10:05 PM
This is very interesting, but also seems very strange. I thought V1 is dealing with small 2D features, so I don't understand why depth would be telling V1 to expect a smaller image on the retina. This seems to be another victory for context though.
ReplyDelete
Replies
UnknownApril 27, 2014 at 10:10 PM
One other thing I noticed... The shadows of the spheres also reinforce the depth perspective. The close sphere has a shadow consistent with an overhead view of the object, where the far one is consistent with a glancing anglem. Some of their later experiments don't have this effect, but I wonder if the same illusion would hold if only the shadows were present, with no perspective lines.
ReplyDelete
Replies
M AravindhApril 28, 2014 at 5:07 AM
In the context of top down information affecting the behavior of neurons in the visual cortex, I found this paper interesting.
Feedforward, horizontal, and feedback processing in the visual
cortex, Victor AF Lamme, Hans Sup`er and Henk Spekreijse, Curr Opin Neurobiol. 1998 Aug;8(4):529-35. (http://www.ncbi.nlm.nih.gov/pubmed/9751656)
Among other things, this relatively old survey paper says that the receptive field for neurons in V1 can change due to horizontal connections.
ReplyDelete
Replies
M AravindhApril 28, 2014 at 6:28 AM
In an attempt to disagree with the authors intuition that V1 neurons are adjusting their receptive field based on depth estimates feedback from higher visual areas, I have come up with three alternative hypothesis.
1. The visual cortex could be a prediction machine. The feedforward process is followed by a feedback prediction of what the brain expects to see next. In the second stimulus, where the ball is further into the walls, the prediction corresponds to an increase in stimulus size as the ball rolls forward down the corridor. Whether or not this prediction uses a depth estimate is secondary.
2. The scene layout processed by higher visual areas can feedback to V1 to increase the receptive field of neurons farther from the center, making them fire. The higher level visual areas can capture the alignment of wall edges and the sphere (implicitly inferring the scene layout) and feedback to V1 to facilitate the illusion.
3. The neurons in V1 that are processing information at higher scales (more coarse edges) can laterally stimulate neurons further away from the center to facilitate affine rectification of the image. I'm basically trying to avoid going into the dorsal stream and then coming back into V1.
ReplyDelete
Replies
Yuxiong WangApril 28, 2014 at 8:18 AM
Previously, we most talk about the fMRI responses. It is quite interesting that the perception area of the receptive field is also adjusted according different scenarios. Is the perception area of the receptive field or other regions also a good evaluation criterion helpful for other tasks while not limited to depth estimation?
ReplyDelete
Replies

Add comment