Visual Neurons and Machines: Scenes

Tuesday, April 8, 2014

Scenes - Objects [Navigation]

Janzen, Selective Neural Representation of Objects Relevant for Navigation, 2004

36 comments:

UnknownApril 8, 2014 at 12:53 PM
What I liked about the paper is the transition from the readings so far. Specifically, the authors show that the PPA which is selective for scenes can also be activated for navigationally relevant objects objects, and marks a big change in the idea's we've come across about the PPA.
ReplyDelete
Replies
AllieApril 8, 2014 at 1:34 PM
I think my favorite part was that they compare forgotten and not forgotten objects. I think this is an important point to consider for vision - should we reward systems that act more like a human by ignoring things that are irrelevant? We usually try to classify everything possible, but maybe being more selective to things relevant to a task like navigation would be a good trait in a vision system.
ReplyDelete
Replies
UnknownApril 8, 2014 at 5:57 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownApril 8, 2014 at 6:10 PM
One question I am left with is whether or not a similar signal would be apparent for objects associated with non-spatial decision making. It seems that they showed that objects associated with a clear choice between paths in a maze produce stronger signal, but could this be repeated if the object were associated with some other decision? For example if the users were trained to press certain buttons to generate various actions, would objects presented in conjunction with that elicit similar results? I guess the question is whether or not this is related to spatial paths or decision making in general.
ReplyDelete
Replies
UnknownApril 8, 2014 at 6:27 PM
I'm puzzled by one thing, and I was hoping someone might be able to fill me in. Fom what I can tell, their analysis is based on these beta variables, which are per-voxel. Their methods say they fit a GLM. I get what the X might be in y = X \beta + \epsilon. Can someone please explain what y is, and whether y changes per experiment?
ReplyDelete
Replies
IshanApril 8, 2014 at 7:37 PM
This comment has been removed by the author.
ReplyDelete
Replies
IshanApril 8, 2014 at 7:43 PM
I liked reading this paper and their result. To me this recall of objects at "decision points" seems very intuitive because of this nice "corner like" 3D structure of the decision points. I am not entirely convinced it is because of the "navigational" issue and I think the authors are overselling it. I bet the same would hold in an experiment where you put 2D objects in rectangles, people would remember better the ones which are closer to corners. It is just because visually you have the corner to latch on to, and thus create an association. Is it because you want to navigate? I do not know.
ReplyDelete
Replies
UnknownApril 8, 2014 at 7:58 PM
It was interesting to see that the paper was organized to look at the information encoded in the brain specific to the task of navigation. This point has been brought up a number of times in class that the fMRI signals we analyze might be task-dependent. However during the fMRI experiment objects were shown in isolation and not in context of the scenes seen during the virtual tour. I wonder if anybody had any concerns about this?
ReplyDelete
Replies
UnknownApril 8, 2014 at 8:17 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownApril 8, 2014 at 10:02 PM
I really liked the paper. In the paper they have shown that brain automatically
distinguishes between objects at navigationally relevant and irrelevant locations.
But I feel that it might be more than just navigation, may be our brain distinguishes to object on the basis of task. They should have added on more task which was not related to navigation while seeing the object and compared its response with navigation and no task at all.
ReplyDelete
Replies
M AravindhApril 9, 2014 at 4:56 AM
This paper reminds me of work in robot path following. The researchers represented a path for the robot to follow as a lookup table of feature points and movement directions associated with them. At test time, the robot searches for feature points in the image and upon matching one from the lookup table simply moves in the direction stored for that feature point. (I don't remember the details of how they made feature matching robust). This method was able to travel significantly large distances (~1km) with very small errors (~1cm) on a quadcoptor. Does anyone have a link to this work?
ReplyDelete
Replies
M AravindhApril 9, 2014 at 5:43 AM
Is navigational relevance a principle for learning routes on the fly or is it a more general categorization mechanism for grouping objects. I think this question is interesting and can be answered by studying the PPA on the same subjects a week after the maze experiment. If the increased activity for objects at decision points is still seen, then it favors a hypothesis that navigational relevance is a general categorization mechanism used by the PPA.
ReplyDelete
Replies
GauravApril 9, 2014 at 5:55 AM
I like that this study involves looking at a series of images or video and is more applicable to robots.
Personal liking aside, I would like to make a point:
Correct me if I'm wrong, but is it wrong to extrapolate from this finding that neuroscience vision studies strongly reflects human life experience in preference to semantics ? to take this statement to it's most extreme conclusion - does the response in PPA reflect only how individual humans encountered scenes and objects in the past and similar responses are grouped by similar real world experience (objects encountered while navigating, object encountered while using hands) ?
This follows that a computer vision system replicating human vision must have a few basic functions and then must be flexible enough to respond to influences due to experience.
I'm interested in seeing a study where a *normal* person is compared with someone having a higher level of expertise or interest in the field. Example, a toys enthusiast in this experiment. Will he/she have a different interest in the study and remember toys for being toys and not navigational way points ?
ReplyDelete
Replies
UnknownApril 9, 2014 at 7:24 AM
One thing I like about this paper is the elegant and simply design, in particular, the way they ruled out the alternative explanation that the reported effects could be due to more attention is paid to objects at the decision point. Attention is a serious confounding variable in many fMRI studies and this seems a neat way to control for it. One issue I have trouble following, is that the paper mentioned allocentric spatial representation at the very beginning, so I assume their objects are in allocentric representations (object-to-object relations). However, watching the virtual museum film seems tap egocentric representations to me (self-to-object relations)? Maybe I am heavily influenced by studies showing evidence from both normal and patient (i.e., hemispatial neglect) studies that there are multiple spatial reference frames (allocentric and egocentric representations) in the brain. So I am not sure if I took the term allocentric here too seriously. Is PPA sensitive to different kinds of spatial reference frames (egocentric v.s allocentric)?
ReplyDelete
Replies
Yuxiong WangApril 9, 2014 at 7:55 AM
From another perspective, I think what PPA does for navigation is quite similar to sparse coding in lots of related tasks in computer vision. That is, rule out unimportant or unnecessary information while highlighting certain factors that are crucial to the current tasks.
ReplyDelete
Replies
UnknownApril 9, 2014 at 2:21 PM
One takeaway for me from the discussion in the class was how context serves as a glue for object and scene responses in the PPA. The open-ended discussion about solving vision 100% versus discarding data to optimize time for completing a task- which is most likely what biological systems do to make decisions, was also interesting. I also think that although we are always optimizing time with respect to a task as biological entities, given more time we are capable of recognizing more from a computer vision perspective and maybe even making better decisions.
ReplyDelete
Replies

Add comment