Добавил:
kiopkiopkiop18@yandex.ru t.me/Prokururor I Вовсе не секретарь, но почту проверяю Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Ординатура / Офтальмология / Английские материалы / Eye Movements A Window on Mind and Brain_Van Gompel_2007

.pdf
Скачиваний:
0
Добавлен:
28.03.2026
Размер:
15.82 Mб
Скачать

634

L. Gareze and J. M. Findlay

Fixation duration (ms)

400

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

350

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

300

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Image type

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

250

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

200

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Line drawings

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

150

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Photographs

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

N = 1061 1617 746 868

704 885

626 818

483 717

271 483 54

175

 

 

 

 

0

 

 

 

 

1001

2001

3001

4001

5001

 

 

 

6001

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

1000

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

2000

 

 

3000

4000

5000

6000

 

 

7000

 

 

Time bins (ms)

Figure 6. Experiment 5 – Free scene viewing of simple line drawings from the Leuven library and greyscale photographs. Mean fixation duration by time for line drawings and photographs. Error bars display 95% confidence limits around the mean. N = number of fixations within time bin.

for the discrepancy discussed above, where the number of saccades executed prior to target fixation was approximately the same for line drawings and photographs but the time at which the target was fixated was later for photographs. The fixations investigated in this analysis consisted of those directed at regions other than the target, indicating that non-target-directed fixations were longer when viewing photographs than line drawings, although this effect was not seen on direct target fixations. The increased fixation durations for fixations on photographs, compared to line drawings, are compatible with previous reports (Henderson & Hollingworth, 1998). Although the photographs used in this experiment were grey-scale, a similar effect was seen which could reflect additional foveal processing of the more complex and detailed visual image.

8. General discussion

These findings taken as a whole suggest that the semantic inconsistency manipulation generated in the creation of these stimuli, whether subtle and naturalistic as in the photographic stimuli or stronger and less natural as in the Leuven line drawings, is not detected from extrafoveal vision during casual scene viewing. This is not to claim that it is impossible to identify semantic information about a target which has not yet been fixated, but simply that it does not occur habitually when viewing a scene with no explicit instructions.

Ch. 29: Absence of Scene Context Effects in Object Detection and Eye Gaze Capture

635

In Experiment 1, performance for inconsistent targets was facilitated during a brief presentation of a line drawing image. Although this facilitation was extinguished with the inversion of the images (Experiment 2), we cannot conclude that the facilitation was generated by the immediate detection of an inconsistent object in the scene because independent, naïve participants often failed to correctly identify the targets in question, even during unlimited scene viewing. If the effect was not mediated by the successful identification of the inconsistent targets, then the facilitation cannot be due to the perceived discrepancy between the target’s identity and the scene context.

This concern motivated the design of the more recognisable and naturalistic photographs used in Experiment 3, which failed to produce reliable evidence of consistency effects. The line drawings derived from them also failed to elicit any consistency effects (Experiment 4). Although the manipulation of consistency with these stimuli was less strong than with the line drawings used in Experiment 1, as all items could be found in a house and therefore were unusually, rather than impossibly, located, participants reliably rated the inconsistent targets as more unlikely than their matched consistent controls (Gareze, 2003). Therefore, the absence of a consistency effect could be due to an insufficiently strong consistency manipulation, indicating that the effect is unlikely to occur under naturalistic circumstances, or due to the improved recognisability of inconsistent targets, rendering them less salient in extrafoveal vision.

Previous work by Hollingworth and Henderson (1998, 1999, 2000) and Gordon (2004) directly contradicts the results of these experiments. In order to reconcile the two, it is necessary to consider the differences between these experiments, which may modulate such a discrepancy. To begin with, one of the major concerns the current experiments aim to address is the repeated display of the same or similar scenes, often up to eight times each. In these experiments, the number of times a participant viewed each scene background was strictly limited to prevent any learning effects. It was hypothesised that as inconsistent objects are better represented in memory, it would be plausible for an inconsistent target viewed in one background to influence performance on a subsequent repeated presentation. As the appropriate detection of scene gist is a crucial component of any evidenced consistency effects, the continued presentation of the same scene would be worth avoiding.

The importance of fixation position relative to the target has also been underestimated in previous work. Most experiments have involved a central fixation position with targets appearing at differing eccentricities within a scene, which does not consider the possibility that semantic information may be processed differently, if at all, in foveal and extrafoveal vision. Gordon (2004) presented targets at two eccentricities (near and far) but these varied across scenes. In one experiment, the mean near eccentricity was 2 9 but the range of eccentricities (0 8 –5 3 ) overlapped with that of far eccentricities (3 4 –8 8 ), which had a mean of 6 7 . Additionally, although object size was recorded, it was not manipulated systematically. Targets of approximately 2 square in size were presented at the above eccentricities, making many of the displays foveal or near foveal in presentation. In this way, it was impossible to distinguish between possible foveal and extrafoveal consistency effects.

636

L. Gareze and J. M. Findlay

Similarly, although object size was recorded, there was no analysis of whether object size affected performance on the task or any evident consistency effects. The results of the current series of experiments suggest that other object variables, such as object size, may modulate the expression of consistency effects. Significant and sometimes opposing consistency effects were found with data sets which showed no overall effect of the consistency manipulation, suggesting that the closer analysis of object-specific visual features may be valuable in the study of consistency effects.

A further concern relating to previous work which we have attempted to address is the nature of the visual stimuli used in these experiments. While many experiments have used line drawings derived from real-world scenes, few have used photographs of actual scenes. Our attempt to develop a set of naturalistic visual scenes resulted in a less extreme consistency manipulation than that found in the more frequently used stimuli. However, the failure to elicit consistency effects with these images calls into question the applicability of these effects to real-world viewing. Although some significant effects were found for certain target object sizes, the specificity of the conditions in which they occurred argues against the common occurrence of these effects during everyday scene viewing.

The investigation of eye movement behaviour also failed to provide evidence of differential extrafoveal processing for consistent and inconsistent targets. The only significant differences were found in fixation measures, confirming that inconsistent objects are fixated for longer than consistent ones (e.g. Henderson et al., 1999; De Graef et al., 1990). Even the line drawings, which exhibited an inconsistent object advantage in extrafoveal vision in Experiment 1, failed to produce any evidence of the earlier fixation of inconsistent targets. Despite previous findings of a reliable advantage for inconsistent objects presented extrafoveally in brief presentations paradigms (e.g. Gordon, 2004; Hollingworth & Henderson, 1998, 1999, 2000), no similar effect was found using these stimuli, suggesting that the effect may be difficult to elicit under more natural viewing conditions.

Acknowledgements

We are grateful to Peter De Graef for permission to use these scenes from the set available at ftp://michotte.psy.kuleuven.ac.be/pub/line_drw/ and for suggesting the manipulation in Experiment 2. We are also grateful to Lora Findlay for creating the stimuli used in Experiment 4 using Adobe Photoshop.

References

Biederman, I., Mezzanote, R. J., & Rabinowitz, J. C. (1982). Scene perception: Detecting and judging objects undergoing relational violations. Cognitive Psychology, 14, 143–177.

Davenport J. L., & Potter M. C. (2004). Scene consistency in object and background perception. Psychological Science, 15, 559–564.

De Graef, P., Christiaens, D., & d’Ydewalle, G. (1990). Perceptual effects of scene context on object identification. Psychological Research, 52, 317–329.

Ch. 29: Absence of Scene Context Effects in Object Detection and Eye Gaze Capture

637

Findlay, J. M., & Gilchrist, I. D. (2003). Active vision: The psychology of looking and seeing. Oxford: Oxford University Press.

Friedman, A. (1979). Framing pictures: The role of knowledge in automatized encoding and memory for gist.

Journal of Experimental Psychology: General, 108, 316–355.

Gareze, L. (2003). The role of foveal and extrafoveal vision in the processing of scene semantics. PhD Thesis, University of Durham.

Gordon, R. D. (2004). Attentional allocation during the perception of scenes. Journal of Experimental Psychology: Human Perception and Performance, 30, 760–777.

Henderson, J. M., & Hollingworth, A. (1998). Eye movements during scene viewing: an overview. In G. Underwood (Ed.), Eye guidance in reading and scene perception. (pp. 269–293). Amsterdam, North-Holland: Elsevier.

Henderson, J. M., Weeks, P. A., Jr., & Hollingworth, A. (1999). The effects of semantic consistency on eye movements during complex scene viewing. Journal of Experimental Psychology: Human Perception and Performance, 25, 210–228.

Hollingworth, A., & Henderson, J. M. (1998). Does consistent scene context facilitate object perception? Journal of Experimental Psychology: General, 127, 398–415.

Hollingworth, A., & Henderson, J. M. (1999). Object identification is isolated from scene semantic constraint: Evidence from object type and token discrimination. Acta Psychologica, 102, 319–343.

Hollingworth, A., & Henderson, J. M. (2000). Semantic informativeness mediates the detection of changes in scenes. Visual Cognition, 7, 213–235.

Lampinen, J. M., Copeland, S. M., & Neuschatz, J. S. (2001). Recollections of things schematic: Room schemas revisited. Journal of Experimental Psychology: Learning, Memory and Cognition, 27, 1211–1222.

Loftus, G. R., & Mackworth, N. H. (1978). Cognitive determinants of fixation location during picture viewing.

Journal of Experimental Psychology: Human Perception and Performance, 4, 565–572.

Pezdek, K., Whetstone, T., Reynolds, K., Askari, N., & Dougherty, T. (1989). Memory for real-world scenes: The role of consistency with schema expectation. Journal of Experimental Psychology: Learning, Memory and Cognition, 15, 587–595.

Sanocki, T. (2003). Representation and perception of scenic layout. Cognitive Psychology, 47, 43–86. Schyns, P. G., & Oliva, A. (1994). From blobs to boundary edges: evidence for time and spatial scale dependent

scene recognition. Psychological Science, 5, 195–200.

Unema, P. J. A., Pannasch, S., Joos, M., & Velichkovsky, B. M. (2005). Time course of information processing during scene perception: The relationship between saccade amplitude and fixation duration. Visual Cognition, 12, 473–494.

This page intentionally left blank

PART 8

EYE MOVEMENTS IN NATURAL ENVIRONMENTS

Edited by

ROGER P. G. VAN GOMPEL

This page intentionally left blank

Chapter 30

LEARNING WHERE TO LOOK

MARY M. HAYHOE

University of Rochester, USA

JASON DROLL

University of California, USA

NEIL MENNIE

Univeristy of Nottingham, UK

Eye Movements: A Window on Mind and Brain

Edited by R. P. G. van Gompel, M. H. Fischer, W. S. Murray and R. L. Hill Copyright © 2007 by Elsevier Ltd. All rights reserved.

642

M. M. Hayhoe et al.

Abstract

How do the limitations of attention and working memory constrain acquisition of information in the context of natural behavior? Overt fixations carry much information about current attentional state, and are a revealing indicator of this process. Fixation patterns in natural behavior are largely determined by the momentary task. The implication of this is that fixation patterns are a learnt behavior. We review several recent findings that reveal some aspects of this learning. In particular, subjects learn the structure and dynamic properties of the world in order to fixate critical regions at the right time. They also learn how to allocate attention and gaze to satisfy competing demands in an optimal fashion, and are sensitive to changes in those demands. Understanding exactly how tasks exert their control on gaze is a critical issue for future research.

Ch. 30: Learning Where to Look

643

A central feature of human cognition is the strict limitation on the ability to acquire visual information from the environment, set by limitations in attention. Related to this are the limits in retaining this information, set by the capacity of working memory. We are far from understanding how the organization of the brain leads to these limitations. We also have little understanding of how they influence the way that visual perception operates in the natural world, in the service of everyday visually guided behavior. Consideration of how the limited processing capacity of cognition influences acquisition of visual information leads us to the problem of how such acquisition is controlled. It is not really possible to address the question of precisely what information is selected from the image, and when it is selected, in the context of traditional experimental paradigms, where the trial structure is designed to measure a particular visual operation over repeated instances, each of short duration. In natural behavior, on the other hand, observers control what information is selected from the image and when it is selected. By observing natural behavior, knowledge of the task structure often allows quite well constrained inferences about the underlying visual computations, on a time scale of a few hundred milliseconds.

1. Eye movements and task structure

How can we study the acquisition of information in the natural world? Although incomplete, eye movements are an overt manifestation of the momentary deployment of attention in a scene. Covert attentional processes, of course, mean that other information is processed as well, but overt fixations carry a tremendous amount of information about current attentional state, and provide an entrée to studying the problem (Findlay & Gilchrist, 2003). Investigation of visual performance in natural tasks is now much more feasible, given the technical developments in monitoring eye, head, and hand movements in unconstrained observers, as well as the development of complex virtual environments. This allows some degree of experimental control while allowing relatively natural behavior. In natural behavior, the task structure is evident, and this allows the role of individual fixations to be fairly easily interpreted, because the task provides an external referent for the internal computations. In contrast, when subjects simply passively view images, the experimenter often has little control of, and no access to, what the observer is doing. When viewing pictures, observers may be engaged in object recognition, remembering object locations and identity, or performing some other visual operation. Immersion in a real scene probably calls for different kinds of visual computations, because observers may be interacting with the objects in the scene. When viewing images of scenes, some regularities in fixation patterns can be explained by image properties such as contrast or chromatic salience. However, these factors usually account for only a modest proportion of the variance (Itti & Koch, 2001; Mannan, Ruddock & Wooding, 1997; Parkhurst, Law, & Neibur, 2002).

Over the past ten years, a substantial amount of evidence has accumulated about deployment of gaze during ongoing natural behavior. In extended visuomotor tasks such as driving, walking, sports, playing a piano, hand-washing, and making tea or sandwiches,