Добавил:

Sekretar kiopkiopkiop18@yandex.ru t.me/Prokururor I Вовсе не секретарь, но почту проверяю Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Ростовский Государственный Медицинский Университет

Предмет:

Медицина общая

Файл:

Visual Perception Fundamentals of awareness multi-sensory integration and high-order perception_Martinez-Conde_2006

.pdf

Скачиваний:

Добавлен:

28.03.2026

Размер:

13.4 Mб

Скачать

☆

<<< < Предыдущая 8 9 10 11 12 13 14 15 16 17 18 1920 / 3220 21 22 23 24 25 26 27 28 29 30 31 32 > Следующая >>>

192

A Without Feedback

Left eye Monopt L

Dichopt

Right eye Monopt R

B	With Feedback

Left eye MonoptDichopt

Dichopt

Right eye MonoptDichopt

Fig. 12. Overriding issues when considering the viability of feedback mechanisms. (A) A general model of early visual system binocular integration without invoking feedback mechanisms. (B) If signiﬁcant feedback exists between the initial dichoptic levels of processing and earlier levels, it means that, by deﬁnition, the earlier levels should therefore behave in the same way as the dichoptic levels (i.e., they will become dichoptic by virtue of the feedback).

The monoptic masking results presented in Fig. 11 show that the strength of monoptic masking increases, in an iterative fashion, with each successive stage of processing in the visual system. Correspondingly, Hubel and Wiesel (1961) found that inhibitory surrounds were stronger in the LGN than in the retina. We proposed that lateral inhibition mechanisms gather strength iteratively in successive stages of the visual hierarchy, as a general principle. The result that dichoptic inhibition is weak in area V1 may reﬂect such a general principle, given that V1 binocular neurons represent the ﬁrst stage where dichoptic inhibition could exist in the ascending visual system. If this idea is

correct, downstream binocular neurons in the visual hierarchy should show iteratively stronger dichoptic masking suppression effects. Whether these effects represent the discovery of a general principle of inhibitory iterative processing in the visual system, or not, we believe that dichoptic masking effects must become stronger downstream of V1, to account for the fact that the overall psychophysical magnitude of dichoptic visual masking is equivalent to that of monoptic masking (as shown in Fig. 11).

To search for the neural correlates of masking higher up in the visual hierarchy, we turned to whole brain imaging (functional magnetic resonance imaging, fMRI) techniques. Masking illusions are known to evoke reliable blood-oxygen level dependent (BOLD) signals that correlate with perception within the human visual cortex (Dehaene et al., 2001; Haynes et al., 2005; Haynes and Rees, 2005). Since the psychophysical strengths of monoptic and dichoptic masking are equivalent (Schiller, 1965; Macknik and Martinez-Conde, 2004a), we should be able to ﬁnd the point in the ascending visual hierarchy in which monoptic and dichoptic masking activities are both extant, and thus determine the ﬁrst point in the visual hierarchy at which awareness of visibility could potentially be maintained. Previous to this level, target responses will not be well inhibited during dichoptic masking: if these prior areas were sufﬁcient to maintain visual awareness, the target would be perceptually visible during dichoptic masking conditions.

We mapped the retinotopic visual areas with fMRI in human subjects and measured BOLD signal in response to monoptic and dichoptic visual masking within each subject’s individually mapped retinotopic areas (Fig. 13). Our results show that dichoptic masking does not correlate with visual awareness in area V1, but begins only downstream of area V2, within areas V3, V3A/B, V4 and later (Fig. 14). These results agree with previous electrophysiological results in monkeys using both visual masking and binocular rivalry stimuli (Logothetis et al., 1996; Sheinberg and Logothetis, 1997; Macknik and Martinez-Conde, 2004a), as well as with one fMRI study of binocular rivalry in humans (Moutoussis et al., 2005). Furthermore, we found that the iterative increase in lateral inhibition, which we previously

193

Fig. 13. (A and B) Examples of retinotopy mapping from two subjects. Visual areas that have been delineated by retinotopic mapping analysis are indicated in different colors. Reprinted from Tse et al. (2005), with kind permission from the National Academy of Sciences of the United States of America.

discovered in the subcortical visual system and in area V1 for monoptic masking, continued in the extrastriate visual areas for dichoptic masking (Fig. 14C). This fact plays an important role in localizing the circuits responsible for visibility and perception. For instance, if the brain areas that maintained visual awareness exhibited only weak suppression (i.e., areas that are early in the visual hierarchy such as the LGN and area V1), then target masking would be incomplete and targets would not be rendered invisible during masking. Since dichoptic visual masking is as strong as

monoptic visual masking perceptually and targetderived neural activity is only weakly suppressed by dichoptic masks prior to area V3, it follows that the circuits responsible for visibility lie downstream of area V2, or else targets would not appear to be suppressed during dichoptic masking.

Having determined the lower boundary in the visual hierarchy for the perception of visual masking of simple targets, we set out to determine if there was also an upper boundary. To do this, we isolated the parts of the brain that showed both an increase in BOLD signal when nonillusory visible targets

194

were displayed and a decrease in BOLD signal when the same targets were rendered less visible by visual masking. Surprisingly, only areas within the occipital lobe showed differential activation between visible and invisible targets (Fig. 15).

In addressing Question ]3 (What brain areas must activate to achieve awareness of visibility?), our combined results suggest that visual areas beyond V2, within the occipital lobe, are responsible for maintaining our awareness of simple targets (Fig. 16). This conclusion may not hold true for complex visual stimuli or attended stimuli (see Discussion).

Our results show that masking in the early visual system is not caused by feedback from higher cortical areas that also cause dichoptic masking. It follows that the circuit that causes masking must be ubiquitous enough and simple enough that it exists at many or possibly all levels of the visual system. As lateral inhibition is the basis for all the receptive ﬁeld structure that we know, it must therefore exist at all levels of the visual system that have receptive ﬁelds. Thus, lateral inhibition is a candidate circuit. This idea is strengthened by our ﬁndings that lateral inhibition is strengthened iteratively at each progressive level of the early visual system.

What speciﬁc neural circuits within the relevant brain areas maintain visibility?

We previously showed that the parts of the target that are most important to conveying its visibility are its spatial edges. We also found that, temporally, the parts of the target’s lifetime most important to its visibility are its onset and termination, rather than its midlife. That is, the target’s temporal edges seemed to convey the

strongest signal concerning the target’s visibility. Similarly, the parts of the mask that were most important to its ability to suppress the perception of the target were the mask’s spatiotemporal edges. We moreover found that the neural correlates of the spatiotemporal edges of stimuli (both targets and masks) were transient bursts of spikes that occurred after the stimulus turned on and off, within neurons with receptive ﬁelds positioned at the spatial edges of the stimulus on the retinotopic map. Brain areas downstream of area V2, but nevertheless lying within the occipital lobe, were sufﬁcient to maintain our awareness of visibility of simple unattended targets. Finally, we showed that visual masking and visual awareness do not rely on feedback mechanisms from higher cortical areas.

On the basis of these ﬁndings, we proposed that the peculiar timing conditions associated with visual masking may be explained through a simple lateral inhibitory network, in which the transient responses to the mask’s spatiotemporal edges inhibit the transient responses to the target’s spatiotemporal edges. Because the target and the mask do not overlap each other spatially, the circuit underlying masking must be called ‘‘lateral inhibition,’’ as deﬁned by Hartline and Ratliff (Hartline, 1949; Ratliff, 1961; Ratliff et al., 1974). The circuit has three properties:

(1)Excitatory input and output.

(A)Monosynaptic connections between the retina and the LGN, and between the LGN and cortex, are excitatory (Cleland et al., 1971; Levick et al., 1972; Reid and Alonso, 1995).

(2)Self-inhibition.

Fig. 14. Retinotopic analysis of monoptic vs. dichoptic masking. (A) The logic underlying the analysis of masking magnitude for hypothetical retinotopic areas. The Mask Only response is bigger than the Target Only response because masks subtend a larger retinotopic angle than targets, and are moreover presented twice in each cycle for 100 ms each ﬂash, whereas the target is single-ﬂashed for only 50 ms. If the target response adds to the mask response in the SWI condition (no-masking percept), then the SWI response will be bigger than the Mask Only response, whereas if the target does not add (masking percept), then the SWI response will be equal or smaller (as the mask itself may also be reciprocally inhibited by the target) than the Mask Only response. (B) Monoptic and dichoptic masking magnitude (% BOLD difference of Mask Only/SWI conditions) as a function of occipital retinotopic brain area, following the analysis described in (A). Negative values indicate increased activation to the SWI condition (no masking), whereas values X0 indicate decreased or unchanged SWI activation (masking). (C) Dichoptic masking magnitude (% BOLD difference of Mask Only/SWI conditions) as a function of occipital retinotopic brain area within the dorsal and ventral processing streams. The strength of dichoptic masking builds up as a function of level in the visual hierarchy for both the dorsal (R2 ¼ 0.90) and ventral (R2 ¼ 0.72) processing streams. Reprinted from Tse et al. (2005), with kind permission from the National Academy of Sciences of the United States of America.

195

A	AREA X [No Masking]			% BOLD Difference
A				% BOLD Difference
BOLD				(MO/SWI)
%				(MO/SWI)	SWI Magnitude
	Mask	Target	SWI	Difference
	Mask	Target	SWI
	Only	Only			0
BOLD	AREA Y [Masking]			BOLD%	AREA X	AREA Y
BOLD				BOLD%

%

% BOLD Difference

Mask

Target

SWI

Only

(MO/SWI)

Monoptic

Dichoptic

Difference

MASKING

% BOLD

NO MASKING

-10

C 25

V2d

V2v

V3d

V3v

V3A/B

V4v

(MO/SWI)

Dorsal

Ventral

Difference

MASKING

NO MASKING

% BOLD

-5

-10

-15

-20

196

B		Occipital Masking	C
B			C
%BOLD	1		%BOLD	1
%BOLD		• Control	%BOLD
		• Control
		• SWI Masking
		STIMULUS
		20

TIME (secs)

Non-Occipital Masking

• Control

• SWI Masking

STIMULUS

20 TIME (secs)

D Occipital vs Non-Occipital Non-illusory Response

	2
%BOLD	1	Non-Occipital
		Non-Occipital
		Occipital
		Occipital
		STIMULUS
		20

TIME (secs)

(3)Lateral inhibition as a function of excitation (thus inhibition follows excitation in time).

(A)Spatiotemporal responses of neurons in the LGN and cortex show that excitation precedes inhibition (Ferster, 1986; Golomb et al., 1994). Fig. 17 shows a descriptive model of how a basic lateral inhibitory circuit that has these properties might account for the spatiotemporal properties found in masking.

(a)Several previous groups have suggested that masking might be explained by lateral inhibition (Bridgeman, 1971; Lawwill, 1973; Anbar and Anbar, 1982; Francis, 1997, 1998). However, various aspects of these models do not match the timing parameters of visual masking we discovered, such as the importance of the target’s after-discharge to its visibility (Macknik and Livingstone, 1998; Macknik et al., 2000a).

(i)For instance, Breitmeyer (1984) noted that Bridgeman’s model predicts that when the target and the mask are presented with nonoptimal stimulus onset

Fig. 15. Localization of visibility-correlated responses to the occipital lobe. (A) An individual brain model from all perspectives, including both hemispheres ﬂat-mapped, overlaid with the functional activation from 17 subjects. The green-shaded areas are those portions of the brain that did not show signiﬁcant activation to Target Only stimuli. The blue voxels exhibited signiﬁcant target activation (Target Only activation4Mask Only activation). Yellow voxels indicate a signiﬁcant difference found between Control (target-visible) and SWI (target-invis- ible) conditions, indicating potentially effective visual masking, and thus a correlation with perceived visibility. (B) Response time-course plots from Control vs. SWI conditions in the occipital cortex. (C) Response time-course plots from Control vs. SWI conditions in nonoccipital cortex. (D) Response timecourse plots from the nonillusory conditions (Target Only and Mask Only combined) in occipital vs. nonoccipital cortex. Error bars in (B), (C), and (D) represent SEM between subjects. Reprinted from Tse et al. (2005), with kind permission from the National Academy of Sciences of the United States of America.

197

Fig. 16. Layout of retinotopic areas that potentially maintain awareness of simple targets. An individual brain model from all perspectives, including both hemispheres ﬂat-mapped, overlaid with the functional activation from one typical subject. The yellowshaded areas are those portions of the brain that did not show signiﬁcant dichoptic masking (as in Fig. 14B and C) and thus are ruled out for maintaining visual awareness of simple targets. The pink-colored voxels represent the cortical areas that exhibited signiﬁcant dichoptic masking and thus are potential candidates for maintaining awareness of simple targets. Reprinted from Tse et al. (2005), with kind permission from the National Academy of Sciences of the United States of America.

asynchronies (to produce masking), the apparent contrast of the target should beat (called ‘‘ringing’’ by Breitmeyer, 1984) in time (like two tuning forks of slightly different pitches) because of poor phase alignment of target and mask resonance. If this is true, then it follows that one should be able to

tune the optimal SOA by noting the beat frequency of the target. At nonoptimal stimulus onset asynchronies, however, the percept of the target is quite steady (Breitmeyer, 1984).

Our model is the ﬁrst to speciﬁcally propose an explanation of how the spatiotemporal edges of stimuli may interact to cause various novel

198

Time-out

spontaneous

very excited

LEVEL OF EXCITATION

160 spike/sec

500 milliseconds

Fig. 17. (A) A representation of the spatial lateral inhibition model originally proposed by Hartline and Ratliff (Ratliff, 1961; Ratliff et al., 1974). The four excitatory neurons (highlighted in yellow and gray) in the center of the upper row receive excitatory input from a visual stimulus. This excitation is transmitted laterally in the form of inhibition, resulting in edge enhancement of the stimulus: the neuronal underpinnings of the Mach band illusion (Mach, 1965). (B) One excitatory and one inhibitory neuron taken from the spatial model in (A), now followed through an arbitrary period of time. Several response phases are predicted, including the onset-response, and the transient after-discharge (Adrian and Matthews, 1927). (C) Reprinted from Adrian and Matthews (1927); their Figs. 4 and 5). The top ﬁgure is the peri-stimulus time histogram of neuronal ﬁring rate from the eel optic nerve when the retina was stimulated by a disk, 36 mm in diameter and 830 candles/m2 in intensity. Duration of the stimulus is denoted with the white bar below the recording. The bottom ﬁgure is similar, except that the disk was 12.8 mm in diameter. Notice that the histograms retain their characteristic shape across different physical intensities, and that each response to the target is subsequently followed by a prominent after-discharge. (D) The average response, from 28 neurons in separate recording sites in area V1 of an anesthetized cynomolgus monkey when visually stimulated for 500 ms with an optimally oriented bar (some recording sites may not have been orientation selective, in which case orientation was arbitrarily chosen). The white bar on the bottom of the histogram represents the time in which the target was on. The onset-response period (pink) lasts for between 45 and 100 ms. The sustained period (orange) varies in duration systematically with target duration and does not appear until the target’s transient onset-response peaks. The combined duration of the transient and sustained periods matches the duration of the visual stimulus. The time-out period (green) and after-discharge (blue) directly follow the cessation of the target and each lasted for about 50 ms. The magnitude of the after-discharge seems to grow in size, as the target duration increases. Reprinted from Macknik and Martinez-Conde (2004b), with kind permission from Elsevier.

illusions of invisibility (below) (Macknik and Martinez-Conde, 2004b). Quantiﬁable computational models of a lateral inhibitory network from the Herzog’s laboratory have also shown that lateral inhibition circuits can potentially predict novel Gestalt-level masking behavior (Herzog and Fahle, 2002; Herzog et al., 2003a, b). The Herzog’s computational model, based on a Wilson–Cowan algorithm, may behave so as to match our results (not yet tested).

Predictions of the lateral inhibitory model

The SWI illusion

The SWI illusion was the ﬁrst perceptual prediction of the model. This illusion combined forward and backward masking together in a cyclic version of visual masking, thus suppressing all of the transient responses associated with each ﬂicker of the target. Without the mask, the target is a highly salient ﬂickering bar, but with the mask present, the target becomes virtually invisible (Macknik and Livingstone, 1998). The Enns and McGraw groups quantiﬁed the psychophysics of the SWI illusion (Enns, 2002; McKeefry et al., 2005). To the best of our knowledge, this is the ﬁrst illusion to have been predicted from electrophysiological data, rather than the other way around (Fig. 18).

The temporal fusion illusion

Temporal fusion is a second illusion predicted from the model. In this illusion, two short (i.e., 50 ms) duration targets fuse perceptually into one single, long-duration target due to the presentation of a spatially and temporally abutting mask (100 ms long) during the ISI (i.e., inter-target interval) (Fig. 19). Without the intervening mask, two 50-ms targets, separated by an interval of 100 ms, are easily perceived as two sequential ﬂashes. However, when the nonoverlapping mask (nonoverlapping either spatially or temporally) is presented during the inter-target interval, the on- set-response from the mask suppresses (through backward masking) the after-discharge from the ﬁrst ﬂash of the target, while the after-discharge from the mask suppresses (through forward masking) the onset-response to the second ﬂash of the target. Therefore, no transient responses remain to

199

A B C

mask

Stimulus

target

mask

Response

Fig. 18. Optical imaging of the SWI illusion. (A) A visual target with a width of 0.121, and the correlated optical image. (B) Two masking stimuli presented without a target, and the correlated response. (C) The SWI stimulus (note: the target and the masks were never presented on the display at the same time) and the correlated response. Notice that the image of the target in (C) is now missing, compared to the response seen in (A), despite that both targets were displayed in an identical manner. Modiﬁed from Macknik and Haglund (1999), with kind permission from the National Academy of Sciences of the United States of America.

indicate that the ﬁrst ﬂash turned off, and the second ﬂash turned on, rendering an illusory percept of a single, long-duration target. This illusion was originally suggested as a prediction of our model by Prof. David McCormick of Yale Medical School, and we reported this illusion at the SFN meeting in 2000 (Macknik et al., 2000b). The temporal fusion illusion veriﬁes the lateral inhibitory model’s predictions, but it has not yet been quantiﬁed psychophysically or physiologically.

Flicker fusion

A third consequence of the model is the wellknown ﬂicker fusion illusion. Flicker fusion (sometimes called ‘‘persistence of vision’’) is essential to everyday vision in the modern world (especially when viewing motion pictures, computer displays, TVs, working under artiﬁcial lighting, etc.). In 1824, Peter Mark Roget (who also wrote the famous Thesaurus) ﬁrst presented the concept of ‘‘persistence of vision’’ to the Royal Society of London, as the ability of the retina to retain an image of an object for 1/20–1/5 s after its removal

200

Target

50ms

Time

Backward Masking

Forward Masking

Target	50ms		50ms
Mask
D		100ms

Time

Perception: Two short flashes

Perception: One long flash

Fig. 19. Physiological prediction underlying the temporal fusion illusion. (A) Timeline depicting two sequential ﬂashes of the same target. (B) The predicted neural responses to the stimuli in (A). The perception is of two ﬂashes. (C) Timeline of the temporal fusion illusion. (D) The predicted neural responses to the stimuli in (C). The after-discharge to the ﬁrst target and the onset-response to the second target are obliterated by the presentation of a 100-m-long mask in between both targets. The perception is of one long ﬂash.

from our ﬁeld of vision (Roget, 1825). A second principle, the ‘‘phi phenomenon’’ or stroboscopic effect (the basis for the famous Gestalt School of Psychology), is closely related to ﬂicker fusion. It was ﬁrst studied by Max Wertheimer (the founder of Gestalt Psychology) and Hugo Munsterberg between 1912 and 1916, who found that subjects can perceptually bridge the temporal gap in between two consecutive displays, allowing them to perceive a series of static images in a continuous movement (Munsterberg, 1916; Wertheimer, 1925).

Several excellent studies of the physiology of ﬂicker fusion have been published (De Valois et al., 1974; Merigan, 1980; Merigan et al., 1985; Merigan and Maunsell, 1990; Gur and Snodderly, 1997). However, these studies have focused on understanding or exploiting the difference between

the critical ﬂicker fusion frequencies in the parvocellular vs. magnocellular retino-geniculo-cortical pathways (in order to understand the function of those pathways to perception), rather than on the physiology underlying the circuit that fundamentally causes ﬂicker fusion (whatever the pathway).

It follows from our model that for two briefduration targets presented in close succession, the after-discharge from the ﬁrst target may interfere (i.e., abolish or diminish) with the onset-response from the second target. Thus the transient responses from a series of ﬂashes would be suppressed, thereby diminishing the salience of the ﬂashes and rendering them less visible. The ﬂashes would therefore be more difﬁcult to differentiate and appear to be fused.

We studied the effect of inhibition at the termination of the stimulus by ﬂashing a target twice, with varied intervals between the ﬁrst and the second ﬂash (see Fig. 20). We found that with short ISIs, both the after-discharge of the ﬁrst ﬂash and the onset-response of the second ﬂash were inhibited. This suggested that when ﬂicker fusion occurs perceptually, it may be due to the lack of

Target Only (300ms)

ISI = 110ms

ISI = 90ms

ISI = 70ms

ISI = 50ms

ISI = 30ms

ISI = 10ms

ISI = 0ms

Fig. 20. Multiunit recording from upper layers of area V1 in a rhesus monkey. The receptive ﬁeld was foveal, 0.11 square, and well oriented. The transparently shaded parts of the histograms represent the various phases of the cells’ response to the second ﬂash. The different phases of response are color coded as in Fig. 17D. Notice that the second-ﬂash response is rescued as the inter-stimulus interval (ISI), or the interval between the end of the ﬁrst-pulse and the beginning of the second-pulse, is increased beyond 30 ms.

201

robust ﬁring to the ﬂickering stimulus (which, in a sense, forwardly and backwardly masks itself). The duration of the inhibitory effect after the ﬁrst ﬂash coincided in time and had the same duration as the time-out period of the ﬁrst ﬂash. Fig. 19 shows that suppression of the onset-response from the second ﬂash only occurs with ISIs of 30 ms or shorter (equivalent to 33 Hz periodic). If we separate the ﬂashes by more than 30 ms, the afterdischarge to the ﬁrst ﬂash and the onset-response to the second ﬂash begin to recover (i.e., equivalent to o3 Hz ﬂicker). These intervals roughly coincide with the minimum ﬂicker fusion threshold in humans for 100% contrast stimuli in the fovea (as we used in anesthetized monkeys in these studies) (Fukuda, 1979; Lennie et al., 1993; Di Lollo and Bischof, 1995; Gorea et al., 2000).

In answer to Question ]4 (What is the structure of the circuits that process visibility?), our experiments suggest that lateral inhibitory networks play a role in suppressing the circuits that maintain visibility. While the nature of the circuits that directly maintain awareness is yet unknown, the knowledge that those circuits are suppressed by lateral inhibition may help us to identify them in the future.

Discussion

This chapter describes our results to date in answer to the following four questions:

(1)What stimulus parameters are important to visibility?

(A)Our results show that spatiotemporal edges are critical to target visibility. When we suppressed spatiotemporal edges, targets became less visible or invisible. Recent research suggests that spatiotemporal corners may be even more important than edges (Troncoso et al., 2005).

(B)Masks produce their strongest suppressive effect at their spatiotemporal edges as well, suggesting that inhibition produced by a mask is a function of the same excitatory activity that makes stimuli visible.

<<< < Предыдущая 8 9 10 11 12 13 14 15 16 17 18 1920 / 3220 21 22 23 24 25 26 27 28 29 30 31 32 > Следующая >>>

Соседние файлы в папке books.google.com

#
28.03.202610.83 Mб0Surgical Techniques in Opthalmology_Garg, Alio_2010.pdf
#
28.03.202612.2 Mб0The Macula Diagnosis, Treatment and Future Trends_Binder_2004.pdf
#
28.03.202626.61 Mб0The Ocular Fundus From Findings to Diagnosis_Wolf, Kirchhof, Reim_2006.pdf
#
28.03.202623.38 Mб0Vision Rehabilitation Assessment, Intervention, and Outcomes_Stuen_2000.pdf
#
28.03.202652.41 Mб0Visual Development, Diagnosis, and Treatment of the Pediatric Patient_Duckman_2006.pdf
#
28.03.202613.4 Mб0Visual Perception Fundamentals of awareness multi-sensory integration and high-order perception_Martinez-Conde_2006.pdf