0网e 194 DESIMONE DUNCAN NPX N N P NEURAL MECHANISMS OF · SELECTIVE VISUAL E E 0 ATTENTION in human visice Robert Desimonel and John Duncan? 8a6ppaw以 NohoyomBethesda Taken together,such results suggest the following general model (Broadbent KEY WORDS:vision.cortex,primaes.visual search.neglect 1958:Neisser 1967:Treisman 1960,1993).At some point (or several points) between input and response,objects in the visual input compete for represen- tation analy vsis,or control.The competition is biased,however,towards in- formation that is currently relevant to behavior.Attended stimuli make demands on processing capacity.while unattended ones often do not. INTRODUCTION In the follow ing sections,we first outline the major behavioral characteris- tics of competition and consider the limitations within the nervous system that make com etition necessary.We then describe selectivity,or how the co The two basic phenomena that define the problem of visual attention can be of tition may h esolved at hoth the e be s shown in each avioral and ne eural vel To some ext nt builds models of biased co ion by Walley Weiden In ake differs e0【o repor ters appe 01 color(targets ere b (1973 d Harte (1984).The appr from 0f hich a on f 0 ar and the su hancing the pro sing(and perhaps bin opportunity for eye movements,would give their report.The display mimic illuminated item.Instead,the model we develop is that attention is an cmergent our usual cluttered visual environment:It contains one or more objects that property of many neural mechanisms working to resolve competition for visual are relevant to current behavior,along with others that are irrelevant. processing and control of behavior. The first basic phenomenon is limited capacity for processing information. At any given time,only a small amount of the information available on the COMPETITION retina can be processed and used in the control of behavior.Subjectively.giving attention to any one target leaves less available for others.In Figure 1,the Behavioral Data probability of reporting the target letter N is much lower with two accompa- nying targets(Figure 1a)than with none (Figure 1b). In one simple type of experiment,two objects are presented in the visual field. The second basic phenomenon is selectivity. Subjects must identify some property of both objects,with a separate response -the ability to filter out un wanted information.Subjectively,one is aware of attended stimuli and largely for each.Such studies reveal several important facts.First,dividing attention unaware of unattended ones.cor espondingly,accuracy in identifying an between two objects almost always results in poorer performance than focusing attended stimulus may be independent of the r number of nontargets in a display attention on one.Identifying simple properties of each object such as size (Figure la vs lc)(see Bundesen 1990.Duncan 1980). brightness,orientation,or spatial position gives much the same result as iden- tifving more complex properties such as shape (see Duncan 1984.1985,1993)
Annu. Rev. Neurosci. 1995. 18:193-222 Copyright © 1995 by Annual Reviews Inc. All rights reserved NEURAL MECHANISMS OF SELECTIVE VISUAL ATTENTION Robert Desimone 1 and John Duncan 2 1Laboratory of Neuro~sychology, NIMH, Building 49, Room 1B80, Bethesda, Maryland 20892 and MRC Applied Psychology Unit, 15 Chaucer Road, Cambridge CB2 2EF, England KEY WORDS: vision, cortex, primates, visual search, neglect INTRODUCTION The two basic phenomena that define the problem of visual attention can be illustrated in a simple example. Consider the arrays shown in each panel of Figure 1. In a typical experiment, before the arrays were presented, subjects would be asked to report letters appearing in one color (targets, here black letters), and to disregard letters in the other color (nontargets, here white letters). The array would then be briefly flashed, and the subjects, without any opportunity for eye movements, would give their report. The display mimics our. usual cluttered visual environment: It contains one or more objects that are relevant to current behavior, along with others that are irrelevant. The first basic phenomenon islimited capacity for processing information. At any given time, only a small amount of the information available on the retina can be processed and used in the control of behavior. Subjectively, giving attention to any one target leaves less available for others. In Figure 1, the probability of reporting the target letter N is much lower with two accompanying targets (Figure la) than with none (Figure lb). The second basic phenomenon isselectivity--the ability to filter out unwanted information. Subjectively, one is aware of attended stimuli and largely unaware of unattended ones. Correspondingly, accuracy in identifying an attended stimulus may be independent of the number of nontargets in a display (Figure lavs lc) (see Bundesen 1990, Duncan 1980). 193 0147-006X/95/0193505.00 www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 194 DESIMONE & DUNCAN a) p b) c) N N D N P @ D Figure 1 Displays demonstrating limited processing capacity and selectivity in human vision. Subjects are shown the displays briefly and asked to report only the black letters. Limited capacity is shown by reduced accuracy as the number oftargets is increased (compare b and a). Selectivity is shown by negligible impact of nontargets (compare a and c). Taken together, such results suggest the following general model (Broadbent 1958; Neisser 1967; Treisman 1960, 1993). At some point (or several points) between input and response, objects in the visual input compete for representation, analysis, or control. The competition is biased, however, towards information that is currently relevant to behavior. Attended stimuli make demands on processing capacity, while unattended ones often do not. In the following sections, we first outline the major behavioral characteristics of competition and consider the limitations within the nervous system that make competition necessary. We then describe selectivity, or how the competition may be resolved, at both the behavioral and neural level. To some extent, our account builds on early models of biased competition by Walley & Weiden (1973) and Harter & Aine (1984). The approach we take differs from standard view of attention, in which attention functions as a mental spotlight enhancing the processing (and perhaps binding together the features) of the illuminated item. Instead, the model we develop is that attention is an emergent property of many neural mechanisms working to resolve competition fo~ visual processing and control of behavior. COMPETITION Behavioral Data In one simple type of experiment, two objects are presented in the visual field. Subjects must identify some property of both objects, with a separate response for each. Such studies reveal several important facts. First, dividing attention between two objects almost always results in poorer performance than focusing attention on one. Identifying simple properties of each object such as size, brightness, orientation, or spatial position gives much the same result as identifying more complex properties such as shape (see Duncan 1984, 1985, 1993). www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 195 196 DESIMONE DUNCAN et al 1992) ond,as long a uses brief stimulus exposures and me sures the accuracy of tion,the major perfo ance lim appears to occur at stimulus input rather than subsequent short-term storage and response.For example,interference from processing two objects is abo ished if they are shown one after the other,with an interval of perhaps a second between them(Duncan 1980),even though the two responses called for must still be remembered and made together at the end of the trial. Third,interference is independent of eye movements.Even though gaze is always maintained at fixation,it is easier to identify one object in the periphery than two Fourth,interference is largely independent of the spatial separation between two objects,at least when the field is otherwise empty (Sagi Julesz 1985, Vecera Farah 1994).Though attention is sometimes seen as a mental spot (TE light lluminating or selectin ion of vi per not d on the ab ion of ween-object competi on sterior parie been rgued that ful sis of every ob ject in a scene ould be impossibly complex(Broadbent 1958.Tsotsos199).Competition have fughe ctions of the two refects a limit on visual i ntification capacity.Equally strong.however,has been the view that competition concerns control of response systems(Allport 1980.Deutsch Deutsch 1963).Certainly.some response activation often occurs from objects a person has been told to ignore (Eriksen Eriksen 1974). which shows that unwanted information is not entirely filtered out in early important for spatial perception and visuomotor performance(Ungerleider vision.Very probably,competition between objects occurs at multiple levels Haxby 1994,Ungerleider Mishkin 1982).Since competition impacts object between sensory input and motor output (Allport 1993). recoenition would expect to find one basis for it in the ventral stream tral al subr Neural Basis for Competition The cific ar ea V2(thin are TEO and TE in the nfe If the nervous system had unlimited capacity to process information in parallel eds fr necessary only at final oud pres (T pro xt along this path nway,neuronal properties change in tw bvio Befor ses.For examplc,where what lim many V c fanction eetially oca paotem p the ulfield compete for V2 neurons may respond to virtual or illusory contours in certain figures (von ng within der Heydt et al 1984),and IT neurons respond sele vely to global or ove visual areas(Des object features such as shape (Desimone et al 1984.Schwartz et al 1983 Tanaka et al 1991).Second,the receptive field size of individua neurons corical processing pathways.or streams,each of which begins with the pri- increases at each stage.As one moves from VI to V4 to TEO to TE,typical mary visual cortex,or VI (see Figure 2).The first.a ventral stream,is directed receptive fields in the central field representation are on the order of 0.2.3.6. nto the inferior temporal cortex and is important for object recognition,while and 25 in size,respectively (see Boussaoud et al 1991.Ungerleider the other,a dorsal stream.is directed into the posterior parietal cortex and is Desimone 1989).I.arge recentive fields mav contribute towards the recognition
VISUAL ATTENTION 195 A possible exception is simple detection of simultaneous energy onsets or offsets (Bonnel et al 1992). Second, as long as the experiment uses brief stimulus exposures and measures the accuracy of stimulus identification, the major performance limitation appears to occur at stimulus input rather than subsequent short-term storage and response. For example, interference from processing two objects is abolished if they are shown one after the other, with an interval of perhaps a second between them (Duncan 1980), even though the two responses called for must still be remembered and made together at the end of the trial. Third, interference is independent of eye movements. Even though gaze is always maintained at fixation, it is easier to identify one object in the periphery than two. Fourth, interference is largely independent of the spatial separation between two objects, at least when the field is otherwise empty (Sagi & Julesz 1985, Vecera & Farah 1994). Though attention is sometimes seen as a mental spotlight illuminating or selecting information from a restricted region of visual space (Eriksen & Hoffman 1973, Posner et al 1980), performance seems not to depend on the absolute spatial distribution of information. An enduring issue is the underlying reason for between-object competition. It has often been argued that full visual analysis of every object in a scene would be impossibly complex (Broadbent 1958, Tsotsos 1990). Competition reflects a limit on visual identification capacity. Equally strong, however, has been the view that competition concerns control of response systems (Allport 1980, Deutsch & Deutsch 1963). Certainly, some response activation often occurs from objects a person has been told to ignore (Eriksen & Eriksen 1974), which shows that unwanted information is not entirely filtered out in early vision. Very probably, competition between objects occurs at multiple levels between sensory input and motor output (Allport 1993). Neural Basis for Competition If the nervous system had unlimited capacity to process information in parallel throughout the visual field, competition between objects would presumably be necessary only at final motor output stages. Before discussing these motor stages, we first consider what limitations in the visual system make competition necessary at the input. Objects in the visual field compete for processing within a network of 30 or more cortical visual areas (Desimone & Ungerleider 1989, Felleman & Van Essen 1991). These areas appear to be organized within two major corticocortical processing pathways, or streams, each of which begins with the primary visual cortex, or V1 (see Figure 2). The first, a ventral stream, is directed into the inferior temporal cortex and is important for object recognition, while the other, a dorsal stream, is directed into the posterior parietal cortex and is www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 196 DESIMONE & DUNCAN Figure 2 Striate cortex, or V1, is the source of two conical visual streams. A dorsal stream is directed into the posterior parietal cortex and underlies spatial perception and visuomotor performance. A ventral stream is directed into the inferior temporal cortex and underlies object recognition. Both streams have further projections into prefrontal cortex. Adapted from Mishkin et al (1983) and Wilson et al (1993). For a "wiring diagram" of the areas and connections of the two streams, see Desimone & Ungerleider (1989) and Felleman & Van Essen (1991). important for spatial perception and visuomotor performance (Ungerleider Haxby 1994, Ungerleider & Mishkin 1982). Since competition impacts object recognition, we would expect to find one basis for it in the ventral stream. The ventral stream includes specific anatomical subregions of area V2 (thin and interstripe regions), area V4, and areas TEO and TE in the inferior temporal (IT) cortex (see Desimone & Ungerleider 1989). As one proceeds from area to the next along this pathway, neuronal properties change in two obvious ways. First, the complexity of visual processing increases. For example, whereas many V1 cells function essentially as local spatiotemporal energy filters, V2 neurons may respond to virtual or illusory contours in certain figures (von der Hey& et al 1984), and IT neurons respond selectively to global or overall object features, such as shape (Desimone et al 1984, Schwartz et al 1983, Tanaka et al 1991). Second, the receptive field size of individual neurons increases at each stage. As one moves from V1 to V4 to TEO to TE, typical receptive fields in the central field representation are on the order of 0.2, 3, 6, and 25° in size, respectively (see Boussaoud et al 1991, Ungerleider Desimone 1989). Large receptive fields may contribute towards the recognition www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 197 198 DESIMONE DUNCAN of objects over retinal translation (Gross&Mishkin 1977.Lueschow et al the competition in their favor.This issue,which we term selectivity,is con- 1994. sidered in later sections. These receptive fieldscn be viewed asacritical visual proce ing re source. If the dorsal stream receives its visual input in parallel to the ventral stream for which objects in the visual field must compete(Desimone 1992.Olshausen as the anatomy suggests (Desimone Ungerleider 1989),then it is presumably et al 1993.Tsotsos 1990).If one were to add ever more independent objects faced with competition among objects as well.As in IT cortex,receptive fields to a V4 or IT receptive field,the information available about any one of them in posterior parietal cortex are very large,and it seems likely that increasing would certainly decrease.If,for example,a color-sensitive IT neuron were to the number of independent objects in the visual field will ev integrate wavelength over its large receptive field,one might not be able to the c parietal cor to ext act the locations of eah of them in tell from that cell alone if a given level of response was due to,say,one red object or two yellow ones or three green ones at different locations in the field. etition.to the must also deal nt th dist y filter of the Such ambiguity may be responsible for the interference effects found in divided input (e.g. &Wurtz 1993a,b.U1 fo attention sible eyes to only arget at a time.Acritic e is how may be reduced.in part by linking objects and their features selectivity is coord different systems so that the same target retinal o ed that object is selected for perceptual and spatial analysis as well as for motor tfro om the entra stbe supp cd by th control. e In fact,the ven about the location of complex object featur es.V and TEO eurons process relatively sophisticated information about SELECTIVITY:SCREENING OUT UNWANTED STIMULI object shape (Desimone Schein 1987,Gallant et al 1993.Tanaka et al 1991)and have retinotopically organized receptive fields(Boussaoud et al 1991.Gattass et al 1988).At any Behavioral Data given retinotopic locus in these areas,receptive fields show considerable The ability to screen out irrelevant objects(Figure 1)is not absolute.It is easy scatter.One could.in principle.derive information about the relative locations in some cases and difficult in others.as is well illustrated in visual search.The of nearby features from a population of cells with partially overlapping fields subject detects or identifies a single target presented in an array of nontargets the same way one could derive information about a specific color from a Examples are shown in Figure 3.In easy cases,the target appears to"pop out" population of neurons with broad but different color tuning.Similarly,although of the array,as if attention were drawn directly to it (Donderi Zelnicker receptive fields in IT cortex may span 20-30 degrees or more,they are not 1969,Treisman Gelade 1980).Under such circumstances,the number of ogeneous Typically.the fields have a hol spot nontargets has little effect on the speed or accuracy of target detection or may extend asyr etrically into the or lo wer contralateral visual field. identification.In hard cases,how are not filtered out well.In Alth gh the ces o of IT n the sam these instand number of nta edisplay has a large effect on ions,for ge min ab for eac anges signi i.e cells are t Gelade 1980) hough in fact me w y are tuned to oth er obj ect features (Desi one et a ■ b) Thus,in principle,objects and their locations might be linked to some exten Q within the ventral stream.Even so,parallel processing across the visual field 0 is likely to be limited To sum up.retinal location,as with other object features,is coarsely coded in the ventral stream.Information about more than one object may,to some Q ■ X extent,be processed in parallel,but the information available about any given object will decline as more and more objects are added to receptive fields n the target is a mis atching Therefore,objects must compete for processing in the ventral stream,and the visual svstem should use anv information it has about relevant ohiects to hias
VISUAL ATTENTION 197 of objects over retinal translation (Gross & Mishkin 1977, Lueschow et al 1994). These receptive fields can be viewed as a critical visual processing resource, for which objects in the visual field must compete (Desimone 1992, Olshausen et al 1993, Tsotsos 1990). If one were to add ever more independent objects to a V4 or IT receptive field, the information available about any one of them would certainly decrease. If, for example, a color-sensitive IT neuron were to integrate wavelength over its large receptive field, one might not be able to tell from that cell alone if a given level of response was due to, say, one red object or two yellow ones or three green ones at different locations in the field. Such ambiguity may be responsible for the interference effects found in divided attention. This ambiguity may be reduced, in part, by linking objects and their features to retinal locations. It is sometimes presumed that location information is absent from the ventral "what" stream altogether and must be supplied by the dorsal "where" stream. In fact, the ventral stream itself contains information about the retinal location of complex object features. V4 and TEO neurons process relatively sophisticated information about object shape (Desimone Schein 1987, Gallant et al 1993, Tanaka et al 1991) and have retinotopically organized receptive fields (Boussaoud et al 1991, Gattass et al 1988). At any given retinotopic locus in these areas, receptive fields show considerable scatter. One could, in principle, derive information about the relative locations of nearby features from a population of cells with partially overlapping fields the same way one could derive information about a specific color from a population of neurons with broad but different color tuning. Similarly, although receptive fields in IT cortex may span 20-30 degrees or more, they are not homogeneous. Typically, the fields have a hot spot at the center of gaze, which may extend asymmetrically into the upper or lower contralateral visual field. Although the stimulus preferences of IT neurons remain the same over large retinal regions, for a large minority of cells the absolute response to a given stimulus changes significantly with retinal location, i.e. cells are tuned to retinal location the same way they are tuned to other object features (Desimone et al 1984, Lueschow et al 1994, Schwartz et al 1983; also see Chelazzi et al 1993a). Thus, in principle, objects and their locations might be linked to some extent within the ventral stream. Even so, parallel processing across the visual field is likely to be limited. To sum up, retinal location, as with other object features, is coarsely coded in the ventral stream. Information about more than one object may, to some extent, be processed in parallel, but the information available about any given object will decline as more and more objects are added to receptive fields. Therefore, objects must compete for processing in the ventral stream, and the visual system should use any information it has about relevant objects to bias www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 198 DESIMONE & DUNCAN the competition in their favor. This issue~ which we term selectivity, is considered in later sections. If the dorsal stream receives its visual input in parallel to the ventral stream as the anatomy suggests (Desimone & Ungerleider 1989), then it is presumably faced with competition among objects as well. As in IT cortex, receptive fields in posterior parietal cortex are very large, and it seems likely that increasing the number of independent objects in the visual field will eventually exceed the capacity of parietal cortex to extract the locations of each of them in parallel. Likewise, neural systems for visuomotor control must also deal with competition, to the extent that distractors are not already filtered out of the visual input (e.g. Munoz & Wurtz 1993a,b). Ultimately, for example, it possible to move the eyes to only one target at a time. A critical issue is how selectivity is coordinated across the different systems so that the same target object is selected for perceptual and spatial analysis as well as for motor control. SELECTIVITY: SCREENING OUT UNWANTED STIMULI Behavioral Data The ability to screen out irrelevant objects (Figure 1) is not absolute. It is easy in some cases and difficult in others, as is well illustrated in visual search. The subject detects or identifies a single target presented in an array of nontargets. Examples are shown in Figure 3. In easy cases, the target appears to "pop out" of the array, as if attention were drawn directly to it (Donderi & Zelnicker 1969, Treisman & Gelade 1980). Under such circumstances, the number of nontargets has little effect on the speed or accuracy of target detection or identification. In hard cases, however, nontargets are not filtered out well. In these instances, the number of nontargets in the display has a large effect on performance. An increase of 50 ms in target detection time for each nontarget added to the array is typical (Treisman & Gelade 1980), though in fact, this ¯ ¯ P [] [] C ¯ X 3 J Figure 3Selectivity in visual search. Target pop-out is revealed when the target is a mismatching element in an otherwise homogeneous field (panel a). Search is also extremely easy, however, whenever targets and nontargets are highly discriminable. Pop-out can also be based on more complex properties (panel b; search for the single digit). www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 199 200 DESIMONE DUNCAN figure varies widely and continuously from one task to another (Treisman ican 1988) According to the biased competition model,targets and nontargets comp city in visu sea ch.One fa ing mple,t find a uniquc et in a Figure 3a),per ring com 1984).There ma be similar biases towards sudden appearances of new objects in the visual field (Jonides Yantis 1988)and towards obiects that are larger,brighter,faster- moving,etc (Treisman Gormican 1988). K 5 0 An attentional system,however,would be of little use if it were entirely dominated by bottom-up biases.What is needed is a way to bias competition L 8 towards whatever information is relevant to current behavior that is on needs too-down control in addition to bottom-uD.stimulus-driven biases.Cor gly,there are m ny cases of easy search that do no depend on local arget onset. A colored et in s ned.(c d)No hias.Iis 989 At least after a pop-out if th e p nng search pt a 72 Schneider Shiffrin 1977). of working memory(Baddeley 1986).The template can specify any property vn control,the ability to find 对月 Even when target selection is guided by top-do of required input-shape,color,location,ete. targets is still dependent on bottom-up stimulus factors.especially the visual Visual search is easy if targets and nontargets are easily discriminable.In similarity of targets to nontargets.Provided that targets and nontargets are this case,nontargets are poor matches to the attentional template and receive sufficiently different.however.easy search can be based on many different a weak competitive bias.Thus,the time it takes to find the target may be visual attributes,including simple features.such as size or color.and more independent of the number of nontargets in the display.By contrast,search is complex coniunctions of these features (Duncan Humphreys 1989.Mcleod difficult if nontargets are similar to the target.In this case,the competitive et al 1988 Wolfe ct al 1989).Coniunction search provides a good example of advantage of the target is reduced because each nontarget shares in the bias the importance of similarity.In Figures 4a and b.the target is a large,white provided by the attentional template.Thus,each nontarg added to the display vertical bar.This target is much harder to find in Figure 4a,where each I-search accounts are consid. nontarget shares two oroperties with the target,than in Figure 46,where or red be 51987).lndc ed the l atter ca cifically with spatial selection,i.e.sele can e xcellen scTiminabiyieachconc be produ ed n' mply by 197 Pos 80.Sperling 1960). al often special case.We do not reviev in detail t wa uch results suggest the following model of biased covered earlier by Posner Petersen (1990),and Colby (1991)has reviewe to the task,any kind of input-objects of a certain kind,objects with a certain the ncural mechanisms of spatial selection.Certainly,however,space is only color or motion,objects in a certain location,etc-can be behaviorally relevant one of the many cues that can be used in efficient target selection.A general Some kind of short-term description of the information currently nceded must account of selectivity must deal with both spatial and nonspatial cases.In terms be used to control competitive bias in the visual system.such that inputs of the biased competition model.prior knowledge of the target's spatial loca- matching that description are favored in the visual cortex (Bundesen 1990. tion is just another type of attentional template that can be used to bias Duncan Humphreys 1989).This short-term description has been called the competition in favor of the target attentional template (Duncan Humnhrevs 1989):it mav he seen as one asnect A final consideration is hias deriv ed from Iono-term memory One interest-
VISUAL ATTENTION 199 figure varies widely and continuously from one task to another (Treisman Gormican 1988). According to the biased competition model, targets and nontargets compete for processing capacity in visual search. One factor influencing selectivity is bottom-up bias. It is very easy, for example, to find a unique target in an array of homogeneous nontargets (Figure 3a), perhaps reflecting an enduring competitive bias towards local inhomogeneities (Sagi & Julesz 1984). There may be similar biases towards sudden appearances of new objects in the visual field (Jonides & Yantis 1988) and towards objects that are larger, brighter, fastermoving, etc (Treisman & Gormican 1988). An attentional system, however, would be of little use if it were entirely dominated by bottom-up biases. What is needed is a way to bias competition towards whatever information is relevant to current behavior. That is, one needs top-down control in addition to bottom-up, stimulus-driven biases. Correspondingly, there are many cases of easy search that do not depend on local inhomogeneity or sudden target onset. A colored target in a multicolored display, for example, may show good pop-out if the colors are highly discriminable (Duncan 1989). At least after a little practice, pop-out can be obtained during search for a single digit among letters (Figure 3b) (see Egeth et al 1972, Schneider & Shiffrin 1977). Even when target selection is guided by top-down control, the ability to find targets is still dependent on bottom-up stimulus factors, especially the visual similarity of targets to nontargets. Provided that targets and nontargets are sufficiently different, however, easy search can be based on many different visual attributes, including simple features, such as size or color, and more complex conjunctions of these features (Duncan & Humphreys 1989, McLeod et al 1988, Wolfe et al 1989). Conjunction search provides a good example of the importance of similarity. In Figures 4a and b, the target is a large, white vertical bar. This target is much harder to find in Figure 4a, where each nontarget shares two properties with the target, than in Figure 4b, where only one property is shared (Quinlan & Humphreys 1987). Indeed, the latter case can give excellent pop-out; a similar result can be produced simply by increasing the discriminability of each conjunction’s component features (Wolfe et al 1989). Such results suggest the following model of biased competition. According to the task, any kind of input---objects of a certain kind, objects with a certain color or motion, objects in a certain location, etc-~can be behaviorally relevant. Some kind of short-term description of the information currently needed must be used to control competitive bias in the visual system, such that inputs matching that description are favored in the visual cortex (Bundesen 1990, Duncan & Humphreys 1989). This short-term description has been called the attentional template (Duncan & Humphreys 1989); it may be seen as one aspect www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 200 DESIMONE & DUNCAN c) Y d) h K Q £) L V g H P F Figure 4(a, b) Discriminability between targets and nontargets in conjunction search. Searching for a large, white vertical bar is harder when nontargets share two (panel a) rather than one (panel b) property with the target. In the latter case good pop-out can be obtained. (c, at) Novelty bias. It easier to find a single inverted letter among upright nontargets (panel c) than the reverse (panel d). of working memory (Baddeley 1986). The template can specify any property of required input--shape, color, location, etc. Visual search is easy if targets and nontargets are easily discriminable. In this case, nontargets are poor matches to the attentional template and receive a weak competitive bias. Thus, the time it takes to find the target may be independent of the number of nontargets in the display. By contrast, search is difficult if nontargets are similar to the target. In this case, the competitive advantage of the target is reduced because each nontarget shares in the bias provided by the attentional template. Thus, each nontarget added to the display interferes with target detection. Alternative, serial-search accounts are considered below. A great deal of work has dealt specifically with spatial selection, i.e. selection based on some cue to the location of target information (Eriksen Hoffman 1973, Posner et al 1980, Sperling 1960). Indeed, spatial selection is often dealt with as a special case. We do not review this work in detail; it was covered earlier by Posner & Petersen (1990), and Colby (1991) has reviewed the neural mechanisms of spatial selection. Certainly, however, space is only one of the many cues that can be used in efficient target selection. A general account of selectivity must deal with both spatial and nonspatial cases. In terms of the biased competition model, prior knowledge of the target’ s spatial location is just another type of attentional template that can be used to bias competition in favor of the target. A final consideration is bias derived from long-term memory. One interestwww.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 201 202 DESIMONE DUNCAN ing case is bias to novelty.As shown in Figures 4c and d,for example,it is while monkeys performed delayed matching-to-sample(DMS)tasks with much easier to find an inverted (novel)target among upright (familiar)non- either novel or familiar stimuli.In DMS.a sample stimulus is followed by targets (Figure 4c)than the reverse (Figure 4d)(Reicher et al 1976).In fact. one or more test stimuli,and the animal signals when a test stimulus matches the time it takes to find an inverted character may be independent of the number the sample.For up to a third of the cells in this region,responses to novel of upright ones in a display (Wang et al 1992).which implies that multiple sample stimuli become suppressed as the animal acquires familiarity with objects have parallel access to memory and that familiarity is a type of object them (Fahy et al 1993,Li et al 1993,Miller et al 1991.Riches et al 1991). feature that can be used to bias attentional competition.A second consideration The cells are not novelty detectors.in that they do not respond to any nove ned imp tance.In a busy om,attention can be attracted by stimulus.Rather,they remain stimulus selective both before and after the b 1959).Similarly visual e with rd t n the In fact this shrinkage in the population of activated neurons as stimuli eider 1977).Thus,the top- fam the down selection bias of a current task can sometimes be overturned by infor for those stim ams the ures hew stim mation of long-term or general significance acting in a bottom-up fashion.In fashion drop out of the next sections we consider both bottom-up and top-down mechanisms for et al 1993).leaving those that are most selective.There is also direct evidence resolving competition. that some IT cells selective for faces become more tuned to a familiar face following expcrience (Rolls et al 1989). Bottom-Up Neural Mechanisms for Object Selection An effect akin to the novelty effect is also found for familiar stimuli that The first neural mechanisms for resolving competition we consider are those have been seen recently.When a test stimulus matches the previously seen that derive from the intrinsic or learned biases of the perceptual systems sample in the DMS trial,responses to that stimulus tend to be suppressed towards certain types of stimuli.We describe them bere as bottom-up pro (Miller et al 1991,1993:also see Baylis Rolls 1987,Eskandar et al 1992. cesses,not because they do not involve feedback pathways in visual Fahy et al 1993.Riches ct al 1991).Although it was originally roposed that (hey may well do so)but be se th utomatic processes this suppressive effect was dependent on that e hey appede sample,recent work has shown it to be an automatic outcome of any stimulus uli that stand out f the e pro sed preferentially a repetition (Miller&Desimone )For many cells.this uppression occurs near of the of ma en if the muli differ in size or etinal locations otherwis re ppea optimal stim y c receptiv al 1994).Thus,the detec ion ovelty and ency apparently may b letely are within a l arge sur at a high ev of s repres rounding region (for reviews see Allman et al 1985,Desimone et al 1985). together, at bo The greater the density of stimuli in the surround,the greater the suppression ve no been recently seen vill have a larger neural (Knierim Van Essen 1992).In the middle temporal area (MT),for example, giving them a competitive a vantage in ga ing con a cell that normally responds to vertically moving stimuli within its receptive orienting systems.This would explain the bi as towards novelty in the H feld may be unresponsive if the same stimuli are part of a larger moving behavioral data described above.The longer the organism attends to the obje pattern coverine the receptive field and surround (allman et al 1985 tanaka the more knowledge about the object is incorporated into the structure of the et al 1986).These mechanisms almost certainly contribute to the pop-out cortex;this reduces the visual signal.It will also reduce the drive on the effects of targets in visual search orienting system so that the organism is free to orient to the next new object As indicated above,the visual system also seems to be biased towards new (Li et al 1993,Desimone et al 1994).This view is compatible with Adaptive objects or objects that have not h n.Thus the t Resonance Theory (Carpenter Grossberg 1987),in which novel stimuli of a stimulu s m activate attentional systems that allow new lone-term memories to he formed. he e stim li n as the Consistent with these neurophysiological results in animals.a reduction in nd,or cor which t neural activation with stimulus epetition in human subjects has been seen in King exam cn ter both event-related p ve ocen entials of the temporal cortex(Begleiter et al 1993)and in brain-im oing studies (Squire et al 1992)
VISUAL ATTENTION 201 ing case is bias to novelty. As shown in Figures 4c and d, for example, it is much easier to find an inverted (novel) target among upright (familiar) targets (Figure 4c) than the reverse (Figure 4d) (Reicher et al 1976). In the time it takes to find an inverted character may be independent of the number of upright ones in a display (Wang et al 1992), which implies that multiple objects have parallel access to memory and that familiarity is a type of object feature that can be used to bias attentional competition. A second consideration is long-term learned importance. In a busy room, attention can be attracted by the sound of one’s own name spoken nearby (Moray 1959). Similarly, long practice with one set of visual targets makes them hard to ignore when they are subsequently made irrelevant (Shiffrin & Schneider 1977). Thus, the topdown selection bias of a current task can sometimes be overturned by information of long-term or general significance acting in a bottom-up fashion. In the next sections we consider both bottom-up and top-down mechanisms for resolving competition. Bottom-Up Neural Mechanisms for Object Selection The first neural mechanisms for resolving competition we consider are those that derive from the intrinsic or learned biases of the perceptual systems towards certain types of stimuli. We describe them here as bottom-up processes, not because they do not involve feedback pathways in visual cortex (they may well do so) but because they appear to be largely automatic processes that are not dependent on cognition or task demands. Stimuli that stand out from their background are processed preferentially at nearly all levels of the visual system. In visual cortex, the responses of many cells to an otherwise optimal stimulus within their classically defined receptive field may be completely suppressed if similar stimuli are within a large surrounding region (for reviews see Allman et al 1985, Desimone et al 1985). The greater the density of stimuli in the surround, the greater the suppression (Knierim & Van Essen 1992). In the middle temporal area (MT), for example, a cell that normally responds to vertically moving stimuli within its receptive field may be unresponsive if the same stimuli are part of a larger moving pattern covering the receptive field and surround (Allman et al 1985, Tanaka et al 1986). These mechanisms almost certainly contribute to the pop-out effects of targets in visual search. As indicated above, the visual system also seems to be biased towards new objects or objects that have not been recently seen. Thus, the temporal context of a stimulus may contribute as much to its saliency as its spatial context. In the temporal domain, stimuli stored in memory may function as the temporal surround, or context, against which the present stimulus is compared. Striking examples of such temporal interactions have been found in the anteroventral portion of IT cortex. Most studies in this region recorded cells www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 202 DESIMONE & DUNCAN while monkeys performed delayed matching-to-sample (DMS) tasks with either novel or familiar stimuli. In DMS, a sample stimulus is followed by one or more test stimuli, and the animal signals when a test stimulus matches the sample. For up to a third of the cells in this region, responses to novel sample stimuli become suppressed as the animal acquires familiarity with them (Fahy et al 1993, Li et al 1993, Miller et al 1991, Riches et al 1991). The cells are not novelty detectors, in that they do not respond to any novel stimulus. Rather, they remain stimulus selective both before and after the visual experience. In fact, this shrinkage in the population of activated neurons as stimuli become familiar may increase the selectivity of the overall neuronal population for those stimuli. As one learns the critical features of a new stimulus, cells activated in a nonspecific fashion drop out of the activated pool of cells (Li et al 1993), leaving those that are most selective. There is also direct evidence that some IT cells selective for faces become more tuned to a familiar face following experience (Rolls et al 1989). An effect akin to the novelty effect is also found for familiar stimuli that have been seen recently. When a test stimulus matches the previously seen sample in the DMS trial, responses to that stimulus tend to be suppressed (Miller et al 1991, 1993; also see Baylis & Rolls 1987, Eskandar et al 1992, Fahy et al 1993, Riches et al 1991). Although it was originally proposed that this suppressive effect was dependent on active working memory for the sample, recent work has shown it to be an automatic outcome of any stimulus repetition (Miller & Desimone 1994). For many cells, this suppression occurs even if the repeated stimuli differ in size or appear in different retinal locations (Lueschow et al 1994). Thus, the detection of novelty and recency apparently occurs at a high level of stimulus representation. Taken together, the results indicate that both novel stimuli and stimuli that have not been recently seen will have a larger neural signal in the visual cortex, giving them a competitive advantage in gaining control over attentional and orienting systems. This would explain the bias towards novelty in the human behavioral data described above. The longer the organism attends to the object, the more knowledge about the object is incorporated into the structure of the cortex; this reduces the visual signal. It will also reduce the drive on the orienting system so that the organism is free to orient to the next new object (Li et al 1993, Desimone et al 1994). This view is compatible with Adaptive Resonance Theory (Carpenter & Grossberg 1987), in which novel stimuli activate attentional systems that allow new long-term memories to be formed. Consistent with these neurophysiological results in animals, a reduction in neural activation with stimulus repetition in human subjects has been seen in both event-related potentials of the temporal cortex (Begleiter et al 1993) and in brain-imaging studies (Squire et al 1992). www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 203 204 DESIMONE DUNCAN Top-Down Control of Selection in the Ventral Stream 1ocalizedaregionmodulstedbyspatialatienioninlsiepmlprestnatecortex av-down biases on ss at hand. region may correspond to area V4(Mangun et al 1993). templa e,der e from the requiremen of the Although we Recently,Motter(1993)has reported attentional effects on responses of cells in V1,V2,and V4.In contrast to the Moran Desimone (1985)study,these share many features. effects were found when one stimulus was inside the field,and others outside. Most surprisingly,cueing the animal for the target location was almost as likely to suppress responses to the target as to facilitate them.A possible reason for SELECTION BASED ON SPATIAL LOCATION As we described above,one central the discrepancy between the two studies is that Motter (1993)found these resource for which stimuli compete in the ventral stream seems to be the effects only when there were a large number of distractors in the visual field receptive field.Not surprisingly then,spatial selection in this stream does not whereas Moran&Desimone(1985)used only a single distractor.Increasing simply enhance processing of the stimulus at the attended location but rathe the c competitio objects in the vis have in sed the rol ms to resolve competition between stimuli in the receptive field. Mo study of cells in v4 and It conex monkevs f med a discrim ination task on to stimuli at one ioin the visual d,rn plicit p the target locatio ge (b t not any of the distractors)was physically adde possibly ractors at a seco ran De 85).The on (M he monkey by cing some complex sensory effects.In any event,other recent studies have indi i to confirmed that attenti onal effects in V4 are muc larger when target and t the star of th un,1. the spatial was purer distractor compete within the same receptive field than in any other configu- d spatia mory. ration (Luck et al 1993:L Chelazzi,unpublished data). were both within the receptive field of the recorded cell,the uronal response was determined primarily by the target;responses to the CIRCUTTRY UNDERLYING SPATIAL SELECTION Although the synaptic mecha- distractor The cells responded as though their nisms mediating the gating of V4 and IT responses are unknown,anatomy Ishrunk around the target.Consistent with this,Richmond dictates that they fall into either of two classes (Desimone 1992).In the first et al(1983)found that the presence of a central fixation target in the receptive class,spatial biasing inputs to visual cortex determine which specific subset field of an IT neuron may block the response to a more peripheral stimulus of a cell's inputs causes the cell to fire,whereas in the second class,the inputs in the field. determine which specific cells in a population are allowed to fire.In other In the Moran Desimone (1985)study,when one of the two locations was words.one can either gate some of the inputs to a cell on or off,or one can placed outside the receptive field of the recorded cell.attention no longer had gate some of the cells on or off.Theoretical models for both classes of circuitry any effect on the response.This was consistent with the biased competition have been developed (Anderson Van essen 1987.Crick&Koch 1990 model:Target and distractor were no longer competing for the cell'respor e none 1992 Ni 9g301 eptieheaspatialbiase7 bur et al 1 sen ct al 1993,Tsotsos 1994).All and thus,top-dow had ar effect f the are ouli within th which operated were much larger in the en ent data to he ortex r were loc y. 19 he same hemifield and,there e gating of V an I responses occurs as a res lt of an exteral inpu targ that biases competition in favor of the target,one might expect to see some n competition(Sato 1988). evidence for it.A possible candidate has been found in a new study of spatial receptive field s were too small to test the effects of placing both attention in V4 (Luck et al 1993).V4 cells in this study showed a sustained inoe e a theroVh elevation of their baseline (prestimulus)firing rates whenever the animal's attention was directed inside their receptive field.This elevation of activity was no effect of attention on VI cells in this paradigm.These results suggest with attention could be the neural analogue of the attentional template for that target selection is a two-stage process:The first stage works over a small location.The elevation occurred at the start of each trial before any stimulus spatial range in V4.and the second stage works over a much larger spatial had appeared.Since the only information about where to attend was given to range in IT cortex:both are in line with their receptive field sizes (Moran the animal minutes carlier at the start of a block of trials,the relevant location Desimone 1985).Studies of event-related notentials in hnmans have also met have heen stomed in wnrkino memory The snatial resnlntion nf this sonrre
VISUAL ATq~NTION 203 Top-Down Control of Selection in the Ventral Stream As we have said, top-down biases on visual processing, or the attentional template, derive from the requirements of the task at hand. Although we consider mechanisms for spatial and object selection separately, they in fact share many features. SELECTION BASED ON SPATIAL LOCATION Aswe described above, one central resource for which stimuli compete in the ventral stream seems to be the receptive field. Not surprisingly then, spatial selection in this stream does not simply enhance processing of the stimulus at the attended location but rather seems to resolve competition between stimuli in the receptive field. In one study of cells in V4 and IT cortex, monkeys performed a discrimination task on target stimuli at one location in the visual field, ignoring simultaneously presented distractors at a second location (Moran & Desimone 1985). The target location for a given run was indicated to the monkey by special instruction trials at the start of that run, i.e. the spatial bias was purely top down and presumably required spatial working memory. When target and distractor were both within the receptive field of the recorded cell, the neuronal response was determined primarily by the target; responses to the distractor were greatly attenuated. The cells responded as though their receptive fields had shrunk around the target. Consistent with this, Richmond et al (1983) found that the presence of a central fixation target in the receptive field of an IT neuron may block the response to a more peripheral stimulus in the field. In the Moran & Desimone (1985) study, when one of the two locations was placed outside the receptive field of the recorded cell, attention no longer had any effect on the response. This was consistent with the biased competition model: Target and distractor were no longer competing for the cell’s response, and thus, top-down spatial bias no longer had any effect. Receptive fields and the region of space over which attention operated were much larger in the IT cortex. However, even here attentional effects were larger when target and distractor were located within the same hemifield and, therefore, more likely to be in competition (Sato 1988). In V1, receptive fields were too small to test the effects of placing both target and distractor within them. However, when one stimulus was located inside, and one outside (at the same spatial separation used in area V4), there was no effect of attention on V1 cells in this paradigm. These results suggest that target selection is a two-stage process: The first stage works over a small spatial range in V4, and the second stage works over a much larger spatial range in IT cortex; both are in line with their receptive field sizes (Moran Desimone 1985). Studies of event-related potentials in humans have also www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 204 DESIMONE & DUNCAN localized a region modulated by spatial attention in lateral prestriate cortex; this region may correspond to area V4 (Mangun et al 1993). Recently, Motter (1993) has reported attentional effects on responses of cells in V1, V2, and V4. In contrast to the Moran & Desimone (1985) study, these effects were found when one stimulus was inside the field, and others outside. Most surprisingly, cueing the animal for the target location was almost as likely to suppress responses to the target as to facilitate them. A possible reason for the discrepancy between the two studies is that Motter (1993) found these effects only when there were a large number of distractors in the visual field, whereas Moran & Desimone (1985) used only a single distractor. Increasing the competition among objects in the visual field may have increased the role of attentional biases. Other differences include the fact that Motter used an explicit spatial cue to indicate the target location in the display, and the target (but not any of the distractors) was physically added to the cue, possibly inducing some complex sensory effects. In any event, other recent studies have confirmed that attentional effects in V4 are much larger when target and distractor compete within the same receptive field than in any other configuration (Luck et al 1993; L Chelazzi, unpublished data). CIRCUITRY UNDERLYING SPATIAL SELECTION Although the synaptie mechanisms mediating the gating of V4 and IT responses are unknown, anatomy dictates that they fall into either of two classes (Desimone 1992). In the first class, spatial biasing inputs to visual cortex determine which specific subset of a cell’s inputs causes the cell to fire, whereas in the second class, the inputs determine which specific cells in a population are allowed to fire. In other words, one can either gate some of the inputs to a cell on or off, or one can gate some of the cells on or off. Theoretical models for both classes of circuitry have been developed (Anderson & Van Essen 1987, Crick & Koch 1990, Desimone 1992, Niebur et al 1993, Olshausen et al 1993, Tsotsos 1994). All of the models resolve competition when there are mulitple stimuli within the receptive field. Presently, there are insufficient data to decide between them. If the gating of V4 and IT responses occurs as a result of an external input that biases competition in favor of the target, one might expect to see some evidence for it. A possible candidate has been found in a new study of spatial attention in V4 (Luck et al 1993). V4 cells in this study showed a sustained elevation of their baseline (prestimulus) firing rates whenever the animal’s attention was directed inside their receptive field. This elevation of activity with attention could be the neural analogue of the attentional template for location. The elevation occurred at the start of each trial before any stimulus had appeared. Since the only information about where to attend was given to the animal minutes earlier at the start of a block of trials, the relevant location must have been stored in working memory. The spatial resolution of this source www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUALATTENTION 205 206 DESIMONE DUNCAN receptive field,the magnitude of the e shift varied ccording to the whatever spatial bias signal enters the cortex,it apparently has a spatial reso- lution finer than the receptive field dimensions of v4 celis. SELECTION BASED ON FEATURES The mechanism underlying the selection of 30 objects by their features(when their location is not known in advance)requires a means to hold the sought-after object in working memory and to use this memory (or attentional template)to resolve competition among the elements in the scene.Recently,evidence for this selection mechanism has been found 25 in the anteroventral portion of IT cortex,the same portion in which memory related activity has been found (Chelazzi et al 1993a) briefly pre rotgactoho esented with a co mp ex picture (the cue)at the cent ory.The cue on a goo at elicited a strong espons Good cue icited or no resp onse elf.Fol ⑧15 ga delay,the and the po oth presented nuli,at an extrafoveal lo tion.The monkey made a sacc to the target stimulus that matched the cue,ignoring the nonmatching stimulus (the distractor). As shown in Figure 5.the choice array initially activated IT cells tuned to 小 the properties of either stimulus,in parallel,irrespective of which stimulus was the target.Within 200 ms after array onset,however,the response changed Poor cue dramatically depending on whether the animal was about to make an cye 0 movement to the good or poor stimulus.When the target was the good stimulus .7 00.71.42.12.83.5 the response remained high.However.when the tareet was the poor stimulus for the recorded cell,the resp onse to the good dist actor stimulus was sup Time from cue onset (s) pressed even tho gh ithin ms be ve field.This change Figure 5 Effects ofobject selection onr 100 ocellsinthe ITcore.The upp movemen he general aphs sho erage r the ta ity in IT cortex reflected only th target's prop Whenthe choice animal made a sacc m to the stin and remained ecedighcchaieea suppressed until well after the eye movement was made.Similar effects were the cu ving the delay.celis found for choice arrays of larger sizes. n whe Just as with spatially directed attention,these effects of object selection in s diverged pending o get was the good or th the IT cortex were much smaller when target and nontargets were located in et al (1993ak opposite hemifields than when they were in the same hemifield,i.e.when they were maximally in competition.Interestingly,similar competitive effects are receptive field were more suppressed when the competing target was located seen even at high levels of oculomotor control in the frontal eye field.Cells just ouside the receptive field,and thus maximally competitive,than when it in this region were recorded while monkeys made eye movements to a target was further in a field of distractors(Schall Hanes 1993).Responses to distractors in the Two find sueeest that the taroet is selected in the IT cortex as a result
VISUAL ATTENTION 205 was very high; when attention was shifted to different regions within the same receptive field, the magnitude of the baseline shift varied according to the distance between the focus of attention and the receptive field center. Thus, whatever spatial bias signal enters the cortex, it apparently has a spatial resolution finer than the receptive field dimensions of V4 cells. SELECTION BASED ON FEATURES The mechanism underlying the selection of objects by their features (when their location is not known in advance) requires a means to hold the sought-after object in working memory and to use this memory (or attentional template) to resolve competition among the elements in the scene. Recently, evidence for this selection mechanism has been found in the anteroventral portion of IT cortex, the same portion in which memoryrelated activity has been found (Chelazzi et al 1993a). Monkeys were briefly presented with a complex picture (the cue) at the center of gaze to hold in memory. The cue on a given trial was either a good stimulus that elicited a strong response from the cell or a poor stimulus that elicited little or no response when presented by itself. Following a delay, the good and the poor stimuli were both presented simultaneously as choice stimuli, at an extrafoveal location. The monkey made a saccadic eye movement to the target stimulus that matched the cue, ignoring the nonmatching stimulus (the distractor). As shown in Figure 5, the choice array initially activated IT cells tuned to the properties of either stimulus, in parallel, irrespective of which stimulus was the target. Within 200 ms after array onset, however, the response changed dramatically depending on whether the animal was about to make an eye movement to the good or poor stimulus. When the target was the good stimulus, the response remained high. However, when the target was the poor stimulus for the recorded cell, the response to the good distractor stimulus was suppressed even though it was still within the receptive field. This change in response occurred about 100 ms before the onset of the eye movement. The cells responded as though the target stimulus captured their response, so neuronal activity in IT cortex reflected only the target’s properties. Cells selective for the nontargets were suppressed within 200 ms and remained suppressed until well after the eye movement was made. Similar effects were found for choice arrays of larger sizes. Just as with spatially directed attention, these effects of object selection in the IT cortex were much smaller when target and nontargets were located in opposite hemifields than when they were in the same hemifield, i.e. when they were maximally in competition. Interestingly, similar competitive effects are seen even at high levels of oculomotor control in the frontal eye field. Cells in this region were recorded while monkeys made eye movements to a target in a field of distractors (Schall & Hanes 1993). Responses to distractors in the www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 206 DESIMONE & DUNCAN 30- 25- 20- ~15- ,., ~ -- Poor cue ,,-~, -0.7 0 0.7 1.4 2.1 2.8 3.5 Time from cue onset (s) Figure5 Effects ofobject selection on responses of cells in the IT cortex. The upper insert illustrates the general visual search task. Graphshow the average response of 22 cells recorded while monkeys performed the task. The cue was chosen to be either a good or a poor stimulus for the recorded cell. When the choice array was presented, the animal made a saccadic eye movement to the stimulus (target) that matched the previous cue. The saccadic latency was 300 ms, indicated by the asterisk. Cells had a higher firing rate in the delay preceding the choice array when their prefered stimulus was the cue. Following the delay, cells were activated (on the average) by their prefered stimulus the array, regardless of whether it was the target. However, 100 ms before the eye movement was made, responses diverged depending on whether the target was the good or the poor stimulus. The two dark horizontal bars indicate when the cue and the choice were presented. Adapted from Chelazzi et al (1993a). receptive field were more suppressed when the competing target was located just outside the receptive field, and thus maximally competitive, than when it was further away. Two findings suggest that the target is selected in the IT cortex as a result www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 207 208 DESIMONE DUNCAN of inputs(initiated at the time of the cue)that bias competition in favor of the a disregard or neglect of objects and actions in contralateral space (for review target.First,IT cells selective for the properties of the cue-target show higher see Bisiach Vallar 1988).Neglect can be manifested as failure to cop maintained activity in the delay following the cue than do cells selective for the distractor.This could be the neural correlate of the attentional template for face etc the target.Second,a subpe onses to Neglect of one form or another has been as sociated with the cho prefe the amage to a grea uctures, tend to be parietal cortex (e.g.Bisiach&Vallar ctive e results i alenstein 1972).the cingulate gyrus ve for th ntal cortex (Heilman d to it by an e ter (Watson et al 1973),the basal ganglia (Hier et al 1977),the thalamus (Rafal source before the onsc Posner 1987.Watson Heilman 1979),the midbrain and supcrior colliculus response to the target when t appears.Eventual (Posner et al 1982),and even the temporal lobe (Shelton et al 1990). interactions during the initial visual activation.cells selective for the distractors Most importantly,neglect of the contralesional side can be exaggerated by are suppressed.At some point in time,mechanisms for spatial selection may competing events on the other,unimpaired,side-a phenomenon termed ex- also be engaged to facilitate localization of the target for the eye movement. tinction.Thus,neglect manifests more as a competitive bias against one side Cue-or template-related activity during delay periods (Fuster Jervey than as an absolute inability to deal with that side(Kinsbourne 1993).If there 1981.Miller et al 1993.Miyashita Chang 1988)and enhanced responses to are critical spatial gating inputs to the ventral stream.they probably arise from (target)stimuli matching a prior cue (Miller Desimone 1994)have also been more than o 6dinanwomeakmcKM民hes ms of the bia areas of ompetition model,damage to the s one he uted in space rather than time imt ortantly,the same seems to be true of spatial ment.Fi h the d which shares many f民 object selection,including which ma inc fu ssion Second 月 nor Th s ITcells in working mani wher sual system this lo oss co ld m red be sual corte eith ectly,through the elimination of ructures th at similar mechanisms mayberef ted i studies of human brain ute to stimulus saliency or that supply top-down spatial selection inputs,or activation that use positron emission tomography (PET).In one study (Corbetta indirectly,through the elimination of structures that supply the critical ones et al 1991),subjects were asked to compare one feature of two successive with inputs.The superior colliculus does not project directly to the visual displays,each containing a moving field of colored shapes.Different portions cortex,for example,but ultimately provides inputs to other structures that do. of extrastriate cortex were preferentially activated depending on whether the Either unilateral or focal damage to the colliculus could affect competition relevant feature was motion,on the one hand,or color or shape.on the other Physverityofhiopdow within these other parts of the sy tem.thus throwing them out of balance (Desimone et al 19 0b)A loss of c mpetitive weights would also explain why influences on ventral stream neurons that may influence object selectic on hias nly follo ilateral rather than bilatera but they are b nd the se eof this review(e.g.Mau nsel et al 1991.Spitzer vith bilate h em iGeld ha et al 1988.Spita Rich ond 1991) over 4hnmulpk Neural Sources of Spatial Selection Bias effe otha loss of function ee neural systems that might a loss of compettive weights and that competuve weights may be affected at ourc h ttentional lemplate for spanial ation.The lesion data any level between sensory input and motor output,it is not surprising that there are readily explaine Iby the ased competition modei but,unfortunately,do are many reports of dissociation between one form of neglect and another.For not by themselves pin down the critical sources. example,there are reports of neglect of body vs environmental objects (e.g. Following the formation of a lesion on one side of the brain.there is often Guariglia Antonucci 1992).neelect of close vs far space (e.e.Halligan
VISUAL ATTENTION 207 of inputs (initiated at the time of the cue) that bias competition in favor of the target. First, IT cells selective for the properties of the cue-target show higher maintained activity in the delay following the cue than do cells selective for the distractor. This could be the neural correlate of the attentional template for the target. Second, a subpopulation of the cells gives enhanced responses to the choice array when their preferred stimulus is the target, even during the first 200 ms in which all cells tend to be active. Together, the results indicate that cells selective for the target are primed to respond to it by an external source before the onset of the choice array; they then give an enhanced response to the target when it appears. Eventually, as a result of competitive interactions during the initial visual activation, cells selective for the distractors are suppressed. At some point in time, mechanisms for spatial selection may also be engaged to facilitate localization of the target for the eye movement. Cue-, or template-, related activity during delay periods (Fuster & Jervey 1981, Miller et al 1993, Miyashita & Chang 1988) and enhanced responses to (target) stimuli matching a prior cue (Miller & Desimone 1994) have also found in studies of working memory in IT cortex. Visual search simply appears to be a variant of a working memory task, in which the distractors are distributed in space rather than time. Importantly, the same seems to be true of spatial selection, which shares many features with object selection, including template-related activity during delays followed by response-suppression to competing nontargets. The major difference may simply be the nature of the template. The potential sources of the template that primes IT cells in working memory is considered below. Somewhat similar mechanisms may be reflected in studies of human brain activation that use positron emission tomography (PET). In one study (Corbetta et al 1991), subjects were asked to compare one feature of two successive displays, each containing a moving field of colored shapes. Different portions of extrastriate cortex were preferentially activated depending on whether the relevant feature was motion, on the one hand, or color or shape, on the other. Physiological studies have also shown a variety of other nonspatial, top-down influences on ventral stream neurons that may influence object selection bias, but they are beyond the scope of this review (e.g. Maunsell et al 1991, Spitzer et al 1988, Spitzer & Richmond 1991). Neural Sources of Spatial Selection Bias LESION STUDIES IN HUMANS We now turn to the neural systems that might be the source of the attentional template for spatial location. The lesion data are readily explained by the biased competition model but, unfortunately, do not by themselves pin down the critical sources. Following the formation of a lesion on one side of the brain, there is often www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 208 DESIMONE & DUNCAN a disregard or neglect of objects and actions in contralateral space (for review see Bisiach & Vallar 1988). Neglect can be manifested as failure to copy one half of a drawing, to read text on one half of a page, to shave one half of the face, etc. Neglect of one form or another has been associated with damage to a great variety of brain structures, including the parietal cortex (e.g. Bisiach & Vallar 1988), the frontal cortex (Heilman & Valenstein 1972), the cingulate gyrus (Watson et al 1973), the basal ganglia (Hier et al 1977), the thalamus (Rafal & Posner 1987, Watson & Heilman 1979), the midbrain and superior col liculus (Posner et al 1982), and even the temporal lobe (Shelton et al 1990). Most importantly, neglect of the contralesional side can be exaggerated by competing events on the other, unimpaired, side--a phenomenon termed extinction. Thus, neglect manifests more as a competitive bias against one side than as an absolute inability to deal with that side (Kinsboume 1993). If there are critical spatial gating inputs to the ventral stream, they probably arise from more than one structure. In terms of the biased competition model, damage to the spatially mapped areas of one hemisphere may cause two different types of behavioral impairment. First is the loss of whatever functions are mediated by the damaged areas, which may include perceptual, visuospatial, and oculomotor functions. Second is the loss of competitive weights afforded to objects in the affected portion of the contralesional field, which may be manifested anywhere between sensory input and motor output. In the visual system, this loss could affect visual cortex either directly, through the elimination of structures that contribute to stimulus saliency or that supply top-down spatial selection inputs, or indirectly, through the elimination of structures that supply the critical ones with inputs. The superior colliculus does not project directly to the visual cortex, for example, but ultimately provides inputs to other structures that do. Either unilateral or focal damage to the colliculus could affect competition within these other parts of the system, thus throwing them out of balance (Desimone et al 1990b). A loss of competitive weights would also explain why neglect and extinction most commonly follow unilateral rather than bilateral lesions; with bilateral lesions, neither hemifield has a competitive advantage over the other. It seems likely that competition in multiple brain systems is coordinated so that a loss of competitive weights in one system has general effects in others. Considering that lesions will typically result in both a loss of function and a loss of competitive weights and that competitive weights may be affected at any level between sensory input and motor output, it is not surprising that there are many reports of dissociation between one form of neglect and another. For example, there are reports of neglect of body vs environmental objects (e.g. Guariglia & Antonucci 1992), neglect of close vs far space (e.g. Halligan www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 209 210 DESIMONE DUNCAN Marshall 1991),and sensory vs motor neglect (e.g.Tegner Levander 1991). 1993).This large structure contains several different nuclei,each of which There are also strong laterality effects,which we do not cover here (see Posner contains one or more functionally distinct regions connected anatomically to Peterson 1990). aspecific region ofthe(Bender Benevento Rezak 197 Unge parietal cortex sp eider e t)The pulvinar has been impli nediates disenga control ba neuropsychological studies of humans with thalamic brair the halance of co weightsf damage (Rafal Posner 1987),PET activation studics (LaBerge Adi Buchsbaum 1990).and physiological recording and chemical deactivation re cases and studies in monkeys(Desimone et al 1990b;Petersen et al 1985,1987:Robinson lobe (e.g.Hu In thes patients,attention can b ecome locked onto one object;nor atrendedobjects et al 1986).However,pulvinar lesions raise the same issues of interpretation as lesions in other structures we have considered. seem to disappear.According to Posner and colleagues,this disengage function In one study,the portion of the pulvinar termed Pdm.which is anatomically of the parietal corex differs from that of the superior colliculus and pulvinar. interconnected with the posterior parietal cortex,was reversibly deactivated in which they propose mediate moving attention and focusing attention,respec- one hemisphere (Petersen et al 1987).Following deactivation.reaction tim tively(reviewed in Posner Petersen 1990).This division is based primarily to targets in the contralesional field were slower than normal,esp ecially wher on reaction time data from patients with large unilateral lesions affecting.but ion was cted int e lesional field (i.e a dis. generally not limited to.one of the three structures.However.monkeys with ed to the of gag discrete unilateral lesions or deactivation of any one of these structures all show a general slowing of reaction times for targets in the contralesional field control R en 1992) as well as a dis ment impairment when attention is switched from the haar2 may ha mply n om the Pdn follow from a loss of competitive weights in the affected field.Thus, nere,s【h structure is role for parietalc ortex in disengagement is still an open question. ntional control in its own right (see bel ow).B n unilateral deactivation o superior colliculus and unilatera e ions of the post etal contex LESION STUDIES IN PRIMATES The had effects similar to those of Pdm deacti vation (see Colby 1991). e s in hu with ey f s Analogous resu ate Its were found with unilateral chemical deactivation of the senta t in a lo lateral pulvinar(PL).the part connected with areas V4 and IT cortex.Monkeys are me aged area ect and syndr s from a loss of discriminated the color of a target in the (contralesional)field opposite the competitive weights in the contralesional field.Bilateral lesions,which do not deactivated pulvinar,with or without a distractor in the unaffected (ipsilesio- upset the competitive balance between the fields,tend to have less effect on nal)field (Desimone et al 1990b).The deactivation had no effect on the spatial attention. monkey's ability to discriminate the target unless it was paired with a distractor. In fact,there are at least two instances when adding a lesion in one hemi- a result reminiscent of extinction.If PL was the source of critical gating inputs sphere corrects an attentional impairment caused by a lesion in the other to extrastriate cortex,moving the distractor closer to the target should have Monkeys with unilateral lesions of the posterior parietal cortex tend to mak had a devastating effec when the distractor was voluntary eye movements into the ipsilesional field when presented with bi. d into the lateral stimuli.However,this bias is corrected when an additional lesion is iminished, sam ifield a was substantially had subsequently made in the po sterior parietal cortex of the opposite hem ispher (Lynch McLaren 1989).Similar cats with unilateral lesions of s arget 1 a les ons have no eff t on the eys to fir nigra in the opposite hemisphere subs antially s the neglect( target embedded in distractors,which further suggests tha PL docs not have al1990. a necessary role in attentional gating (Bender Butter 1987). PULVINAR The In fact,the biased competition model predicts results similar to those of propo sed sour nputs to the pulvinar deactivation from partial lesions in any spatially mapped visual strue orex has probably been the pulvinar (e..see Crick 1984 Olhausen et a ture that makes a contribution to saliency and hence competitive weight.Such
VISUAL ATTENTION 209 Marshall 1991), and sensory vs motor neglect (e.g. Tegner & Levander 1991). There are also strong laterality effects, which we do not cover here (see Posner & Peterson 1990). One interesting possibility raised by lesion studies is that the posterior parietal cortex specifically mediates disengaging attention from its current focus (Posner et al 1984), or in our terms, shifting the balance of competitive weights from one object to another. A disengagement deficit may partly explain rare cases of Balint’s syndrome and simultagnosia following extensive bilateral damage to the parietal lobe (e.g. Humphreys & Riddoch 1993). In these patients, attention can become locked onto one object; nonattended objects seem to disappear. According to Posner and colleagues, this disengage function of the parietal cortex differs from that of the superior colliculus and pulvinar, which they propose mediate moving attention and focusing attention, respectively (reviewed in Posner & Petersen 1990). This division is based primarily on reaction time data from patients with large unilateral lesions affecting, but generally not limited to, one of the three structures. However, monkeys with discrete unilateral lesions or deactivation of any one of these structures all show a general slowing of reaction times for targets in the contralesional field as well as a disengagement impairment when attention is switched from the ipsilesional to contralesional field (see below). These impairments may simply follow from a loss of competitive weights in the affected field. Thus, a specific role for parietal cortex in disengagement is still an open question. LESION STUDIES IN PRIMATES The general rule for lesion effects in monkeys is the same as in humans: Unilateral lesions of structures with a contralateral field representation result in a loss of whatever functions are mediated by the damaged area as well as neglect and extinction syndromes from a loss of competitive weights in the contralesional field. Bilateral lesions, which do not upset the competitive balance between the fields, tend to have less effect on spatial attention. In fact, there are at least two instances when adding a lesion in one hemisphere corrects an attentional impairment caused by a lesion in the other. Monkeys with unilateral lesions of the posterior parietal cortex tend to make voluntary eye movements into the ipsilesional field when presented with bilateral stimuli. However, this bias is corrected when an additional lesion is subsequently made in the posterior parietal cortex of the opposite hemisphere (Lynch & McLaren 1989). Similarly, cats with unilateral lesions of striate cortex show a severe contralateral neglect; however, a lesion of the substantia nigra in the opposite hemisphere substantially reduces the neglect (Wallace et al 1990). PULVINAR The most frequently proposed source of attentional inputs to the cortex has probably been the pulvinar (e.g. see Crick 1984, Olhausen et al www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 210 DESIMONE & DUNCAN 1993). This large structure contains several different nuclei, each of which contains one or more functionally distinct regions connected anatomically to a specific region of the visual cortex (Bender 1981; Benevento & Rezak 1976; Ungerleider et al 1983, 1984). The pulvinar has been implicated in attentional control based on neuropsychological studies of humans with thalamic brain damage (Rafal & Posner 1987), PET activation studies (LaBerge Buchsbaum 1990), and physiological recording and chemical deactivation studies in monkeys (Desimone et al 1990b; Petersen et al 1985, 1987; Robinson et al 1986). However, pulvinar lesions raise the same issues of interpretation as lesions in other structures we have considered. In one study, the portion of the pulvinar termed Pdm, which is anatomically interconnected with the posterior parietal cortex, was reversibly deactivated in one hemisphere (Petersen et al 1987). Following deactivation, reaction times to targets in the contralesional field were slower than normal, especially when attention was first misdirected into the ipsilesional field (i.e. a disengage impairment). Thus, Pdm deactivation seemed to reduce the saliency of contralesional stimuli thereby reducing their competitive weights for either visual processing or control over behavior (Robinson & Petersen 1992). This loss weights may have simply resulted from the loss of Pdm inputs to the posterior parietal cortex of the same hemisphere, as the latter structure is implicated in attentional control in its own fight (see below). Both unilateral deactivation the superior colliculus and unilateral lesions of the posterior parietal cortex had effects similar to those of Pdm deactivation (see Colby 1991). Analogous results were found with unilateral chemical deactivation of the lateral pulvinar (PL), the part connected with areas V4 and IT cortex. Monkeys discriminated the color of a target in the (contralesional) field opposite the deactivated pulvinar, with or without a distractor in the unaffected (ipsilesional) field (Desimone et al 1990b). The deactivation had no effect on monkey’s ability to discriminate the target unless it was paired with a distractor, a result reminiscent of extinction. If PL was the source of critical gating inputs to extrastriate cortex, moving the distractor closer to the target should have had a devastating effect on performance. However, when the distractor was moved into the same hemifield as the target, the impairment was substantially diminished, presumably because neither stimulus then had a competitive advantage. As with Pdm, deactivation of PL most likely deprived visual cortex in the same hemisphere of excitatory inputs and reduced target saliency. Bilateral pulvinar lesions have no effect on the ability of monkeys to find a target embedded in distractors, which further suggests that PL does not have a necessary role in attentional gating (Bender & Butter 1987). In fact, the biased competition model predicts results similar to those of pulvinar deactivation from partial lesions in any spatially mapped visual structure that makes a contribution to saliency and hence competitive weight. Such www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only
VISUAL ATTENTION 211 212 DESIMONE DUNCAN one quadrant of the In on study,anim we make eye movements to an od -ma nimpp 1976. around an imaginary ring(Schiller&Lee 1991).If the target was located in eye movements rather than in selection for visual processing.A remarkable the lesion quadrant and if it was dimmer than the other stimuli in the unaffected implication of the fact that these visuomotor cells respond equally to targets parts of the field,the animals were impaired.However,there was no impair and distractors in the absence of eye movements is that visual input to these ment if the target was brighter than the other stimuli,suggesting that the V4 parts of the oculomotor system does not derive from cells in the dorsal and lesion reduced target saliency.In another study,animals were especially im- ventral streams whose responses are gated by spatial attention.Competition paired in discriminating the shape of a target located in the lesion quadrant between stimuli must take place inder ndently within the oculomotor system when a distractor was located in an unaffcc d part of the field:however.there and yet be coordinated with competition within visual pr cessing systems was little impai rment when both the target and distractor were located within Although cells in the substantia nigra and intermediate layer of the col- the lesion adrant (Desimone et al 1990a).In the latter configurat ion,neither licnlus have not yet been tested in this condition of attentio n withou mulus had a co etitive advantage from the lesic lsin Pdm,dorsolateral p cummary,the 0 atit odel affo explanation for 1981.c cem this pure the effects I this tir the pulvin B 19 93b o more likely tha t es to be a cri etal 1985).Of the the po enor pane o the ven tral stream.To pin down th sources wil kely require converging may be th on,as th evidence from lesion and physiological studies simply reflect the ves from s with PET have shown activation parietal c Po parietal cortex PHYSIOLOGICAL STUDIES The classic paradigm for studying cells within the a task involving shifting attention (Corbetta et al 1993).Thus,based on pres presumed control system for spatial attention has been the saccadic enhance ence of the cnhancement citect.both postenor panctal and pretrontal cortex ment paradigm (Goldberg Wurtz 1972.Wurtz Goldberg 1972).In this are possible sources of a spatial-biasing signal to visual cortex. task.the monkey fixates a central stimulus while a second stimulus is presented If the top-down selection of spatial locations for attention typically involves within a cell's receptive field in the periphery.in the experimental condition working memory.as we have suggested,an important clue to the identity of the fixation stimulus is turned off and the nal saccades to the re the relevant cells would be respo nse activation in workine memory tasks in field stimulus when it a rs.In a cont rol condition.the fixation fact.in such tasks cells in the dorsolateral prefrontal and oosterior parictal tays on and the arded fo are tonically active whenever the animal holds"in mind"a location c field stim ng when itdims,ign us The ntrol onng the ehtheexperimentalcoiditionHaso m s is la op dow within a cell's receptive field (in the absence of any stimulus)(Chelazzi et al omatic,or re al 1993 di pelli o Wise 1993a.Funahashi e时a11989 Fuster 1973,Gn 988,0u a&Fuster 1992,Wilson et al o the receptive field stimus in the expermentalo 993).Furth ore,these t om appear to. ted system tion (the target) than in the control (the distractor),a resui usually terme see Goldman-Rakic 1988).These physiol Idata,in conju the enhancement effect(although,in fact,it is often unclear whether the targe with data showing neglect and extinction effects following both prefronta response is enhanced or the distractor response is suppressed).This effect is posterior parietal lesions,argue that both structures may work together in found in the superior colliculus.the substantia nigra.the Pdm nucleus of the generating top-down spatial selection biases. pulvinar,the posterior parietal cortex.the frontal eye fields(Goldberg&Wurtz 1972.Hikosaka wurtz 1983.Lynch et al 1977 Robinson et al 1978.Petersen Sources of Object Selection Bias et al 1985,Wurtz Mohler 1976;also see Colby 1991),and the dorsolateral As with spatial selection,the attentional templates for objects and their features prefrontal cortex (di Pelligrino Wise 1993b).However,in both the superfi- may derive from mechanisms underlying working memory.If so,then the cial layers of the colliculus and the frontal eye fields,the effect is known to prefrontal cortex most likely plays an important role.Just as lesions of the be specific for saccadic eye movements:no enhancement is found when the dorsolateral prefrontal cortex impair working memory for space(see Funahashi
VISUAL ATTENTION 211 an outcome is observed in monkeys with lesions affecting one quadrant of the visual field representation in area V4. In one study, animals were trained to make eye movements to an odd-man-out target in an array of stimuli presented around an imaginary ring (Schiller & Lee 1991). If the target was located the lesion quadrant and if it was dimmer than the other stimuli in the unaffected parts of the field, the animals were impaired. However, there was no impairment if the target was brighter than the other stimuli, suggesting that the V4 lesion reduced target saliency. In another study, animals were especially impaired in discriminating the shape of a target located in the lesion quadrant when a distractor was located in an unaffected part of the field; however, there was little impairment when both the target and distractor were located within the lesion quadrant (Desimonet al 1990a). In the latter configuration, neither stimulus had a competitive advantage from the lesion. In summary, the biased competition model affords a ready explanation for the effects of unilateral or partial lesions on attention. At this time, the pulvinar is no more likely than other structures to be a critical source of gating inputs to the ventral stream. To pin down these sources will likely require converging evidence from lesion and physiological studies. PHYSIOLOGICAL STUDIES The classic paradigm for studying cells within the presumed control system for spatial attention has been the saccadic enhancement paradigm (Goldberg & Wurtz 1972, Wurtz & Goldberg 1972). In this task, the monkey fixates a central stimulus while a second stimulus is presented within a cell’s receptive field in the periphery. In the experimental condition, the fixation stimulus is turned off and the animal saccades to the receptive field stimulus when it appears. In a control condition, the fixation stimulus stays on and the monkey is rewarded for signaling when it dims, ignoring the receptive field stimulus. The control over eye movements is largely top down in this task, although the experimental condition has some automatic, or reflexive, components. Some of the cells in virtually all structures implicated in spatial attention give larger responses to the receptive field stimulus in the experimental condition (the target) than in the control (the distractor), a result usually termed the enhancement effect (although, in fact, it is often unclear whether the target response is enhanced or the distractor response is suppressed). This effect is found in the superior colliculus, the substantia nigra, the Pdm nucleus of the pulvinar, the posterior parietal cortex, the frontal eye fields (Goldberg & Wurtz 1972, Hikosaka & Wurtz 1983, Lynch et al 1977, Robinson et al 1978, Petersen et al 1985, Wurtz & Mohler 1976; also see Colby 1991), and the dorsolateral prefrontal cortex (di Pelligrino & Wise 1993b). However, in both the superficial layers of the colliculus and the frontal eye fields, the effect is known to be specific for saccadic eye movements; no enhancement is found when the www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only. 212 DESIMONE & DUNCAN animal simply attends to the peripheral stimulus and signals when it dims by releasing a bar (Colby et al 1993, Goldberg & Bushnell 1981, Wurtz & Mohler 1976). Thus, these cells appear to be involved in the selection of targets for eye movements rather than in selection for visual processing. A remarkable implication of the fact that these visuomotor cells respond equally to targets and distractors in the absence of eye movements ithat visual input to these parts of the oculomotor system does not derive from cells in the dorsal and ventral streams whose responses are gated by spatial attention. Competition between stimuli must take place independently within the oculomotor system and yet be coordinated with competition within visual processing systems. Although cells in the substantia nigra and intermediate layers of the colliculus have not yet been tested in this condition of attention without eye movements, cells in Pdm, dorsolateral prefrontal cortex, and posterior parietal cortex all show the enhancement effect in this purely attentional condition (Bushnell et al 1981, Colby et al 1993, di Pelligrino & Wise 1993b, Petersen et al 1985). Of these three regions, the posterior parietal and prefrontal cortices may be the most critical for spatial attention, as the enhancement in Pdm may simply reflect the input it receives from the posterior parietal cortex. Furthermore, studies with PET have shown activation of posterior parietal cortex in a task involving shifting attention (Corbetta et al 1993). Thus, based on presence of the enhancement effect, both posterior parietal and prefrontal cortex are possible sources of a spatial-biasing signal to visual cortex. If the top-down selection of spatial locations for attention typically involves working memory, as we have suggested, an important clue to the identity of the relevant cells would be response activation in working memory tasks. In fact, in such tasks cells in the dorsolateral prefrontal and posterior parietal cortexes are tonically active whenever the animal holds "in mind" a location within a cell’s receptive field (in the absence of any stimulus) (Chelazzi et 1993b, Colby et al 1993, di Pelligrino & Wise 1993a, Funahashi et al 1989, Fuster 1973, Gnadt & Andersen 1988, Quintana & Fuster 1992, Wilson et al 1993). Furthermore, these two regions are heavily interconnected anatomically and appear to form part of a distributed system for spatial cognition (for review, see Goldman-Rakic 1988). These physiological data, in conjunction with data showing neglect and extinction effects following both prefrontal and posterior parietal lesions, argue that both structures may work together in generating top-down spatial selection biases. Sources of Object Selection Bias As with spatial selection, the attentional templates for objects and their features may derive from mechanisms underlying working memory. If so, then the prefrontal cortex most likely plays an important role. Just as lesions of the dorsolateral prefrontal cortex impair working memory for space (see Funahashi www.annualreviews.org/aronline Annual Reviews Annu. Rev. Neurosci. 1995.18:193-222. Downloaded from arjournals.annualreviews.org by University of California - San Diego on 01/05/07. For personal use only