Temporally Unfolding Neural Representation of Pictorial Occlusion

More documents

Recommendations

Info

R. Rauschenberger et al.Fig. 1. Stimuli and procedure for the notched and complete conditions.tinguishable, but rather that they receive equivalent internalrepresentations (Kourtzi & Kanwisher, 2001).We determined, at two different points in time (100 and 250ms after stimulus onset), whether a partly occluded object isrepresented as a collection of local features explicitly present inthe retinal image or as a completed shape in which occludedcontours have been recovered. We accomplished this by observingthe magnitude of HDR attenuation when a pictorialocclusionprime was followed either by a stimulus whose localfeatures unambiguously matched those of the prime or by astimulus that unambiguously matched a completed interpretationof the prime. Accordingly, a pictorial-occlusion prime(notched disk abutting a square) was exposed for either100 or 250 ms and was then followed by one of two unambiguousprobe displays (see Fig. 1): either one corresponding to thecompleted interpretation of the preceding disk (complete condition:complete disk overlapping a square) or one correspondingto an unambiguously notched disk (notched condition:notched disk near a square). We refer to these conditions with aletter indicating the form of the probe (C for complete, N fornotched) and the duration of the prime (100 or 250 ms).If there were no evolution of the neural representation of thepictorial-occlusion prime, and the prime were treated as functionallyequivalent to a complete disk even in brief exposures,the HDR evoked in the C100 condition would be comparable tothat in the C250 condition (see Fig. 2a). In other words, therewould be no difference in response to the complete-disk probeacross exposure durations. Furthermore, if the prime’s neuralrepresentation were complete almost immediately—even at100-ms exposures—as has been claimed recently (Lerner et al.,2004), the response evoked in the C100 condition would beattenuated compared with that in the N100 condition (becausethe respective representations of the prime and probe stimuli aremore similar to one another in the former condition than in thelatter; Fig. 2a).If, however, the neural representation of pictorial occlusionevolves over time, as suggested by behavioral data, a differentFig. 2. Predicted functional magnetic resonance imaging (fMRI) response(a) if there is no evolution of the representation of pictorial occlusionsand completion is nearly instantaneous and (b) if the processing ofpictorial occlusions progresses from an initial representation of local imagefeatures to a completed representation of a shape that may includefeatures that are not explicitly present in the retinal image. Predictions areshown only for the three critical conditions: notched probe after 100-msprime (N100), complete probe after 100-ms prime (C100), and completeprobe after 250-ms prime (C250). (See the main text for details.)pattern of results should emerge. In this case, the HDR evokedin the C250 condition should be attenuated relative to that in theC100 condition, because the prime and probe representationsare more similar to one another in the former condition, owing toamodal completion (see Fig. 2b). Similarly, the response in theN100 condition should also be attenuated relative to that in theC100 condition (which is the opposite of what would be predictedif no evolution occurs; cf. Figs. 2a and 2b)—again becausethere is a correspondence of the prime and proberepresentations in the former but not in the latter condition. Inother words, a difference should be observed across exposuredurations for the complete-disk probe, as well as between probetypes at the brief exposure.Consequently, the pattern of results across the C100, C250,and N100 conditions allows us to infer whether the representationof the pictorial-occlusion prime undergoes an evolutionfrom a representation of local image features to a representationof an amodally completed shape. The inferences from the fourthcondition (N250) are less straightforward, because the fate of theearly representation remains controversial. According to onetheory, the completed representation, once available, replacesthe earlier representation (Sekuler & Palmer, 1992). In thiscase, the pictorial-occlusion prime would be represented asVolume 17—Number 4 359
Neural Representation of Pictorial Occlusionequivalent to a notched probe only in short exposures, and theHDR in the N100 condition would be attenuated relative to theHDR in the N250 condition. According to an alternative ambiguityaccount, the pictorial-occlusion prime continues to berepresented as notched at long exposures (in addition to beingrepresented as completed), owing to the cue conflict entailed bypictorial occlusion (Gerbino, 1989). In this case, the pictorialocclusionprime would be treated as equivalent to both thecomplete-disk and the notched-disk probes, and the HDR in thetwo conditions would be comparable. We return to this point inthe Discussion.METHODImaging was performed on a 1.5-T Philips MRI scanner. Foreach subject, a high-resolution anatomic volume was obtainedwith a magnetization-prepared rapid gradient echo (MPRAGE)sequence (12.4-min acquisition; birdcage head coil; TR 5 8.1ms; TE 5 3.7 ms; flip angle 5 81; 256-mm 256-mm field ofview; 256 256 acquisition matrix; 200 slices, 1 mm thick withno gap, yielding 1-mm isotropic resolution). Functional imagingwas performed using an echoplanar imaging sequence (surfacecoil centered over occipital scalp; TR 5 2s;TE5 40 ms; flipangle 5 901; foot-to-head phase encoding; 192-mm 192-mmfield of view; 64 64 acquisition matrix; 33 slices, 3 mm thickwith no gap, yielding 3-mm isotropic resolution). Followingmotion correction, slice-time correction, and functional-to-anatomicco-registration, the functional images and anatomicvolume were transformed into Talairach space (Talairach &Tournoux, 1988). The anatomic volume was then segmented andreconstructed at the gray-matter/white-matter boundary, andsubsequently flattened to yield a surface representation of thecortex (Kriegeskorte & Goebel, 2001). All analyses were performedusing BrainVoyager (Maastricht, The Netherlands) onthis flattened cortical representation.Each of 10 subjects performed two different types of functionallocalizerruns, one retinotopic-mapping run, and six eventrelated(Burock, Buckner, Woldorff, Rosen, & Dale, 1998) experimentalruns (three runs each in the upper left and upper rightquadrants, respectively). In all runs, subjects were instructed tomaintain fixation on a central fixation cross. In the first of the twolocalizer runs, two disks were flashed on one 451 axis in oppositehemifields (duration: 200 ms, stimulus onset asynchrony: 800 ms)for 16 s; these alternated with two disks on the other 451 axis. Thedisks were of the same color, size, and eccentricity as those in theexperimental runs (see the next paragraph). The two disks in theupper visual field were also in the same locations as the disks inthe experimental runs and were used to localize the regions ofinterest (ROIs) in early visual cortex; the two disks in the lowervisual field were used to balance the appearance of the stimulus.Cortical activity was localized using multiple regression by contrastingthe regressor weights associated with each of the twostimulus configurations. Because the experimental stimuli wererestricted to the upper visual field, assessment of activity in earlyvisual areas was restricted to the ventral occipital cortex. Thesecond localizer run was modeled after previous studies (Kourtzi& Kanwisher, 2001) in which the lateral occipital complex (LOC)was localized using a general linear model analysis with a contrastbetween regressors associated with intact versus fragmentedshapes; the stimuli were taken from Kourtzi and Kanwisher(2001). The statistical threshold was set to p < .001 (uncorrected)for contrasts in both localizers. Borders between visual areas wereobtained using an independent retinotopic-mapping scan, usingthe bifield stimulation technique (Slotnick & Yantis, 2003).During the experimental runs, a pictorial-occlusion prime (asquare abutting a notched disk) was presented for either 100 or250 ms, followed by a 400-ms noise mask, and then a 100-msfixation display. The probe (250 ms in duration) was either acomplete disk overlapping a square or a notched disk standingseparate from the square (Fig. 1). Both the diameter of the diskand one side of the square subtended 1.21 of visual angle.Squares were red and disks green, presented on a black background.The probe and prime were both presented in the upperleft or upper right quadrant, with the center of the disk 21 fromfixation. There were four different stimulus orientations (disk attop left, top right, bottom left, or bottom right of square), thusproviding variability in the stimulus. A trailing fixation displaywas presented for either 2,150 or 2,000 ms, depending on theexposure duration of the prime (constraining the duration ofeach trial to be 3,000 ms).Null trials were included in addition to the four experimentalconditions to create between-trial temporal jitter (Kourtzi &Kanwisher, 2001). The five trial types were pseudorandomlyintermixed such that each trial type was preceded and followedequally often by every other trial type. Approximately 48 trialswere presented for each condition in each quadrant. The subject’stask, designed to enhance attention to the stimuli, wasto indicate whether the second stimulus (the probe) contained acomplete or notched disk, by pressing one of two buttons of aresponse pad.For each subject, event-related time courses (expressed aspercentage signal change) for each experimental condition werecalculated in independently defined ROIs, using the activity atevent onset (time 0) as baseline (Slotnick & Schacter, 2004).Event-related activity was interpolated, yielding 1-s temporalresolution. ROIs in the left and right hemispheres were used toobtain event-related averages corresponding to stimuli in theright and left hemifields, respectively. There was no qualitativedifference in data from the two hemispheres; therefore, wecollapsed data from the two hemispheres of each subject andcomputed group mean time courses.RESULTSEarly visual areas (V1, V2, VP, V4v, and LOC) were localizedseparately for each subject and served as ROIs for the analysis of360 Volume 17—Number 4
Page 4 and 5: R. Rauschenberger et al.HDR attenua
Page 6 and 7: R. Rauschenberger et al.whereas the

Temporally Unfolding Neural Representation of Pictorial Occlusion

Create successful ePaper yourself

Delete template?

Save as template?