|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Helmholtz Institute, Department of Physics of Man, Utrecht University, Utrecht, the Netherlands
2 Department of Physiology, University College London, London, UK
3 Helmholtz Institute, Department of Psychonomics, Utrecht University, Utrecht, the Netherlands
| Abstract |
|---|
|
|
|---|
(Received 28 April 2005;
accepted after revision 16 June 2005;
first published online 23 June 2005)
Corresponding author R. van Ee: Helmholtz Institute, Department of Physics of Man, Utrecht University, Utrecht, the Netherlands. Email: r.vanee{at}phys.uu.nl
| Introduction |
|---|
|
|
|---|
Thirty years ago, Julesz (1971) used a random-dot stereogram in which the corresponding dots had polarity-reversed luminance. Subjects were unable to binocularly fuse the retinal images but the colour information aided fusion. Lu & Fender (1972) in a classical study reported that colour differences alone do not give rise to the percept of depth. Nevertheless, stereopsis from isoluminant stimuli was reported in later experiments (Comerford, 1974; de Weert & Sadza, 1983; Grinberg & Williams, 1985). There are, however, methodological problems with the isoluminance procedure. First, the point of isoluminance may vary depending on the location on the retina as a result of the varying density of the different types of cones. Second, chromatic aberrations may provide luminance information in supposedly isoluminant stimuli (Livingstone & Hubel, 1987). Thus, Livingstone and Hubel concluded that depth disappears at isoluminance and proposed that colour is processed by the parvocellular processing stream, whereas depth, motion and orientation are processed by the magnocellular stream. The information of one processing stream is not available to the other until very late in visual processing (Livingstone & Hubel, 1987).
More recently, it has become clear that there is extensive cross-communication between the parvo- and the magnocellular system (Stuart et al. 1992; Gegenfurtner & Hawken, 1996; Gegenfurtner et al. 1997). In addition, in a number of more recent, better-controlled studies sustained depth perception at isoluminance was reported (Tyler & Cavanagh, 1991; Scharff & Geisler, 1992; Simmons & Kingdom, 1997; Kingdom et al. 1999; Kingdom & Simmons, 2000). Even when colour information alone may not be able to induce depth perception, it might still contribute to stereopsis when luminance variations or other cues are present. A colour-contingent depth after-effect has been reported, supporting the existence of colour-dependent depth mechanisms (Domini et al. 2000), and Kingdom and Simmons proposed independent chromatic and achromatic stereopsis mechanisms (Simmons & Kingdom, 1997; Kingdom et al. 1999; Kingdom & Simmons, 2000; Simmons & Kingdom, 2002). However, in the most recent work on this issue, Krauskopf & Forte (2002) reported that disparity discrimination deteriorated in stimuli with an added chromatic component compared to monochrome stimuli, suggesting that colour information does not enhance stereoscopic depth perception.
Despite these mixed results, it remains a valid hypothesis that colour information could improve the identification of binocular disparities to recover the original three-dimensional scene from the two-dimensional retinal images (Jordan & Bovik, 1992). The computation of these disparities, and thus the volumetric aspects of the scene, depends critically on the correct matching of the corresponding features in the two images. Indeed when observers view ambiguous stereograms of vertical bars in which a bar in one eye could be matched to several bars in the other eye, both colour and luminance cues can aid disparity-based depth perception at a threshold level (Jordan et al. 1990). These results imply that colour can be used in simple binocular matching problems. In complex images with many potential matches it is known that the brain uses information such as orientation and motion for binocular matching (van Ee & Anderson, 2001). The presence of cortical neurones in the macaque area V2 that are tuned for both colour and binocular disparity (Ts'o et al. 2001) suggest that colour may also be one of the primitives that the brain uses to solve the matching problem.
Here, we examined the role of colour in binocular matching of complex images. Our protocol allows for volumetric (suprathreshold) measurements rather than forced-choice (threshold) responses that have been used previously. We designed a stimulus in which colour (monochrome/trichrome) and luminance (congruent/incongruent) could be manipulated. This stimulus was an array of fronto-parallel bars that were uniformly distributed inside a volume (cf. van Ee & Anderson, 2001). Observers judged the magnitude of the perceived depth of the volume subtended by the bars in terms of the perceived depth-to-width ratio of the volume (Fig. 1). These stimuli ranged in complexity from a few bars, in which few false matches are possible, to many bars entailing many potential matches. The rationale of using this stimulus for studying binocular matching is as follows. When two arbitrary, uncorrelated images are projected to the two eyes, very little volumetric depth is experienced and the perceived depth shrinks towards the zero-disparity horopter. If a stimulus feature plays a role in binocular matching, the amount of perceived depth in complex stereoscopic displays will be greater when this feature is present than when it is absent. If colour contributes to binocular matching, it should reduce the number of false matches. As a result, the magnitude of perceived depth will be greater in colour than in monochrome stimuli (Fig. 2). Apart from colour, we also manipulated luminance. Luminance, a known primitive to solve the binocular matching problem, was used to validate our protocol. Luminance was either congruent, i.e. corresponding bars had the same luminance levels, or incongruent.
|
|
| Method |
|---|
|
|
|---|
Observers viewed stereograms on a LaCie electron 22 blue IV monitor of 38.7 x 28.2 deg at a constant viewing distance of 60 cm. The resolution was 1600 x 1200 pixels at a refresh rate of 85 Hz. The half-images of the stereogram were presented side by side on the monitor. The half-image of one eye was made invisible to the other eye by using a mirror set-up and a black septum, reducing the width of each eye's visual field to 19.1 deg (for more details see van Ee et al. 2001). The room was completely darkened. The head was stabilized with a chin and forehead rest.
Participants
Nine observers participated, all of whom were tested for normal colour and stereo vision prior to the experiment. All observers wore their corrective glasses, and were naïve with respect to the purpose of the experiment except for observer R.E. The experiment conformed to the standards set by the Declaration of Helsinki and the procedures had been approved by the Helmholtz Institute. Informed consent was obtained prior to the experiment.
Stimuli
The stimuli were stereograms of bright fronto-parallel bars of 5.6 arcmin width on a black background. These bars had disparities that evoked a percept of a three-dimensional volume of bars. Observers matched the perceived depth-to-width ratio evoked by the stimulus with the aspect ratio of the outline rectangle. This task is identical to the task used in previous studies (van Ee & Anderson, 2001; van Ee & Richards, 2002; van Ee, 2003). The bars were equally divided in depth and surrounded by a white rectangle at zero depth to facilitate correct eye posture. The far end of the volume had a relative disparity of 40.7 arcmin, which is consistent with a depth of 80 mm. The volume always receded in depth behind the monitor (uncrossed disparity). The stimuli had a constant height of 23.9 deg and a constant width of 15.8 deg. The length of the bars was varied randomly, between 100% and 80% of the volume's height. The orientation of the bars was random, varying from vertical to 10 deg. The number of bars (6, 24, 42, 72 and 96), their luminance, and colour properties varied across trials in a 2 x 2 factorial design: luminance was either congruent (bar A in the left eye had the same luminance level as bar A in the right eye), or incongruent (the luminance level in one eye did not predict the luminance level in the other eye). Note that the bar contrast polarity for congruent and incongruent luminance conditions is always the same (bright bars on a black background). Furthermore, the stimuli were either monochrome, in which all bars were green, or trichrome, in which the bars were red, blue or green, these being the phosphors of the CRT. Colour was always congruent. Stimuli were presented for 4 s and automatically appeared after a fixation cross disappeared. Each observer participated in 140 trials (5 numbers of bars, 2 x 2 colour luminance1, 7 trial repetitions). All trials were counterbalanced and presented in pseudorandom order.
For the trichrome stimuli, four shades of each of the three colours were used. As a result each bar had one of three different colours and one of 12 (4 x 3) different luminance levels. To create monochrome stimuli that were equal to the trichrome stimuli with respect to luminance information, a near-isoluminant shade of green, corresponding to each shade of red or blue in the trichrome stimuli, was used. Thus each bar in the monochrome stimuli was green and had one of 12 different luminance levels. In other words, compared to the monochrome stimuli, the trichrome stimuli differed with respect to colour information, but were equal with respect to luminance information. The following luminance levels were used for the trichrome stimuli (cd m2): green: max 51.78, min 19.56; red: max 22.34, min 9.07; blue: max 16.92, min 5.52. The difference between the shades of red and blue used in the trichrome stimuli, and the matched shades of green in the monochrome stimuli ranged between 0.34 and 0.17 cd m2 for red/green and between 0.10 and 0.29 cd m2 for blue/green. Luminance levels were measured using a photometer (LightScan, Cambridge Research Systems (Rochester, UK) OP200-E; photo cell: UDT Instruments (Baltimore, MD, USA) model 265).
| Results |
|---|
|
|
|---|
A linear mixed-effect model, or multilevel, analysis was employed to test the influence of complexity, colour and luminance on observed depth. In a linear mixed-effects model, responses from a subject are thought to be the sum (linear) of so-called fixed and random effects. If an effect, such as a colour, affects the population mean, it is fixed. If an effect is associated with a sampling procedure (e.g. subject effect), it is random. Though the fixed effect is most important, it is necessary to model the random effects as well. The linear mixed-effect models solve these problems by estimating fixed and random effects in one model. This statistical procedure allows us to evaluate all the data of all subjects in one analysis (for more information on linear mixed-effects models see Goldstein et al. 1994; Langford et al. 1999; McCulloch & Searle, 2001; Hox, 2002). Observer-specific (random) effects are taken into account at the level of the individual observers; the effects of interest (colour, luminance information and complexity) were investigated at the level of the distinct responses. Five models of increasing complexity were fitted to the data. Each model is an elaboration of a simpler model, investigating the effect of a subsequent factor of interest on the fit of the model. The acceptance of a more complex model and the rejection of the simpler model was based on a comparison of the 2-log likelihoods of the models, using maximum likelihood estimates. Model 1, the reference model, did not contain any parameters of interest and assumes that all variance is error variance around the mean. Model 2 is based on the assumption that all variance is error variance around an individual mean. The models 3 and 4 have been subdivided to separately test the influence of each of the three fixed factors of interest: complexity, colour and luminance. Model 3 includes overall effects, and model 4 includes individual variation within these overall effects. Model 5, finally, investigates possible interactions of the main effects.
To illustrate the mixed-effect modelling process and to demonstrate the increasing fit to the data, the predicted values of the subsequent models are presented in Fig. 3 for the factor complexity (models 1, 2, 3 and 4A), and compared to the raw data. Similar graphs could be created for the factors colour and luminance. In Table 1 the fit of each model is compared to the simpler model. The result of model 2 shows that variation between subjects is significant (P < 0.001). The results of model 3 show that all independent variables modulate the observed depth (complexity (3A), P < 0.001, colour (3B), P < 0.001, luminance (3C), P < 0.05), and from model 4 it becomes clear that perceived depth was more affected by the complexity for some observers than for others (complexity (4A): P < 0.001). This latter effect was not significant for colour or luminance information (colour (4B): P= 0.326, luminance (4C): P= 1). Model 5 did not increase the fit to the data. This means that there is no interaction between any of the independent variables colour, luminance and complexity. In conclusion, model (4A) that fits the data best, includes a random effect at the level of observers, a fixed effect for complexity, colour and luminance, and a random observer-dependent effect of complexity.
|
|
Model 4A shows that all the manipulated variables affected the perceived depth, but does not show the direction of this effect. Figure 4 depicts the raw data as well as the values predicted. Perceived depth ranged from approximately 50 to 150 mm, whereas the actual depth from disparity in all stimuli was 80 mm. Perceived depth increased for the trichrome compared to the monochrome stimuli (F1,966= 13.024, P < 0.001). Moreover, perceived depth increased for stimuli with congruent luminance compared to incongruent luminance stimuli (F1,966= 8.771, P < 0.01). Finally, when the complexity of the stimulus, i.e. the number of bars in the stimulus, increased, less depth was perceived (F1,7= 16.922, P < 0.01). Debriefing after the experiment showed that none of our observers had become aware of the difference between the congruent and incongruent luminance stimuli, neither for the trichrome nor for the monochrome stimuli.
|
| Discussion |
|---|
|
|
|---|
Physiological models of stereopsis and binocular matching are largely based on cortical neurones that can compute disparity (e.g. Cumming & DeAngelis, 2001). In a recent study, Ts'o et al. (2001) reported the existence of single neurones in V2 of macaques that are sensitive to both colour and disparity. This supports the notion that the primate visual system multiplexes disparity and colour as early as V2. Although there is no direct evidence that these cells exist in humans as well, the macaque visual system is generally considered a good model for both human stereopsis and colour vision. These results are in accordance with previously reported observations that colour information is being used to resolve ambiguous stimuli at threshold (Jordan et al. 1990).
In sum, this study shows that colour information may be used to solve the stereo matching problem and modulates the perception of depth. This observation does not support the notion put forward by Livingstone & Hubel (1987), who proposed separate processing of colour and luminance information. In their view, only the latter processing route is involved in depth perception. Our data are in agreement with the study of Krauskopf & Forte (2002) in that colour information modulates the perception of depth. However, in contrast to their findings, we observed an increase in perceived depth. Future research is needed to define the different conditions that increase or reduce depth perception. Our work suggests that one of the functional roles of the cells that multiplex colour and disparity (V2) may be to help solve the binocular matching problem.
| References |
|---|
|
|
|---|
Cumming BG & DeAngelis GC (2001). The physiology of stereopsis. Annu Rev Neurosci 24, 203238.[CrossRef][Medline]
de Weert CMM & Sadza KJ (1983). New data concerning the contribution of colour differences to stereopsis. In Colour Vision: Physiology and psychophysics, ed. Mollon JD & Sharpe LT, pp. 553562. Academic Press, London.
Domini F, Blaser E & Cicerone CM (2000). Color-specific depth mechanisms revealed by a color-contingent depth aftereffect. Vision Res 40, 359364.[CrossRef][Medline]
Gegenfurtner KR & Hawken MJ (1996). Interaction of motion and color in the visual pathways. Trends Neurosci 19, 394401.[CrossRef][Medline]
Gegenfurtner KR, Kiper DC & Levitt JB (1997). Functional properties of neurons in macaque area v3. J Neurophysiol 77, 19061923.
Goldstein H, Healy M & Rasbash J (1994). Multilevel models with applications to repeated measures data. Statistics Med 13, 16431655.[CrossRef]
Grinberg DL & Williams DR (1985). Stereopsis with chromatic signals from the blue-sensitive mechanism. Vision Res 25, 531537.[CrossRef][Medline]
Hox J (2002). Multilevel Analysis: Techniques and Applications. Lawrence Erlbaum Associates, Mahwah, NJ, USA.
Jordan JR & Bovik AC (1992). Using chromatic information in dense stereo correspondence. Pattern Recognition 25, 367383.[CrossRef]
Jordan JR, Geisler WS & Bovik AC (1990). Color as a source of information in the stereo correspondence process. Vision Res 30, 19551970.[CrossRef][Medline]
Julesz B (1971). Foundations of Cyclopean Perception. University Chicago Press, Chicago, USA.
Kingdom FAA & Simmons DR (2000). The relationship between colour vision and stereoscopic depth perception. J Soc for 3D Broadcasting Imaging 1, 1019.
Kingdom FA, Simmons DR & Rainville S (1999). On the apparent collapse of stereopsis in random-dot-stereograms at isoluminance. Vision Res 39, 21272141.[CrossRef][Medline]
Krauskopf J & Forte JD (2002). Influence of chromaticity on vernier and stereo acuity. J Vis 2, 645652.[CrossRef][Medline]
Langford IH, Marris C, McDonald AL, Goldstein H, Rasbash J & O'Riordan T (1999). Simultaneous analysis of individual and aggregate responses in psychometric data using multilevel modelling. Risk Anal 19, 669677.
Livingstone MS & Hubel DH (1987). Psychophysical evidence for separate channels for the perception of form, color, movement, and depth. J Neurosci 7, 34163468.[Abstract]
Lu C & Fender DH (1972). The interaction of color and luminance in stereoscopic vision. Invest Ophthalmol 11, 482490.
McCulloch CE & Searle SR (2001). Generalized, Linear and Mixed Models. Wiley, New York.
Scharff LV & Geisler WS (1992). Stereopsis at isoluminance in the absence of chromatic aberrations. J Opt Soc Am 9, 868876.
Simmons DR & Kingdom FA (1997). On the independence of chromatic and achromatic stereopsis mechanisms. Vision Res 37, 12711280.[CrossRef][Medline]
Simmons DR & Kingom FA (2002). Interactions between chromatic- and luminance-contrast-sensitive stereopsis mechanisms. Vision Res 42, 15351545.[CrossRef][Medline]
Stuart GW, Edwards M & Cook ML (1992). Colour inputs to random-dot stereopsis. Perception 21, 717729.[CrossRef][Medline]
Ts'o DY, Roe AW & Gilbert CD (2001). A hierarchy of the functional organization for color, form and disparity in primate visual area v2. Vision Res 41, 13331349.[CrossRef][Medline]
Tyler CW & Cavanagh P (1991). Purely chromatic perception of motion in depth: Two eyes as sensitive as one. Percept Psychophys 49, 5361.[Medline]
van Ee R (2003). Correlation between stereoanomaly and perceived depth when disparity and motion interact in binocular matching. Perception 32, 6784.[CrossRef][Medline]
van Ee R & Anderson BL (2001). Motion direction, speed and orientation in binocular matching. Nature 410, 690694.[CrossRef][Medline]
van Ee R, Anderson BL & Farid H (2001). Occlusion junctions do not improve stereoacuity. Spatial Vision 15, 4559.[CrossRef][Medline]
van Ee R & Richards W (2002). A planar and a volume tric test for stereoanomaly. Perception 31, 5164.[CrossRef][Medline]
| Acknowledgements |
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |