PhD thesis - Institute for Computer Graphics and Vision - Graz ...

More documents

Recommendations

Info

46 Chapter 4. GPU-based Depth Map Estimation using Plane Sweeping To our knowledge, all published GPU-based dense depth estimation methods use the simple sum of absolute differences (SAD) or squared differences (SSD) for image dissimilarity computation (usually for performance reasons). By contrast, we have a set of GPU-based image correlation functions available, including the SAD, the normalized cross correlation (NCC) and the zero-mean NCC (ZNCC) similarity functions. The NCC and ZNCC implementations optionally use sum tables for an efficient implementation [Tsai and Lin, 2003]. Small row and column sums can be generated directly by sampling multiple texture elements within the fragment shader. Summation over larger regions can be performed using a recursive doubling approach similar to the GPU-based generation of integral images [Hensley et al., 2005]. Full integral image generation is also possible, but precision loss is observed for the NCC and ZNCC similarity functions in this case (see Section 4.2.2.2). For longer image sequences one cannot presume constant brightness conditions across all images, hence an optional prenormalization step is performed, which subtracts the boxfiltered image from the original one to compensate changes in illumination conditions. If this prenormalization is applied, the depth maps obtained using the different correlation functions have similar quality. 4.2.2.1 Efficient Summation over Rectangular Regions The image similarity functions described in the following section can be efficiently implemented by utilizing integral images (also known as summed-area tables in computer graphics). Integral images allow constant-time box filtering regardless of the window size [Crow, 1984]. Given the integral image of a source image any box filtering can be performed in constant time using four image accesses (resp. texture lookups). This efficient box filtering approach can be extended more complex higher-order filtering operations [Heckbert, 1986]. The single-pass procedure to calculate the integral image efficiently on a general purpose processor is slow when mapped on SIMD architectures. Consequently, a different approach using a logarithmic number of passes to generate the integral image on the GPU is much more efficient [Hensley et al., 2005]. Note, that the integral image requires a much higher precision of the color channels than the source image precision. Calculating and using integral images on the GPU is only feasible since the emergence of floating point support on current graphics hardware. Note that for very small window sizes the utilization of bilinear texture fetches available on current graphics hardware essentially for free is usually more efficient than the computation and application of integral images. Bilinear texturing allows the summation of four adjacent pixels by just one texture access, e.g. summing the values inside a 4x4 windows can be done using 4 bilinear texture lookups (instead of 16 individual accesses). Consequently, in order to obtain highest performance suitably customized procedures are best for very small correlation windows.
4.2. Plane Sweep Depth Estimation 47 4.2.2.2 Normalized Correlation Coefficient The widely used (zero-mean) normalized correlation coefficient for window-based local matching of two images X and Y is (where ¯ X and ¯ Y denote the means inside the rectangular region W) ZNCC = � i∈W (Xi − ¯ X) (Yi − ¯ Y ) �� i∈W (Xi − ¯ � X) 2 i∈W (Yi − ¯ Y ) 2 which is invariant under (affine linear) changes of luminance between images, but relatively costly to calculate. Using integral images the ZNCC can be calculated in constant time regardless of the correlation window size [Tsai and Lin, 2003], since ZNCC = � XiYi − ( � Xi) ( � Yi) /N �� X2 i − ( � Xi) 2 � �� /N Y 2 i − ( � Yi) 2 � /N . From the above formula it can be seen that five integral images are requires to calculate the ZNCC: the integral image for � Xi, � Yi, � X 2 i , � Y 2 i and finally � XiYi. The precision requirement for the higher order sums is 8 + 8 + log 2 512 + log 2 512 = 34 bit for 512 × 512 source images. The 32 bit floating point format of current GPUs has a mantissa of 23 bit and artefacts due to precision loss may occur. Figure 4.2 illustrates the reduced precision by depicting a ZNCC error image generated in software on a CPU and another one computed on the GPU. An increasing loss of precision can be seen towards the lower right corner of the image. Since the integral image generations starts from the upper left corner, the lower right portion has the highest precision requirements within the integral image. Note that the precision requirements for the simple sums � Xi and � Yi are 26 bit for 8 bit images with 512 × 512 pixels resolution. By subtracting the image mean in advance from the source image two additional precision bits can be saved: one by halving the magnitude of the source values and another one by exploiting the sign bit in the integral image. Instead of creating full integral images, which allows box filtering with arbitrary window sizes, it is usually sufficient to sum the values with a given specific window, since we do not vary the aggregation window size during similarity score computation. Accumulation of larger windows can be performed using a similar recursive doubling scheme as used for integral image generation. Consequently, the precision requirements on the target buffer storing the aggregated values depend on the window size, and these are substantially lower than the requirements for integral images. ,
Page 1:
Graz University of Technology Insti
Page 5 and 6:
Acknowledgments Writing a PhD thesi
Page 7 and 8:
Contents 1 Introduction 1 1.1 Intro
Page 9 and 10:
CONTENTS ix 6.2.2 Regularization .
Page 11 and 12: List of Figures 1.1 Several reconst
Page 13: LIST OF FIGURES xiii 9.3 Fused 3D m
Page 17 and 18: Chapter 1 Introduction Contents 1.1
Page 19 and 20: 1.2. Using Graphics Processing Unit
Page 21 and 22: 1.3. 3D Models from Multiple Images
Page 27 and 28: 1.4. Overview of this Thesis and Co
Page 29: 1.4. Overview of this Thesis and Co
Page 32 and 33: 16 Chapter 2. Related Work given in
Page 34 and 35: 18 Chapter 2. Related Work lacking
Page 36 and 37: 20 Chapter 2. Related Work to semin
Page 38 and 39: 22 Chapter 2. Related Work algorith
Page 40 and 41: 24 Chapter 2. Related Work Typicall
Page 43 and 44: Chapter 3 Mesh-based Stereo Reconst
Page 45 and 46: 3.2. Overview of Our Method 29 Mesh
Page 47 and 48: 3.2. Overview of Our Method 31 (0,0
Page 49 and 50: 3.3. Implementation 33 3.3 Implemen
Page 51 and 52: 3.3. Implementation 35 to perspecti
Page 53 and 54: 3.4. Performance Enhancements 37 th
Page 55 and 56: 3.5. Results 39 PARAM depth_index =
Page 57 and 58: 3.6. Discussion 41 (a) The key imag
Page 59 and 60: Chapter 4 GPU-based Depth Map Estim
Page 61: 4.2. Plane Sweep Depth Estimation 4
Page 65 and 66: 4.2. Plane Sweep Depth Estimation 4
Page 67 and 68: 4.3. Sparse Belief Propagation 51 i
Page 69 and 70: 4.3. Sparse Belief Propagation 53 p
Page 71 and 72: 4.5. Timing Results 55 spaced in a
Page 73 and 74: 4.5. Timing Results 57 passes to ca
Page 75 and 76: 4.7. Discussion 59 maps, is still u
Page 77 and 78: 4.7. Discussion 61 (a) Without BP (
Page 79 and 80: Chapter 5 Space Carving on 3D Graph
Page 81 and 82: 5.2. Volumetric Scene Reconstructio
Page 83 and 84: 5.3. Single Sweep Voxel Coloring in
Page 85 and 86: 5.3. Single Sweep Voxel Coloring in
Page 87 and 88: 5.4. Extensions to Multi Sweep Spac
Page 89 and 90: 5.6. Discussion 73 prior knowledge
Page 91 and 92: 5.6. Discussion 75 (a) (b) (c) (d)
Page 93 and 94: 5.6. Discussion 77 (a) (b) (c) (d)
Page 95 and 96: Chapter 6 PDE-based Depth Estimatio
Page 97 and 98: 6.2. Variational Techniques for Mul
Page 99 and 100: 6.2. Variational Techniques for Mul
Page 101 and 102: 6.3. GPU-based Implementation 85 6.
Page 103 and 104: 6.3. GPU-based Implementation 87 6.
Page 105 and 106: 6.4. Results 89 Figure 6.2: A recon
Page 107 and 108: 6.4. Results 91 (a) Without back-ma
Page 109 and 110: 6.5. Discussion 93 Nevertheless, th
Page 111 and 112: 6.5. Discussion 95 (a) (b) Figure 6
Page 113 and 114:
Chapter 7 Scanline Optimization for
Page 115 and 116:
7.2. Scanline Optimization on the G
Page 117 and 118:
Page 119 and 120:
Page 121 and 122:
7.3. Cross-Correlation based Multiv
Page 123 and 124:
Page 125 and 126:
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
7.4. Discussion 115 this method uti
Page 133:
7.4. Discussion 117 (a) left view (
Page 136 and 137:
120 Chapter 8. Volumetric 3D Model
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 147 and 148:
Chapter 9 Results Contents 9.1 Intr
Page 149 and 150:
9.2. Synthetic Sphere Dataset 133 (
Page 151 and 152:
9.3. Synthetic House Dataset 135 to
Page 153 and 154:
9.4. Middlebury Multi-View Stereo T
Page 155 and 156:
9.5. Statue of Emperor Charles VI 1
Page 157 and 158:
9.6. Bodhisattva Figure 141 (a) Fro
Page 159 and 160:
9.6. Bodhisattva Figure 143 (a) Fro
Page 161:
9.6. Bodhisattva Figure 145 (a) (b)
Page 164 and 165:
148 Chapter 10. Concluding Remarks
Page 167 and 168:
Appendix A Selected Publications A.
Page 169 and 170:
Bibliography [Akbarzadeh et al., 20
Page 171 and 172:
BIBLIOGRAPHY 155 [Curless and Levoy
Page 173 and 174:
BIBLIOGRAPHY 157 [Grabner, 2003] Gr
Page 175 and 176:
BIBLIOGRAPHY 159 [Klaus et al., 200
Page 177 and 178:
BIBLIOGRAPHY 161 [Mairal and Kerive
Page 179 and 180:
BIBLIOGRAPHY 163 [Pollefeys et al.,
Page 181 and 182:
BIBLIOGRAPHY 165 [Strecha and Van G
Page 183 and 184:
BIBLIOGRAPHY 167 [Yang et al., 2006
show all

PhD thesis - Institute for Computer Graphics and Vision - Graz ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?