Gesture-Based Interaction with Time-of-Flight Cameras

More documents

Recommendations

Info

CHAPTER 4. SHADING CONSTRAINT Besides the shading constraint, there are also other ways of fusing range and intensity data. A number of authors exploit the fact that an edge in the intensity data often co-occurs with an edge in the range data. Nadabar and Jain (1995) use a Markov random field (MRF) to identify different types of edges. Diebel and Thrun (2006) use edge strengths estimated on a high-resolution color image to increase the resolution of a low-resolution depth map. The idea of integrating the shading constraint with other range information has been investigated by a number of researchers. Most of this work focuses on the inte- gration of SfS with stereo. These two techniques complement each other well because SfS works well on uniformly colored areas whereas stereo requires surface texture to find stereo correspondences. Because of this, the fusion of SfS with stereo has a slightly different focus than the fusion of SfS with a TOF range map. In the stereo case, we only have range information in textured areas and need to rely on shading cues in untextured areas. In the TOF case, we have a dense range map and wish to fuse information from TOF and shading at the same pixel. Many approaches to the fusion of SfS and stereo (see for example the work of Thompson (1993), Fua and Leclerc (1995), and Hartt and Carlotto (1989)) use an objective function that depends directly on the two or more images obtained from a multi-camera setup; for this reason, they do not generalize to settings where a range map has been obtained in some other way than through stereo. Samaras et al. (2000) combine stereo with SfS by using stereo in textured areas and SfS in untextured areas, but they do not perform a fusion of stereo and SfS at the same location. Haines and Wilson (2007, 2008) fuse stereo and SfS in a probabilistic approach based on a disparity map and the shading observed in one of the stereo images. Because there is a one-to-one correspondence between disparity and range, the approach could also be used with range maps obtained by arbitrary means. However, since color is used to segment areas of different albedo, the approach is not suitable for use with TOF cameras, which typically only deliver a grayscale image. There are several other approaches that combine shading with a range map obtained by arbitrary means; stereo may be used, but it is not essential to the formu- lation of the algorithm. Leclerc and Bobick (1991) use a stereo range map to initial- ize an iterative SfS method. Cryer et al. (1995) use a heuristic that combines low- frequency components from the stereo range map with high-frequency components from the SfS range map. Mostafa et al. (1999) use a neural network to interpolate the difference between the SfS result and a more coarsely sampled range map from a range sensor; the SfS result is corrected using this error estimate. These approaches 34
allow arbitrary range maps to be used, but they are all somewhat ad-hoc. 4.2. METHOD Our approach to improving the accuracy of the range map using the shading constraint is based on a probabilistic model of the image formation process. We obtain a maximum a posteriori estimate for the range map using a numerical minimiza- tion technique. The approach has a solid theoretical foundation and incorporates the sensor-based range information and the shading constraint in a single model; for details, see Section 4.2. The method delivers robust estimation results on both synthetic and natural images, as we show in Section 4.3. 4.2 Method 4.2.1 Probabilistic Image Formation Model We seek to find the range map R that maximizes the posterior probability p(X R , X I |R, A) p(R) p(A). ✞ ☎ ✝4.1 ✆ p(X R , X I |R, A) is the probability of observing a range map X R and an intensity image X I given that the true range map describing the shape of the imaged object is R and that the parameters of the reflectance model are A. Typically, A is the albedo of the object – we will discuss this in more detail below. p(R) is a prior on the range map, p(A) is a prior on the reflectance model parameters. The conditional probability p(X R , X I |R, A) is based on the following model of image formation: First of all, we assume that p(X R , X I |R, A) can be written as fol- lows: p(X R , X I |R, A) = p(X R |R, A) p(X I |R, A). ✞ ☎ ✝4.2 ✆ In other words, the observations X R and X I are conditionally independent given R and A. We now assume that the observed range map X R is simply the true range map R with additive Gaussian noise, i.e. p(X R |R, A) = N (X R − R|µ = 0, σR(R, A)). ✞ ☎ ✝4.3 ✆ Note that the standard deviation σR is not constant but can vary per pixel as a function of range and albedo. As we will see in Section 4.2.4, the noise in the range measurement of a TOF camera depends on the amount of light that returns to the camera. The shading constraint postulates that a given range map R is associated with 35
Page 1 and 2: Aus dem Institut für Neuro- und Bi
Page 3 and 4: Contents Acknowledgements vi I Intr
Page 5 and 6: CONTENTS 7.3.2 Sparse Features . .
Page 7: Acknowledgements There are a number
Page 11 and 12: 1 Introduction Nowadays, computers
Page 13 and 14: the scene (typically with light nea
Page 15 and 16: 2.1 Introduction 2 Time-of-Flight C
Page 17 and 18: 2.2. STATE-OF-THE-ART TOF SENSORS r
Page 19 and 20: 2.3. ALTERNATIVE OPTICAL RANGE IMAG
Page 21 and 22: 2.3. ALTERNATIVE OPTICAL RANGE IMAG
Page 23 and 24: 2.4 Measurement Principle 2.4. MEAS
Page 25 and 26: .I(t) .A .B . .A0 .A1 .A2 .A3 2.5.
Page 27 and 28: 2.5. TECHNICAL REALIZATION form H(f
Page 29 and 30: 2.6. MEASUREMENT ACCURACY the name
Page 31 and 32: 2.7 Limitations 2.7. LIMITATIONS As
Page 33 and 34: 2.7. LIMITATIONS objects than the i
Page 35: Part II Algorithms 27
Page 38 and 39: CHAPTER 3. INTRODUCTION straint: If
Page 40 and 41: CHAPTER 3. INTRODUCTION the class i
Page 44 and 45: CHAPTER 4. SHADING CONSTRAINT an in
Page 46 and 47: CHAPTER 4. SHADING CONSTRAINT 4.2.3
Page 48 and 49: CHAPTER 4. SHADING CONSTRAINT in th
Page 50 and 51: CHAPTER 4. SHADING CONSTRAINT inten
Page 52 and 53: CHAPTER 4. SHADING CONSTRAINT RMS e
Page 54 and 55: CHAPTER 4. SHADING CONSTRAINT inten
Page 56 and 57: CHAPTER 4. SHADING CONSTRAINT sures
Page 58 and 59: CHAPTER 5. SEGMENTATION (a) (b) Fig
Page 60 and 61: CHAPTER 5. SEGMENTATION (a) (b) (c)
Page 62 and 63: CHAPTER 5. SEGMENTATION (a) (b) Fig
Page 64 and 65: CHAPTER 5. SEGMENTATION Figure 5.7:
Page 67 and 68: 6.1 Introduction 6 Pose Estimation
Page 69 and 70: 6.2. METHOD resolve disambiguities
Page 71 and 72: 6.2. METHOD Figure 6.2: Graph model
Page 73 and 74: 6.3. RESULTS Figure 6.3: Point clou
Page 75 and 76: 6.3. RESULTS exactly which part of
Page 77 and 78: 6.4. DISCUSSION learning. As a resu
Page 79 and 80: 7 Features In image processing, the
Page 81 and 82: The first type of feature is relate
Page 83 and 84: 7.1 Geometric Invariants 7.1.1 Intr
Page 85 and 86: 7.1. GEOMETRIC INVARIANTS The featu
Page 87 and 88: 4 3 ǫ2 2 5 ×10−3 1 7.1. GEOMETR
Page 89 and 90: 7.1. GEOMETRIC INVARIANTS Figure 7.
Page 91 and 92: 7.1. GEOMETRIC INVARIANTS We were a
Page 93 and 94:
7.2. SCALE INVARIANT FEATURES measu
Page 95 and 96:
an algorithm for the fast evaluatio
Page 97 and 98:
7.2. SCALE INVARIANT FEATURES the o
Page 99 and 100:
7.2. SCALE INVARIANT FEATURES exhib
Page 101 and 102:
detection rate detection rate 1 0.8
Page 103 and 104:
7.2. MULTIMODAL SPARSE FEATURES 7.3
Page 105 and 106:
7.3. MULTIMODAL SPARSE FEATURES Fig
Page 107 and 108:
7.3. MULTIMODAL SPARSE FEATURES Fig
Page 109 and 110:
7.3. MULTIMODAL SPARSE FEATURES eac
Page 111 and 112:
7.3. MULTIMODAL SPARSE FEATURES tem
Page 113 and 114:
false positive rate 0.4 0.35 0.3 0.
Page 115 and 116:
7.3. MULTIMODAL SPARSE FEATURES thi
Page 117 and 118:
7.4. LOCAL RANGE FLOW FOR HUMAN GES
Page 119 and 120:
Page 121 and 122:
Page 123 and 124:
Page 125 and 126:
Page 127:
Part III Applications 119
Page 130 and 131:
CHAPTER 8. INTRODUCTION an alternat
Page 132 and 133:
CHAPTER 9. FACIAL FEATURE TRACKING
Page 134 and 135:
CHAPTER 9. FACIAL FEATURE TRACKING
Page 137 and 138:
10.1 Introduction 10 Gesture-Based
Page 139 and 140:
10.2. METHOD scenarios: One, where
Page 141 and 142:
10.2. METHOD Figure 10.3: Segmented
Page 143 and 144:
Table 10.1: Interpretation of point
Page 145 and 146:
corner along the ray. 10.2. METHOD
Page 147 and 148:
600 500 400 300 200 100 0 0 10 20 3
Page 149:
10.4. DISCUSSION tention here was t
Page 152 and 153:
CHAPTER 11. DEPTH OF FIELD BASED ON
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
CHAPTER 12. CONCLUSION alization of
Page 171 and 172:
Bibliography ARTTS 3D TOF Database.
Page 173 and 174:
BIBLIOGRAPHY James R. Diebel and Se
Page 175 and 176:
BIBLIOGRAPHY ence on Computer Visio
Page 177 and 178:
BIBLIOGRAPHY Stefan Kunis and Danie
Page 179 and 180:
BIBLIOGRAPHY Thierry Oggier, Michae
Page 181 and 182:
BIBLIOGRAPHY Rudolf Schwarte, Horst
Page 183 and 184:
BIBLIOGRAPHY ’06: Proceedings of
Page 185:
modulation frequency, 8, 23, 26 Mon
show all

Gesture-Based Interaction with Time-of-Flight Cameras

Create successful ePaper yourself

Delete template?

Save as template?