PhD Thesis Semi-Supervised Ensemble Methods for Computer Vision

More documents

Recommendations

Info

38 Chapter 3. Overview of Semi-Supervised Learning 3.8 Computer Vision Applications In computer vision, the probably most frequently applied semi-supervised learning algorithm is co-training. For example, Levin et al. [Levin et al., 2003] used co-training to train a car detector. They start with a small number of hand labeled samples and generate additional labeled examples by applying co-training of two boosted off-line classifiers, where one uses gray-value images and the other is trained from background subtracted images, respectively. Moreover, Javed et al. [Javed et al., 2005] applied an arbitrary number of classifiers and extended the method to on-line learning. In particular, they first generate a seed model by off-line boosting, which is improved later on by on-line boosting. If multiple disjoint views exist, co-training can also be applied for tracking, e.g., [Tang et al., 2007, Yu et al., 2008, Liu et al., 2009]. There also exist several approaches based on deep neural networks in order to improve the visual recognition performance using unlabeled data, e.g., [Yu et al., 2008, Mobahi and Collobert, 2009]. Recently, Fergus et al. [Fergus et al., 2009] presented a semi-supervised framework that is able to learn object classifiers from 80 million images. In particular, they propose a graph-based method that scales linear with the number of samples and thus allows for large-scale usage. Guillaumin et al. [Guillaumin and Schmid, 2010] proposed a multimodal SSL approach used for image categorization, where the main idea is to additionally to the visual information, also exploit other sources of information such as text, which is surrounding images on web pages. Socher and Fei-Fei [Socher and Fei-Fei, 2010] applied Semi-supervised learning to image-segmentation. 3.9 SSL from weakly related data As has been shown above, there exist a large amount of methods and algorithms for the semi-supervised learning problem. The main differences between these approaches are often only based on their assumptions which they are imposing over the unlabeled data (e.g., manifold assumption or large margin assumption, etc.) and on which supervised learning method they are based, such as SVMs or boosting. Yet, one assumption that most of them have in common is that the underlying marginal data distribution P (X , Y) is i.i.d., which means they draw samples from data of which they assume it is identical and independently distributed. However, in practice, unlabeled data does not necessarily come from the same distribution as the labeled data. Another problem that occurs in practice but is ignored by most approaches is the fact that although unlabeled data is usually easy to obtain, unlabeled data which consists of sufficient amounts of target class samples is not. For instance, consider the problem of training a visual object detector for alpacas (see Figure 3.3a). For this task, it is difficult
3.9. SSL from weakly related data 39 to obtain many labeled images with alpacas. However, it is also difficult to obtain many unlabeled images containing alpacas. The same is true for mongoose (Figure 3.3b) and many other target objects. a) b) Figure 3.3: Examples for difficult to obtain images: alpaca (a) and mongoose (b) A practically useful semi-supervised learning algorithm has to be able to handle weakly related unlabeled data. Unfortunately, there is only a limited amount of approaches that try to tackle this problem and it is also not part of this thesis. However, recently, there have been proposed some first algorithmic attempts that are worth to be mentioned: Self-taught Learning Raina et al. [Raina et al., 2007] highlighted the problem that unlabeled data often does not consist of sufficient samples from the target class and presented a framework called “self-taught learning” or STL. In STL, the main idea is to perform transfer learning from unlabeled data; i.e., although the unlabeled data is not necessarily related to the target class and labels are not available, they show that learning can still benefit from such samples. In particular, STL assumes of having labeled data D L = {(x 1 , y 1 ), . . . , (x l , y l )} drawn i.i.d. from some distribution. Additionally, they suppose a set of unlabeled data D U = {(x l+u ), . . . , (x l+u )}, however, not necessarily drawn from the same distribution as D L . Yet, although they assume D U only weakly related to D L , they do not assume the unlabeled data to be totally unrelated. The STL consists of two steps: First, an unsupervised learning method, i.e., sparse coding [Olshausen and Field, 1996], is applied to D U in order to obtain a higher-level sparse representation of the unlabeled images. This representation is then used in order to train a common SVM on D L . Although the approach of Raina et al. is trivial and seems to be straight-forward, an evaluation of STL on several domains, e.g., object categorization, character recognition and
Page 1:
PhD Thesis Semi-Supervised Ensemble
Page 5:
Statutory Declaration I declare tha
Page 8 and 9:
Most of all, I would like to thank
Page 10 and 11:
learning. Finally, we hypothesize t
Page 12 and 13:
sten Teil dieser Arbeit schlagen wi
Page 14 and 15: ii CONTENTS 3.6 Graph-based Methods
Page 16 and 17: iv CONTENTS 10 Conclusion 137 10.1
Page 18 and 19: vi LIST OF FIGURES 4.8 Performance
Page 20 and 21: viii LIST OF FIGURES 9.7 Comparison
Page 22 and 23: x LIST OF FIGURES
Page 24 and 25: xii LIST OF TABLES 8.2 Results and
Page 26 and 27: xiv LIST OF ALGORITHMS
Page 28 and 29: 2 Chapter 1. Introduction Figure 1.
Page 30 and 31: 4 Chapter 1. Introduction the liter
Page 32 and 33: 6 Chapter 1. Introduction 1.1 Contr
Page 34 and 35: 8 Chapter 1. Introduction
Page 36 and 37: 10 Chapter 2. Preliminaries and Not
Page 52 and 53: 26 Chapter 3. Overview of Semi-Supe
Page 70 and 71: 44 Chapter 4. SemiBoost and Visual
Page 88 and 89: 62 Chapter 5. On-line Semi-Supervis
Page 110 and 111: 84 Chapter 6. Semi-Supervised Rando
Page 112 and 113: 86 Chapter 6. Semi-Supervised Rando
Page 114 and 115:
88 Chapter 6. Semi-Supervised Rando
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
98 Chapter 7. On-line Semi-Supervis
Page 126 and 127:
100 Chapter 7. On-line Semi-Supervi
Page 128 and 129:
102 Chapter 7. On-line Semi-Supervi
Page 130 and 131:
104 Chapter 8. Multiple Instance Le
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
116 Chapter 9. Visual Object Tracki
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
138 Chapter 10. Conclusion As many
Page 166 and 167:
140 Chapter 10. Conclusion positive
Page 168 and 169:
142 Chapter 10. Conclusion
Page 170 and 171:
144 Chapter A. Publications (8) Mar
Page 172 and 173:
146 Chapter A. Publications
Page 174 and 175:
148 Chapter B. Acronyms SVM Support
Page 176 and 177:
150 BIBLIOGRAPHY [Balcan et al., 20
Page 178 and 179:
152 BIBLIOGRAPHY [Chapelle and Zien
Page 180 and 181:
154 BIBLIOGRAPHY [Gall and Lempinsk
Page 182 and 183:
156 BIBLIOGRAPHY [Leistner et al.,
Page 184 and 185:
158 BIBLIOGRAPHY [Nigam et al., 200
Page 186 and 187:
160 BIBLIOGRAPHY [Shalev-Shwartz, 2
Page 188 and 189:
162 BIBLIOGRAPHY [Xu et al., 2009]
show all

PhD Thesis Semi-Supervised Ensemble Methods for Computer Vision

Create successful ePaper yourself

Delete template?

Save as template?