Recognizing Cars in Aerial Imagery to Improve Orthophotos

published imagery. Cars are an obvious clutter in a 3D city 

model. Therefore we want to detect and remove them. 

Our efficient car detector exploits an on-line Boosting algorithm 

for training the detector. We use on-line learning with 

a human to build a good detector from only a few labeled 

samples, and we can successively improve the detector to 

achieve satisfaction with the performance. We believe that 

our procedure can be applied to a range of different objects, 

not just cars. Of urgent interest are people. 

In our experimental work, we demonstrated that we can 

train the detector with just a few hundred samples. This is 

far less than other approaches need, for example [27] report 

results with 7800 (positive and negative) examples. We believe 

that we can decrease the training effort perhaps to a 

level where just two samples are needed. 

If we scale up our current results, as obtained from only two 

images with 500 cars, to apply to 400 non-redundant images 

with 100,000 cars, we would achieve an automated detection 

of 95000 cars, we would have to cope with 5000 false detections 

(which do not hurt for inpainting) and we would use 

about 130 hours of computing (on a single machine). Scaling 

the work to use 3000 images for the same area and the 

same number of cars will be possible once we have started 

to exploit the high overlap imagery. 

6.3 Next Steps 

Our hope for further improvements is nourished by the availability 

of overlapping images, typically 10 images per ground 

patch. We envision an approach that automatically improves 

the detector in an unsupervised fashion. The image 

overlaps will help to obtain additionally positive samples, 

e.g. when a car is detected in one image and not in the 

other. 

Our optimism is further encouraged by recent unpublished 

work that employs 3D information from the images to automatically 

label the false negatives and use them for training. 

We are also exploring co-training strategies by using 

extracted height field features to construct a shape-based 

car model in an appearance driven detection process. 

We speculate that a combination of approaches will achieve a 

better performance and also a greater level of generalization 

towards use for various objects and types of aerial imagery. 

We have recently proposed an approach called “conservative 

learning” [21]. There we have demonstrated that by 

using a combination of generative and discriminative classifiers 

and a conservative update strategy a person detector 

can be learned in an unsupervised manner. 

7. ACKNOWLEDGMENTS 

This work has been supported by the APAFA Project No. 

813397, financed by the Austrian Research Promotion Agency 

(http://www.ffg.at) and the Virtual Earth Academic Research 

Collaboration funded by Microsoft. 

8. REFERENCES 

[1] S. Agarwal, A. Awan, and D. Roth. Learning to detect 

objects in images via a sparse, part-based 

representation. PAMI, 26(11):1475–1490, 2004. 

[2] M. Bertalmio, L. Vese, G. Sapiro, and S. Osher. 

Simultaneous structure and texture image inpainting. 

Technical Report 02(47), UCLA CAM Report, 2002. 

[3] T. Chan and J. Shen. Mathematical models for local 

nontexture inpaintings. SIAM J. Appl. Math., 

62(3):1019–1043, 2001. 

[4] D. Comaniciu and P. Meer. Mean shift analysis and 

applications. In ICCV, pages 1197–1203, 1999. 

[5] N. Cornelis, B. Leibe, K. Cornelis, and L. van Gool. 

3d city modeling using cognitive loops. In Third 

International Symposium on 3D Data Processing, 

Visualization and Transmission, 3DPVT, 2006. 

[6] A. Demiriz, K. Bennett, and J. Shawe-Taylor. Linear 

programming boosting via column generation. 

Machine Learning, 46:225–254, 2002. 

[7] R. Fergus, P. Perona, and A. Zisserman. Object class 

recognition by unsupervised scale-invariant learning. 

In In Proc. IEEE Conf. Computer Vision and Pattern 

Recognition, 2003., pages 264–271, 2003. 

[8] W. Foerstner. Etrims. 

http://www.ipb.uni-bonn.de/projects/etrims, 

2006. 

[9] Y. Freund and R. Schapire. A decision-theoretic 

generalization of on-line learning and an application to 

boosting. Journal of Computer and System Sciences, 

55(1):119–139, 1997. 

[10] H. Grabner and H. Bischof. On-line boosting and 

vision. In Proc. of CVPR 2006, volume I, pages 

260–267. IEEE CS, 2006. 

[11] A. Klaus, M. Sormann, and K. Karner. Segment-based 

stereo matching using belief propagation and a 

self-adapting dissimilarity measure. In Proc. ICPR 

2006, pages 15–18. IEEE CS, 2006. 

[12] K. Levi and Y. Weiss. Learning object detection from 

a small number of examples: The importance of good 

features. In CVPR, pages 53–60, 2004. 

[13] N. Littlestone. Learning quickly when irrelevant 

attributes abound. Machine Learning, 2:285–318, 

1987. Winnow algorithm. 

[14] T. Ojala, M. Pietikäinen, and T. Mäenpää. 

Multiresolution gray-scale and rotation invariant 

texture classification with local binary patterns. 

PAMI, 24(7):971–987, 2002. 

[15] A. Opelt, M. Fussenegger, A. Pinz, and P. Auer. Weak 

hypotheses and boosting for generic object detection 

and recognition. In Proc. ECCV2004, volume II, pages 

71–84, 2004. 

[16] N. Oza and S. Russell. Online bagging and boosting. 

In Proceedings Artificial Intelligence and Statistics, 

pages 105–112, 2001. 

[17] D. S. P. Viola, M.J. Jones. Detecting pedestrians 

using patterns of motion and appearance. In 

Proceedings of the Ninth IEEE Conference on 

Computer Vision (ICCV’03), volume 2, pages 

734–741, 2003. Pedestria dedection using AdaBoost. 

[18] J.-H. Park and Y.-K. Choi. On-line learning for active 

pattern recognition. In IEEE Signal Processing 

Letters, pages 301–303, 1996. 

[19] J. Ponce, M. Herbert, C. Schmid, and A. Zisserman, 

editors. Toward Category-Level Object Recognition. 

Springer, 2006.

Previous page

Next page

1

2

3

4

5

6

7

8

9

Recognizing Cars in Aerial Imagery to Improve Orthophotos

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?