The Gixel Array Descriptor (GAD) for Multi-Modal Image Matching

Figure 10. More results for matching with GAD (left) and recall vs. 1 − precision curve comparison (right). (a) Matching with JPEG 

compression (image 1 and 2 from the “Ubc” series [1]) (b) Matching with illumination change (image 1 and 4 from the “Leuven” series [1]) 

likely helps in reducing the impact of JPEG compression. 

In Fig.10(b) with illumination change, the GAD has a recall 

rate slightly inferior to other descriptors, but still finds 

a large number of correct matches with almost no errors. 

5.5. Processing Time 

GAD’s computation process is time-consuming compared 

to state-of-the-art descriptors, but no efforts at optimization 

have been made yet. For examples, Fig.1 (size 

512x512) takes GAD 8.9 seconds, while SURF needs 0.7s; 

Fig.10(b) (size 900x600) takes GAD 19.5s, while SURF 

needs 1.3s. At this point, speed is not a primary concern in 

our research, but we’ll pursue optimizations in future work. 

6. Conclusion 

We introduce a novel descriptor unit called a Gixel, 

which uses an additive scoring method to extract surrounding 

edge information. We show that a circular array of Gixels 

will sample edge information in overlapping regions to 

make the descriptor more discriminative and it can be invariant 

to rotation and scale. Experiments demonstrate the 

superiority of the Gixel array descriptor (GAD) for multimodal 

matching, while maintaining a performance comparable 

to state-of-the-art descriptors on traditional single 

modality matching. 

The GAD still has some limitations in its current development 

status. We have put little effort into optimization, 

so the run time is slow. In addition, though GAD exhibits 

rotation and scale invariance, large viewpoint changes may 

reduce performance, and we have not addressed that issue 

yet. Finally, as a feature built sole on edges, GAD may not 

perform well in situations where edges are rare. These issues 

will be investigated sin our future work. 

References 

[1] K. Mikolajczyk and C. Schmid. A performance evaluation of 

local descriptors. IEEE Transactions on Pattern Analysis and 

Machine Intelligence, 27(10):1615–1630, 2005. 2, 4, 5, 7, 8 

[2] H. Bay, A. Ess, T. Tuytelaars, L. Van Gool. SURF: Speeded 

Up Robust Features. Computer Vision and Image Understanding, 

110(3):346–359, 2008. 2, 5 

[3] R. Zabih and J. Woodfill. Non-parametric local transforms for 

computing visual correspondance. Proceedings of the European 

Conference on Computer Vision, pp.151–158, 1994. 2 

[4] A. Johnson and M. Hebert. Object recognition by matching 

oriented points. Proceedings of the IEEE Conference on Computer 

Vision and Pattern Recognition, pp.684–689, 1997. 2 

[5] S. Belongie, J. Malik, and J. Puzicha. Shape matching and 

object recognition using shape contexts. IEEE Transactions 

on Pattern Analysis and Machine Intelligence, 24(4):509–522, 

2002. 2 

[6] D. Lowe. Distinctive image features from scale-invariant keypoints. 

International Journal of Computer Vision, 60(2):91– 

110, 2004. 2, 5 

[7] Y. Ke and R. Sukthankar. PCA-SIFT: A more distinctive representation 

for local image descriptors. Proceedings of the 

IEEE Conference on Computer Vision and Pattern Recognition, 

pp.511–517, 2004. 2 

[8] M. Calonder, V. Lepetit, C. Strecha, and P. Fua. BRIEF: Binary 

Robust Independent Elementary Features. Proceedings of 

the European Conference on Computer Vision, 2010. 2, 5 

[9] E. Rublee, V. Rabaud, K. Konolige, G. Bradski. ORB: an efficient 

alternative to SIFT or SURF. Proceedings of the IEEE 

International Conference on Computer Vision, 2011. 2, 5 

[10] F. Tang, S. H. Lim, N. L. Chang , H. Tao. A novel feature 

descriptor invariant to complex brightness changes. Proceedings 

of the IEEE Conference on Computer Vision and Pattern 

Recognition, pp.2631–2638, 2009. 2 

[11] A. Bosch, A. Zisserman, and X. Munoz. Image classification 

using random forests and ferns. Proceedings of the IEEE 

International Conference on Computer Vision, 2007. 2 

[12] E. Shechtman and M. Irani. Matching local self-similarities 

across images and videos. Proceedings of the IEEE Conference 

on Computer Vision and Pattern Recognition, 2007. 2 

[13] S. Leutenegger, M. Chli and R. Siegwart. BRISK: Binary 

Robust Invariant Scalable Keypoints, Proceedings of the IEEE 

International Conference on Computer Vision, 2011. 2 

[14] S. Winder, G. Hua, M. Brown. Picking the best DAISY. Proceedings 

of the IEEE Conference on Computer Vision and 

Pattern Recognition, pp.178-185, 2009. 3 

[15] G. Yang, C. V. Stewart, M. Sofka, C. L. Tsai. Registration of 

Challenging Image Pairs: Initialization, Estimation, and Decision. 

IEEE Transactions on Pattern Analysis and Machine 

Intelligence, 29(11):1973–1989, 2007. 7

Previous page

Next page

1

2

3

4

5

6

7

8

The Gixel Array Descriptor (GAD) for Multi-Modal Image Matching

Create successful ePaper yourself

Delete template?

Save as template?