Handwritten Word Spotting in Old Manuscript Images using Shape ...

More documents

Recommendations

Info

The segmentation process experiments evaluate the accuracy of the word segmentation. The segmented word and the labeled word are overlapped in order to check if they are the same word. Different thresholds of overlapping percentage are used to evaluate the accuracy of the segmentation process. The first approach has two types of experiments. The first one evaluates how the clustering process is done. The second one evaluates the accuracy of the retrieval process: • The experiment shows the relation between the basic features chosen by using 2D plots. • By using visual results, we observe the distribution of the observations of our ground truth in the clusters. • We evaluate the accuracy, the homogeneity and completeness of the clustering using Vmeasure (explained in section 9.3). • The accuracy of the retrieval process is evaluated by means of a precision-recall curve. The second approach is evaluated by means of precision-recall curves: • Two experiments are used to assess the accuracy of this approach by using different characteristics pixels (background and foreground pixels). • Both characteristics points are compared in order to evaluated. 9.3 Metrics One drawback of clustering process is the proper selection of the number of clusters. Learning process consist in bunching the observations in different clusters. The ideal solution is achieved when all the instances of the same word are in the same cluster, and each cluster has only instances of only one word. The results of the retrieval process depend on the accuracy in the clustering process. The evaluation of the clustering process has been done using V-measure [22]. V-measure is an entropy-based measure which explicitly measures how successfully the criteria of homogeneity and completeness have been satisfied. V-measure is computed as the “mean” of distinct homogeneity and completeness scores, that is, V-measure can be weighted to favour the contributions of homogeneity or completeness. A clustering result satisfies homogeneity if each one of its clusters contain only data points which are members of a single class, and a clustering result satisfies completeness if all the data points that are members of a given class are elements of the same cluster The retrieval process is evaluated using precision-recall curves: recall = precision = number of relevant items retrieved number of relevant items in collection number of relevant items retrieved total number of items retrieved 22 (4) (5)
9.4 Experiments We present the experiments done in this work. We show the results for the pre-processing step and for the two approaches developed. Pre-processing The pre-processing step has the objective of segmenting the words on the documents. The performance of the next steps will depend on the results obtained from this stage. The segmentation process is evaluated in terms of words found with respect the ground truth. We have used both the complete and reduced ground truth in order to evaluate the segmentation process. After segmenting the words from the document, we have matched these words with the labelled words of the ground truth. Each segmented word is compared with the words of the ground truth observing the percentage of overlapping (Fig. 15). The words that have more of 40% of overlapping of their bounding boxes are considered as the same word. Figure 15: Examples of a correct overlapping (left) and a incorrect overlapping (right). In table 2 we observe the results of applying our method to the different ground truth by using different thresholds. In both, the same performance is obtained: with small overlapping threshold the percentage of words found is high, but when we increase the threshold, the percentage decreases. We observe that, using the ground truth with 50 documents and 20 classes and 0.1 as threshold the accuracy is over 100%. A label of a word could contain part of a next word, then, when we compare the two words segmented, both have the same label. We observe that the results stay stable until we reach a threshold value of 40%, then the accuracy decreases. In order to obtain good results and reduce the number of errors (explained before), we have used 40% as threshold for our experiments. Pixel-based descriptors organized in a hierarchical structure The main problem with a Cluster algorithm is to choose the number of clusters that bunches the observations of the best form. We have done some experiments to obtain which is the best number of cluster. The first layer of our architecture is formed by clusters constructed in terms of basic features. The second layer uses BSM features. A key parameter in the BSM feature computation is the number of bins to obtain the histogram-measure calculated using different number of beans. In the first experiment we evaluate the performance depending on the number of bins. This performance is evaluated in terms of the V-measure. 23
Page 1: MASTER IN COMPUTER VISION AND ARTIF
Page 4 and 5: a step forward towards shortening t
Page 6 and 7: a classification of the training se
Page 8 and 9: (a) 1617: index of volume 69 (b) 17
Page 10 and 11: The models can then be used to retr
Page 12 and 13: Figure 5: We present two approaches
Page 14 and 15: 6.1.1 Binarization The binarization
Page 16 and 17: partial Gaussian derivatives along
Page 18 and 19: 7. Pixel-based descriptors organize
Page 20 and 21: layer uses an automatic method. It
Page 22 and 23: Table 1: Intervals for each directi
Page 26 and 27: Table 2: Pre-processing results. Th
Page 28 and 29: Figure 18: Distribution of the obse
Page 30 and 31: Figure 20: Comparative using differ
Page 32 and 33: Figure 22: Classification process u
Page 34 and 35: use foreground pixels, because the
Page 36 and 37: [15] G. Nagy. Twenty years of docum
Page 38: Figure 27: SOM using characteristic

Handwritten Word Spotting in Old Manuscript Images using Shape ...

Create successful ePaper yourself

Delete template?

Save as template?