an innovative algorithm for key frame extraction in video ...

More documents

Recommendations

Info

The difference between two edge direction histograms (d D ) is computed using theEuclidean distance as such in the case of two wavelet statistics (d W ):ddDW( D , D ) = ( D ( j)− D ( j))tt+1∑( W , W ) = ( W ( j)−W( j))tt+171j=019∑j=0ttt+1t+122(11)where D t and D t+1 are the edge direction histograms and W t and W t+1 are thewavelets statistics for frame F(t) and frame F(t+1).The three resulting values (to simplify the notation we have indicated them asdH, dW, and dD only) are mapped into the range [0,1] and then combined to formthe final frame difference measure (dHWD) as follow:dHWD( d ⋅ d ) + ( d ⋅ d ) + ( d ⋅ d )= (12)HWWThe aim of the frame difference measure is to accentuate dissimilarities inorder to detect changes within the frame sequence. At the same time it isimportant that only when the frames are very different, the measure should reporthigh difference values. As told before, the majority of the key frame selectionmethods exploit just one visual feature which is not sufficient to effectivelydescribe an image contents. If we were to use, for example, only the colorhistogram, a highly dynamic sequence (e.g. one containing fast moving orpanning effects) with frames of the same color contents, would result in a series ofsimilar frame difference values and the motion effects would be lost. Similarly,frames with the same color content but different from the point of view of othervisual attributes are considered similar. The uses of multiple feature can overcomethese issues but pose the problem of their combination. In content-based retrievalsystems, the features are combined by weighing them with suitable factors whichare usually task-dependent [31]. We choose instead to use a different approach:the explicit selection of weight factors is removed by weighing each differenceagainst the other. Moreover, this allow us to register significant differences in thedhwd values only if at least two of the single differences exhibit high values (andthus two of the visual attributes emphasize the frame dissimilarity).4.2 Key frame selectionThe key frame selection algorithm that we propose dynamically selects therepresentative frames by analyzing the complexity of the events depicted in theshot in terms of pictorial changes. The frame difference values initially obtainedare used to construct a curve of the cumulative frame differences which describeshow the visual content of the frames changes over the entire shot, an indication ofthe shot’s complexity: sharp slopes indicate significant changes in the visualcontent due to a moving object, camera motion, or the registration of a highlydynamic event. These cases must be taken into account in selecting the key framesto include in the shot summary. They are identified in the curve of the cumulativeframe differences as those points at the sharpest angles of the curve (curvature orcorner points). The key frames are those corresponding to the mid points betweeneach pair of consecutive curvature points. To detect the high curvature points weuse the algorithm proposed by Chetverikov et al. [35]. The algorithm wasoriginally developed for shape analysis in order to identify salient points in a 2Dshape outline. The high curvature points are detected in a two-pass processing. InDDH10
the first pass the algorithm detects candidate curvature points. The algorithmdefines as a “corner” a location where a triangle of specified size and openingangle can be inscribed in a curve. Using each curve point P as a fixed vertexpoint, the algorithm tries to inscribe a triangle in the curve, and then determinesthe opening angle α(P) in correspondence of P. Different triangles are consideredusing points that fall within a window of a given size w centered in P; the sharpestangle is retained as a possible high curvature point. This procedure is illustrated inFig. 6. Defining the distance between points P and O as d PO , the distance betweenpoints P and R as d PR , and the distance between points O and P as d OP , theopening angle α corresponding to the triangle OPR is computed as:2OP2PROPPR2ORd + d − dα = arccos(13)2 ⋅ d ⋅ dA triangle satisfying the constraints on the distances between points (weconsider only the x-coordinates):ddminmin≤ P − Oxxx≤ P − Rand the constraint on the angle valuesx≤ d≤ dmaxmaxα ≤ α max(15)is called an admissible triangle. The first two constraints represent the operatingwindow; the set of points contained in it are used to define the triangles. The thirdconstraint is used to discard angles that are too flat. The sharpest opening angle ofthe admissible triangles is then assigned to P:α ( P)= min α =α{ OPR ˆ }If a point has no admissible triangles, the point is rejected assigning it an angledefault value of π. In the second pass, those points in the set of the candidate highcurvature points that are sharper than their neighbors (within a certain distance)are classified as high curvature points. A candidate point P is discarded if it has asharper valid neighbor N, that is if:α ( P)> α ( N)(17)(14)(16)A point N is defined to be a neighbor of P if the following constraint is valid:P − N ≤ d (18)xxIn our implementation we have defined the minimum points distance d min asalways equal to 1; consequently the only two parameters that influence the resultsof the algorithm are d max and α max . The most important parameter is α max whichcontrols the set of admissible angles: a high value of α max will result in morepoints included in the set of candidate high curvature points, while a lower valueindicates that only very sharp angles must be considered. This is the same asconsidering worthy of attention only slopes corresponding to sharp changes in thecurve of the cumulative d HWD frame differences.max11
Page 1 and 2: AN INNOVATIVE ALGORITHM FOR KEYFRAM
Page 3 and 4: Section 2 of this paper presents se
Page 5 and 6: into the summarization algorithm to
Page 7 and 8: ... ... S tF(t) F(t+1)F(t+n) F(t+γ
Page 9: histogram. The threshold has been h
Page 13 and 14: Fig. 7. An example of key frame sel
Page 15 and 16: energy feature, the magnitudes of t
Page 17 and 18: was available we also selected the
Page 19 and 20: 6.2 Computational TimeTable 4 shows
Page 21 and 22: performance of the SRDI algorithm e
Page 23 and 24: Table 6. Fidelity measure results c
Page 25 and 26: eeopenFidelityHSTFidelityHWDSRDHSTS
Page 27 and 28: AcknowledgementsThe video indexing

an innovative algorithm for key frame extraction in video ...

Create successful ePaper yourself

Delete template?

Save as template?