12.07.2015 Views

an innovative algorithm for key frame extraction in video ...

an innovative algorithm for key frame extraction in video ...

an innovative algorithm for key frame extraction in video ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>in</strong>to the summarization <strong>algorithm</strong> to the evaluation phase where users arerequested to compare the summary with the orig<strong>in</strong>al sequence. For example, <strong>in</strong>Narasimha et al. [19] <strong>an</strong>d <strong>in</strong> Lagendjik et al. [24] a global subjective evaluation isgiven <strong>for</strong> the goodness of the summary; <strong>in</strong> Liu et al. [21] users are asked to givescores based on their satisfaction mark<strong>in</strong>g the summaries as good, acceptable, orbad. Ngo et al. [25] apply the criteria of “<strong>in</strong><strong>for</strong>mativeness” <strong>an</strong>d “enjoyability”, <strong>for</strong>their evaluation of <strong>video</strong> highlights: “<strong>in</strong><strong>for</strong>mativeness” assesses the capability ofcover<strong>in</strong>g the content while avoid<strong>in</strong>g redund<strong>an</strong>cy; “enjoyability” assesses theper<strong>for</strong>m<strong>an</strong>ce of the <strong>algorithm</strong> <strong>in</strong> select<strong>in</strong>g perceptually agreeable <strong>video</strong> segments<strong>for</strong> summaries.The problem of these approaches is that their evaluation is highly subjective<strong>an</strong>d c<strong>an</strong>not be used to <strong>an</strong>alyze <strong>video</strong> sequences automatically.We have chosen,<strong>in</strong>stead, a more objective, general purpose to summary evaluation, one that doesnot take <strong>in</strong>to account the k<strong>in</strong>d of <strong>video</strong> be<strong>in</strong>g processed, <strong>an</strong>d c<strong>an</strong> be automaticallyapplied to all <strong>video</strong> sequences without requir<strong>in</strong>g the services of <strong>video</strong> experts. Asummary is considered good if the set of <strong>key</strong> <strong>frame</strong>s effectively represents thepictorial content of the <strong>video</strong> sequence. This objective evaluation is validregardless of genre, <strong>an</strong>d c<strong>an</strong> be per<strong>for</strong>med automatically if a suitable qualitymeasure is provided. From the very few works that have addressed the problem ofobjective evaluation of summaries, we have chosen two quality measures. Thefirst, well known <strong>in</strong> the literature, is the Fidelity measure, as proposed by Ch<strong>an</strong>get al. [26]; the second is the Shot Reconstruction Degree (SRD) recently proposedby Liu et al. [27]. These measures were chosen because they apply two differentapproaches: the Fidelity employs a global strategy, while the SRD uses a localevaluation of the <strong>key</strong> <strong>frame</strong>s. Along with the evaluation of the pictorial contentus<strong>in</strong>g these two measures we have also judged the compactness of the summaryon the basis of the Compression Ratio measure.3.1 FidelityThe Fidelity measure, which compares each <strong>key</strong> <strong>frame</strong> <strong>in</strong> the summary with theother <strong>frame</strong>s <strong>in</strong> the <strong>video</strong> sequence, <strong>an</strong>d is def<strong>in</strong>ed as a semi-Hausdorff dist<strong>an</strong>ce.We let a <strong>video</strong> sequence start<strong>in</strong>g at time t <strong>an</strong>d conta<strong>in</strong><strong>in</strong>g γNF<strong>frame</strong>s be{ F(t + n) n = 0,1, K , 1}S = γ (1)tNF −<strong>an</strong>d the set of γNKF<strong>key</strong> <strong>frame</strong>s extracted from the <strong>video</strong> sequence bet{ F (t + n ),F (t + n ), ,F (t + n ) 0 ≤ n < γ }KF = K(2)KF1KF2The dist<strong>an</strong>ce between the set of <strong>key</strong> <strong>frame</strong>s KF t <strong>an</strong>d a <strong>frame</strong> F belong<strong>in</strong>g to the<strong>video</strong> sequence S t c<strong>an</strong> be computed as:d( F t n),KF) = m<strong>in</strong> Diff ( F(t + n)+tjKFNKF{ ,FKF(t+ nj)} j = 1,2, K,γNKF( (3)where Diff( ) is a suitable <strong>frame</strong> difference measure. The dist<strong>an</strong>ce between the<strong>video</strong> sequence S t <strong>an</strong>d the set of <strong>key</strong> <strong>frame</strong>s KF t is f<strong>in</strong>ally def<strong>in</strong>ed as:{ d( F(t + n),KF ) n = 0,1, K,γ 1}d( S , KF ) max− (4)tt=tNFnWe c<strong>an</strong> then compute the Fidelity measure as:Fidelity( St tt t, KF ) = MaxDiff − d(S , KF ) (5)iNF5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!