12.07.2015 Views

an innovative algorithm for key frame extraction in video ...

an innovative algorithm for key frame extraction in video ...

an innovative algorithm for key frame extraction in video ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6.2 Computational TimeTable 4 shows the computation time (<strong>in</strong> m<strong>in</strong>utes, seconds <strong>an</strong>d thous<strong>an</strong>d ofseconds) of the <strong>algorithm</strong>s tested <strong>in</strong>clud<strong>in</strong>g the <strong>frame</strong> decod<strong>in</strong>g time <strong>for</strong> the<strong>algorithm</strong>s that required it. We have implemented the <strong>key</strong> <strong>frame</strong>s <strong>extraction</strong><strong>algorithm</strong>s <strong>in</strong> C++ under the Borl<strong>an</strong>d C++ Builder 5.0 development environmentwith the default optimization (<strong>for</strong> faster execution) turned on. The computer used<strong>for</strong> the comparison was <strong>an</strong> AMD Athlon 1Ghz with 512MB of RAM, <strong>an</strong>d runn<strong>in</strong>ga W<strong>in</strong>dows 2000 Professional operat<strong>in</strong>g system. For image process<strong>in</strong>g <strong>algorithm</strong>s,separable filters were used whenever it was possible. Results that c<strong>an</strong> be used <strong>in</strong>subsequent operations were computed only once. A test session was per<strong>for</strong>med byprocess<strong>in</strong>g all the six <strong>video</strong> sequences with a <strong>key</strong> <strong>frame</strong> <strong>extraction</strong> <strong>algorithm</strong>.Be<strong>for</strong>e the next test session the system was restarted. For each <strong>video</strong> thecomputational time reported refers to the average time of three test sessions. TheMP <strong>algorithm</strong> is not <strong>in</strong>cluded s<strong>in</strong>ce its computation is virtually <strong>in</strong>st<strong>an</strong>t<strong>an</strong>eous. The<strong>algorithm</strong>’s computational time are reported <strong>in</strong> relative <strong>for</strong>m with respect to theATS computational time.Table 4. Computational time of the five <strong>algorithm</strong>s on the six <strong>video</strong>s. The times reported <strong>for</strong> theATS <strong>algorithm</strong> are <strong>in</strong> m<strong>in</strong>utes, seconds <strong>an</strong>d thous<strong>an</strong>d of seconds. For the other <strong>algorithm</strong>s, thetimes are <strong>in</strong> relative <strong>for</strong>m with respect to the ATS.Video ATS FR CP SRDI PMEeeopen 00:10:596 1.01 2.86 3.97 9.03nw<strong>an</strong>w1 00:14:152 1.35 5.75 9.16 7.22news 00:10:425 1.34 5.74 9.09 6.66football 00:14:531 1.30 5.81 8.78 7.68bugsbunny 01:38:542 1.13 2.98 4.65 9.41basketball 00:06:570 1.08 3.08 4.55 6.97For the apparently discrep<strong>an</strong>cies <strong>for</strong> the computational time of the ATS <strong>an</strong>dFR <strong>algorithm</strong>s with respect to the CP <strong>algorithm</strong>, it should be noted that thedecod<strong>in</strong>g time take up a signific<strong>an</strong>tly part of the process<strong>in</strong>g time of the two<strong>algorithm</strong>s. For the “eeopen”, “bugsbunny”, <strong>an</strong>d “basketball” <strong>video</strong>s, the decod<strong>in</strong>gtime represents about the 90% (ATS) <strong>an</strong>d 82% (FR) of the total computationaltime. For the rema<strong>in</strong><strong>in</strong>g three <strong>video</strong>s, the decod<strong>in</strong>g time represents the 70%, <strong>an</strong>dthe 50% of the ATS <strong>an</strong>d FR total time respectively. The high computational timeof the SRDI <strong>an</strong>d PME <strong>algorithm</strong>s is ma<strong>in</strong>ly due to the heavy preprocess<strong>in</strong>grequired be<strong>for</strong>e the <strong>key</strong> <strong>frame</strong> c<strong>an</strong> be selected. The SRDI <strong>algorithm</strong> requires pixel<strong>in</strong>terpolation based on motion vectors <strong>an</strong>d pixel background classification.Although the <strong>key</strong> <strong>frame</strong> selection step of the PME <strong>algorithm</strong> is simple, the preprocess<strong>in</strong>gphase per<strong>for</strong>med with the two nonl<strong>in</strong>ear filters penalizes the <strong>algorithm</strong>:much data must be <strong>an</strong>alyzed, <strong>an</strong>d each step of the filter<strong>in</strong>g process also requiresdata reorder<strong>in</strong>g be<strong>for</strong>e the f<strong>in</strong>al result <strong>for</strong> that step c<strong>an</strong> be obta<strong>in</strong>ed.6.3 ResultsIn our prelim<strong>in</strong>ary experiments, the Fidelity <strong>an</strong>d SRD measures <strong>for</strong> the FR<strong>algorithm</strong> were more th<strong>an</strong> 20% lower th<strong>an</strong> those <strong>for</strong> the other <strong>algorithm</strong>s. Asstated be<strong>for</strong>e, <strong>in</strong> its orig<strong>in</strong>al <strong>for</strong>mulation, the FR <strong>algorithm</strong> assigns to each shot <strong>an</strong>umber of <strong>key</strong> <strong>frame</strong>s proportional to the length of the shot. When the totalnumber of <strong>key</strong> <strong>frame</strong>s to be extracted is comparable to the number of shots, thereis a high probability that no <strong>key</strong> <strong>frame</strong>s will be assigned to some of the shots.Thus <strong>for</strong> a fair comparison, <strong>in</strong> subsequent experiments we have modified the19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!