Download - Academy Publisher
Download - Academy Publisher
Download - Academy Publisher
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
esult is the probability value of each sample belonging to<br />
the whole posture cluster samples, and the posture cluster<br />
is decided by the maximum probability. In this paper<br />
N is 3314 and cluster number M is 3. Some clustering<br />
results are show in Fig.3 and Fig.4.<br />
It intuitively satisfies human senses that sample frames<br />
in Fig.4 are classified into the same cluster, because they<br />
are indeed of similar histogram or main color.<br />
V. EXPERIMENTAL RESULTS<br />
In this section, we describe the experiments aimed at<br />
evaluating the proposed method, which integrated into a<br />
system that was tested using two basketball video<br />
sequences.<br />
A. Shots boundary detection<br />
Our proposed shot boundary detection approach has<br />
been tested on two matches of basketball from the 2009<br />
All Star NBA matches. The first video is 20 minutes long<br />
and contains 127 shots. The second video is of 30 minutes<br />
and contains 233 shots. From table 1 the accuracy of our<br />
shot boundary detection algorithm is about 93.7%. The<br />
false detection may be caused by photo flash, for instance,<br />
if the difference of the histogram of the frames located at<br />
the photo flash will be tremendous, consequently, will<br />
result in incorrect detection. Table 1 lists the performance<br />
in terms of the precision and recall.<br />
The dissimilarity matching on two frames are all<br />
based on the histogram of the frames shown in Fig.2. A<br />
promising performance, Recall 85%-93%, and precision<br />
90%-95%, has been achieved.<br />
TABLE1.<br />
THE RSULTOF OUR SHOT BOUNDARY<br />
DETECTION<br />
manually auto precision recall<br />
Video 1 127 106 96.4% 91%<br />
Video 2 233 212 98% 96%<br />
C. Shot classification in basketball video<br />
The total length of the basketball video is about 60<br />
minutes (153 shots) consisting of NBA All Star match (30<br />
min) and CBA (30 min). Table 2 shows recall and<br />
precision of shot classification. The lower accuracy of the<br />
NBA sequence is caused by close up of players having<br />
skin colors similar to play court.<br />
TABLE 2. THE RESULT OF OUR SHOT CLASSIFICATION<br />
M: Manually A: auto R: recall P: precision<br />
In-play shot<br />
Non-play shot<br />
M A R P M A R P<br />
VI. CONCLUSIONS<br />
We have presented an effective high-level semantic<br />
concept of “semantic shot classes”, which frequently<br />
occurs in broadcast sports video, and compared four fuzzy<br />
c-means algorithms and found serious shortcomings for<br />
fuzzy c-means. In order to detect this concept, we have<br />
proposed a method for semantic shot classification,<br />
consequently we give two improved versions for FRCM,<br />
and we carried out our research on image clustering with<br />
NERFCM effectively. Experiments have shown that an<br />
appropriate construction of mid-level representations can<br />
improve the accuracy and flexibility of shot classification.<br />
We have justified the proposed mid-level<br />
representations through the task of video shot<br />
classification. Our future work includes the evaluation of<br />
individual features for various tasks; extension of the<br />
proposed framework to different sports, such as football,<br />
basketball, and baseball, which require different event and<br />
object detection modules.<br />
REFERENCES<br />
[1] A. Ekin, A. Tekalp, and R. Mehrotra, “Automatic soccer<br />
video analysis and summarization,” IEEE Transactions on<br />
Circuits & Systems for Video Technology, pp. 796-807,<br />
2003.<br />
[2] C. Liang, J. Kuo, W. Chu, and J. Wu, “Semantic units<br />
detection and summarization of baseball videos,” 47th<br />
Midwest Symposium on Circuits and Systems, 2004.<br />
[3] Y. Zhang, C. Xu, Y. Rui, J. Wang. And H. Lu, “Semantic<br />
Event Extraction from Basketball Games using<br />
Multi-Modal Analysis,” IEEE International Conference on<br />
Multimedia and Expo, 2007<br />
[4] A. Hanjalic, “Adaptive extraction of highlights from a sport<br />
video based on excitement modeling,” IEEE<br />
Transactions on Multimedia, pp. 1114-1122, 2005.<br />
[5] R. Ren, J. Jose, and H. Yin.: Affective sports highlight<br />
detection,” European Signal Processing Conference, 2007<br />
[6] A. Hanjalic, “Shot-boundary detection: Unraveled and<br />
resolved,” IEEETrans. Circuits Syst. Video Technol., vol.<br />
12, pp. 90–105, Feb. 2002.<br />
[7] Y. H. Gong, L. T. Sin, C. H. Chuan, H. J. Zhang, and M.<br />
Sakauchi, “Automatic parsing of TV soccer programs,” in<br />
Proc. Int. Conf. Multimedia Computing and Systems,<br />
Washington, DC, May 15, 1995, pp. 167–174<br />
[8] Cheng Yong, Xu De. A method for shot boundary<br />
detection using adaptive threshold [J]. Acta Electronica<br />
Sinca, 2004,32(3): 508 511 (in Chinese)]<br />
[9] G. Millerson, The Technique of Television Production,<br />
12th ed. New York: Focal, March 1990<br />
[10] John C. Russ (2005). Image Analysis Of Food<br />
Microstructure CRC Press. ISBN 0849322413.<br />
[11] RICHARD J. HATHAWAY and JOHN W. DAVENPORT,<br />
Pattern Recognition, Vol. 22, No. 2, pp. 205 212, 1989.<br />
NBA 16 18 100% 88.9% 69 65 92.8% 98.5%<br />
CBA 21 20 90.5% 95% 83 77 90.4% 97.4%<br />
156