12.01.2015 Views

Download - Academy Publisher

Download - Academy Publisher

Download - Academy Publisher

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

esult is the probability value of each sample belonging to<br />

the whole posture cluster samples, and the posture cluster<br />

is decided by the maximum probability. In this paper<br />

N is 3314 and cluster number M is 3. Some clustering<br />

results are show in Fig.3 and Fig.4.<br />

It intuitively satisfies human senses that sample frames<br />

in Fig.4 are classified into the same cluster, because they<br />

are indeed of similar histogram or main color.<br />

V. EXPERIMENTAL RESULTS<br />

In this section, we describe the experiments aimed at<br />

evaluating the proposed method, which integrated into a<br />

system that was tested using two basketball video<br />

sequences.<br />

A. Shots boundary detection<br />

Our proposed shot boundary detection approach has<br />

been tested on two matches of basketball from the 2009<br />

All Star NBA matches. The first video is 20 minutes long<br />

and contains 127 shots. The second video is of 30 minutes<br />

and contains 233 shots. From table 1 the accuracy of our<br />

shot boundary detection algorithm is about 93.7%. The<br />

false detection may be caused by photo flash, for instance,<br />

if the difference of the histogram of the frames located at<br />

the photo flash will be tremendous, consequently, will<br />

result in incorrect detection. Table 1 lists the performance<br />

in terms of the precision and recall.<br />

The dissimilarity matching on two frames are all<br />

based on the histogram of the frames shown in Fig.2. A<br />

promising performance, Recall 85%-93%, and precision<br />

90%-95%, has been achieved.<br />

TABLE1.<br />

THE RSULTOF OUR SHOT BOUNDARY<br />

DETECTION<br />

manually auto precision recall<br />

Video 1 127 106 96.4% 91%<br />

Video 2 233 212 98% 96%<br />

C. Shot classification in basketball video<br />

The total length of the basketball video is about 60<br />

minutes (153 shots) consisting of NBA All Star match (30<br />

min) and CBA (30 min). Table 2 shows recall and<br />

precision of shot classification. The lower accuracy of the<br />

NBA sequence is caused by close up of players having<br />

skin colors similar to play court.<br />

TABLE 2. THE RESULT OF OUR SHOT CLASSIFICATION<br />

M: Manually A: auto R: recall P: precision<br />

In-play shot<br />

Non-play shot<br />

M A R P M A R P<br />

VI. CONCLUSIONS<br />

We have presented an effective high-level semantic<br />

concept of “semantic shot classes”, which frequently<br />

occurs in broadcast sports video, and compared four fuzzy<br />

c-means algorithms and found serious shortcomings for<br />

fuzzy c-means. In order to detect this concept, we have<br />

proposed a method for semantic shot classification,<br />

consequently we give two improved versions for FRCM,<br />

and we carried out our research on image clustering with<br />

NERFCM effectively. Experiments have shown that an<br />

appropriate construction of mid-level representations can<br />

improve the accuracy and flexibility of shot classification.<br />

We have justified the proposed mid-level<br />

representations through the task of video shot<br />

classification. Our future work includes the evaluation of<br />

individual features for various tasks; extension of the<br />

proposed framework to different sports, such as football,<br />

basketball, and baseball, which require different event and<br />

object detection modules.<br />

REFERENCES<br />

[1] A. Ekin, A. Tekalp, and R. Mehrotra, “Automatic soccer<br />

video analysis and summarization,” IEEE Transactions on<br />

Circuits & Systems for Video Technology, pp. 796-807,<br />

2003.<br />

[2] C. Liang, J. Kuo, W. Chu, and J. Wu, “Semantic units<br />

detection and summarization of baseball videos,” 47th<br />

Midwest Symposium on Circuits and Systems, 2004.<br />

[3] Y. Zhang, C. Xu, Y. Rui, J. Wang. And H. Lu, “Semantic<br />

Event Extraction from Basketball Games using<br />

Multi-Modal Analysis,” IEEE International Conference on<br />

Multimedia and Expo, 2007<br />

[4] A. Hanjalic, “Adaptive extraction of highlights from a sport<br />

video based on excitement modeling,” IEEE<br />

Transactions on Multimedia, pp. 1114-1122, 2005.<br />

[5] R. Ren, J. Jose, and H. Yin.: Affective sports highlight<br />

detection,” European Signal Processing Conference, 2007<br />

[6] A. Hanjalic, “Shot-boundary detection: Unraveled and<br />

resolved,” IEEETrans. Circuits Syst. Video Technol., vol.<br />

12, pp. 90–105, Feb. 2002.<br />

[7] Y. H. Gong, L. T. Sin, C. H. Chuan, H. J. Zhang, and M.<br />

Sakauchi, “Automatic parsing of TV soccer programs,” in<br />

Proc. Int. Conf. Multimedia Computing and Systems,<br />

Washington, DC, May 15, 1995, pp. 167–174<br />

[8] Cheng Yong, Xu De. A method for shot boundary<br />

detection using adaptive threshold [J]. Acta Electronica<br />

Sinca, 2004,32(3): 508 511 (in Chinese)]<br />

[9] G. Millerson, The Technique of Television Production,<br />

12th ed. New York: Focal, March 1990<br />

[10] John C. Russ (2005). Image Analysis Of Food<br />

Microstructure CRC Press. ISBN 0849322413.<br />

[11] RICHARD J. HATHAWAY and JOHN W. DAVENPORT,<br />

Pattern Recognition, Vol. 22, No. 2, pp. 205 212, 1989.<br />

NBA 16 18 100% 88.9% 69 65 92.8% 98.5%<br />

CBA 21 20 90.5% 95% 83 77 90.4% 97.4%<br />

156

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!