HMM based structuring of tennis videos using visual and audio cues

Patrick Gros, Guillaume Gravier, Ewa Kijak, Lionel Oisel, Frédéric Bimbot

doi:10.1109/icme.2003.1221310

HMM based structuring of tennis videos using visual and audio cues

DOI オープンアクセス

説明

This paper focuses on the use of hidden Markov models (HMMs) for structure analysis of videos, and demonstrates how they can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The basic temporal unit is the video shot. Visual features describe the audio events within a video shot. The video structure parsing relies on the analysis of the temporal interleaving of video shots, with respect to prior information about tennis content and editing rules. As a result, typical tennis scenes are identified. In addition, each shot is assigned to a level in the hierarchy described in terms of point, game and set.

収録刊行物

2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)

2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698) III-309, 2003-01-01

IEEE

HMM based structuring of tennis videos using visual and audio cues

説明

収録刊行物

詳細情報詳細情報について

書き出し

問題の指摘

HMM based structuring of tennis videos using visual and audio cues

説明

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について