Paper
1 November 1996 Multichannel video segmentation
Pascal Faudemay, Liming Chen, Claude Montacie, Marie-Jose Caraty, Christine Fernandez-Maloigne, Xiaowei Tu, Mohsen Ardebilian Fard, Jean-Luc Le Floch
Author Affiliations +
Proceedings Volume 2916, Multimedia Storage and Archiving Systems; (1996) https://doi.org/10.1117/12.257295
Event: Photonics East '96, 1996, Boston, MA, United States
Abstract
A video is a multimedia document which is structured in scenes and shots. Scenes are lists of consecutive shots characterized by common visual and audio features. Shots are sets of consecutive frames separated by cuts, which can be easily recognized by existing techniques. Video segmentation into scenes is a new and open problem. It is needed for scenes retrieval, specially in authoring and interactive video applications. We propose a new approach of video segmentation into scenes, which is based on several media and takes into account the film syntax. We characterize a scene by some similarity between color histograms of the current shot, and of one of the most recent previous shots. Similarity between a shot frame and a frame of a previous shot may indicate the presence of alternate shots, which belong to the same scene. Other techniques based on projective geometry are presented in a companion paper. These techniques enable to detect the movement of the camera. We recognize the speakers of a scene by AR vector model techniques, such as the one proposed by some of the authors in the Orphee system, implemented at Laforia. However the speaker recognition problem is much more difficult when applied to the video CD-I, due to several transition types and various types of noise. We present experimental results, based on this approach. Detection of alternate shots is efficient, but speaker recognition needs improvements.
© (1996) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Pascal Faudemay, Liming Chen, Claude Montacie, Marie-Jose Caraty, Christine Fernandez-Maloigne, Xiaowei Tu, Mohsen Ardebilian Fard, and Jean-Luc Le Floch "Multichannel video segmentation", Proc. SPIE 2916, Multimedia Storage and Archiving Systems, (1 November 1996); https://doi.org/10.1117/12.257295
Lens.org Logo
CITATIONS
Cited by 6 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Databases

Image segmentation

Speaker recognition

Autoregressive models

Acoustics

Image retrieval

RELATED CONTENT

MPEG-7 audio-visual indexing test-bed for video retrieval
Proceedings of SPIE (December 15 2003)
Video indexing based on image and sound
Proceedings of SPIE (October 06 1997)
Retrieval based on image content using DC-image
Proceedings of SPIE (September 26 2001)
Video query formulation
Proceedings of SPIE (March 23 1995)

Back to Top