Paper
24 August 1999 Video-assisted segmentation of speech and audio track
Medha Pandit, Yusseri Yusoff, Josef Kittler, William J. Christmas, E. H. S. Chilton
Author Affiliations +
Proceedings Volume 3846, Multimedia Storage and Archiving Systems IV; (1999) https://doi.org/10.1117/12.360456
Event: Photonics East '99, 1999, Boston, MA, United States
Abstract
Video database research is commonly concerned with the storage and retrieval of visual information invovling sequence segmentation, shot representation and video clip retrieval. In multimedia applications, video sequences are usually accompanied by a sound track. The sound track contains potential cues to aid shot segmentation such as different speakers, background music, singing and distinctive sounds. These different acoustic categories can be modeled to allow for an effective database retrieval. In this paper, we address the problem of automatic segmentation of audio track of multimedia material. This audio based segmentation can be combined with video scene shot detection in order to achieve partitioning of the multimedia material into semantically significant segments.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Medha Pandit, Yusseri Yusoff, Josef Kittler, William J. Christmas, and E. H. S. Chilton "Video-assisted segmentation of speech and audio track", Proc. SPIE 3846, Multimedia Storage and Archiving Systems IV, (24 August 1999); https://doi.org/10.1117/12.360456
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Multimedia

Acoustics

Databases

Statistical modeling

Data modeling

Automatic tracking

Back to Top