Paper
20 July 2001 Multiscale audio-video analysis and processing: segmentations and arrangements
Author Affiliations +
Proceedings Volume 4519, Internet Multimedia Management Systems II; (2001) https://doi.org/10.1117/12.434277
Event: ITCom 2001: International Symposium on the Convergence of IT and Communications, 2001, Denver, CO, United States
Abstract
We propose a multi-scale and multi-modal analysis and processing scheme for audio-video data. Using a non-linear scale-space technique audio-video is analyzed and processed such that it is invariant under various imaging and hearing conditions. Degradations due to Lyapunov and structural instabilities are suppressed by this scale-space technique without destroying essential semantic relations. On the basis of an audio-video segmentation its arrangements are quantified in terms of spatio-temporal inclusion relations and dynamic ordening relations by means of scaling connectivity relations. These relations infer a topological structure on top of the audio-video scale-space inducing a unimodal and multi-modal semantics. Our scheme is illustrated separately for video, audio and audio-video material the latter pointing out the added value of integrating audio and video.
© (2001) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Raango Aldershoff and Alfons H. Salden "Multiscale audio-video analysis and processing: segmentations and arrangements", Proc. SPIE 4519, Internet Multimedia Management Systems II, (20 July 2001); https://doi.org/10.1117/12.434277
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Multimedia

Video

Image segmentation

Semantic video

Visualization

Computing systems

Databases

Back to Top