Presentation + Paper
22 April 2020 First person perspective video activity recognition
Author Affiliations +
Abstract
The initial development of two First-Person Perspective Video Activity Recognition Systems is discussed. The first system, the First Person Fall Detection or UFall, can be used to recognize when a person wearing or holding the mobile vision system has fallen. The problem of fall detection is tackled from the unique first-person perspective. The second system, the directed CrossWalk System (UCross), involves detection of the user movement across a crosswalk and is intended for use in helping a low vision person navigate. In both cases, the user is wearing or holding the camera device for purposes of monitoring or inspection of the environment. This first-person perspective yields unusual fall data and this is captured and used for the creation of a fall detection system. For both systems Machine Learning is employed using video input to trained Long-Term Short-Term (LSTM) Networks. These first-perspective video activity recognition systems use the Tensorflow framework [1] and is deployed using mobile phones for proof of concept. These applications could be useful for low vision people and in the case of fall detection for senior citizens, police, construction and other inspection-oriented jobs to help users who have fallen. The success and challenges faced with this unique first-person perspective data are presented along with future avenues of work.
Conference Presentation
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Lynne Grewe, Chengzhi Hu, Krishna Tank, Aditya Jaiswal, Thomas Martin, Sahil Sutaria, Tran Huynh, and Francis David Bustos "First person perspective video activity recognition", Proc. SPIE 11423, Signal Processing, Sensor/Information Fusion, and Target Recognition XXIX, 1142310 (22 April 2020); https://doi.org/10.1117/12.2557922
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Machine vision

Computer vision technology

Machine learning

Back to Top