The HEVC Annotated Regions (AR) SEI message supports object tracking by carrying parameters defining rectangular bounding boxes with unique object identifiers, time-aligned within a video bitstream. An end-to-end distributed video analytics pipeline utilizing the AR SEI message within the GStreamer framework has been implemented, with an edge node and a cloud server node. At the edge, light-weight face detection is performed, and face region parameters are used to create the AR SEI message syntax within an HEVC bitstream. At the cloud server, face regions are extracted from the decoded video and age and gender classification is performed. The HEVC bitstream is updated to include additional metadata in the AR SEI message.
|