As a video encoding standard, High Efficiency Video Coding (HEVC) achieves excellent performance while causing a dramatic increase in coding complexity. Especially, the coding tree unit (CTU) depth decision process is the most complicated section, which takes heavy computation complexity in the entire HEVC intra coding process. Therefore, a deep learning-based method is applied to directly predict the CTU depth level for each frame in this study. In addition, a large-scale dataset that contains the coding unit image files and the corresponding depths was generated by HM16.15 to train and test the deep learning model. Besides, a Convolutional Neural Network called LeNet is fine-tuned by modifying the original architecture, and then the model with a more complicated structure is evaluated and compared on an acquired dataset. The experiments show that the fine-tuned deep learning model has the ability to identify accurately the depth level of CTU, the recognition accuracy reaches over 98.6%.
In High Efficiency Video Coding (HEVC) standard, the best intra prediction mode is decided by choosing the smallest ratedistortion cost of actual encoding among the total of 35 modes with the MPM (Most Probable Mode) scheme for compression purpose of mode encoding with reference to the adjacent reference blocks of the current prediction unit. This causes heavy computational complexity. In this paper, a deep neural network is conceived and experimented as a probable module for the intra prediction mode decision process inside of the HEVC encoding scheme. The neural network is trained and tested with a ground-truth dataset constructed from actual HEVC Intra encoding of original images. For the performance of the test, accuracy is used as the percentage of the correct mode output by the designed neural network to the ground-truth mode. The experimental results show that the neural network does not give good accuracy for the correct mode. However, accuracy increased when similar angle mode is considered as the correct mode. Also, the special modes of DC and Planar for MPM are analyzed in this paper.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.