KEYWORDS: 3D modeling, Visualization, 3D acquisition, Visual compression, 3D image processing, Video, Process modeling, Performance modeling, Visual process modeling, Quantization, Neural networks, Deep learning
This paper provides insight into the 3D scene compression represented by a neural implicit function. The goal of this paper is to introduce the aspects of implicit neural representation for 3D scenes such as NeRF (Neural Radiance Field) and propose a novel compression method for Neural Implicit representation for 3D scenes. We also provide the analysis of compression performance of 3D scene representation by using Neural implicit function.
KEYWORDS: Volume rendering, Video, Video compression, High efficiency video coding, Video coding, Voxels, Image fusion, Image compression, Discontinuities
The versatile video coding (VVC) [1] standard has doubled the number of intra prediction modes and MPM modes in the picture compared to the previous standard, High Efficiency Video Coding (HEVC) [2]. The most probable mode (MPM) is used to efficiently encode the intra prediction mode based on the neighboring intra-coded blocks. The VVC improves the compression performance by increasing the number of intra prediction mode and MPM candidates as the resolution of the video increases, but the texture map may be inefficient because the characteristics of the texture map are different from the general image. In this paper, we propose the efficient MPM candidate derivation on the Truncated Signed Distance Field (TSDF) [3] volume-based mesh property (texture map) for multi-view images. The proposed method shows 0.92% BD-rate performance gain for luma component in the random-access configuration [4].
An efficient biprediction decision scheme of high efficiency video coding (HEVC) is proposed for fast-encoding applications. For low-delay video applications, bidirectional prediction can be used to increase compression performance efficiently with previous reference frames. However, at the same time, the computational complexity of the HEVC encoder is significantly increased due to the additional biprediction search. Although a some research has attempted to reduce this complexity, whether the prediction is strongly related to both motion complexity and prediction modes in a coding unit has not yet been investigated. A method that avoids most compression-inefficient search points is proposed so that the computational complexity of the motion estimation process can be dramatically decreased. To determine if biprediction is critical, the proposed method exploits the stochastic correlation of the context of prediction units (PUs): the direction of a PU and the accuracy of a motion vector. Through experimental results, the proposed method showed that the time complexity of biprediction can be reduced to 30% on average, outperforming existing methods in view of encoding time, number of function calls, and memory access.
KEYWORDS: Video, Scalable video coding, Signal to noise ratio, Video compression, Video processing, Temporal resolution, Video coding, Image quality, Space operations, Multimedia
In universal media access (UMA) environment, because of the heterogeneous networks and terminals, flexible video adaptation, that is performed according to the network conditions and terminal capabilities as well as user preferences, is required to maximize consumer experience and ensure Quality of Service (QoS). MPEG-21 Digital Item Adaptation (DIA) support an interoperable framework for effective and efficient video adaptation. Among MPEG-21 DIA tools, utility function that describes the relations among the feasible adaptation operation, resource constraint, and utility plays the most important role in adaptation process because the optimal adaptation operation is decided among the feasible adaptation operations with given constraints. Therefore, in this paper, the overall concept of MEPG-21 DIA based adaptation framework and formulation of utility function are presented. In addition, the feasibility of the adaptation framework is presented by applying it to a few use cases for generating utility function and applications to specific adaptation scenarios involving nonscalable and scalable video.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.