In this paper, the concept of non-desynchronizing bits (NDBs) is defined in the context of H.264 video as a bit whose inversion does not cause desynchronization at the bitstream level or change the number of decoded macroblocks. We established that, on the whole, NDBs make up about a third (about 30%) of a bitstream, and that their flipping effect on visual quality is mostly insignificant. In most cases (90%), the PSNR value obtained when modifying an NDB is very close to the intact value. The performance of the proposed non-desync-based decoding framework, which retains a corrupted packet, under the condition of not causing desynchronization, has been compared to the JM-FC and a state-of-the-art concealment approach using the STBMA approach, and on average, respectively, provides 3.5 dB and 1.42 dB gain over them. Index Terms-video transmission, H.264, syntax elements, nondesynchronizing bit, concealment This work was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant.
3D and multi-view 360-degree videos offer 3D immersive visual experience of the real world on head-mounted displays (HMDs). They are used in extended reality applications (including virtual reality, augmented reality and mixed reality) such as gaming, entertainment, education and medicine. However, these applications require a very high rate of compression and a great deal of computational resources. Compression efficiency and coding complexity are highly influenced by 360 projections used to map spherical frames to rectangular frames. To study the effect of this mapping process, in this paper, we evaluate the performance of various projections in 3D and multi-view 360-degree videos in terms of coding efficiency, quality and complexity. The evaluation is conducted for two coding scenarios: stereo coding scenario where the encoder considers the interview dependency for disparity estimation and simulcast coding scenario where two views are encoded separately. This performance evaluation approach provides valuable insights on using 360-degree projections in 3D and multi-view coding.
The latest video coding standards, H.264 and H.265, are highly vulnerable in error-prone networks. Reconstructed packets may exhibit significant degradation in terms of PSNR and visual quality. This paper presents a novel list decoding approach exploiting the receiver side user datagram protocol (UDP) checksum. The proposed method identifies the possible locations of errors by analyzing the pattern of the calculated UDP checksum. This permits to considerably reduce the number of candidate bitstreams in comparison to conventional list decoding approaches. When a packet composed of N bits contains a single-bit error, instead of considering N candidate bitstreams, as is the case in conventional list decoding approaches, the proposed approach considers N/32 candidate bitstreams, leading to a reduction of 97% of the number of candidates. For a two-bit error, the reduction increases to 99.6%. The method's performance is evaluated using H.264 and H.265 test model software. Our simulation results reveal that, on average, the error was corrected perfectly 80 to 90% of the time (the original bitstream was recovered). In addition, the proposed approach provides, on average, a 2.79 dB gain over frame copy (FC) error concealment using the Joint Model (JM) and a 3.57 dB gain over our implementation of FC error concealment in the HEVC Test Model (HM).
In this paper, we introduce a novel cyclic redundancy check (CRC)-based single error correction method which we apply to robust H.264 Baseline video decoding. Unlike stateof-the-art methods, the proposed correction algorithm does not require lookup tables as it determines the error location based on binary operations using the computed link layer CRC syndrome. Since multiple errors can lead to the same CRC syndrome as a single error, verification of the corrected packet is performed through a non-desynchronizing bits validation (NDBV), which forwards only compliant packets to the video decoder. Simulations on the H.264 Baseline profile show an average gain of 3.04 dB and 2.36 dB over stateof-the-art spatio-temporal error concealment (STBMA) and NDBV+STBMA reconstruction methods, respectively, at a residual bit error rate of 10 −6 .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.