2023
DOI: 10.3389/frsip.2023.1137006
|View full text |Cite
|
Sign up to set email alerts
|

MRET: Multi-resolution transformer for video quality assessment

Abstract: No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience. Unlike video recognition tasks, VQA tasks are sensitive to changes in input resolution. Since large amounts of UGC videos nowadays are 720p or above, the fixed and relatively small input used in conventional NR-VQA methods results in missing high-frequency details for many videos. In this paper, we propose a novel Transformer-based NR-VQA framework that preserves the hig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…Apart from exploiting the temporal redundancy of the video, other proposals also take advantage of the spatial redundancy of the image, using regions of interest for feature extraction or downsampled images. The NR-VQA model proposed in [111] uses a systematic sampling of the three spatiotemporal planes, and the one proposed in [112] combines frame sampling strategy with a multi-resolution patch sampling mechanism to maintain the high-resolution quality information. The work done in [113] integrates the fusion of temporal statistics of local and global image features.…”
Section: No-reference Quality Assessmentmentioning
confidence: 99%
“…Apart from exploiting the temporal redundancy of the video, other proposals also take advantage of the spatial redundancy of the image, using regions of interest for feature extraction or downsampled images. The NR-VQA model proposed in [111] uses a systematic sampling of the three spatiotemporal planes, and the one proposed in [112] combines frame sampling strategy with a multi-resolution patch sampling mechanism to maintain the high-resolution quality information. The work done in [113] integrates the fusion of temporal statistics of local and global image features.…”
Section: No-reference Quality Assessmentmentioning
confidence: 99%