2021
DOI: 10.1007/s11554-021-01151-6
|View full text |Cite
|
Sign up to set email alerts
|

Real time video summarizing using image semantic segmentation for CBVR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…e three parameters of spatial contrast sensitivity function, brightness adaptive factor, and contrast masking factor of the digital multimedia video image are calculated, respectively. According to the product of the three parameters, the transform domain JND model of the digital multimedia video image is established, and the model is used for the quantization and coding of multimedia video multiresolution image [10,11].…”
Section: Fault Tolerant Digital Video Transmission Systemmentioning
confidence: 99%
“…e three parameters of spatial contrast sensitivity function, brightness adaptive factor, and contrast masking factor of the digital multimedia video image are calculated, respectively. According to the product of the three parameters, the transform domain JND model of the digital multimedia video image is established, and the model is used for the quantization and coding of multimedia video multiresolution image [10,11].…”
Section: Fault Tolerant Digital Video Transmission Systemmentioning
confidence: 99%
“…The author (Jain et al, 2021 ) proposed a technique for effectively summarizing videos in real time. To identify common objects, the Mask R-CNN model trained on the COCO dataset is employed.…”
Section: Literature Surveymentioning
confidence: 99%
“…Because summary evaluation is a subjective process, manual comparisons are difficult to achieve reliable results. In the video summarizing bibliography, many datasets stand out: SumMe (Choudhary et al, 2017 ; Taylor and Qureshi, 2018 ), TVSum, ADL (Yousefi et al, 2018 ), TRECKVID'08 (Ren and Jiang, 2009 ), and COCO (Jain et al, 2021 ) rushes require a thorough video summary. The dataset used in Table 2 is shown in Figure 7 .…”
Section: Literature Surveymentioning
confidence: 99%
“…We expanded on previous work from (Machireddy et al, 2021), where authors showed that a sparsely manually annotated dataset, typically around 1% of the image stack, was sufficient to train models to segment the whole volume. While state-of-the-art in semantic segmentation has been dominated by attention-based models for natural images (Jain et al, 2021), convolutional architectures remain main stream with EM data, and were used in (Machireddy et al, 2021), and the companion paper within this journal volume. In this paper, we compared architectures as well as training frameworks to find the most suitable one for the task of semantic segmentation in the aforementioned specific context of FIB-SEM images.…”
Section: Introductionmentioning
confidence: 99%