Kazuki Omi scite author profile

Kazuki Omi

5Publications

2Citation Statements Received

153Citation Statements Given

How they've been cited

How they cite others

153

Affiliations

Nagoya Institute of Technology

Publications

Order By: Most citations

On the Performance Evaluation of Action Recognition Models on Transcoded Low Quality Videos

Otani¹,

Hashiguchi²,

Omi³

et al. 2022

Preprint

View full text Add to dashboard Cite

In the design of action recognition models, the quality of videos in the dataset is an important issue, however the trade-off between the quality and performance is often ignored. In general, action recognition models are trained and tested on high-quality videos, but in actual situations where action recognition models are deployed, sometimes it might not be assumed that the input videos are of high quality. In this study, we report qualitative evaluations of action recognition models for the quality degradation associated with transcoding by JPEG and H.264/AVC. Experimental results are shown for evaluating the performance of pre-trained models on the transcoded validation videos of Kinetics400. The models are also trained on the transcoded training videos. From these results, we quantitatively show the degree of degradation of the model performance with respect to the degradation of the video quality.

show abstract

Model-Agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition

Omi

Kimata

Tamaki

2022

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

In this paper, we propose a multi-domain learning model for action recognition. The proposed method inserts domain-specific adapters between layers of domain-independent layers of a backbone network. Unlike a multi-head network that switches classification heads only, our model switches not only the heads, but also the adapters for facilitating to learn feature representations universal to multiple domains. Unlike prior works, the proposed method is model-agnostic and doesn't assume model structures unlike prior works. Experimental results on three popular action recognition datasets (HMDB51, UCF101, and Kinetics-400) demonstrate that the proposed method is more effective than a multi-head architecture and more efficient than separately training models for each domain.

show abstract

Performance Evaluation of Action Recognition Models on Low Quality Videos

et al. 2022

View full text Add to dashboard Cite

In the design of action recognition models, the quality of videos is an important issue; however, the trade-off between the quality and performance is often ignored. In general, action recognition models are trained on high-quality videos, hence it is not known how the model performance degrades when tested on low-quality videos, and how much the quality of training videos affects the performance. The issue of video quality is important, however, it has not been studied so far. The goal of this study is to show the trade-off between the performance and the quality of training and test videos by quantitative performance evaluation of several action recognition models for transcoded videos in different qualities. First, we show how the video quality affects the performance of pre-trained models. We transcode the original validation videos of Kinetics400 by changing quality control parameters of JPEG (compression strength) and H.264/AVC (CRF). Then we use the transcoded videos to validate the pre-trained models. Second, we show how the models perform when trained on transcoded videos. We transcode the original training videos of Kinetics400 by changing the quality parameters of JPEG and H.264/AVC. Then we train the models on the transcoded training videos and validate them with the original and transcoded validation videos. Experimental results with JPEG transcoding show that there is no severe performance degradation (up to −1.5%) for compression strength smaller than 70 where no quality degradation is visually observed, and for larger than 80 the performance degrades linearly with respect to the quality index. Experiments with H.264/AVC transcoding show that there is no significant performance loss (up to −1%) with CRF30 while the total size of video files is reduced to 30%. In summary, the video quality doesn't have a large impact on the performance of action recognition models unless the quality degradation is severe and visible. This enables us to transcode the training and validation videos and reduce the file sizes to one-third of the original videos.

show abstract

Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition

Omi¹,

Tamaki²

2022

Preprint

View full text Add to dashboard Cite

In this paper, we propose a multi-domain learning model for action recognition. The proposed method inserts domain-specific adapters between layers of domainindependent layers of a backbone network. Unlike a multihead network that switches classification heads only, our model switches not only the heads, but also the adapters for facilitating to learn feature representations universal to multiple domains. Unlike prior works, the proposed method is model-agnostic and doesn't assume model structures unlike prior works. Experimental results on three popular action recognition datasets (HMDB51, UCF101, and Kinetics-400) demonstrate that the proposed method is more effective than a multi-head architecture and more efficient than separately training models for each domain.

show abstract

On the instability of unsupervised domain adaptation with ADDA

Omi

Tamaki

2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kazuki Omi

On the Performance Evaluation of Action Recognition Models on Transcoded Low Quality Videos

Model-Agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition

Performance Evaluation of Action Recognition Models on Low Quality Videos

Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition

On the instability of unsupervised domain adaptation with ADDA

Contact Info

Product

Resources

About