Satya Krishna Gorti scite author profile

Satya Krishna Gorti

5Publications

72Citation Statements Received

125Citation Statements Given

How they've been cited

166

How they cite others

122

125

Affiliations

Publications

Order By: Most citations

Weakly Supervised Action Selection Learning in Video

Ma¹,

Gorti²,

Volkovs³

et al. 2021

View full text Add to dashboard Cite

Localizing actions in video is a core task in computer vision. The weakly supervised temporal localization problem investigates whether this task can be adequately solved with only video-level labels, significantly reducing the amount of expensive and error-prone annotation that is required. A common approach is to train a frame-level classifier where frames with the highest class probability are selected to make a video-level prediction. Frame-level activations are then used for localization. However, the absence of frame-level annotations cause the classifier to impart class bias on every frame. To address this, we propose the Action Selection Learning (ASL) approach to capture the general concept of action, a property we refer to as "actionness". Under ASL, the model is trained with a novel class-agnostic task to predict which frames will be selected by the classifier. Empirically, we show that ASL outperforms leading baselines on two popular benchmarks THUMOS-14 and ActivityNet-1.2, with 10.3% and 5.7% relative improvement respectively. We further analyze the properties of ASL and demonstrate the importance of actionness. Full code for this work is available here: https://github.com/layer6ai-labs/ASL.

show abstract

X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

Gorti¹,

Vouitsis

Ma³

et al. 2022

108

View full text Add to dashboard Cite

Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks

Gorti¹,

Ma²

2018

Preprint

View full text Add to dashboard Cite

Cross-Class Relevance Learning for Temporal Concept Localization

Ma¹,

Gorti²,

Volkovs³

et al. 2019

Preprint

View full text Add to dashboard Cite

We present a novel Cross-Class Relevance Learning approach for the task of temporal concept localization. Most localization architectures rely on feature extraction layers followed by a classification layer which outputs class probabilities for each segment. However, in many real-world applications classes can exhibit complex relationships that are difficult to model with this architecture. In contrast, we propose to incorporate target class and class-related features as input, and learn a pairwise binary model to predict general segment to class relevance. This facilitates learning of shared information between classes, and allows for arbitrary class-specific feature engineering. We apply this approach to the 3rd YouTube-8M Video Understanding Challenge together with other leading models, and achieve first place out of over 280 teams. In this paper we describe our approach and show some empirical results.

show abstract

Weakly Supervised Action Selection Learning in Video

Ma¹,

Gorti²,

Volkovs³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.