Shurjo Banerjee scite author profile

Shurjo Banerjee

5Publications

30Citation Statements Received

98Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Michigan–Ann Arbor, Xerox (France), Georgia Institute of Technology

Publications

Order By: Most citations

A Critical Investigation of Deep Reinforcement Learning for Navigation

Dhiman¹,

Banerjee²,

Griffin³

et al. 2018

Preprint

View full text Add to dashboard Cite

The navigation problem is classically approached in two steps: an exploration step, where mapinformation about the environment is gathered; and an exploitation step, where this information is used to navigate efficiently. Deep reinforcement learning (DRL) algorithms, alternatively, approach the problem of navigation in an end-to-end fashion. Inspired by the classical approach, we ask whether DRL algorithms are able to inherently explore, gather and exploit map-information over the course of navigation. We build upon Mirowski et al.[2017]'s work and introduce a systematic suite of experiments that vary three parameters: the agent's starting location, the agent's target location, and the maze structure. We choose evaluation metrics that explicitly measure the algorithm's ability to gather and exploit map-information. Our experiments show that when trained and tested on the same maps, the algorithm successfully gathers and exploits map-information. However, when trained and tested on different sets of maps, the algorithm fails to transfer the ability to gather and exploit map-information to unseen maps. Furthermore, we find that when the goal location is randomized and the map is kept static, the algorithm is able to gather and exploit map-information but the exploitation is far from optimal. We open-source our experimental suite in the hopes that it serves as a framework for the comparison of future algorithms and leads to the discovery of robust alternatives to classical navigation methods.

show abstract

A Geometric Approach to Online Streaming Feature Selection

Sekeh¹,

Ganesh²,

Banerjee³

et al. 2019

Preprint

View full text Add to dashboard Cite

Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Dhiman¹,

Banerjee²,

Siskind³

et al. 2018

Preprint

View full text Add to dashboard Cite

Consider mutli-goal tasks that involve static environments and dynamic goals. Examples of such tasks, such as goaldirected navigation and pick-and-place in robotics, abound. Two types of Reinforcement Learning (RL) algorithms are used for such tasks: model-free or model-based. Each of these approaches has limitations. Model-free RL struggles to transfer learned information when the goal location changes, but achieves high asymptotic accuracy in single goal tasks. Model-based RL can transfer learned information to new goal locations by retaining the explicitly learned state-dynamics, but is limited by the fact that small errors in modelling these dynamics accumulate over long-term planning. In this work, we improve upon the limitations of model-free RL in multigoal domains. We do this by adapting the Floyd-Warshall algorithm for RL and call the adaptation Floyd-Warshall RL (FWRL). The proposed algorithm learns a goal-conditioned action-value function by constraining the value of the optimal path between any two states to be greater than or equal to the value of paths via intermediary states. Experimentally, we show that FWRL is more sample-efficient and learns higher reward strategies in multi-goal tasks as compared to Q-learning, model-based RL and other relevant baselines in a tabular domain. * Several weeks after submitting the work we found a work from Kaelbling (1993), with similar main contribution is as our work. We have highlighted the minor differences in the related work section.

show abstract

Towards a wireless multimodal speech capture system

Sebkhi

Desai

Khan

et al. 2016

View full text Add to dashboard Cite

Iterative Vision-and-Language Navigation

Krantz¹,

Banerjee²,

Wang³

et al. 2022

Preprint

View full text Add to dashboard Cite

We present Iterative Vision-and-Language Navigation (IVLN), a paradigm for evaluating language-guided agents navigating in a persistent environment over time. Existing Vision-and-Language Navigation (VLN) benchmarks erase the agent's memory at the beginning of every episode, testing the ability to perform cold-start navigation with no prior information. However, deployed robots occupy the same environment for long periods of time. The IVLN paradigm addresses this disparity by training and evaluating VLN agents that maintain memory across tours of scenes that consist of up to 100 ordered instruction-following Room-to-Room (R2R) episodes, each defined by an individual language instruction and a target path. We present discrete and continuous Iterative Room-to-Room (IR2R) benchmarks comprising about 400 tours each in 80 indoor scenes. We find that extending the implicit memory of high-performing transformer VLN agents is not sufficient for IVLN, but agents that build maps can benefit from environment persistence, motivating a renewed focus on map-building agents in VLN. * Equal contributions.Preprint. Under review.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shurjo Banerjee

A Critical Investigation of Deep Reinforcement Learning for Navigation

A Geometric Approach to Online Streaming Feature Selection

Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Towards a wireless multimodal speech capture system

Iterative Vision-and-Language Navigation

Contact Info

Product

Resources

About