John D. Co-Reyes scite author profile

John D. Co-Reyes

5Publications

25Citation Statements Received

134Citation Statements Given

How they've been cited

How they cite others

134

Affiliations

Publications

Order By: Most citations

Entity Abstraction in Visual Model-Based Reinforcement Learning

Veerapaneni¹,

Co-Reyes²,

Chang³

et al. 2019

Preprint

View full text Add to dashboard Cite

This paper tests the hypothesis that modeling a scene in terms of entities and their local interactions, as opposed to modeling the scene globally, provides a significant benefit in generalizing to physical tasks in a combinatorial space the learner has not encountered before. We present object-centric perception, prediction, and planning (OP3), which to the best of our knowledge is the first entity-centric probabilistic dynamic latent variable framework for model-based reinforcement learning that acquires entity representations from raw visual observations without supervision and uses them to predict and plan. OP3 enforces entity-abstraction -symmetric processing of each entity representation with the same locally-scoped function -which enables it to scale to model different numbers and configurations of objects from those in training. Our approach to solving the key technical challenge of grounding these entity representations to actual objects in the environment is to frame this variable binding problem as an inference problem, and we develop an interactive inference algorithm that uses temporal continuity and interactive feedback to bind information about object properties to the entity variables. On block-stacking tasks, OP3 generalizes to novel block configurations and more objects than observed during training, outperforming an oracle model that assumes access to object supervision and achieving two to three times better accuracy than a state-of-the-art video prediction model that does not exhibit entity abstraction.

show abstract

Evolving Reinforcement Learning Algorithms

Co-Reyes¹,

Miao²,

Peng³

et al. 2021

Preprint

View full text Add to dashboard Cite

We propose a method for meta-learning reinforcement learning algorithms by searching over the space of computational graphs which compute the loss function for a value-based model-free RL agent to optimize. The learned algorithms are domain-agnostic and can generalize to new environments not seen during training. Our method can both learn from scratch and bootstrap off known existing algorithms, like DQN, enabling interpretable modifications which improve performance. Learning from scratch on simple classical control and gridworld tasks, our method rediscovers the temporal-difference (TD) algorithm. Bootstrapped from DQN, we highlight two learned algorithms which obtain good generalization performance over other classical control tasks, gridworld type tasks, and Atari games. The analysis of the learned algorithm behavior shows resemblance to recently proposed RL algorithms that address overestimation in value-based methods.

show abstract

Information is Power: Intrinsic Control via Information Capture

Rhinehart¹,

Wang²,

Berseth³

et al. 2021

Preprint

View full text Add to dashboard Cite

Multi-objective evolution for Generalizable Policy Gradient Algorithms

Garau-Luis¹,

Miao²,

Co-Reyes³

et al. 2022

Preprint

View full text Add to dashboard Cite

Differentiable Architecture Search for Reinforcement Learning

Miao¹,

Song²,

Co-Reyes³

et al. 2021

Preprint

View full text Add to dashboard Cite

We introduce RL-DARTS, one of the first applications of Differentiable Architecture Search (DARTS) in reinforcement learning (RL) to search for convolutional cells, applied to the Procgen benchmark. We outline the initial difficulties of applying neural architecture search techniques in RL, and demonstrate that by simply replacing the image encoder with a DARTS supernet, our search method is sample-efficient, requires minimal extra compute resources, and is also compatible with off-policy and on-policy RL algorithms, needing only minor changes in preexisting code. Surprisingly, we find that the supernet can be used as an actor for inference to generate replay data in standard RL training loops, and thus train end-to-end. Throughout this training process, we show that the supernet gradually learns better cells, leading to alternative architectures which can be highly competitive against manually designed policies, but also verify previous design choices for RL policies.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

John D. Co-Reyes

Entity Abstraction in Visual Model-Based Reinforcement Learning

Evolving Reinforcement Learning Algorithms

Information is Power: Intrinsic Control via Information Capture

Multi-objective evolution for Generalizable Policy Gradient Algorithms

Differentiable Architecture Search for Reinforcement Learning

Contact Info

Product

Resources

About