Anna Koop scite author profile

Anna Koop

5Publications

36Citation Statements Received

62Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Alberta

Publications

Order By: Most citations

On the role of tracking in stationary environments

Sutton

Koop

Silver

2007

View full text Add to dashboard Cite

It is often thought that learning algorithms that track the best solution, as opposed to converging to it, are important only on nonstationary problems. We present three results suggesting that this is not so. First we illustrate in a simple concrete example, the Black and White problem, that tracking can perform better than any converging algorithm on a stationary problem. Second, we show the same point on a larger, more realistic problem, an application of temporaldifference learning to computer Go. Our third result suggests that tracking in stationary problems could be important for metalearning research (e.g., learning to learn, feature selection, transfer). We apply a metalearning algorithm for step-size adaptation, IDBD (Sutton, 1992a), to the Black and White problem, showing that meta-learning has a dramatic long-term effect on performance whereas, on an analogous converging problem, meta-learning has only a small second-order effect. This small result suggests a way of eventually overcoming a major obstacle to meta-learning research: the lack of an independent methodology for task selection.

show abstract

Learning to Generalize through Predictive Representations: A Computational Model of Mediated Conditioning

Ludvig

Koop

View full text Add to dashboard Cite

Finding Useful Predictions by Meta-gradient Descent to Improve Decision-making

Kearney¹,

Koop²,

Günther³

et al. 2021

Preprint

View full text Add to dashboard Cite

What’s a good prediction? Challenges in evaluating an agent’s knowledge

2022

View full text Add to dashboard Cite

Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to observable values. However, the prevailing reliance on estimator accuracy as a proxy for the usefulness of the knowledge has the potential to lead us astray. We demonstrate the conflict between accuracy and usefulness through a series of illustrative examples including both a thought experiment and an empirical example in Minecraft, using the General Value Function framework (GVF). Having identified challenges in assessing an agent’s knowledge, we propose an alternate evaluation approach that arises naturally in the online continual learning setting: we recommend evaluation by examining internal learning processes, specifically the relevance of a GVF’s features to the prediction task at hand. This paper contributes a first look into evaluation of predictions through their use, an integral component of predictive knowledge which is as of yet unexplored.

show abstract

What's a Good Prediction? Challenges in evaluating an agent's knowledge

Kearney¹,

Koop²,

Pilarski³

2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Anna Koop

On the role of tracking in stationary environments

Learning to Generalize through Predictive Representations: A Computational Model of Mediated Conditioning

Finding Useful Predictions by Meta-gradient Descent to Improve Decision-making

What’s a good prediction? Challenges in evaluating an agent’s knowledge

What's a Good Prediction? Challenges in evaluating an agent's knowledge

Contact Info

Product

Resources

About