2012
DOI: 10.1007/978-3-642-24647-0_3
|View full text |Cite
|
Sign up to set email alerts
|

Trading Value and Information in MDPs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
91
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 68 publications
(92 citation statements)
references
References 11 publications
1
91
0
Order By: Relevance
“…The theory was originally developed in applied settings for the development of telecommunications and cryptography systems [16,18]. However, it has seen wide application in fields related to cognitive science, including theoretical neuroscience [19], statistical complexity theory [20,21], and models of rational action under information processing constraints [22,23]. Fruitful applications of information theory to language are now possible due to large datasets and computational power that make it possible to estimate information-theoretic quantities such as entropy from language data [24].…”
Section: What Is Communicative Efficiency?mentioning
confidence: 99%
“…The theory was originally developed in applied settings for the development of telecommunications and cryptography systems [16,18]. However, it has seen wide application in fields related to cognitive science, including theoretical neuroscience [19], statistical complexity theory [20,21], and models of rational action under information processing constraints [22,23]. Fruitful applications of information theory to language are now possible due to large datasets and computational power that make it possible to estimate information-theoretic quantities such as entropy from language data [24].…”
Section: What Is Communicative Efficiency?mentioning
confidence: 99%
“…Following the work of (Simon, 1972) decision-making with limited information-processing resources has been studied extensively in psychology, economics, political science, industrial organization, computer science, and artificial intelligence research. In this paper, we use an information-theoretic model of decisionmaking under resource constraints (McKelvey and Palfrey, 1995;Kappen, 2005;Wolpert, 2006;Todorov, 2009;Peters et al, 2010;Theodorou et al, 2010;Rubin et al, 2012). In particular, Braun et al (2011) and Ortega and Braun (2011) present a framework in which gain in expected utility is traded off against the adaptation cost of changing from an initial behavior to a posterior behavior.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, we use an information-theoretic model of bounded rational decision-making Braun, 2012, 2013;Braun and Ortega, 2014; that has precursors in the economic literature (McKelvey and Palfrey, 1995;Mattsson and Weibull, 2002;Sims, 2003Sims, , 2005Sims, , 2006Sims, , 2010Wolpert, 2006) and that is closely related to recent advances in the information theory of perception-action systems (Todorov, 2007(Todorov, , 2009Still, 2009;Friston, 2010;Peters et al, 2010;Tishby and Polani, 2011;Daniel et al, 2012Daniel et al, , 2013Kappen et al, 2012;Rawlik et al, 2012;Rubin et al, 2012;Neymotin et al, 2013;Tkačik and Bialek, 2014;Palmer et al, 2015). The basis of this approach is formalized by a free energy principle that trades off expected utility, and the cost of computation that is required to adapt the system accordingly in order to achieve high utility.…”
Section: Introductionmentioning
confidence: 99%
“…A recent theory has suggested that random choice behavior results from computational limitations on the policy [ 46 ]. In this framework, optimal policy is assumed to be a tradeoff between expected reward and expected information cost.…”
Section: Discussionmentioning
confidence: 99%