Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021
DOI: 10.24963/ijcai.2021/348
|View full text |Cite
|
Sign up to set email alerts
|

State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability

Abstract: The sum-product network (SPN) has been extended to model sequence data with the recurrent SPN (RSPN), and to decision-making problems with sum-product-max networks (SPMN). In this paper, we build on the concepts introduced by these extensions and present state-based recurrent SPMNs (S-RSPMNs) as a generalization of SPMNs to sequential decision-making problems where the state may not be perfectly observed. As with recurrent SPNs, S-RSPMNs utilize a repeatable template network to model sequences of arbitrary len… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…Similar to mapl-cirup, SPUDD (Hoey et al 1999) is a variation of the classic value iteration algorithm using Although SPUDD was introduced over two decades ago, it is still considered the state-of-the-art approach for solving factored MDPs exactly, as mentioned in multiple recent publications on approximate methods (Hayes et al 2021;Heß, Sundermann, and Thüm 2021;Moreira et al 2021;Dudek, Shrotri, and Vardi 2022;Tan and Nejat 2022). Vlasselaer et al (2016) investigate how dynamic Bayesian networks (Murphy 2002) benefit from knowledge compilation techniques.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to mapl-cirup, SPUDD (Hoey et al 1999) is a variation of the classic value iteration algorithm using Although SPUDD was introduced over two decades ago, it is still considered the state-of-the-art approach for solving factored MDPs exactly, as mentioned in multiple recent publications on approximate methods (Hayes et al 2021;Heß, Sundermann, and Thüm 2021;Moreira et al 2021;Dudek, Shrotri, and Vardi 2022;Tan and Nejat 2022). Vlasselaer et al (2016) investigate how dynamic Bayesian networks (Murphy 2002) benefit from knowledge compilation techniques.…”
Section: Related Workmentioning
confidence: 99%