“…Recently, there is a growing line of research in the statistics literature for policy learning and/or evaluation in infinite horizons. Some references include Chen et al (2022), Ertefaie and Strawderman (2018), Liao et al (2020), Liao et al (2021), Li et al (2022), Luckett et al (2020), Ramprasad et al (2022), Shi et al (2022, and Xu et al (2020). In the computer science literature, there is a huge literature on developing reinforcement learning (RL) algorithms in infinite horizons.…”