Bensoussan, Li, Nguyen, et al. 2020.
“Machine Learning and Control Theory.” arXiv:2006.05604 [Cs, Math, Stat].
Berrueta, Pinosky, and Murphey. 2024.
“Maximum Diffusion Reinforcement Learning.” Nature Machine Intelligence.
Brockman, Cheung, Pettersson, et al. 2016.
“OpenAI Gym.” arXiv:1606.01540 [Cs].
Chen, Lu, Rajeswaran, et al. 2021.
“Decision Transformer: Reinforcement Learning via Sequence Modeling.” In
Advances in Neural Information Processing Systems.
Clifton, and Laber. 2020.
“Q-Learning: Theory and Applications.” Annual Review of Statistics and Its Application.
Drori. 2022a. “Deep Reinforcement Learning.” In The Science of Deep Learning.
———. 2022b. “Reinforcement Learning.” In The Science of Deep Learning.
Fellows, Mahajan, Rudner, et al. 2019.
“VIREL: A Variational Inference Framework for Reinforcement Learning.” In
Advances in Neural Information Processing Systems.
Hu, Com, and Wellman. n.d. “Nash Q-Learning for General-Sum Stochastic Games.”
Jaakkola, Singh, and Jordan. 1995.
“Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems.” In
Advances in Neural Information Processing Systems.
Kaelbling, Littman, and Moore. 1996.
“Reinforcement Learning: A Survey.” Journal of Artifical Intelligence Research.
Kochenderfer, Wheeler, and Wray. 2022. Algorithms for decision making.
Krakovsky. 2016.
“Reinforcement Renaissance.” Commun. ACM.
Krishnamurthy, Agarwal, and Langford. 2016.
“Contextual-MDPs for PAC-Reinforcement Learning with Rich Observations.” arXiv:1602.02722 [Cs, Stat].
Lehman, Gordon, Jain, et al. 2022.
“Evolution Through Large Models.”
Oberst, and Sontag. 2019.
“Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models.” In
Proceedings of the 36th International Conference on Machine Learning.
Parisotto, and Salakhutdinov. 2017.
“Neural Map: Structured Memory for Deep Reinforcement Learning.” arXiv:1702.08360 [Cs].
Schulman, Wolski, Dhariwal, et al. 2017.
“Proximal Policy Optimization Algorithms.”
Shibata, Yoshinaka, and Chikayama. 2006.
“Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning.” In
Algorithmic Learning Theory. Lecture Notes in Computer Science 4264.
Silver, Singh, Precup, et al. 2021.
“Reward Is Enough.” Artificial Intelligence.
Sutton, Richard S, and Barto. 1998.
Reinforcement Learning.
Sutton, Richard S., McAllester, Singh, et al. 2000.
“Policy Gradient Methods for Reinforcement Learning with Function Approximation.” In
Advances in Neural Information Processing Systems.