Our paper has been accepted to 31st International Conference on Neural Information Processing (ICONIP 2024). This paper proposes an adaptation method of trust region for trust region policy optimization (TRPO).
- Shoma Shimizu, Kento Uchida, Atsuo Maki, and Shinichi Shirakawa: Adaptive Trust Region Radius for Robust Policy Optimization, 31st International Conference on Neural Information Processing (ICONIP 2024), Auckland, New Zealand, December 2-6, 2024.