Zainal, Muhammad Muqri Muazam, et al. “A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-V3 Path-Planning Task”. International Journal of Autonomous Robotics and Intelligent Systems (IJARIS), vol. 1, no. 2, Dec. 2025, pp. 143-56, https://ejournal.unimap.edu.my/index.php/ijaris/article/view/2639.