Zainal, Muhammad Muqri Muazam, Kamarulzaman Kamarudin, Nasr Abdalrahman, Wan Rahiman, M Aizat Abu Bakar, and M Rizal Manan. “A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-V3 Path-Planning Task”. International Journal of Autonomous Robotics and Intelligent Systems (IJARIS) 1, no. 2 (December 29, 2025): 143–156. Accessed January 2, 2026. https://ejournal.unimap.edu.my/index.php/ijaris/article/view/2639.