Zainal, Muhammad Muqri Muazam, Kamarulzaman Kamarudin, Nasr Abdalrahman, Wan Rahiman, M Aizat Abu Bakar, and M Rizal Manan. 2025. “A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-V3 Path-Planning Task”. International Journal of Autonomous Robotics and Intelligent Systems (IJARIS) 1 (2):143-56. https://ejournal.unimap.edu.my/index.php/ijaris/article/view/2639.