Zainal, M. M. M., Kamarudin, K., Abdalrahman, N., Rahiman, W., Abu Bakar, M. A., & Manan, M. R. (2025). A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-v3 Path-Planning Task. International Journal of Autonomous Robotics and Intelligent Systems (IJARIS), 1(2), 143–156. Retrieved from https://ejournal.unimap.edu.my/index.php/ijaris/article/view/2639