[1]
M. M. M. Zainal, K. Kamarudin, N. Abdalrahman, W. Rahiman, M. A. Abu Bakar, and M. R. Manan, “A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-v3 Path-Planning Task”, IJARIS, vol. 1, no. 2, pp. 143–156, Dec. 2025.