1.
Zainal MMM, Kamarudin K, Abdalrahman N, Rahiman W, Abu Bakar MA, Manan MR. A Comparative Study of On-Policy and Off-Policy Tabular RL in the Taxi-v3 Path-Planning Task. IJARIS [Internet]. 2025 Dec. 29 [cited 2026 Jan. 2];1(2):143-56. Available from: https://ejournal.unimap.edu.my/index.php/ijaris/article/view/2639