Reinforcement Learning for Solving the Knight’s Tour Problem