Firstly enter start and target points and start learning algorithm.
After learning environment with Q-learning algorithm it shows the most beneficial route to target based on Q matrice.
After showing the most beneficial route to target based on Q matrice, it will draw episode via cost and episode via step graphs.