The Bellman Equation and Markov Decision Processes in Reinforcement Learning: Making Smarter Selections