Explorations In The Foundations Of Value-Based Reinforcement Learning