Reinforcement Learning Design