Economic hierarchical Q-learning