Reinforcement learning of simple indirect mechanisms