Multiple-Target Reinforcement Learning with a Single Policy