Reinforcement learning - LIVETUTU