Multi-agent Reinforcement Learning Algorithm