In this paper, we focus on how values are backpropagated in the MCTS tree, and apply complex return strategies from the Reinforcement Learning (RL) literature to MCTS, producing 4 new MCTS variants.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results