A New Softmax Operator for Reinforcement Learning - GA将？開発日記～王理のその先へ～