2015-02-14 Parameter-exploring Policy Gradients 後で読む http://www.is.tuebingen.mpg.de/fileadmin/user_upload/files/publications/Neural-Networks-2010-Sehnke_[0].pdf