Follow
Soichiro Nishimori
Soichiro Nishimori
Verified email at g.ecc.u-tokyo.ac.jp
Title
Cited by
Cited by
Year
Pgx: Hardware-accelerated parallel game simulators for reinforcement learning
S Koyamada, S Okano, S Nishimori, Y Murata, K Habara, H Kita, S Ishii
Advances in Neural Information Processing Systems 36, 2024
82024
Mjx: A framework for Mahjong AI research
S Koyamada, K Habara, N Goto, S Okano, S Nishimori, S Ishii
2022 IEEE Conference on Games (CoG), 504-507, 2022
22022
Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains
S Nishimori, XQ Cai, J Ackermann, M Sugiyama
arXiv preprint arXiv:2404.07465, 2024
2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
T Kitamura, T Kozuno, M Kato, Y Ichihara, S Nishimori, A Sannai, ...
arXiv preprint arXiv:2401.17780, 2024
2024
End-to-End Policy Gradient Method for POMDPs and Explainable Agents
S Nishimori, S Koyamada, S Ishii
arXiv preprint arXiv:2304.09769, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–5