Shapley Q-value: A local reward approach to solve global reward games J Wang, Y Zhang, TK Kim, Y Gu Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7285-7292, 2020 | 105 | 2020 |
Modelling hierarchical structure between dialogue policy and natural language generator with option framework for task-oriented dialogue system J Wang, Y Zhang, TK Kim, Y Gu arXiv preprint arXiv:2006.06814, 2020 | 45 | 2020 |
Shaq: Incorporating shapley value theory into multi-agent q-learning J Wang, Y Zhang, Y Gu, TK Kim Advances in Neural Information Processing Systems 35, 5941-5954, 2022 | 25 | 2022 |
WULAI-QA: Web understanding and learning with AI towards document-based question answering against COVID-19 Y Zhang, X Zhang, Y Hu, G Wang, R Yan Proceedings of the 14th ACM International Conference on Web Search and Data …, 2021 | 5 | 2021 |
Constrained Reinforcement Learning with Smoothed Log Barrier Function B Zhang, Y Zhang, L Frison, T Brox, J Bödecker arXiv preprint arXiv:2403.14508, 2024 | 3 | 2024 |
Robust reinforcement learning in continuous control tasks with uncertainty set regularization Y Zhang, J Wang, J Boedecker Conference on Robot Learning, 1400-1424, 2023 | 3 | 2023 |
Geometric regularity with robot intrinsic symmetry in reinforcement learning S Yan, Y Zhang, B Zhang, J Boedecker, W Burgard arXiv preprint arXiv:2306.16316, 2023 | 3 | 2023 |
Improving the Efficiency and Efficacy of Multi-Agent Reinforcement Learning on Complex Railway Networks with a Local-Critic Approach Y Zhang, U Deekshith, J Wang, J Boedecker Proceedings of the International Conference on Automated Planning and …, 2024 | | 2024 |
Learning Continuous Control with Geometric Regularity from Robot Intrinsic Symmetry S Yan, B Zhang, Y Zhang, J Boedecker, W Burgard 2024 IEEE International Conference on Robotics and Automation (ICRA), 49-55, 2024 | | 2024 |
UDUC: An Uncertainty-driven Approach for Learning-based Robust Control Y Zhang, J Hoffmann, J Boedecker arXiv preprint arXiv:2405.02598, 2024 | | 2024 |
Open Ad Hoc Teamwork with Cooperative Game Theory J Wang, Y Li, Y Zhang, W Pan, S Kaski arXiv preprint arXiv:2402.15259, 2024 | | 2024 |
Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving Y Zhang, J Boedecker, C Li, G Zhou arXiv preprint arXiv:2301.13313, 2023 | | 2023 |