Follow
Mridul Agarwal
Title
Cited by
Cited by
Year
Multi-agent multi-armed bandits with limited communication
M Agarwal, V Aggarwal, K Azizzadenesheli
arXiv preprint arXiv:2102.08462, 2021
152021
Transferring dexterous surgical skill knowledge between robots for semi-autonomous teleoperation
MM Rahman, N Sanchez-Tamayo, G Gonzalez, M Agarwal, V Aggarwal, ...
2019 28th IEEE International Conference on Robot and Human Interactive …, 2019
152019
Reinforcement learning for joint optimization of multiple rewards
M Agarwal, V Aggarwal
arXiv preprint arXiv:1909.02940, 2019
14*2019
Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization
M Agarwal, V Aggarwal, AK Umrawal, CJ Quinn
ACM/IMS Transactions on Data Science (TDS) 2 (4), 1-39, 2022
11*2022
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022
92022
Sartres: a semi-autonomous robot teleoperation environment for surgery
MM Rahman, MV Balakuntala, G Gonzalez, M Agarwal, U Kaur, ...
Computer Methods in Biomechanics and Biomedical Engineering: Imaging …, 2021
82021
Grasping region identification in novel objects using microsoft kinect
A Rai, PK Patchaikani, M Agarwal, R Gupta, L Behera
International Conference on Neural Information Processing, 172-179, 2012
72012
Communication efficient parallel reinforcement learning
M Agarwal, B Ganguly, V Aggarwal
Uncertainty in Artificial Intelligence, 247-256, 2021
62021
On the approximation of cooperative heterogeneous multi-agent reinforcement learning (marl) using mean field control (mfc)
WU Mondal, M Agarwal, V Aggarwal, SV Ukkusuri
Journal of Machine Learning Research 23 (129), 1-46, 2022
52022
Regret Guarantees for Model-Based Reinforcement Learning with Long-Term Average Constraints
M Agarwal, Q Bai, V Aggarwal
The 38th Conference on Uncertainty in Artificial Intelligence, 2022
4*2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations
M Agarwal, Q Bai, V Aggarwal
arXiv preprint arXiv:2109.05439, 2021
42021
Deserts: Delay-tolerant semi-autonomous robot teleoperation for surgery
G Gonzalez, M Agarwal, MV Balakuntala, MM Rahman, U Kaur, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 12693 …, 2021
42021
Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm
Q Bai, M Agarwal, V Aggarwal
arXiv preprint arXiv:2105.14125, 2021
42021
Escaping saddle points for zeroth-order non-convex optimization using estimated gradient descent
Q Bai, M Agarwal, V Aggarwal
2020 54th Annual Conference on Information Sciences and Systems (CISS), 1-6, 2020
42020
Multi-Objective Reinforcement Learning with Non-Linear Scalarization
M Agarwal, V Aggarwal, T Lan
Proceedings of the 21st International Conference on Autonomous Agents and …, 2022
32022
Reinforcement Learning for Mean-Field Game
M Agarwal, V Aggarwal, A Ghosh, N Tiwari
Algorithms 15 (3), 73, 2022
32022
Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits
M Agarwal, V Aggarwal, AK Umrawal, C Quinn
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6557-6565, 2021
3*2021
Blind decision making: Reinforcement learning with delayed observations
M Agarwal, V Aggarwal
Pattern Recognition Letters 150, 176-182, 2021
22021
Encoders and Decoders for Quantum Expander Codes Using Machine Learning
S Chadaga, M Agarwal, V Aggarwal
arXiv preprint arXiv:1909.02945, 2019
12019
REINFORCEMENT LEARNING FOR CONCAVE OBJECTIVES AND CONVEX CONSTRAINTS
M Agarwal
Purdue University Graduate School, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20