Tabish Rashid

Cited by

	All	Since 2019
Citations	4651	4600
h-index	10	10
i10-index	10	10

1500

750

375

1125

201820192020202120222023202419 132 308 678 1059 1493 925

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Mikayel SamvelyanMeta AI, UCLVerified email at meta.com
Gregory FarquharDeepMindVerified email at google.com
Christian Schroeder de WittUniversity of OxfordVerified email at robots.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordVerified email at eng.ox.ac.uk
Philip TorrProfessor, University of OxfordVerified email at eng.ox.ac.uk
Chia-Man HungUniversity of OxfordVerified email at robots.ox.ac.uk
Nantas NardelliStealthVerified email at arbitrarygravitas.com
Jason R.C. NurseReader in Cyber Security, University of KentVerified email at kent.ac.uk
Ioannis AgrafiotisComputer Science Department, University of OxfordVerified email at cs.ox.ac.uk

Tabish Rashid

Microsoft Research

Verified email at microsoft.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21(178):1−51, 2020, 2020	2334	2020
The StarCraft Multi-Agent Challenge M Samvelyan, T Rashid, CS de Witt, G Farquhar, N Nardelli, TGJ Rudner, ... AAMAS 2019, 2019	975	2019
Maven: Multi-agent variational exploration A Mahajan, T Rashid, M Samvelyan, S Whiteson Advances in Neural Information Processing Systems, 7613-7624, 2019	389	2019
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Advances in Neural Information Processing Systems 33, 2020, 2020	339*	2020
Facmac: Factored multi-agent centralised policy gradients B Peng, T Rashid, C Schroeder de Witt, PA Kamienny, P Torr, W Böhmer, ... Advances in Neural Information Processing Systems 34, 12208-12221, 2021	194	2021
A new take on detecting insider threats: exploring the use of hidden markov models T Rashid, I Agrafiotis, JRC Nurse Proceedings of the 8th ACM CCS International Workshop on Managing Insider …, 2016	185	2016
Imitating human behaviour with diffusion models T Pearce, T Rashid, A Kanervisto, D Bignell, M Sun, R Georgescu, ... arXiv preprint arXiv:2301.10677, 2023	120	2023
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Boehmer, S Whiteson International Conference on Learning Representations, 2019	47	2019
Exploration with unreliable intrinsic reward in multi-agent reinforcement learning W Böhmer, T Rashid, S Whiteson arXiv preprint arXiv:1906.02138, 2019	30	2019
Regularized softmax deep multi-agent q-learning L Pan, T Rashid, B Peng, L Huang, S Whiteson Advances in Neural Information Processing Systems 34, 1365-1377, 2021	27	2021
Estimating α-Rank by Maximizing Information Gain T Rashid, C Zhang, K Ciosek Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5673-5681, 2021	9	2021
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games L Schäfer, L Jones, A Kanervisto, Y Cao, T Rashid, R Georgescu, ...	2	2023
Aligning Agents like Large Language Models A Jelley, Y Cao, D Bignell, S Devlin, T Rashid		2023
Exploration and value function factorisation in single and multi-agent reinforcement learning T Rashid University of Oxford, 2021		2021
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS de Witt, G Farquhar, J Foerster, S Whiteson Proceedings of the 35th International Conference on Machine Learning, 2018		2018

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors