Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng, Y Klochkov, MF Taufiq, H Li arXiv preprint arXiv:2308.05374, 2023 | 94 | 2023 |
Conformal Off-Policy Prediction in Contextual Bandits MF Taufiq, JF Ton, R Cornish, YW Teh, A Doucet Conference on Neural Information Processing Systems (NeurIPS 2022), 2022 | 11 | 2022 |
Manifold Restricted Interventional Shapley Values MF Taufiq, P Blöbaum, L Minorics Conference on Artificial Intelligence and Statistics (AISTATS 2023), 2023 | 3 | 2023 |
Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees MF Taufiq, JF Ton, Y Liu arXiv preprint arXiv:2402.17106, 2024 | | 2024 |
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits MF Taufiq, A Doucet, R Cornish, JF Ton Conference on Neural Information Processing Systems (NeurIPS 2023), 2023 | | 2023 |
Causal Falsification of Digital Twins R Cornish, MF Taufiq, A Doucet, C Holmes arXiv preprint arXiv:2301.07210, 2023 | | 2023 |