Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng, Y Klochkov, MF Taufiq, ... arXiv preprint arXiv:2308.05374, 2023 | 226 | 2023 |
Conformal Off-Policy Prediction in Contextual Bandits MF Taufiq, JF Ton, R Cornish, YW Teh, A Doucet Conference on Neural Information Processing Systems (NeurIPS 2022), 2022 | 19 | 2022 |
Manifold Restricted Interventional Shapley Values MF Taufiq, P Blöbaum, L Minorics Conference on Artificial Intelligence and Statistics (AISTATS 2023), 2023 | 10 | 2023 |
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits MF Taufiq, A Doucet, R Cornish, JF Ton Conference on Neural Information Processing Systems (NeurIPS 2023), 2023 | 3 | 2023 |
Understanding Chain-of-Thought in LLMs through Information Theory JF Ton, MF Taufiq, Y Liu arXiv preprint arXiv:2411.11984, 2024 | | 2024 |
Achievable Fairness on Your Data With Utility Guarantees MF Taufiq, JF Ton, Y Liu Conference on Neural Information Processing Systems (NeurIPS 2024), 2024 | | 2024 |
Causal Falsification of Digital Twins R Cornish, MF Taufiq, A Doucet, C Holmes arXiv preprint arXiv:2301.07210, 2023 | | 2023 |