Measuring and mitigating name biases in neural machine translation J Wang, B Rubinstein, T Cohn Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 39 | 2022 |
Putting words into the system's mouth: A targeted attack on neural machine translation using monolingual data poisoning J Wang, C Xu, F Guzmán, A El-Kishky, Y Tang, BIP Rubinstein, T Cohn arXiv preprint arXiv:2107.05243, 2021 | 35 | 2021 |
Targeted poisoning attacks on black-box neural machine translation C Xu, J Wang, Y Tang, F Guzmán, BIP Rubinstein, T Cohn arXiv preprint arXiv:2011.00675, 2020 | 34* | 2020 |
Mitigating backdoor poisoning attacks through the lens of spurious correlation X He, Q Xu, J Wang, B Rubinstein, T Cohn arXiv preprint arXiv:2305.11596, 2023 | 13 | 2023 |
Mitigating data poisoning in text classification with differential privacy C Xu, J Wang, F Guzmán, B Rubinstein, T Cohn Findings of the Association for Computational Linguistics: EMNLP 2021, 4348-4356, 2021 | 10 | 2021 |
As easy as 1, 2, 3: Behavioural testing of NMT systems for numerical translation J Wang, C Xu, F Guzmán, A El-Kishky, BIP Rubinstein, T Cohn arXiv preprint arXiv:2107.08357, 2021 | 8 | 2021 |
IMBERT: Making BERT immune to insertion-based backdoor attacks X He, J Wang, B Rubinstein, T Cohn arXiv preprint arXiv:2305.16503, 2023 | 7 | 2023 |
TuBA: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning X He, J Wang, Q Xu, P Minervini, P Stenetorp, BIP Rubinstein, T Cohn arXiv preprint arXiv:2404.19597, 2024 | 4 | 2024 |
Backdoor Attack on Multilingual Machine Translation J Wang, Q Xu, X He, BIP Rubinstein, T Cohn arXiv preprint arXiv:2404.02393, 2024 | 3 | 2024 |
Don't Throw Away Data: Better Sequence Knowledge Distillation J Wang, E Briakou, H Dadkhahi, R Agarwal, C Cherry, T Cohn arXiv preprint arXiv:2407.10456, 2024 | 2 | 2024 |
Foiling training-time attacks on neural machine translation systems J Wang, X He, B Rubinstein, T Cohn Findings of the Association for Computational Linguistics: EMNLP 2022, 5906-5913, 2022 | 2 | 2022 |
Detecting Backdoors in Deep Text Classifiers Y Guo, J Wang, T Cohn arXiv preprint arXiv:2210.11264, 2022 | 2 | 2022 |
Seep: Training dynamics grounds latent representation search for mitigating backdoor poisoning attacks X He, Q Xu, J Wang, BIP Rubinstein, T Cohn Transactions of the Association for Computational Linguistics 12, 996-1010, 2024 | | 2024 |