MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs
Rui Wen, Mark Russinovich, Andrew Paverd, Jun Sakuma, Ahmed Salem
May 2026
arXiv | May 2026
Rui Wen, Mark Russinovich, Andrew Paverd, Jun Sakuma, Ahmed Salem
May 2026
arXiv | May 2026
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris Köpf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella-Béguelin
Transactions on Machine Learning Research (TMLR) | October 2025
Manuel Costa, Boris Köpf, Aashish Kolluri, Andrew Paverd, Mark Russinovich, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
May 2025
arXiv | May 2025
Giovanni Cherubin, Boris Köpf, Andrew Paverd, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
USENIX Security Symposium | August 2024
Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Béguelin, Richard Turner, Antti Honkela
Transactions on Machine Learning Research | December 2023, 第 2023 卷
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Ruehle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones
International Conference on Machine Learning | July 2023
编辑: Barbara Engelhardt, Emma Brunskill, Kyunghyun Cho
Ahmed Salem, Giovanni Cherubin, David Evans, Boris Köpf, Andrew Paverd, Anshuman Suri, Shruti Tople, Santiago Zanella-Béguelin
IEEE Symposium on Security and Privacy | May 2023
Avinash Sudhodanan, Andrew Paverd
31st USENIX Security Symposium | August 2022
Santiago Zanella-Béguelin, Shruti Tople, Andrew Paverd, Boris Köpf
International Conference on Machine Learning | July 2021
编辑: Marina Meila and Tong Zhang
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Victor Ruehle, Andrew Paverd, Olga Ohrimenko, Boris Köpf, Marc Brockschmidt
ACM Conference on Computer and Communication Security (CCS) | November 2020
Rui Wen, Mark Russinovich, Andrew Paverd, Jun Sakuma, Ahmed Salem
May 2026
arXiv | May 2026
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris Köpf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella-Béguelin
Transactions on Machine Learning Research (TMLR) | October 2025
Manuel Costa, Boris Köpf, Aashish Kolluri, Andrew Paverd, Mark Russinovich, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
May 2025
arXiv | May 2025
Giovanni Cherubin, Boris Köpf, Andrew Paverd, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
USENIX Security Symposium | August 2024
Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Béguelin, Richard Turner, Antti Honkela
Transactions on Machine Learning Research | December 2023, 第 2023 卷
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Ruehle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones
International Conference on Machine Learning | July 2023
编辑: Barbara Engelhardt, Emma Brunskill, Kyunghyun Cho
Ahmed Salem, Giovanni Cherubin, David Evans, Boris Köpf, Andrew Paverd, Anshuman Suri, Shruti Tople, Santiago Zanella-Béguelin
IEEE Symposium on Security and Privacy | May 2023
Avinash Sudhodanan, Andrew Paverd
31st USENIX Security Symposium | August 2022
Santiago Zanella-Béguelin, Shruti Tople, Andrew Paverd, Boris Köpf
International Conference on Machine Learning | July 2021
编辑: Marina Meila and Tong Zhang
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Victor Ruehle, Andrew Paverd, Olga Ohrimenko, Boris Köpf, Marc Brockschmidt
ACM Conference on Computer and Communication Security (CCS) | November 2020
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris Köpf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella-Béguelin
Transactions on Machine Learning Research (TMLR) | October 2025
Manuel Costa, Boris Köpf, Aashish Kolluri, Andrew Paverd, Mark Russinovich, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
May 2025
arXiv | May 2025
Giovanni Cherubin, Boris Köpf, Andrew Paverd, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
USENIX Security Symposium | August 2024
Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Béguelin, Richard Turner, Antti Honkela
Transactions on Machine Learning Research | December 2023, 第 2023 卷
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Ruehle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones
International Conference on Machine Learning | July 2023
编辑: Barbara Engelhardt, Emma Brunskill, Kyunghyun Cho
Ahmed Salem, Giovanni Cherubin, David Evans, Boris Köpf, Andrew Paverd, Anshuman Suri, Shruti Tople, Santiago Zanella-Béguelin
IEEE Symposium on Security and Privacy | May 2023
Santiago Zanella-Béguelin, Shruti Tople, Andrew Paverd, Boris Köpf
International Conference on Machine Learning | July 2021
编辑: Marina Meila and Tong Zhang
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Victor Ruehle, Andrew Paverd, Olga Ohrimenko, Boris Köpf, Marc Brockschmidt
ACM Conference on Computer and Communication Security (CCS) | November 2020
Rui Wen, Mark Russinovich, Andrew Paverd, Jun Sakuma, Ahmed Salem
May 2026
arXiv | May 2026
Manuel Costa, Boris Köpf, Aashish Kolluri, Andrew Paverd, Mark Russinovich, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
May 2025
arXiv | May 2025
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris Köpf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella-Béguelin
Transactions on Machine Learning Research (TMLR) | October 2025
Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Béguelin, Richard Turner, Antti Honkela
Transactions on Machine Learning Research | December 2023, 第 2023 卷
Giovanni Cherubin, Boris Köpf, Andrew Paverd, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin
USENIX Security Symposium | August 2024
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Ruehle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones
International Conference on Machine Learning | July 2023
编辑: Barbara Engelhardt, Emma Brunskill, Kyunghyun Cho
Ahmed Salem, Giovanni Cherubin, David Evans, Boris Köpf, Andrew Paverd, Anshuman Suri, Shruti Tople, Santiago Zanella-Béguelin
IEEE Symposium on Security and Privacy | May 2023
Avinash Sudhodanan, Andrew Paverd
31st USENIX Security Symposium | August 2022
Santiago Zanella-Béguelin, Shruti Tople, Andrew Paverd, Boris Köpf
International Conference on Machine Learning | July 2021
编辑: Marina Meila and Tong Zhang
Santiago Zanella-Béguelin, Lukas Wutschitz, Shruti Tople, Victor Ruehle, Andrew Paverd, Olga Ohrimenko, Boris Köpf, Marc Brockschmidt
ACM Conference on Computer and Communication Security (CCS) | November 2020