SentinelBench: A Benchmark for Long-Running Monitoring Agents
Matheus Kunzler Maldaner, Adam Fourney, Amanda Swearngin, Hussein Mozannar, Gagan Bansal, Maya Murad, Rafah Hosn, Saleema Amershi
June 2026
arXiv | June 2026
Matheus Kunzler Maldaner, Adam Fourney, Amanda Swearngin, Hussein Mozannar, Gagan Bansal, Maya Murad, Rafah Hosn, Saleema Amershi
June 2026
arXiv | June 2026
Gagan Bansal, Wenyue Hua, Zezhou Huang, Adam Fourney, Amanda Swearngin, Will Epperson, Tyler Payne, Jake Hofman, Brendan Lucier, Chinmay Singh, Markus Mobius, Akshay Nambi, Archana Yadav, Kevin Gao, David Rothschild, Aleksandrs Slivkins, Daniel G. Goldstein, Hussein Mozannar, Nicole Immorlica, Maya Murad, Matthew Vogel, Subbarao Kambhampati, Eric Horvitz, Saleema Amershi
MSR-TR-2025-50 | October 2025
Publié par Microsoft
Hussein Mozannar, Gagan Bansal, Cheng Tan, Adam Fourney, Victor Dibia, Jingya Chen, Jack Gerrits, Tyler Payne, Matheus Kunzler Maldaner, Madeleine Grunde-McLaughlin, Eric Zhu, Griffin Bassman, Jacob Alber, Peter Chang, Ricky Loynd, Friederike Niedtner, Ece Kamar, Maya Murad, Rafah Hosn, Saleema Amershi
MSR-TR-2025-40 | July 2025
Publié par Microsoft
Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney (adamfo), Jack Gerrits, Erkang (Eric) Zhu, Saleema Amershi
CHI 2025 | April 2025
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
ToCHI | April 2025, Vol 32(1)
Omar Shaikh, Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
ACL | March 2025
Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld
MSR-TR-2024-53 | December 2024
Publié par Microsoft
Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas, Erkang (Eric) Zhu, Friederike Niedtner, Grace Proebsting, Griffin Bassman, Jack Gerrits, Jacob Alber, Peter Chang, Ricky Loynd, Robert West, Victor Dibia, Ahmed Awadallah, Ece Kamar, Rafah Hosn, Saleema Amershi
MSR-TR-2024-47 | November 2024
Publié par Microsoft
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang (Eric) Zhu, Chi Wang, Saleema Amershi
August 2024
Preprint
Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang (Eric) Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Ahmed Awadallah, Ryen W. White, Doug Burger, Chi Wang
COLM 2024 | August 2024
Best Paper, LLM Agents Workshop ICLR'24
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
CHI 2024 | May 2024
Honorable Mention, CHI 2024
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
AAAI 2024 | January 2024
Valerie Chen, Q. Vera Liao, Jennifer Wortman Vaughan, Gagan Bansal
Proc. ACM Hum.-Comput. Interact. | October 2023, Vol 7
Article 370
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | June 2023
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
NeurIPS Workshop on Human-Centered AI | October 2022
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | October 2022
Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld
CHI | May 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Eric Horvitz, Dan Weld
AAAI 2021 | February 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Walter Lasecki, Dan Weld, Eric Horvitz
HCOMP 2019 | October 2019
Gagan Bansal, Besmira Nushi, Ece Kamar, Dan Weld, Walter Lasecki, Eric Horvitz
AAAI Conference on Artificial Intelligence | January 2019
Daniel S. Weld, Gagan Bansal
Communications of ACM | October 2018
Matheus Kunzler Maldaner, Adam Fourney, Amanda Swearngin, Hussein Mozannar, Gagan Bansal, Maya Murad, Rafah Hosn, Saleema Amershi
June 2026
arXiv | June 2026
Gagan Bansal, Wenyue Hua, Zezhou Huang, Adam Fourney, Amanda Swearngin, Will Epperson, Tyler Payne, Jake Hofman, Brendan Lucier, Chinmay Singh, Markus Mobius, Akshay Nambi, Archana Yadav, Kevin Gao, David Rothschild, Aleksandrs Slivkins, Daniel G. Goldstein, Hussein Mozannar, Nicole Immorlica, Maya Murad, Matthew Vogel, Subbarao Kambhampati, Eric Horvitz, Saleema Amershi
MSR-TR-2025-50 | October 2025
Publié par Microsoft
Hussein Mozannar, Gagan Bansal, Cheng Tan, Adam Fourney, Victor Dibia, Jingya Chen, Jack Gerrits, Tyler Payne, Matheus Kunzler Maldaner, Madeleine Grunde-McLaughlin, Eric Zhu, Griffin Bassman, Jacob Alber, Peter Chang, Ricky Loynd, Friederike Niedtner, Ece Kamar, Maya Murad, Rafah Hosn, Saleema Amershi
MSR-TR-2025-40 | July 2025
Publié par Microsoft
Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney (adamfo), Jack Gerrits, Erkang (Eric) Zhu, Saleema Amershi
CHI 2025 | April 2025
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
ToCHI | April 2025, Vol 32(1)
Omar Shaikh, Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
ACL | March 2025
Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld
MSR-TR-2024-53 | December 2024
Publié par Microsoft
Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas, Erkang (Eric) Zhu, Friederike Niedtner, Grace Proebsting, Griffin Bassman, Jack Gerrits, Jacob Alber, Peter Chang, Ricky Loynd, Robert West, Victor Dibia, Ahmed Awadallah, Ece Kamar, Rafah Hosn, Saleema Amershi
MSR-TR-2024-47 | November 2024
Publié par Microsoft
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang (Eric) Zhu, Chi Wang, Saleema Amershi
August 2024
Preprint
Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang (Eric) Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Ahmed Awadallah, Ryen W. White, Doug Burger, Chi Wang
COLM 2024 | August 2024
Best Paper, LLM Agents Workshop ICLR'24
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
CHI 2024 | May 2024
Honorable Mention, CHI 2024
Valerie Chen, Q. Vera Liao, Jennifer Wortman Vaughan, Gagan Bansal
Proc. ACM Hum.-Comput. Interact. | October 2023, Vol 7
Article 370
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | June 2023
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
NeurIPS Workshop on Human-Centered AI | October 2022
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | October 2022
Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld
CHI | May 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Eric Horvitz, Dan Weld
AAAI 2021 | February 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Walter Lasecki, Dan Weld, Eric Horvitz
HCOMP 2019 | October 2019
Gagan Bansal, Besmira Nushi, Ece Kamar, Dan Weld, Walter Lasecki, Eric Horvitz
AAAI Conference on Artificial Intelligence | January 2019
Daniel S. Weld, Gagan Bansal
Communications of ACM | October 2018
Gagan Bansal, Wenyue Hua, Zezhou Huang, Adam Fourney, Amanda Swearngin, Will Epperson, Tyler Payne, Jake Hofman, Brendan Lucier, Chinmay Singh, Markus Mobius, Akshay Nambi, Archana Yadav, Kevin Gao, David Rothschild, Aleksandrs Slivkins, Daniel G. Goldstein, Hussein Mozannar, Nicole Immorlica, Maya Murad, Matthew Vogel, Subbarao Kambhampati, Eric Horvitz, Saleema Amershi
MSR-TR-2025-50 | October 2025
Publié par Microsoft
Omar Shaikh, Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
ACL | March 2025
Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney (adamfo), Jack Gerrits, Erkang (Eric) Zhu, Saleema Amershi
CHI 2025 | April 2025
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
ToCHI | April 2025, Vol 32(1)
Omar Shaikh, Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
ACL | March 2025
Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld
MSR-TR-2024-53 | December 2024
Publié par Microsoft
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang (Eric) Zhu, Chi Wang, Saleema Amershi
August 2024
Preprint
Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang (Eric) Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Ahmed Awadallah, Ryen W. White, Doug Burger, Chi Wang
COLM 2024 | August 2024
Best Paper, LLM Agents Workshop ICLR'24
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
AAAI 2024 | January 2024
Valerie Chen, Q. Vera Liao, Jennifer Wortman Vaughan, Gagan Bansal
Proc. ACM Hum.-Comput. Interact. | October 2023, Vol 7
Article 370
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | June 2023
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
NeurIPS Workshop on Human-Centered AI | October 2022
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | October 2022
Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld
CHI | May 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Walter Lasecki, Dan Weld, Eric Horvitz
HCOMP 2019 | October 2019
Gagan Bansal, Besmira Nushi, Ece Kamar, Dan Weld, Walter Lasecki, Eric Horvitz
AAAI Conference on Artificial Intelligence | January 2019
Daniel S. Weld, Gagan Bansal
Communications of ACM | October 2018
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | June 2023
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | October 2022
Matheus Kunzler Maldaner, Adam Fourney, Amanda Swearngin, Hussein Mozannar, Gagan Bansal, Maya Murad, Rafah Hosn, Saleema Amershi
June 2026
arXiv | June 2026
Gagan Bansal, Wenyue Hua, Zezhou Huang, Adam Fourney, Amanda Swearngin, Will Epperson, Tyler Payne, Jake Hofman, Brendan Lucier, Chinmay Singh, Markus Mobius, Akshay Nambi, Archana Yadav, Kevin Gao, David Rothschild, Aleksandrs Slivkins, Daniel G. Goldstein, Hussein Mozannar, Nicole Immorlica, Maya Murad, Matthew Vogel, Subbarao Kambhampati, Eric Horvitz, Saleema Amershi
MSR-TR-2025-50 | October 2025
Publié par Microsoft
Hussein Mozannar, Gagan Bansal, Cheng Tan, Adam Fourney, Victor Dibia, Jingya Chen, Jack Gerrits, Tyler Payne, Matheus Kunzler Maldaner, Madeleine Grunde-McLaughlin, Eric Zhu, Griffin Bassman, Jacob Alber, Peter Chang, Ricky Loynd, Friederike Niedtner, Ece Kamar, Maya Murad, Rafah Hosn, Saleema Amershi
MSR-TR-2025-40 | July 2025
Publié par Microsoft
Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld
MSR-TR-2024-53 | December 2024
Publié par Microsoft
Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas, Erkang (Eric) Zhu, Friederike Niedtner, Grace Proebsting, Griffin Bassman, Jack Gerrits, Jacob Alber, Peter Chang, Ricky Loynd, Robert West, Victor Dibia, Ahmed Awadallah, Ece Kamar, Rafah Hosn, Saleema Amershi
MSR-TR-2024-47 | November 2024
Publié par Microsoft
Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney (adamfo), Jack Gerrits, Erkang (Eric) Zhu, Saleema Amershi
CHI 2025 | April 2025
Omar Shaikh, Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
ACL | March 2025
Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang (Eric) Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Ahmed Awadallah, Ryen W. White, Doug Burger, Chi Wang
COLM 2024 | August 2024
Best Paper, LLM Agents Workshop ICLR'24
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
CHI 2024 | May 2024
Honorable Mention, CHI 2024
Hussein Mozannar, Gagan Bansal, Adam Fourney, Eric Horvitz
AAAI 2024 | January 2024
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | June 2023
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
NeurIPS Workshop on Human-Centered AI | October 2022
Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi, Han Liu, Saleema Amershi
ACL 2023 | October 2022
Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld
CHI | May 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Eric Horvitz, Dan Weld
AAAI 2021 | February 2021
Gagan Bansal, Besmira Nushi, Ece Kamar, Walter Lasecki, Dan Weld, Eric Horvitz
HCOMP 2019 | October 2019
Gagan Bansal, Besmira Nushi, Ece Kamar, Dan Weld, Walter Lasecki, Eric Horvitz
AAAI Conference on Artificial Intelligence | January 2019
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang (Eric) Zhu, Chi Wang, Saleema Amershi
August 2024
Preprint
Helena Vasconcelos, Gagan Bansal, Adam Fourney, Q. Vera Liao, Jennifer Wortman Vaughan
ToCHI | April 2025, Vol 32(1)
Valerie Chen, Q. Vera Liao, Jennifer Wortman Vaughan, Gagan Bansal
Proc. ACM Hum.-Comput. Interact. | October 2023, Vol 7
Article 370
Daniel S. Weld, Gagan Bansal
Communications of ACM | October 2018