ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
Yiran Wu, Mauricio Velazco, Andrew Zhao, Manuel Ra'ul Mel'endez Luj'an, Srisuma Movva, Yogesh K Roy, Quang-Huy Nguyen, Roberto Rodriguez, Qingyun Wu, Michael Albada, Julia Kiseleva, Anand Mudgerikar
2026 ICML 2026 | May 2026