Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication MemGym: a Long-Horizon Memory Environment for LLM Agents Wujiang Xu, Yu Wang, Kai Mei, Kaiqu Liang, Zhenting Wang, Ming Jin, Han Zhang, Shi-Xiong Zhang, Wenyue Hua, Sambit Sahu, Dimitris N. Metaxas May 2026 arXiv | May 2026
Publication STAR-P’olyaMath: Multi-Agent Reasoning under Persistent Meta-Strategic Supervision Jiaao Wu, Xian Zhang, Han Liu, Sophia Zhang, Fan Yang, Yinpeng Dong May 2026 arXiv | May 2026
Publication Multi-agent AI systems outperform human teams in creativity Tiancheng Hu, Yixuan Jiang, Haotian Li, Jos'e Hern'andez-Orallo, Xing Xie, Nigel Collier, David Stillwell, Luning Sun May 2026 arXiv | May 2026
Publication GroupMemBench: Benchmarking LLM Agent Memory in Multi-Party Conversations Jingbo Yang, Kwei-Herng Lai, Xiaowen Wang, Shiyu Chang, Y. Harari, Evgeniy Gabrilovich May 2026 arXiv | May 2026
Publication Auditing Agent Harness Safety Chengzhi Liu, Yicheng Guo, Yepeng Liu, Yuzhe Yang, Qianqi Yan, Xuandong Zhao, Wenyue Hua, Shengchao Liu, Sharon Li, Yuheng Bu, Xin Wang May 2026 arXiv | May 2026
Publication LIFT: Last-Mile Fine-Tuning for Table Explicitation Divij Khaitan, Ashish Tiwari May 2026 arXiv | May 2026
Publication Inducing Artificial Uncertainty in Language Models Sophia Hager, Simon Zeng, Nicholas Andrews May 2026 arXiv | May 2026
Publication FIND: Toward Multimodal Financial Reasoning and Question Answering for Indic Languages Sarmistha Das, V. Vishal, Syed Ibrahim Ahmad, Manish Gupta, Sriparna Saha May 2026 arXiv | May 2026
Publication GEAR: Granularity-Adaptive Advantage Reweighting for LLM Agents via Self-Distillation Sijia Li, Yuchen Huang, Zifan Liu, Yanping Li, Jingjing Fu, Li Zhao, Jiang Bian, Ling Zhang, Jun Zhang, Rui Wang May 2026 arXiv | May 2026
Publication Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty Joykirat Singh, Zaid Khan, Archiki Prasad, Junyan Chen, Akshay Nambi, Hyunji Lee, Elias Stengel-Eskin, Mohit Bansal May 2026 arXiv | May 2026