Publication Online Experiential Learning for Language Models Tianzhu Ye, Li Dong, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei arXiv: Computation and Language | March 2026, Vol 2603(16856)
Publication MSCCL++: Rethinking GPU Communication Abstractions for AI Inference Changho Hwang, Peng Cheng, Roshan Dathathri, Abhinav Jangda, Saeed Maleki, Madan Musuvathi, Olli Saarikivi, Aashaka Shah, Ziyue Yang, Binyang Li, Caio Rocha, Qinghua Zhou, Mahdieh Ghazimirsaeed, Sreevatsa Anantharamu, Jithin Jose ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) | March 2026 Best Paper – Honorable Mention
Publication Reasoning-Driven Multimodal LLM for Domain Generalization Zhipeng Xu, Zilong Wang, Xinyang Jiang, Dongsheng Li, De Cheng, Nannan Wang February 2026 arXiv | February 2026
Publication AI Evaluation Should Require Standardized Item-Level Data Releases Hang Jiang, Susu Zhang, Dongyao Zhu, Yuzhuo Bai, Sang Truong, Xiaoyuan Yi, Sanmi Koyejo, Xing Xie, Ziang Xiao February 2026 arXiv | February 2026
Publication A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research Zhuofan Shi, M. Ma, Zekun Yao, Fangkai Yang, Jue Zhang, Dongge Han, Victor Ruhle, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang ICML 2026 | February 2026
Publication Evaluating LLM Reasoning Beyond Correctness and CoT Soheil Abbasloo February 2026 February 2026
Publication Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs Xiaoda Wang, Ching Chang, Defu Cao, Kaiqiao Han, Fang Sun, Yue Huang, Minxiao Wang, Chang Xu, Xiao Luo, Runze Yan, Xiangliang Zhang, Xiao Hu, Yan Liu, Yizhou Sun, Wei Wang, Carl Yang 2026 ICML 2026 | February 2026
Publication Temperature as a Meta-Policy: Adaptive Temperature in LLM Reinforcement Learning Haoran Dang, Cuiling Lan, Hai Wan, Xibin Zhao, Yan Lu ICLR 2026 | February 2026
Publication TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation Steven Liu, Jane Luo, Xin Zhang, Aofan Liu, Hao Liu, J. Wu, Ziyang Huang, Yangyu Huang, Yu Kang, Scarlett Li February 2026 arXiv | February 2026 Github
Publication Beyond Correctness: Learning Robust Reasoning via Transfer Hyunseok Lee, Soheil Abbasloo, Jihoon Tack, Jinwoo Shin February 2026 February 2026