Publication From Articulated Kinematics to Routed Visual Control for Action-Conditioned Surgical Video Generation Bohan Li, Shuo Yang, Bao Peng, Xianda Guo, Erli Zhang, Youqi Tao, Junfeng Duan, Daguang Xu, Qi Dou, Xin Jin, Wenjun Zeng, Hao Zhao, Yueming Jin May 2026 arXiv | May 2026
Publication Unifying Scientific Communication: Fine-Grained Correspondence Across Scientific Media Megha Mariam K.M, Vineeth N Balasubramanian, C. V. Jawahar CVPR 2026 | May 2026
Publication Morphology prediction of small nanoparticles in any orientation from single electron micrographs Henrik Eliasson, Fangjinhua Wang, Xi (Ada) Wang, Dániel Baráth, Marc Pollefeys, Rolf Erni npj Computational Materials | May 2026
Publication Understanding Annotator Safety Policy with Interpretability Alexander X. Oesterling, Donghao Ren, Yannick Assogba, Dominik Moritz, Sunnie S. Y. Kim, Leon Gatys, Fred Hohman May 2026 arXiv | May 2026
Publication Audio-Visual Intelligence in Large Foundation Models Youxuan Qin, Kaihong Liu, Shengqiong Wu, Kai Wang, Shijian Deng, Yapeng Tian, Junbin Xiao, Yazhou Xing, Yinghao Ma, Bobo Li, Roger Zimmermann, Lei Cui, Furu Wei, Jiebo Luo, Hao Fei May 2026 arXiv | May 2026
Publication WATCH: Wide-Area Archaeological Site Tracking for Change Detection Girmaw Abebe Tadesse, Titien Bartette, Andrew Hassanali, Allen Kim, Jonathan Chemla, Andrew Zolli, Yves Ubelmann, Caleb Robinson, Inbal Becker-Reshef, Juan M. Lavista Ferres May 2026 arXiv | May 2026
Publication Exploring Data-Free LoRA Transferability for Video Diffusion Models Yuchen Wang, Wenliang Zhong, Lichen Bai, Zikai Zhou, Shitong Shao, Bo-Xun Cheng, Shuo Chen, Shuo Yang, Zeke Xie May 2026 arXiv | May 2026
Publication Modeling Subjective Urban Perception with Human Gaze Lin Che, Xi (Ada) Wang, Marc Pollefeys, Konrad Schindler, M. Raubal, Peter Kiefer May 2026 arXiv | May 2026
Publication Moving Beyond Sparse Grounding with Complete Screen Parsing Supervision A. Said Gurbuz, Sunghwan Hong, Ahmed Nassar, Marc Pollefeys, Peter Staar ICML 2026 | May 2026
Publication Scaling Video Understanding via Compact Latent Multi-Agent Collaboration Kerui Chen, Jinglu Wang, Jianrong Zhang, Ming Li, Yang Lu, Hehe Fan May 2026 arXiv | May 2026