Publication TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering Dongxing Mao, Yilin Wang, Linjie Li, Zhengyuan Yang, Alex Jinpeng Wang AAAI Conference on Artificial Intelligence | March 2026
Publication Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training Miaosen Zhang, Yishan Liu, Shuxia Lin, Xu Yang, Qi Dai, Chong Luo, Wei Jiang, Peng-Fei Hou, Anxiang Zeng, Xin Geng, Baining Guo ICML 2026 | March 2026
Publication Spatially Grounded Long-Horizon Task Planning in the Wild Sehun Jung, Hyunjee Song, Donghyun Kim, Reuben Tan, Jianfeng Gao, Yong Jae Lee, Donghyun Kim March 2026 arXiv | March 2026
Publication FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance Quanhao Li, Zhen Xing, Rui Wang, Haidong Cao, Qiaofei Dai, Daoguo Dong, Zuxuan Wu March 2026 arXiv | March 2026
Publication ActPrompt: In-Domain Feature Adaptation via Action Cues for Video Temporal Grounding Yubin Wang, Xinyang Jiang, De Cheng, Dongsheng Li, Cairong Zhao IEEE Transactions on Image Processing | March 2026, Vol 35: pp. 2714-2726
Publication StreamReady: Learning What to Answer and When in Long Streaming Videos Shehreen Azad, Vibhav Vineet, Y. Rawat CVPR 2026 | March 2026
Publication X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection Youngseo Kim, Kwan Yun, Seokhyeon Hong, Sihun Cha, Colette Suhjung Koo, J. Noh March 2026 arXiv | March 2026
Publication Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models Jiajun He, Zongyu Guo, Zhaoyang Jia, Xiaoyi Zhang, Jiahao Li, Xiao Li, Bin Li, José Miguel Hernández-Lobato, Yan Lu 2026 ICML 2026 | March 2026
Publication Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective Arctanx An, Shizhao Sun, Danqing Huang, Mingxi Cheng, Yang Gao, Ji Li, Yu Qiao, Jiang Bian March 2026 arXiv | March 2026