Abstract: Zero-shot semantic segmentation continues to face challenges in effectively handling unseen object classes, despite its critical applications in medical imaging, autonomous driving, and ...
A fundamental challenge for GUI agents is robustly grounding natural language instructions, which requires not only precise spatial alignment (locating elements accurately) but also correct semantic ...
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes, Zesong Yang, Bangbang Yang, Wenqi Dong, Chenxuan Cao, Liyuan Cui, Yuewen Ma, Zhaopeng Cui, Hujun Bao It ...
Abstract: In recent years, the semantic segmentation of multimodal remote-sensing images using convolutional methods has received significant attention. Owing to the localized nature of convolutional ...