Abstract: Referring Multi-Object Tracking (RMOT) aims to dynamically track an arbitrary number of referred targets in a video sequence according to the language expression. Previous methods mainly ...
Abstract: With the growing demand for underwater recognition systems, underwater visual object tracking has recently received wide attention. The visible light scene analysis in a liquid environment ...
This is the official repo for Visual Contrastive Decoding, a simple, training-free method for mitigating hallucinations in LVLMs during decoding without utilizing ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results