Visual Basic Component Object Model Object

Visual-Linguistic Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking

Abstract: Referring Multi-Object Tracking (RMOT) aims to dynamically track an arbitrary number of referred targets in a video sequence according to the language expression. Previous methods mainly ...

IEEE

Fine-Grained Underwater Visual Object Tracking via Long-Term Template Refinement

Abstract: With the growing demand for underwater recognition systems, underwater visual object tracking has recently received wide attention. The visible light scene analysis in a liquid environment ...

GitHub

VCD: Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

This is the official repo for Visual Contrastive Decoding, a simple, training-free method for mitigating hallucinations in LVLMs during decoding without utilizing ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Visual-Linguistic Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking

Fine-Grained Underwater Visual Object Tracking via Long-Term Template Refinement

VCD: Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Trending now