Abstract: Recent advances in LiDAR representation learning with limited annotations show strong promise. Existing well-performed methods mainly focus on distilling the 2D representation into the 3D ...
Abstract: Recent neural models for video captioning are typically built using a framework that combines a pre-trained visual encoder with a large language model(LLM) decoder. However, large language ...
This repository provides code and workflows to test several state-of-the-art vehicle detection deep learning algorithms —including YOLOX, SalsaNext, and RandLA-Net— on a Flash Lidar dataset. The ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results