iclr2022专题

【多模态】18、ViLD | 通过对视觉和语言知识蒸馏来实现开集目标检测（ICLR2022）

文章目录一、背景二、方法2.1 对新类别的定位 Localization2.2 使用 cropped regions 进行开放词汇检测2.3 ViLD 三、效果论文：Open-vocabulary Object Detection via Vision and Language Knowledge Distillation 代码：https://github.com/t