Open-world text-specified object counting
Our objective is open-world object counting in images, where the target object class is specified by a text description. To this end, we propose CounTX, a class-agnostic, single-stage model using a transformer decoder counting head on top of pre-trained joint text-image representations. CounTX is ab...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
British Machine Vision Association
2023
|