Pošalji tekstualnu poruku: What, where and how many? Combining object detectors and CRFs