Duygulu, P., Bastan, M. & Forsyth, D. (2006) Translating images to words for recognizing objects in large image and video collections. TOWARD CATEGORY-LEVEL OBJECT RECOGNITION.