Published January 1, 2016 | Version v1
Conference paper Open

The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents

  • 1. Univ Paris Saclay, Univ Paris Sud, CNRS, LIMSI, F-91405 Orsay, France
  • 2. Univ Grenoble Alpes, LIG, Grenoble, France
  • 3. LIST, Esch Sur Alzette, Luxembourg
  • 4. ITU, Istanbul, Turkey
  • 5. IMMI CNRS, Orsay, France
  • 6. UPC, Barcelona, Spain

Description

In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which can be performed on 3M data, the structure of the server was kept intentionally simple in order to preserve its genericity, relying on standard Web technologies. Layers of annotations, defined as data associated to a media fragment from the corpus, are stored in a database and can be managed through standard interfaces with authentication. Interfaces tailored specifically to the needed task can then be developed in an agile way, relying on simple but reliable services for the management of the centralized annotations. We then present our implementation of an active learning scenario for person annotation in video, relying on the CAMOMILE server; during a dry run experiment, the manual annotation of 716 speech segments was thus propagated to 3504 labeled tracks. The code of the CAMOMILE framework is distributed in open source.

Files

bib-5076b88d-091d-4e3e-9584-6df3538a6434.txt

Files (397 Bytes)

Name Size Download all
md5:fe67087a348226a18c45dc71ff746514
397 Bytes Preview Download