Advances in Intelligent MultiMedia: MultiModal semantic representation

P McKevitt

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Intelligent MultiMedia or MultiModal systems involve the computer processing, understanding and production of inputs and outputs from at least speech, text, and visual information in terms of semantic representations. One of the central questions for these systems is what form of semantic representation should be used. Here, we look at current trends in multimodal semantic representation which are mainly XML and frame- based, relate our experiences in the development of multimodal systems (CHAMELEON and CONFUCIUS) and conclude that producer/consumer, intention (speech acts), semantic-content, and timestamps are four important components of any multimodal semantic representation.
Original languageEnglish
Title of host publicationUnknown Host Publication
EditorsH Sakaki
Place of PublicationMeisei University (Hino Campus), Hino-shi, Tokyo, Japan
PublisherMeisei University
Pages2-13
Number of pages12
Publication statusPublished - Aug 2005
EventProc. of the Pacific Rim International Conference on Computational Linguistics (PACLING-05) - Meisei University (Hino Campus), Hino-shi, Tokyo, Japan
Duration: 1 Aug 2005 → …

Conference

ConferenceProc. of the Pacific Rim International Conference on Computational Linguistics (PACLING-05)
Period1/08/05 → …

Fingerprint Dive into the research topics of 'Advances in Intelligent MultiMedia: MultiModal semantic representation'. Together they form a unique fingerprint.

  • Cite this

    McKevitt, P. (2005). Advances in Intelligent MultiMedia: MultiModal semantic representation. In H. Sakaki (Ed.), Unknown Host Publication (pp. 2-13). Meisei University.