Aggregation and Personalization of Infotainment – An Architecture Illustrated with a Collaborative Scenario

A user-centric architecture of infotainment content adaptation to the context is presented. The architecture uses component technologies in term of business logic and functionalities offered by social web (OpenID, FOAF) and semantic descriptions of MPEG-7 and MPEG-21. Technological alternatives are discussed and adapted to the specificity of vehicle applications in terms of scalability and platform mobility. The requirements of the architecture are motivated by the presentation of a scenario.

Community-Aware Semantic Multimedia Tagging – From Folksonomies to Commsonomies

Tagging is an extremely popular mechanism in many Web 2.0 applications to create metadata supporting search and retrieval of arbitrary multimedia information like digital images, video or audio. However, compared to the syndicated multimedia information itself, the metadata are still “sticky”. They cannot be accessed across several Web 2.0 applications, their semantic enrichment is not possible and they cannot be embedded in the local practices of communities of practice. Here, we present a multimedia tagging mechanism based on the international standard MPEG-7 for community-aware, standard compliant tagging of semantically enriched metadata implemented in the M7MT proof-of-concept application.

Virtual Campfire – A Mobile Social Software for Cross-Media Communities

Multimedia creation, annotation and sharing are challenging tasks especially of interdisciplinary, intercultural and intergenerational communities. We present the mobile social software Virtual Campfire to provide cross-media and cross-community support for de- and recontextualization of multimedia content, employing Web Services, the MPEG-7 standard and Web 2.0 technologies etc. Virtual Campfire can enable communities to set up and maintain multimedia community information systems quickly and easily.

MPEG-7 for Video Quality Description and Summarisation

Manual quality control of audiovisual content in the different steps of the media production, delivery and archiving process causes significant costs. Semi-automatic quality control requires automatisation of quality analysis, quality metadata interoperability and efficient visualisation tools. In this paper we propose the use of MPEG-7 for standard compliant description of media quality metadata and a quality summary visualisation tool which facilitates efficient exploration of visually impaired content by the user.

Imagesemantics: User-Generated Metadata, Content Based Retrieval & Beyond

With the advent of Web 2.0 technologies a new attitude towards processing contents in the Internet has emerged. Nowadays it is a lot easier to create, share and retrieve multimedia contents on the Web. However, with the increasing amount in contents the retrieval process becomes more complex and often leads to inadequate search results. One main reason is summarized easily: Approaches to image clustering and retrieval usually either stick solely to the images’ low-level features or their semantic tags. However, this is frequently inappropriate since the “real” semantics of an image can only be  derived from the combination of low-level features and semantic tags. Consequently, we investigated a more holistic view on semantics based on a system called Imagesemantics that tries to close the gap between both approaches by combining them.

Moides – Mobile Image Description

Rapid improvements in the domain of mobile devices have enabled the processing of multimedia data on mobile camera phones. In this context, new applications and scenarios for these devices have been developed. This paper introduces Moides as a system for mobile camera phones which enables a user to take a picture, use some feature extraction algorithms (e.g., an object recognition algorithm and/or an edge detection algorithm) and send the extracted meta-data in form of a MPEG-7 document to a server which returns semantic information about the content of the image (e.g., a specific building has been detected, etc.).

LAS: A Lightweight Application Server for MPEG-7 Services in Community Engines

The success of multimedia enabled community engines depends on a careful design of the digital media and the related communication/collaboration tools. However, the semantics of the multimedia contents in community communication and collaboration is hard to capture and complex to compute. With the opportunities given by the combination of metadata descriptions standards like MPEG-7 and server-side content-based computations, the manageability of multimedia semantics in community engines becomes more feasible. As a proof of concept, we introduce a Lightweight Application Server for MPEG-7 Services (LAS) which is deployed for a web-based high-level semantic annotation tool for arbitrary images.

Using the MPEG-7 Colour Structure Descriptor for Human Identification in the POLYMNIA System

The POLYMNIA project aims to develop an intelligent cross-media platform for personalized leisure and entertainment in theme parks or recreation venues. The visitors of the theme park are – on request – identified, tracked and recorded individually in order to create personalised photos and videos documenting the visit. One specific problem in this system is the identification of humans across different cameras and under varying environmental conditions. We use the MPEG-7 Colour Structure Descriptor (CSD) for this purpose which has been reported to perform well for this application. We propose a new distance function for the CSD, the weighted city block distance. Evaluation shows that the new matching function yields better results than the distance proposed in the standard.

Virtual Entrepreneurship Lab 2.0: Sharing Entrepreneurial Knowledge by Non-Linear Story-Telling

Digital non-linear story-telling is an ideal approach to share knowledge in large distributed communities and networks. Story-telling combines semantic knowledge and episodic knowledge by explicit problem solving strategies. To facilitate knowledge sharing authoring environments based on multimedia repositories as well as advanced story players are needed. Interoperability with existing information systems can be enhanced by relying on multimedia description standards like MPEG-7 and the use of W3C standard conform community hosting engines. We present an environment for sharing entrepreneurial knowledge by high quality video taped interview sequences from well-known high-tech entrepreneurs.

MEDINA: A Semi-Automatic Dublin Core to MPEG-7 Converter for Collaboration and Knowledge Management in Multimedia Repositories

Knowledge creation processes in the cultural sciences have a discursive nature. The cultural scientists participating in these discourses build a community of practice. Reifications in information systems should support four concepts: (1) Content description by metadata, (2) coverage within standards, (3) repository technologies, and (4) platform independence of applications. By the combination of these concepts information systems support cultural scientists to extract and manage knowledge about high-level semantics of multimedia artifacts in open repositories with metadata annotations. Basic support for the latter can be achieved by a loose classification scheme as in Dublin Core, but with more sophisticated MPEG-7 description elements for time based media. We present a system called MEDINA for semi-automatic Dublin Core to MPEG-7 conversion to maintain media already annotated in Dublin Core.