Given the strong impact of standardization in the sector of networked multimedia, TEMICS, in partnership with industrial companies, seeks to promote its results in standardization (IETF, JPEG, MPEG). While aiming at generic approaches, some of the solutions developed are applied to practical problems in partnership with industry (Thomson, France Télécom) or in the framework of national projects (RNRT COSOCATI, DIPHONET, EIRE, VIP, COSINUS, RIAM-COPARO, RNTL DOMUS-VIDEUM, ANR-ESTIVALE, ANR-MEDIEVALS, ANR-ESSOR, ANR-ICOS-HD, Futurimages) and European projects (IST-BUSMAN, IST-OZONE, IST-DANAE, IST-DISCOVER, IST-SIMILAR, IST-NEWCOM++). The application domains addressed by the project are networked multimedia applications (on wired or wireless Internet) via their various requirements and needs in terms of compression, of resilience to channel noise, or of advanced functionalities such as navigation, protection and authentication.
TEMICS has five Cifre contracts with industrial partners:
TEMICS also supervises DRT projects in collaborations with industrial partners:
The project objective is to design error resilient video coding solutions and joint sourcechannel coding techniques for robust transmission of video signals over wireless IP networks. TEMICS contributes to VIP by designing estimation algorithms for robust decoding of arithmetic codes in presence of channel noise, and by integrating these techniques in a fine grain scalable video coder and decoder. TEMICS also studies redundant wavelet transforms for erasure-resilient coding and decoding of video signals.
The aim of the Diphonet project is to develop protection and tracing tools for applications of professional images delivery over the Internet.The watermarking technique is used to insert copyright information as well as meta-data in order to trace the origin of a picture. The watermarking technique based on game theory reported in sub-section tatouage has been evaluated and optimized against intentional attacks and potential de-synchronizations.
The aim of the project is to develop tools for indexing, content-based access and for advanced visualization of videos. We designed a technique for structuring a video sequence in a set of hyper-scenes, where each hyper-scene gathers similar scenes. This method is based on an initial scene shot decomposition assumed to be available. The criteria used to merge the initial shots are based on the use of 1-D mosaic representations (each initial shot is represented using two 1-D mosaic images). The similarity between two scenes is therefore evaluated by comparing their mosaic images. Such comparisons are done here using global statistical similarities, and a region-based matching criterion. These different criteria are embedded in a decision process. The tool has been applied to MPEG-2 compressed content. The 1-D mosaic images are thus computed using the MPEG-2 motion vectors. Then they are approximated by a polygonal representation in order to simplify the comparison process. The video structure is therefore obtained using a clustering algorithm. Experiments show that satisfactory hyper-scene structuration can be obtained.
The project objectives are to develop image and video compression algorithms with optimized rate-allocation algorithms and supporting fine grain scalability. TEMICS focuses on the scalable video compression aspects. The algorithm setting the basis for our contributions is based on a 3-D spatio-temporal decomposition of each Group Of Frames (GOF). A motion estimation based on a quadtree decomposition is incorporated in the temporal filtering in order to obtain a more efficient temporal de-correlation. The spatial decomposition is similar to the technique used in JPEG-2000. The obtained quantized coefficients are therefore coded using a 3-D EBCOT coding method. Furthermore, two GOF coding modes (Intra and Inter) are introduced to take into account the correlation between two successive GOF. The GOF length is chosen in function of the temporal variations between the successive GOF.
ESTIVALE is a project dealing with the diffusion of video on demand in several contexts: from personal use to professionnal use. People involved in the project are from different communities: signal processing and security, economists and jurist. The goal of the project is to design technical solutions for securing this delivery, through DRM and watermarking tools, and to remain consistent with the economical and juridical studies and demands. In 2009, the TEMICS project-team has contributed on the design of a practical efficient fingerprinting scheme for video.
MEDIEVALS is a project dealing with the diffusion of video or audio on demand. MEDIALIVE developed a software to secure this delivery through visual encryption, and the goal of the project is to add watermarking/fingerprinting in the process to improve the security of the delivered content. In 2009, the TEMICS project-team has contributed on the rationale of the new software architecture. We performed tests to study the best articulation between visual encryption, watermarking and anti-collusion codes.
Compared with predictive coding, distributed video compression holds a number of promises for mobile applications: a more flexible coder/decoder complexity balancing, increased error resilience, and the capability to exploit inter-view correlation, with limited inter-camera communication, in multiview set-ups. However, despite the growing number of research contributions in the past, key questions remain to bring monoview and multi-view DVC to a level of maturity closer to predictive coding: estimating at encoder or decoder the {\em virtual} correlation channel from unknown - or only partially known - data; finding the best SI at the decoder for data not - or only partially - known. Solutions to the above questions have various implications on coder/decoder complexity balancing, on delay and communication topology, and rate-distortion performance. These questions are being addressed by the ANR-ESSOR project. The TEMICS project-team more specifically contributes on the design of Slepian-Wolf and Wyner-Ziv coding tools as well as on the design of robust and joint source-channel distributed coding strategies. More specifically, in 2009, the TEMICS project-team has pursued the development of a practical distributed video compression schemes based on LDPC codes, improving its performance by designing estimation and decoding algorithms which allow us exploiting both the non-uniformity distribution and the memory of the correlated sources present in the video signal.
The Futurim@ages project studies coding, distribution and rendering aspects of future television video formats: 3DTV, high-dynamic range videos, and full-HD TV. In this context, TEMICS focuses on compact representations and restitution of multi-view videos. Multi-view videos provide interesting 3D functionalities, such as 3DTV (visualization of 3D videos on auto-stereoscopic screen devices) or Free Viewpoint Video (FFV, i.e. the ability to change the camera point of view while the video is visualized). However, multi-view videos represent a huge amount of redundant data compared with standard videos, hence the need to develop efficient compression algorithms. Stereoscopic or auto-stereoscopic devices display very specific camera viewpoints, which should be generated even if they do not correspond to acquisition viewpoints. Artifacts such as ghosting or bad modelled occlusions must be dealt with to render high quality 3D videos. In the Futurim@ages project, we also address the problem of depth map retrieval from multi-view and from monocular videos of static scenes. For the latter case, we rebuilt the Structure from Motion part of the 3D video codec developed in the team by integrating the lattest state-of-the-art vision algorithms. As for the multi-view case, we developed a new depth map estimation algorithm which aims at preserving depth discontinuities, which is necessary to render correct virtual views of the scene. In the near future, we plan to compare this software to the Depth Estimation Reference Software (DERS) from the MPEG 3DV group, in terms of virtual views synthesis quality.
The objective of the project is to develop new solutions of scalable description for High Definition video content to facilitate their editing, their acces via heterogeneous infrastructures (terminals, networks). The introduction of HDTV requires adaptations at different levels of the production and delivery chain. The access to the content for editing or delivery requires associating local or global spatio-temporal descritors to the content. The TEMICS project-team contributed in particular on the study of new forms of signal representation amenable to both compression and feature extraction (see Section \ref{description}). A new concept called visual sentences which can be seen as a sparse extension of the concept of visual words has been introduced and assessed for image retrieval in large databases. Methods for computing a similarity measure between descriptors has been designed for the approximate nearest neighbor search task required in content based image retrieval systems. A novel method of dictionary construction has also been developed for this problem of joint compression/description problem.
BUSMAN develops and integrates indexing and watermarking techniques to ease the search and use of video content. TEMICS contribution is focused on watermarking techniques to enrich video content by hiding meta-data. Different levels of robustness to a range of attacks such as transcoding required for transmission in heterogeneous networks, and compression at various rates must be provided. A first version of the software based on state-of-the-art techniques has been delivered for integration in the BUSMAN demonstrator comprising an authoring tool as well as a client-server delivery architecture over fixed and mobile networks. The technique should evolve to incorporate side-informed embedding solutions in order to improve the performance.
The goal of the OZONE project is to develop a pervasive computing and communication framework which will bring relevant information and services to the individual, anywhere and at anytime. The OZONE project can be viewed as the first step towards concrete ambient intelligence applications. Our contributions to the OZONE project is related to the transmission of the video data throughout the OZONE network. High quality performance of the video transmission system is essential to guarantee the quality of service required by users needs in the context of such applications. Our contribution is related to the study and development of a video transmission platform incorporating mechanisms in support of end-to-end QoS such as congestion control and loss control. The software corresponding to the video transmission loss and network congestion control has been delivered to the project's partners.
The TEMICS team is involved in the STREP DANAE addressing issues of dynamic and distributed adaptation of scalable multimedia content in a context-aware environment. Its objectives are to specify, develop, integrate and validate in a testbed a complete framework able to provide end-to-end quality of (multimedia) service at a minimal cost to the end-user. TEMICS contributes on the aspects of fine grain scalable video coding and on the study of new source codes for increasing the error resiliency of the scalable video coder while preserving its compression and scalable properties. In collaboration with other DANAE partners, TEMICS contributes to different core experiments defined in the context of MPEG-21/SVC: a core experiment on spatial transforms, on error resilience and on coding with multi-rate adaptability.
ECRYPT aims at stimulating European research in the area of cryptography and data hiding. It is splitted into several virtual labs, the one concerning us being called WAVILA: WAtermarking VIrtual LAb. Our contribution is the design of a practical efficient fingerprinting scheme for video content.
Video coding solutions so far have been adopting a paradigm where it is the task of the encoder to explore the source statistics, leading to a complexity balance where complex encoders interact with simpler decoders. This paradigm is strongly dominated and determined by applications such as broadcasting, video on demand, and video streaming. Distributed Video Coding (DVC) adopts a completely different coding paradigm by giving the decoder the task to exploit - partly or wholly - the source statistics to achieve efficient compression. This change of paradigm also moves the encoder-decoder complexity balance, allowing the provision of efficient compression solutions with simple encoders and complex decoders. This new coding paradigm is particularly adequate to emerging applications such as wireless video cameras and wireless low-power surveillance networks, disposable video cameras, certain medical applications, sensor networks, multi-view image acquisition, networked camcorders, etc., where low complexity encoders are a must because memory, computational power, and energy are scarce. The objective of DISCOVER is to explore and to propose new video coding schemes and tools in the area of Distributed Video Coding with a strong potential for new applications, targeting new advances in coding efficiency, error resiliency, scalability, and model based-video coding. The TEMICS project-team is coordinating - and contributing to - the workpackage dealing with the development of the theoretical framework and the development of Wyner-Ziv specific tools. In 2007, we have in particular designed optimum prediction filters for the Wyner-Ziv scenario, developed Slepian-Wolf codes based on quasi-arithmetic codes and on channel codes. The TEMICS project-team also contributes to the development of algorithmic tools for the complete coding/decoding architecture and to the integration of the complete video codec. In that context, methods of side information extraction, of rate control and optimal MMSE signal reconstruction in presence of side information have been developed.
The NEWCOM++ project proposal (Network of Excellence in Wireless COMmunication) intends to create a trans-European virtual research centre on the topic ``The Network of the Future''. It was submitted to Call 1 of the VII Framework Program under the Objective ICT-2007.1.1: The Network of the Future, mainly in its target direction ``Ubiquitous network infrastructure and architectures''. We participate in the workpackage WPR7 - Joint source and channel co-decoding which we now coordinate together with the task TR7.3 Tools for multi-terminal JSCC/D. WPR7 addresses issues related to the robust transmission of multimedia, and essentially video, over wireless channels (possibly terminating a wired IP network). Such issues are : (i) solving the compatibility problem with the classical OSI layers separation (to what extent can we keep this separation ?) (ii) providing new tools (and expanding existing ones) for Joint Source and Channel Coding/decoding (JSCC/D) in classical one to one, one to many (broadcast), or distributed contexts (iii) providing new tools for analysing the efficiency of these tools (iv) working on practical, long term situations, which will be used as test-beds.
This ARC (Action de Recherche Coopérative) aims at creating a synergy in the area of geometric objects transmission over networks, and more specifically to study the representation of geometric objects for their transmission over heterogeneous networks. TEMICS contributes by providing compression algorithms for unstructured surface meshes, and techniques for progressive and scalable compression taking into account visual quality criteria.
Fabriano is an ACI (Action Concertée Incitative) dedicated to the study of technical solutions to the problem of security based on watermarking and steganography. In particular, this action aims at developing a theoretical framework for stegano-analysis to be applied for the design of algorithms that will allow to detect the presence of a message within a signal in the respect of rights and ethical issues.
Nebbiano studies the security and the reliability of watermarking techniques. It is decomposed into 3 axis of research. Two of them stem from the past national project ACI FABRIANO: Investigations of the role of the Independent Component Analysis in data hiding, and the design of new secure and robust watermarking techniques. A third topic has been introduced in the above section: the reliability of watermarking techniques, especially the assessment of extremely low probability of errors.
This is a collaboration with N.\ Sellami (ISECS, Sfax, Tunisia) and I.\ Fijalkow (ETIS, Cergy France). The goal of the proposed project is the analysis of turbo-like receiver in order to allocate the resources (power, training sequence length...) of the system. The grant supports travel and living expenses of investigators for short visits to partner institutions abroad.