Wednesday, July 27, 2011

MPEG news: a report from the 97th meeting, Torino, Italy

The 97th MPEG meeting in Torino brought a few interesting news which I'd like to report here briefly. Of course, as usual, there is the official press release, however, I'd like to report on some interesting topics as follows:
  • MPEG Unified Speech and Audio Coding (USAC) reached FDIS status
  • Call for Proposals: Compact Descriptors for Visual Search (CDVS)
  • Call for Proposals: Internet Video Coding (IVC)
  • DIS on MPEG Dynamic Adaptive Streaming over HTTP (DASH)
MPEG Unified Speech and Audio Coding (USAC) reached FDIS status

ISO/IEC 23003-3 aka Unified Speech and Audio Coding (USAC) reached FDIS status and soon will be an International Standard. The FDIS itself won't be publicly available but the Unified Speech and Audio Coding Verification Test Report in September 2011 (most likely here). 

Call for Proposals: Compact Descriptors for Visual Search (CDVS)

I reported previously about that and here comes the final CfP including the evaluation framework.

MPEG is planning standardizing technologies that will enable efficient and interoperable design of visual search applications. In particular we are seeking technologies for visual content matching in images or video. Visual content matching includes matching of views of objects, landmarks, and printed documents that is robust to partial occlusions as well as changes in vantage point, camera parameters, and lighting conditions.

There are a number of component technologies that are useful for visual search, including format of visual descriptors, descriptor extraction process, as well as indexing, and matching algorithms. As a minimum, the format of descriptors as well as parts of their extraction process should be defined to ensure interoperability.

It is envisioned that a standard for compact descriptors will:
  • ensure interoperability of visual search applications and databases, 
  • enable high level of performance of implementations conformant to the standard,
  • simplify design of descriptor extraction and matching for visual search applications, 
  • enable hardware support for descriptor extraction and matching in mobile devices,
  • reduce load on wireless networks carrying visual search-related information.
It is envisioned that such standard will provide a complementary tool to the suite of existing MPEG standards, such as MPEG-7 Visual Descriptors. To build full visual search application this standard may be used jointly with other existing standards, such as MPEG Query Format, HTTP, XML, JPEG, JPSec, and JPSearch.

The Call for Proposals and the Evaluation Framework is publicly available. From a research perspective, it would be interesting to see how technologies submitted as an answer to the CfP compete with existing approaches and applications/services.

In this context, it is probably worth looking at IEEE Multimedia Jul.-Sep. 2011 issue which is dedicated to visual content: identification and search including an overview about this new MPEG standard.

Call for Proposals: Internet Video Coding (IVC)

I reported previously about that and the final CfP for Internet Video Coding Technologies is available here. The requirements reveal some interesting issues the call is about:
  • Real-time communications, video chat, video conferencing,
  • Mobile streaming, broadcast and communications,
  • Mobile devices and Internet connected embedded devices 
  • Internet broadcast streaming, downloads
  • Content sharing.
Requirements fall into the following major categories:
  • IPR requirements
  • Technical requirements
  • Implementation complexity requirements 
Clearly, this work item has an optimization towards IPR but others are not excluded. In particular,
It is anticipated that any patent declaration associated with the Baseline Profile of this standard will indicate that the patent owner is prepared to grant a free of charge license to an unrestricted number of applicants on a worldwide, non-discriminatory basis and under other reasonable terms and conditions to make, use, and sell implementations of the Baseline Profile of this standard in accordance with the ITU-T/ITU-R/ISO/IEC Common Patent Policy. 

MPEG Dynamic Adaptive Streaming over HTTP (MPEG-DASH)

For all DASH enthusiast, the latest - and probably almost final - version of DASH-related standards can be found here. Please note that DASH has been reorganized into MPEG-DASH referred to as ISO/IEC DIS 23009-1.2, Part 1: Media presentation description and segment formats. Additionally, you might be interested in the following draft:
  • ISO/IEC 14496-12:2008/DAM 3, Part 12: ISO base media file format, AMENDMENT 3: DASH support and RTP reception hint track processing
  • ISO/IEC FDIS 23001-7, Part 7: Common encryption format for ISO base media file format
All these DASH-related documents are publicly available here. In terms of implementation, the interested reader might check out the ITEC-DASH VLC-based implementation and GPAC (which provides basic support for DASH) respectively.

Further information you may find at the MPEG Web site, specifically under the hot news section and the press release. Working documents of any MPEG standard so far can be found here. If you want to join any of these activities, the list of Ad-hoc Groups (AhG) is available here (soon also here) including the information how to join their reflectors.

2 comments:

Joomla application development said...

This is the first time i am visiting your blog. And i gathered many information from your blog. Thanks for sharing this information.

Fitness Equipments India said...

Yes, well written article. I could find out some useful information on your blog guys. This is really useful for everyone. I love this post. Thanks for sharing it.