Inside MPEG-7

Inside MPEG-7

Before starting this page I must warn the reader that MPEG-7 has more elements of abstractness, compared to previous MPEG standards that may make reading it more difficult than other pages. With this warning, let’s start from some definitions that will hopefully facilitare understanding of the MPEG-7 standard.

Element Definition Examples
Data The audio-visual information to be described using MPEG-7 MPEG-4 elementary streams, Audio CDs containing music, hard disks containing MP3 files, synthetically generated pictures or drawings on a piece of paper
Feature A distinctive characteristic of a Data item that means something to somebody Colour of a picture, particular rhythm of a piece of music, camera movement in a video or cast of a movie
Descriptor A representation of a Feature. It defines the syntax and semantic of the representation of the Feature. Different Descriptors may very well represent the same Feature Feature “colour” that can be represented as a histogram or as a frequency spectrum
Descriptor Value An instantiation of a Descriptor for a given Data set. Descriptor Values are combined through the Description Scheme mechanism to form a Description
Description Scheme The specification of the structure and semantics of relationships among its components. These can be Descriptors or, recursively, Description Schemes. The distinction between a DS and a D is that a D just contains basic data types and does not make reference to any other D (and, obviously, DS) A movie that is temporally structured in scenes, with textual descriptions at scene level and some audio descriptors of dialogues and background music

In the following, Ds and DSs are collectively called Description Tools (DT).

The figure below represents the main elements making up the MPEG-7 standard.


Figure 1 – The main MPEG-7 elements

MPEG-7 provides a wide range of low-level descriptors.

MPEG-7 Visual Tools consist of basic structures and Ds that cover the following basic visual features: Color, Texture, Shape, Motion and Localisation. 

The Color feature has multiple Ds. Some of them are: 

Name Descriptor
Color Quantization Expresses colour histograms keeping the flexibility of linear and non-linear quantisation and look-up tablesThe audio-visual information to be described using MPEG-7
Dominant Color(s) Represent features where a small number of colours suffice to characterize the color information in the region of interes
Scalable Color Is useful for image-to-image matching and retrieval based on colour feature
Color Structure Captures both colour content (similar to a color histogram) and information about the structure of this content whose intended use is for still-image retrieval because its main functionality is image-to-image matching
Color Layout Specifies the spatial distribution of colors that can be used for image-to-image matching and video-clip-to-video-clip matching or for layout-based retrieval for color, such as sketch-to-image matching

The Texture feature has 3 Ds. 

Name Descriptor
Homogeneous texture Is used for searching and browsing through large collections of similar looking patterns. An image can be considered as a mosaic of homogeneous textures so that these texture features associated with the regions can be used to index the image data. Agricultural areas and vegetation patches are examples of homogeneous textures commonly found in aerial and satellite imagery
Texture Browsing Provides a perceptual characterization of texture, similar to a human characterization, in terms of regularity, coarseness and directionality. The computation of this descriptor proceeds similarly to the Homogeneous Texture D. First, the image is filtered with a bank of special filters. From the filtered outputs, two dominant texture orientations are identified. Then the regularity and coarseness is determined by analysing the filtered image projections along the dominant orientations
Edge histogram Represents the spatial distribution of five types of edges: four directional edges and one non-directional edge. Edge histogram can retrieve images with similar semantic meaning, since edges play an important role for image perception

The Shape feature has 4 Ds. Region-Based Shape is a Descriptor to describe any shapes. This is a complex task because the shape of an object may consist of either a single region or a set of regions as well as some holes in the object or several disjoint regions. 

The Motion feature has 4 Ds: camera motion, object motion trajectory, parametric object motion, and motion activity. 

Name Descriptor
Camera Motion Characterises motion parameters of a camera in a 3-D space. This motion parameter information can be extracted automatically or generated by capture devices
Motion Trajectory Describes the motion trajectory of an object, defined as the localisation, in time and space, of one representative point of this object. In surveillance, alarms can be triggered if some object has a trajectory identified as dangerous (e.g. passing through a forbidden area, being unusually quick, etc.). In sports, specific actions (e.g. tennis rallies taking place at the net) can be recognized

MPEG-7 Audio Tools. There are seventeen low-level Audio temporal and spectral Ds that may be used in a variety of applications. While low-level audio Ds in general can serve many conceivable applications, the Spectral Flatness D specifically supports the functionality of robust matching of audio signals. Applications include audio fingerprinting, identification of audio based on a database of known works and, thus, locating metadata for legacy audio content without metadata annotation. 

Four sets of audio Description Technologies – roughly representing application areas – are integrated in the standard: sound recognition, musical instrument timbre, spoken content, and melodic contour. Timbre is defined as the perceptual features that make two sounds with the same pitch and loudness sound different.

Name Descriptor
Musical Instrument Timbre Describes perceptual features of instrument sounds. The aim of the Timbre D is to describe the perceptual features with a reduced set of Ds that relate to notions such as “attack”, “brightness” or “richness” of a sound
Sound Recognition Indexes and categorises general sounds, with immediate application to sound effects
Spoken Content Describes words spoken within an audio stream. This trades compactness for robustness of search, because current Automatic Speech Recognition (ASR) technologies have their limits, and one will always encounter out-of-vocabulary utterances. To accomplish this, the tools represent the output and what might normally be seen as intermediate ASR results. The tools can be used for two broad classes of retrieval scenario: indexing into and retrieval of an audio stream, and indexing of multimedia objects annotated with speech

One can easily see, from this cursory presentation, the range of tools that are being offered to content owners and to application developers. The MPEG-7 Ds have been designed for describing a wide range of information types: low-level audio-visual features such as color, texture, motion, audio energy, and so forth, as illustrated above; high-level features of semantic objects, events and abstract concepts; content management processes; information about the storage media; and so forth. It is expected that most Ds corresponding to low-level features will be extracted automatically, whereas more human intervention will be is likely to be required for producing higher-level Ds. 

The MPEG-7 Multimedia Description Schemes part of the standard defines a set of DTs dealing with generic as well as multimedia entities. Generic entities are features which are used in audio, visual, and text descriptions, and are therefore “generic” to all media. These are, for instance, “vector”, “time”, etc. More complex DTs are also standardised. They are used whenever more than one medium needs to be described (e.g. audio and video.) These DTs can be grouped into 6 different classes according to their functionality as in the following figure.


Figure 2 – The MPEG-7 Multimedia Description Schemes 

Elements Definition
Basic Elements Facilitate creation and packaging of descriptions
Content description Representes perceivable information
Content management Information about the media features, the creation and the usage of the AV content
Content organization Represents the analysis and classification of several AV contents
Navigation and access Specifies summaries and variations of the AV content
User Interaction Describes user preferences and usage history

Basic Elements define a number of Schema Tools that facilitate the creation and packaging of MPEG-7 descriptions, a number of basic data types and mathematical structures, such as vectors and matrices, which are important for audio-visual content description. There are also constructs for linking media files and localising segments, regions, and so forth. Many of the basic elements address specific needs of audio-visual content description, such as the description of time, places, persons, individuals, groups, organisations, and other textual annotation. 

Content Description describes the Structure (regions, video frames, and audio segments) and Semantics (objects, events, abstract notions). Structural aspects describe the audio-visual content from the viewpoint of its structure. Conceptual aspects describe the audio-visual content from the viewpoint of real-world semantics and conceptual notions. 

The Content Management DTs allow the description of the life cycle of the content, from content to consumption including media coding, storage, and file formats and content usage.

Name Description Tools
Creation Information Describes the creation and classification of the audio-visual content and other material that is related to the audio-visual content: Title (which may itself be textual or another piece of audio-visual content), Textual Annotation, and Creation Information such as creators, creation locations and dates
Classification Information Describes how the audio-visual material is classified into categories such as genre, subject, purpose, language, and so forth. It also provides review and guidance information such as age classification, subjective review, parental guidance, and so forth
Related Material Information Describes whether other audio-visual material exists that is related to the content being described. Usage Information describes the usage information related to the audio-visual content such as usage rights (through links to the rights holders and other information related to rights management and protection), availability, usage record, and financial information. Media Description describes the storage media such as the compression, coding and storage format of the audio-visual data

Content Organisation organises and models collections of audio-visual content and of descriptions. 

Navigation and Access facilitates browsing and retrieval of audio-visual content by defining summaries, partitions, decompositions, and variations of the audio-visual material. 

Name Description Tools
Summaries Provide compact summaries of the audio-visual content to enable discovery, browsing, navigation, visualization and sonification of audio-visual content
Partitions and Decompositions Describe different decompositions of the audio-visual signals in space, time and frequency. Variations provide information about different variations of audio-visual programs, such as summaries and abstracts; scaled, compressed and low-resolution versions; and versions with different languages and modalities – audio, video, image, text, and so forth

User Interaction describes user preferences and usage history pertaining to the consumption of the multimedia material. This allows, for example, matching between user preferences and MPEG-7 content descriptions in order to facilitate personalization of audio-visual content access, presentation and consumption. 

The main tools used to implement MPEG-7 descriptions are DDL, DSs, and Ds. Ds bind a feature to a set of values. DSs are models of the multimedia objects and of the universes that they represent. They specify the types of the Ds that can be used in a given description, and the relationships between these Ds or between other DSs. The DDL provides the descriptive foundation by which users can create their own DSs and Ds and defines the syntactic rules to express and combine DSs and Ds. 

The Description Definition Language satisfies the requirement of being able to express spatial, temporal, structural, and conceptual relationships between the elements of a DS, and between DSs. It provides a rich model for links and references between one or more descriptions and the data that it describes. The DDL Parser is also capable of validating Description Schemes (content and structure) and D data types, both primitive (integer, text, date, time) and composite (histograms, enumerated types). MPEG-7 adopted XML Schema Language as the DDL but added certain extensions in order to satisfy all requirements. The DDL can be broken down into the following logical normative components: the XML Schema structural language components; the XML Schema datatype language components and the MPEG-7 specific extensions. 

The information representation specified in the MPEG-7 standard provides the means to represent coded multimedia content description information. The entity that makes use of such coded representation of the multimedia content is an “MPEG-7 terminal”. This may be a standalone application or a part of an application system. The architecture of such a terminal is depicted in the figure below. 


Figure 3 – Model of an MPEG-7 terminal

The Delivery layer, placed at the bottom of the figure, provides MPEG-7 elementary streams to the Systems layer. MPEG-7 elementary streams consist of consecutive individually accessible portions of data named Access Units. An access unit (AU) is the smallest data entity to which timing information can be attributed. MPEG-7 elementary streams contain Schema information, that defines the structure of the MPEG-7 description and Descriptions information. The latter can be either the complete description of the multimedia content or fragments of the description. 

MPEG-7 data can be represented either in textual, binary or a mixture of the two formats, depending on application requirements. A unique mapping between the binary format and the textual format is defined by the standard. A bi-directional loss-less mapping between the textual representation and the binary representation is possible, but this need not always be used. Some applications may not want to transmit all the information contained in the textual representation and may prefer to use a more bit-efficient binary lossy transmission. The syntax of the textual format is defined by the DDL and the syntax of the binary format, called  Binary format for MPEG-7 data (BiM) was originally defined in Part 1 (Systems) of the standard but was later moved to Part 1 of MPEG-B. 

At the compression layer, the flow of AUs (either textual or binary) is parsed, and the content description is reconstructed. The MPEG-7 binary stream can be either parsed by the BiM parser, transformed into textual format and then transmitted in textual format for further reconstruction processing, or the binary stream can be parsed by the BiM parser and then transmitted in proprietary format for further processing. 

AUs are further structured as commands encapsulating the schema or the description information. Commands allow a description to be delivered in a single chunk or to be fragmented in small pieces. They allow basic operations such as updating a D, deleting part of the description or adding a new DDL structure. The reconstruction stage of the compression layer updates the description information and associated schema information by consuming these commands. Further structure of the schema or description is out of the scope of the MPEG-7 standard in its current form.

192 thoughts on “Inside MPEG-7

  1. Pingback: A Roadmap | Riding the Media Bits

  2. Pingback: main site m pussyxpic com

  3. Pingback: Pix link nude irkburo ru

  4. Pingback: Origin site alisextube com

  5. Pingback: Video site dubaipornx com

  6. Pingback: Pic link free-anal-porno mysexydownload com

  7. Pingback: Video site sexpics abudhabihottestgirls com

  8. Pingback: My homepage privatepics sleepingbitch com

  9. Pingback: Pix link anal-porn mysexydownload com

  10. Pingback: Origin site asianthaijapanese adult-porn-photos com

  11. Pingback: main site pornpics abudhabihottestgirls com

  12. Pingback: See video cjmiles nakedgirlfuck com

  13. Pingback: Source camsexy nakedgirlfuck com

  14. Pingback: url privatepics tuel-spb ru

  15. Pingback: Web sosu jivetalk org

  16. Pingback: See me ixfap ru

  17. Pingback: Go site babacams com

  18. Pingback: Origin site anal assfuckz com

  19. Pingback: Homepage xxvideos pro

  20. Pingback: Link xxvidos mobi

  21. Pingback: My homepage alicumshot com

  22. Pingback: Web amateur-sex jivetalk org

  23. Pingback: url site hh-tube ru

  24. Pingback: Pix link zumtub ru

  25. Pingback: More zumfap ru

  26. Pingback: See link anektub ru

  27. Pingback:

  28. Pingback: source

  29. Pingback: housewives and teens sex dating

  30. Pingback: go to the page

  31. Pingback: here

  32. Pingback: just click for source

  33. Pingback: more detailed on this page

  34. Pingback: click to see more

  35. Pingback: go to the source

  36. Pingback: follow this link

  37. Pingback: here is the link

  38. Pingback: i provide a link

  39. Pingback: follow this address

  40. Pingback: 100% Free Porn Meet Members 60779

  41. Pingback: 100% Free Sex Meet Users 63364

  42. Pingback: Best Sex Hookup Users 108

  43. Pingback: FREE XXX Dates List 50079

  44. Pingback: FREE Porn Dating List 39035

  45. Pingback: FREE PORNO Dating Members 66380

  46. Pingback: FREE Porn Dating Users 62420

  47. Pingback: Best XXX Meet List 49022

  48. Pingback: Best XXX Meet Users 3251

  49. Pingback: just click for source

  50. Pingback:

  51. Pingback:

  52. Pingback: page address w8s

  53. Pingback:

  54. Pingback: page j9vRuk

  55. Pingback: link 3KP1cc

  56. Pingback:

  57. Pingback:

  58. Pingback: view more

  59. Pingback: as reported here

  60. Pingback: click to to learn more C3IkB

  61. Pingback: i provide a link

  62. Pingback:

  63. Pingback: on this page

  64. Pingback:

  65. Pingback:

  66. Pingback: based on these data

  67. Pingback:

  68. Pingback: read an article

  69. Pingback: follow the link

  70. Pingback:

  71. Pingback: address lJ1T

  72. Pingback: go here

  73. Pingback:

  74. Pingback:

  75. Pingback: visit the page

  76. Pingback: more detailed on this page

  77. Pingback:

  78. Pingback: more on the page 9IM

  79. Pingback:

  80. Pingback: go to the source

  81. Pingback: view more

  82. Pingback: visit the page GIdC

  83. Pingback: link to details kBliN

  84. Pingback: taken from here

  85. Pingback: click at this page GccLWP

  86. Pingback:

  87. Pingback: please click for source

  88. Pingback: click the following article XmnLl

  89. Pingback: a source

  90. Pingback:

  91. Pingback: here is the link

  92. Pingback: link to the page

  93. Pingback: page address

  94. Pingback:

  95. Pingback: click to find out more

  96. Pingback:

  97. Pingback:

  98. Pingback: go to the source

  99. Pingback: provided link IpThs

  100. Pingback: read more CtUV

  101. Pingback: read further qro8N

  102. Pingback: view more V8L

  103. Pingback: a source

  104. Pingback:

  105. Pingback: click to go

  106. Pingback: follow this post

  107. Pingback: see more

  108. Pingback: just click for source

  109. Pingback:

  110. Pingback: click to read more

  111. Pingback:

  112. Pingback: go to the page

  113. Pingback: click at this page

  114. Pingback: here is the link

  115. Pingback: read article

  116. Pingback: Coub видео приколы подборка

  117. Pingback: 2018-2019

  118. Pingback: 2019

  119. Pingback:

  120. Pingback: a2019-2020

  121. Pingback: facebook

  122. Pingback: facebook1

  123. Pingback:

  124. Pingback: древние поздравления на свадьбу

  125. Pingback: поздравления вместе с цветами

  126. Pingback: именины егора поздравления

  127. Pingback: поздравление наталья лев

  128. Pingback: поздравление для девушек львов

  129. Pingback: поздравление ули с годиком

  130. Pingback: поздравления вам сегодня годик

  131. Pingback: сайт поздравления с месяцем

  132. Pingback: шаблон презентация поздравление мужчине

  133. Pingback: лучшее пожелание доброго утра

  134. Pingback: поздравление марине видео

  135. Pingback:

  136. Pingback: tureckie_serialy_na_russkom_jazyke

  137. Pingback: Ñìîòðåòü âñå ñåðèè ïîäðÿä

  138. Pingback: 2020

  139. Pingback: Video

  140. Pingback: +1+

  141. Pingback: 1 2 3 4 5 6 7 8 9 10

  142. Pingback: movies

  143. Pingback: Watch TV Shows

  144. Pingback: casino

  145. Pingback: Kinokrad 2019 Kinokrad Hd

  146. Pingback: Kinokrad

  147. Pingback: filmy-kinokrad

  148. Pingback: kinokrad-2019

  149. Pingback: filmy-2019-kinokrad

  150. Pingback: serial

  151. Pingback:

  152. Pingback: dorama hdrezka

  153. Pingback: movies hdrezka

  154. Pingback: HDrezka

  155. Pingback: kinosmotretonline

  156. Pingback: LostFilm HD 720

  157. Pingback: Красивое поздравление с Новым годом 2020

  158. Pingback:

  159. Pingback: bofilm ñåðèàë

  160. Pingback: bofilm

  161. Pingback: 1 seriya

  162. Pingback: Êîíñóëüòàöèÿ ïñèõîëîãà

  163. Pingback:

  164. Pingback:

  165. Pingback:

  166. Pingback:

  167. Pingback:

  168. Pingback:

  169. Pingback:

  170. Pingback:

  171. Pingback:

  172. Pingback:

  173. Pingback: See-Season-1

  174. Pingback: Evil-Season-1

  175. Pingback: Evil-Season-2

  176. Pingback: Evil-Season-3

  177. Pingback: Evil-Season-4

  178. Pingback: Dollface-Season-1

  179. Pingback: Queer-Eye-We-re-in-Japan-Season-1

  180. Pingback: serial 2020

  181. Pingback: Dailymotion

  182. Pingback: Watch+movies+2020

  183. Pingback:

  184. Pingback:

  185. Pingback:

  186. Pingback: #1plus1

  187. Pingback: 1plus1

  188. Pingback: Watch Movies Online

  189. Pingback: Film

  190. Pingback: Film 2020

  191. Pingback: Film 2021

  192. Pingback: watch online TV LIVE