Sei sulla pagina 1di 18

Mpeg– 7

INTRODUCTION:

• Its formal name is Multimedia Content Description


Interface

• Objective- is to serve the need of audiovisual content-


based retrieval (or audiovisual object retrieval)

• It focus on audiovisual objects which gives an effective


and efficient retrieval for various applications
•It supports multimedia applications such as still
pictures,graphics,3D models,audio,speech,video and
composition info

•It consists of 7 parts such as System, DDL, visual,


Audio, MDS, RS & conformance

•The system specifies the syntax of the binary format and


DDL specifies the syntax of the textual format
COMPARISION:

• MPEG 1 & MPEG 2 standards made interactive video on


CD-ROM & Digital television possible

• MPEG 4 is the multimedia standard for the fixed &


mobile web enabling integration of multiple paradigms

• MPEG 7 is a standard for describing the multimedia


content data accessed by, a device or a computer code.
ADVANTAGES:

• It is not aimed at anyone application in particular rather,


the elements support as broad a range of applications as
possible

• It provides a rich set ot standerdized tools to describe


multimedia content. It offers a comprehensive set of
Audiovisual Description tools to create descriptions for
applications enabling the needed effective & efficient
access to multimedia content
• MPEG-7 names its working model as Experimentation
Model
IMPORTANT TERMS:
• Feature: A chracteristic of the data
• Description: A set of instantiated Ds and DSs that
describes the structural and conceptual info of the
content,storage and usage of the content and so on
•Descriptor (D):A definition (syntax and semantics) of the
feature
•Description scheme(DS): Specification of the structure
and relationship b/w Ds & DSs
•Description definition language(DDL): Syntactic rules to
express and combine DSs & Ds
• The first set of experiments in MPEG is called Core
Experiments
DESCRIPTOR:

• It is designed to describe both low-level features, such as


color,texture,shape, & motion and high level features such
as events & abstract concepts. It is based on their
performance, efficiency & size
• Color
-Color space: RGB,YCbCr,HSV,HMMD,3D
colorspace,Monochrome
-Color quantization: Linear, Non linear, Lookup tables
-Dominant colors: A small number of representative
colors in each region or image
-Scalablecolor: A color histogram in HSV color space. It
is encoded by a Haar transform & hence is scalable

-Color layout: Spactial color distribution of colors for


color-layout-based retrieval

-Color structure: The frequency of a color stucture


element describes both the color content & its structure in
the image

-GOF/GOP: It is applied to a video segment or a


group of still images
•Texture

-Homogeneous texture: Uses orientation & scale-tuned


Gabor filters that quantitatively represent regions of
homogeneous texture

-Texture browsing: Describes the regularity,coarseness,


& directionalityof edges used to represent & browse
homogeneous textures

-Edge histogram: Represents the spatial distribution of


directionaledges & 1 non directional edge
•Shape
-Region-based shape: A set of angular radial
transform(ART) coefficients is used to describe an
object’s shape
-Contour-based shape: Uses a curvature scale
space(CSS) representation that is invarient to scale,
rotation & nonrigid motion
-3D shape: Describes 3D mesh models & shape index
•Motion

-Camera motion: Fixed, pan, tilt, roll, dolly, track,


boom(fig)

-Object motion trajectory: A list of keypoints (x,


y,z,t). It is used to specify the acceleration along the
path

-Parametric object motion: The basic model is the


2D model for translation, rotation, scaling, sheering
& the combination of these

-Motion Activity: Provides descriptions such as the


intensity, pace, mood,& so on, of the video
fig: Camera Motions
• Localisation
-Region locator: Specifies the localization of regions in
images with a box or a polygon

-Spatiotemporal locator: Describes its regions in video


sequences. Uses one or more sets of descriptors of regions
& their motions

• Others
-Face Recognition: A normalised face image is
represented as a 1D vector, then projected onto a set of 49
basis vectors, representing all possible face vectors
DESCRIPTION SCHEME (DS):

•Basic elements
-Data types & mathematical structures: Vectors,
matrices, histograms…
-Construts: Links media files & localizing segments,
regions, & so on
-Schema tools: Includes root elements, top-level
elements, & package tools

•Content Management
-Media Description: Involves a single DS, the
medialInformation DS, Composed of a Medial Identification
D & one or more Media Profile Ds
-Creation & Production description: Includes info about
creation, classification & related materials
-Content Usage Description: Provides info about usage
rights, usage record, availability, & finance
•Content Description
-Structural Description: A segment DS describes it. A segment is a
section of an audiovisual object. It has five subclasses: Audiovisual
segment DS, Audio segment DS, Still region DS, Moving region DS, &
Video segment DS.

-Conceptual Description: It involves higher level description of the content,


such as Event DS, Object DS, State DS & Concept DS
•Navigation and access
-Summaries: These provides a video summary for quick browsing &
navigation of the content, by presenting only the keyframes.The DSs
supported are Summarization DS, HierarchialSummary DS, HighlightLevel
DS, SequentialSummary DS
-Partitions and decompositions: This refers to view partitions &
decompositions. The view partitions describe diff space& frequencu
views of the audiovisual data, such as a spatial view, temporal view,
frequency view & so on. The view decompositions DSs specify diff tree
such as Space tree DS

•Content Organisation
-Collections: The CollectionStructure DS groups audiovisual contents
into clusters.
-Models: Model DSs include a probability model DS. Analytic model
DS & classifier DS extract the models & statistics of the attributes &
features of the collections

•User Interaction
-Userpreference: DSs describes it in the consumption of audio-visual
contents, such as content types, browsing modes, privacy
characteristics
DESCRIPTION DEFINITION LANGUAGE (DDL):
• MPEG-7 adopted the XML Schema Language as its DDL.
MPEG-7 DDL has the following components:

•XML Schema structure components


- The schema, wrapper arround definitions and declarations
- Primary & Secondary strutural components
- Helper components
•XML Schema data type components
- Primitive & derived data types, Mechanisms for the user to derive
new data types
•MPEG-7 Extensions
- Array & matrix data types, Multiple data types, Enumerated data
type & Intellectual Property Management and Protection for Ds &
DSs

Potrebbero piacerti anche