Multimedia Communication Systems: Techniques, Standards, Networks

MULTIMEDIA COMMUNICATION SYSTEMS:
TECHNIQUES, STANDARDS, NETWORKS
Zoran BOJKOVI(1), Dragorad MILOVANOVI(2), Andreja SAMOVI(1)

(1)
Faculty for Traffic and Transport Engineering, Belgrade, Yugoslavia, Vojvode Stepe 305
phone: +381-11-3091-217, fax: +381-11-466-294, e-mail: bojkovic@net.yu
(2)
Faculty for Electrical Engineering, Belgrade, Yugoslavia
Abstract: In this paper, it is outlined that multimedia - an integrated and interactive presentation of speech, audio, video,
graphics and text, has become a major theme in todays information technology that merges the practices of communications,
computing and information processing into an interdisciplinary fields. The challenge of multimedia communications is to
provide services that integrate text, sound, image and video information and to do it in a way that preserves the case of use
and interactivity. A brief description of elements for multimedia systems is presented. User and network requirements are
discussed together with pocket transfer concept. An overview of multimedia communication standards is given. We also
discuss multimedia transport over ATM and IP networks. The issues concerning multimedia digital subscriber lines are
outlined together with multimedia over wireless, mobile and broadcasting networks as well as digital TV infrastructure for
interactive multimedia services.
Keywords: multimedia, standard, network, communication, system, user, requirement, asynchronous transfer mode,
terminal, Internet, protocol
I. Introduction
The past years have seen an explosion in the use of information from remote computers or servers,
digital media. Industry is making significant investments multimedia information must travel through computer
to deliver digital audio, image and video information to networks. Since the amount of information involved in
consumers and customers. A new infrastructure of digital the transmission of video and audio can be substantial, the
audio, image and video recorders and players, on-line multimedia information must be compressed before it can
services, and electronic commerce is rapidly being be sent through the network in order to reduce the
deployed. At the same time major corporations are communication delay. Constraints, such as limited delay
converting their audio, image and video archives to an and jitter, are used to ensure a reasonable video and audio
electronic form. Digital media offer several distinct effect at the receiving end. Therefore, communication
advantages over analog media: the quality of digital networks are undergoing constant improvements, in order
audio, image and video signals is higher than that of their to provide for multimedia communication capabilities.
analog counterparts. Editing is easy because one can Local area networks are used to connect local computers
access the exact discrete locations that should be changed. and other equipment, and wide area networks and the
Copying is simple with no loss of fidelity. A copy of Internet connect the local area networks together. Better
digital media is identical to the original. Digital audio, standards are constantly being developed, in order to
image and videos are easily transmitted over networked provide a global information superhighway over which
information systems. These advantages have opened up multimedia information will travel.
many new possibilities. Multimedia communications is the field referring to
Multimedia consists the representation, storage, retrieval and dissemination of
of Multimedia data + Set of interactions. Multimedia data machine-processable information expressed in multiple
is informally considered as the collection of three M's: media, such as text, image, graphics, speech, audio, video,
multi-source, multi-type and multi-format data. The animation, handwriting, data files. With the advent of
interactions among the multimedia components consist of high capacity storage devices, powerful and yet
complex relationships without which multimedia would economical computer workstations and high speed
be a simple set of visual, audio and other data. integrated services digital networks, providing a variety of
Multimedia and multimedia communication can be multimedia communications services is becoming not
globally viewed as a hierarchical system. The multimedia only technically but also economically feasible. In
software and applications provide a direct interactive addition, the broadband integrated services digital
environment for users. When a computer requires network (BISDN) has been given special attention as a
next generation communication network infrastructure LMDS Local Multipoint Distribution
which will be capable of transmitting full motion pictures Service
and high speed data at 150 and 600 Mb/s and voice as MAC Medium Access Control
well as data trough out the world [1]. MBS Mobile Broadband System
At first, the concept of multimedia communication MPEG Motion Picture Expert Group
modeling will be described, together with user and MVPD Multichannel Video Program
network requirements, packet transfer concept as well as Distribution
multimedia terminals. The second part deals with NTI Null Timing Information
multimedia communication standards. Finally, we will NTSC National Television System
concentrate on multimedia communications over Committee
networks. Multimedia transfer over ATM networks is PAL Phase Alternating Line
described. This is followed by multimedia over IP PC Personal Computer
networks. Special issues relating to multimedia over PCM Pulse Code Modulation
wireless will be discussed. PVR Packet Voice Receiver
PVT Packet Voice Transmitter
QoS Quality of Service
II. ACRONYMS RFC Request for Comments
SHDSL Single pair High speed DSL
SIF Standard Source Input Format
ADPCM Adaptive Differential Pulse
TCP Transmission Control Protocol
Code Modulation
TR Technical Report
ADSL Asymmetric Digital Subscriber
UDP User Datagram Protocol
Line
UMTS Universal Mobile
AE Area Directors
Telecommunication System
ATDM Asynchronous Time-Division
VBR Variable Bit Rate
Multiplex
VoDSL Voice over DSL
ATM Asynchronous Transfer Mode
VPN Virtual Private Network
AV Audiovisual
WAN Wide Area Network
BISDN Broadband Integrated Services
WATM Wireless ATM
Digital Networks
WG Working Group
CATV Cable Television
WLAN Wireless LAN
CC Command and Control
WWW World Wide Web
CLEC Competitive Local Exchange
Carriers
III. MULTIMEDIA COMMUNICATION MODEL
CTI Complete Timing Information
Multimedia best suits the human beings complex
DBS Direct Broadcast Satellite
perception, communicating behaviors as well as the way
DDL Description Definition
of acting. Namely, it will not only provide communication
Language
capabilities and information sharing people, irrespective
DSL Digital Subscriber Line
of location and time, but also easy and immediate access
DSM Digital Storage Media
to widely distributed information banks and information
GSM Global System for Mobile
processing centers. Applications in medicine, education,
HDTV High Definition Television
travel, real estate, banking, insurance, administration and
HFC Hybrid Fiber Coax
publishing are emerging at a fast pace. These applications
IAB Internet Architecture Board
are characterized by large multimedia documents that
IEC International Electrotechnical
must be communicated within very short delays.
Commision
Computer-controlled cooperative work, whereby a group
IESG Internet Engineering Steering
of users can jointly view, create, edit and discuss
Group
multimedia documents, is going to be characteristic of
IETF Internet Engineering Task
many transactions [2]. Some glamorous applications on
Force
multimedia processing include: distance learning, virtual
IP Internet Protocol
library access and living books. In distance learning, we
IPN Integrated Packet Network
learn and interact with instructors remotely over a
ISO International Organization for
broadband communication network. Virtual library access
Standardization
means that we instantly have access to all of the published
ISOC Internet Society
material in the world, in its original form and format, and
JPEG Joint Photographic Experts
can browse, display, print and even modify the material
Group
instantaneously. Living books supplement the written
LAN Local Area Network
word and the associated pictures with animations and high-speed data networks between geographically
hyperlink access to supplementary material [3, 4, 5]. distributed local area networks (LANs) high definition
Applications that are enabled or enhanced by video are still picture communication and TV conferencing or
often seen as the primary justification for the development corporate cable TV services. The new paradigm of
of multimedia networks. Trends towards multimedia BISDN application system as a result of the integration of
communication are represented in Fig. 1. multimedia processing by workstations and multimedia
communication by BISDN is shown in Fig. 2.
Much of the work on packet video has considered
A multimedia communication model is strongly
fairly homogenous networking scenario [6]. It would be a
influenced by the manufacturer-dependent solutions for
proper if a single type of video service dominated in the
personal computers and workstations including
networks. However, it is not a valid assumption for the
application software on the one hand and by the
traffic issues. First, video will not constitute a uniform
intelligent network concept on the other [7,8,9,10]. A
service with easily determined behavior and requirements.
layered model for future multimedia communication
Secondly, video will not share resources with streams of
comprises five constituents:
only the same type. This means that multiplexing in the
network should be evaluated for a heterogeneous mix of
traffic types. In business areas, there is a potential need
for various kinds of new communication system, such as
Audio communication
(telephony, sound broadcasting,
Multimedia
communication
Data, text, image Video
communication communication
(data transfer, fax (video telephony,
TV/HDTV
Figure 1. Multimedia communication
Multimedia
Broadband ISDN integrated
communication
New information
Integration system paradigm
Multimedia
Workstation, PC processing
Figure 2. New information system paradigm using

BISDN and workstations
User Transport User

interface
a) interface
Processing,
Transport User
storage and
b) interface
retrieval
Figure 3. Elements of multimedia systems used in (a)

person-to-person communications and (b) person-to-
machine modes
a) Partitioning of complex information objects into requested by the user. The storage and retrieval
distinct information types for the purpose of easier mechanisms involve browsing and searching to find
communication, storing and processing. This existing multimedia data. Also, storage and archiving in
comprises data, video or audio taking into account order to move user-created multimedia data to the
the integration of different information types not appropriate place for access by others. Examples of
being excluded. applications for the person-to-machine mode include
b) Standardization of service components per creation and access of business meeting notes, access of
information type, possibly with several levels of broadcast video and document archives from a digital
quality per information type. library or other repositories.
c) Creation of platforms at two levels: a network service
platform, and a multimedia communication platform. IV. USER REQUIREMENTS
The first level hides the transport networks and
network building blocks from an application
The user needs a multimedia communication system
designers or users point of view. The second level which prepares and presents the information of interest,
provides communication support on the basis of
allows for the dynamic control of applications and
information structure and information exchange
provides a natural interface. From a users point of view,
building blocks for a large number of applications.
the most important requirements of multimedia
d) Definition of generic applications for multiple use communications are:
in various multimedia environments and different Fast preparation and presentation of the different
branches meeting common widespread needs. information types of interest, taking into account the
e) Specific applications: electronic shopping, capabilities of available terminals and services.
teletraining and remote maintenance, based on
special information building blocks and utilizing the Dynamic control of multimedia applications with
network service platform, the multimedia respect to connection interactions and quality on
communication platform as well as including generic demand combined with user-friendly human/machine
applications. interfaces.
With regards to the capability of the available resources Intelligent support of users taking into consideration
in each case, the multimedia communication applications their individual capabilities, and
must be scalable in order to run in a constant manner Standardization.
across different network and terminal types and User requirements in terms of services are defined by
capabilities. the media, the transmission content, the type of
There are two key communications modes in which communication as well as the ability to combine the three.
multimedia systems are generally used: person-to-person On the other hand, multimedia communication services
communications and person-to-machine communications. can be classified as being local (interactive or non-
The key elements of multimedia systems are presented in interactive), remote non-interactive or remote interactive
Fig. 3. As it can be sean both of these modes have a lot of and also as being for residential, business or mobile use.
commonality, as well as some differences. The context in which multimedia services can be used is
shown in Table1.
In the person-to-person mode shown in Fig. 3(a), there
is a user interface that provides the mechanisms for all
users to interact with each other and a transport layer that Service usage conditions can be defined by their use,
moves the multimedia signal from one user location to place, independence and degree of urgency. Services can
some or all other user locations associated with the be for private or business use. The terminal and services
communications. The user interface creates the are usually used in the office, the home, the car or a
multimedia signal and allows users to interact with the public place. Independence could be defined by the
multimedia signal in an easy-to-use manner. The transport portability of the terminal and its independence of a given
layer preserves the quality of the multimedia signals so infrastructure as perceived by the user. The degree of
that all users receive what they perceive to be high-quality independence varies from one type of terminal to another.
signals at each user location. Examples of applications for On the other hand, the degree of urgency, from the users
the person-to-person mode are teleconferencing, point of view, determines whether the service should be
videophones, distance learning and shared workspace provided in real time or an off-line service is sufficient.
scenarios. In the person-to-machine mode, shown in A number of key requirements are common to the new
Fig. 3(b), there is again a user interface for interacting multimedia services:
with the machine, along with a transport layer for moving Instant availability
the multimedia signal from the storage location to the Real-time information transfer
user, as well as a mechanism for storage and retrieval of Service always on line
multimedia signals that are either created by the user or
Table 1 Context in which multimedia services can be used.
Local Remote non-interactive Remote interactive
Enhanced telephones
Leisure (TV)
Videophones
The Arts
Home, shoping
Residential Teaching Broadcasting
Games
Games
Remote consultation

Video on demand
Presentation Broadcasting Project management
Mobile Demonstration Remote security Contract negotiation
Monitoring
Video meeting
Videoconferencing
Multimedia presenation
Teleinformation Distance learning
Training
Business Teletraining Project management
Database consulation
Telesupervision Remote security

Monitoring
Remote diagnostic
Users should be able to access their services from Suitable standardizad services and supplementary
any terminal (mobile point of delivery) service supporting multimedia applications
Whereas traditional voice services already have these The requirements of applications regarding the
characteristics, data services over the Internet (including communications services can be divided into traffic and
voice over data) have typically been limited to basic bit functional requirements. The traffic requirements
transport, with no service guarantees, no guaranteed include transmission bandwidth delay and reliability.
availability, and rather fragment service interruptions. They depend on the used kind, number and quality of
With new data service emerging, such as virtual the data streams. The traffic requirements can be
private networks (VPN) and interconnection service satisfied by the use of resource management
between two network service providers, priorities in the mechanisms. They establish a relationship between
data networking domain have to change. In order to transmitted data and resources and ensure that the
resolve and build robust multimedia networks, it is audio-visual data are transmitted in a timely manner.
natural that operators will seek to base their data For this, during the transmission of data, the information
networks on the proven service delivery capability about the resource needs must be available at all nodes
currently deployed in leading-edge voice networks. This participating in the distributed applications, i.e., end
will provide the flexibility, functionality and reliability systems and centers. Hence, resources must be reserved
required to meet the new demands of future users. Also, and state must be created in these nodes, which
it will enable operators to offer the sophisticated basically means that a connection is established. The
services currently provided for voice, in the multimedia functional requirements are multicast transmission and
domain. the ability to define coordinated sets of unidirectional
Multimedia applications have several requirements streams.
with respect to the service offered to them by the Current fixed and mobile networks are built on
communication system. These requirements depend on mature architectures with strong traffic management,
the type of the application and on its usage scenario. For configuration capabilities, service platforms and well
instance, a nonconversational application for the defined points of interconnection between the networks
retrieval of audio-visual data has different needs then a of different operators. A key requirement is that the
conversational application for live audio-visual same high quality network services should exist when
communication (i.e. a conferencing tool). The usage building integrated networking platforms for voice, data
scenario influences the criticality of the demands. and multimedia services [11].
From the network point of view, the most important A future multimedia network must be organized to
requirements of multimedia communications are: support heavy traffic flows, a wide variety of service
High speed and changing bit rates mixes, and different traffic patterns, both in terms of
Several virtual connections over the same access routing the traffic efficiently and in terms of scaling for
Synchronization of different information types overload. The network must adapt quickly to constantly
changing traffic conditions. Reliable security features hardware protocols. Furthermore, packet
and firewalls must be in place for interworking between communication does not require users to allocate a fixed
the many operators that will be competing in the market channel or bandwidth before data transmission. Since
a result of deregulation. users send packets only when necessary, and since
many users can send packets over the same shared
channel, resources in packet networks are utilized more
V. PACKET TRANSFER CONCEPT
efficiently than in circuit switched networks. Video
signals are especially well suited for packet
Todays fiber technology offers a transmission transmission. Images usually contain regions of high
capability that can easily handle high bit rates like those detail and low detail as well as periods of rapid motion
required for video transmission. This leads to the and little motion. Thus, effective video coders should be
development of networks, which integrate all types of able to produce data streams with variable bit rates that
information services. By basing such a network on change with local characteristics [15]. Packet networks
packet switching, the services (video, voice and data) can carry variable bit rate (VBR) signals directly. No
can be dealt with in a common format. Packet switching buffering or rate control feedback is necessary at the
is more flexible than circuit switching in that it can transmitter.
emulate the latter while vastly different bit rates can be The asynchronous transfer mode (ATM) networks
multiplexed together. In addition, the networks are based on virtual circuit switching: all fixed size
statistically multiplexing of variable rate services may packets of a circuit have the fixed route [16]. The tasks
yield a higher utilization of the channel capacity than of packet video transfer over asynchronous time
what is obtainable with fixed capacity allocation. Many division multiplexed network or Internet is to code and
years ago, most of these arguments were verified in a transfer digital video signal under quality contrains as it
number of projects [10,12,13,14] is shown in Fig. 4. In Internet protocol (IP) networks the
Compared to circuit switching, packet switching packets are of variable length and there is no pre-
offers dynamic allocation of bandwidths and switching established route, so they may arrive out of order at
resources as well as the elimination of channel structure. destination. During transfers in ATM and IP networks,
Packet networks allow integrated service transport. delay and some packet loss are unavoidable [18,19].
They can carry voice, video and data using the same
ENCODER: APPLICATION NETWORK FRAMING: APPLICATION DECODER:

FRAMING: multiplexing FRAMING:
Video
Users
- transformation - data structuring routing - resynchronization - dequantization
- quantization - overhead (FEC) - error detection - entropy decoding
- entropy coding - retransmission - loss detection - inv. transformation
- bit rate control - error correction - loss concealment
- erasure correction - postprocessing
Figure 4 Digital video signal transport
The generic functions of network in packet transfer probabilistic guarantees a network must know its
from source to user are routing and multiplexing. current flow of traffic, based on already accepted
Routing provides connectivity and does not depend on connections or measurements of actual network load.
information type used in the transfer. Multiplexing New connections are allowed if they can be guaranteed
determines much of the transfer quality in the network the quality they request and their characteristics do not
and is highly dependent of the traffic characteristics, the risk the quality of already accepted connections or
quality requirements and users applications. measurements of actual network load.
Statistical multiplexing with quality guarantees is the
best choice for video transfer [20]. In order to offer
VI. MULTIMEDIA TERMINALS configurations and using ATM principle. This could make
Every major advance in networking has been preceded the integration of all information types easier and provide
by an advance in the user interface that has precipitated the necessary high bit rates. This concept supports the
the acceptance and growth of the networking advance. extension of a PC or workstation into an ATM oriented
For example, the invention of the telephone preceded the desk area network, comprising cameras, printers and other
growth of switch networks, the invention of the television special purpose system or subsystems offering interfaces
preceded the growth of TV networks and cable television to ATM networks.
(CATV), the radio telephone let to the cellular network
the PC let to the LAN/WAN network and the browser led
to the growth of the Internet and World Wide Web VII. MULTIMEDIA COMMUNICATION
(WWW). For the multimedia, there need to be new smart STANDARDS
terminals created in order to facilitate the display, access, In a broad sense, multimedia is assumed to be a
indexing, browsing and searching of multimedia content general framework for interaction with information
in a convenient and easy-to-use maner. available from different sources. With the digital
Multimedia itself denotes the integrated manipulation revolution, it became possible to further exploit a well-
of at least some information represented as continuous known concept: the more that is known about the content,
media data, as well as some information encoded as the better can be its representation, processing, etc., in
discrete media data (text and graphics). Here, we have the terms of efficiency and allowed functionalities.
act of capturing, processing, communicating, presenting
A multimedia standard is expected to provide support
and/or storage.
for a large number of applications. These applications
Multimedia terminals are needed to retrieve, analyze,
store and broadcast the new forms of written, sound and translate into specific set of requirements which may be
very different from one another. One theme common to
visual content. The architecture of these systems can be
most applications is the need for supporting interactivity
defined according to different approaches based on
telecommunications data processing and audiovisual with different kinds of data. Communications mean
standards but the production of standards for multimedia
technology. By incorporating voice, data as well as still
communications is be set by the problem that the many
and moving pictures into their communications, business
has increasingly sophisticated functions to improve access industries having a stake in it have radically different
to distributed resources and save valuable time in the approaches to standardization. Standards play a major role
in the multimedia revolution because they provide
decision process. Remote dialogue, discussion,
interoperability between hardware and software provided
information production, maintenance and inspection are
now possible from the new multimedia systems and at by multiple vendors.
operating costs that are continuing to fall. Existing The success of the Moving Picture Experts Group
solutions offer two types of terminal: multifunction office (MPEG) [21] is based on a number of concurrent
or computer workstations and dedicated equipment such elements. MPEG appeared at a time when coding
as enhanced telephone terminals, videophones or shared algorithms of audio and video were reaching asymptotic
teleconferencing systems. performance. By relying the support in terms of technical
Multimedia communication requires powerful expertise, of all industries interested in digital audio and
terminals: upgraded personal computers (PCs), desktop video applications, MPEG contributed to the practical
workstations or video computers. Todays terminals are acceptance of the audio-visual representation layer,
enhanced for broadband multimedia applications, for independent of the delivery system. A last element of
example PCs by the addition of telecommunication and success has been the focus on the decoder instead of the
video/audio capabilities and TV receivers by the addition traditional encoder-decoder approach. Therefore, MPEG
of intelligence and interactivity. At the same time, high could provide the standard solution to the major players
definition television (HDTV) is in development leading who were considering the use of digital coding of audio
the way towards all digital TV. HDTV is a technology and video for innovative mass-market products and allow
driver for memories, image/video processors and flat a faster achievement of a critical mass than would have
screens [20]. been possible without it. The different industries have
Multimedia terminal equipment also comprises suitable been diverging, but multimedia communications
cameras, scanners and printers, mass storage. Special necessarily need some convergence zone that can only be
equipment is necessary for editing multimedia achieved by standardization in key areas. Putting every
information, i.e., the creation, alternation and deletion of stake holder together and producing communication
content and structures. Three-dimensional (3D) display standards accepted by all is a big task. After the great
devices and speech recognition system will further success of the MPEG-1 and MPEG-2 standards which
facilitate faster and easier human interaction with opened the digital frontiers to audiovisual information and
multimedia applications or editors. allowed the deployment of high performance services, the
PCs and workstation architecture are considered for the MPEG-4 is striking again with the emerging MPEG-4
interconnection of the systems components, based on star standard. The MPEG-4 standard is the acknowledgement
by MPEG, the leading standardization body in Version 2). H.263+ contains approximately 12 new
audiovisual representation technology, that the data features that do not exist in H.263. These include new
models underpinning MPEG-1 and MPEG-2 were limited coding modes that improve compression efficiency,
and could not fulfill new needs of emerging multimedia support for scalable bit streams, several new features to
applications, such as hyperlinking, interaction and natural support packet networks and error-prone environments,
and synthetic data integration. MPEG-4 is the answer to added functionality, and support for a variety of video
the requirements coming from the new ways in which formats [7].
audio-visual information is nowadays produced, delivered
and consumed. To reach this target, MPEG-4 follows an
object based representation approach where an audio- A. MPEG-1 (CODING OF MOVING PICTURES
visual scene is coded as a composition of objects, natural AND ASSOCIATED AUDIO)
as well as synthetic, providing the first powerful hybrid
playground. The objective of MPEG-4 is thus to provide The first standard developed by the group, nicknamed
an audiovisual representation standard supporting, new MPEG-1, was the coding of the combined audio-visual
ways of communication access, and interaction with signal at a bit rate around 1.5 Mbit/s. This was motivated
digital audiovisual data, and offering a common technical by the prospect that was becoming apparent in 1988 to
solution to various services. It also extends to layered store video signals on a compact disc with a quality
coding (scalabilities), multiview (stereoscopic video), comparative to VHS cassettes. In 1988, coding of video at
shape/texture/motion coding of objects, and animation. Its such low bit rates had become possible thanks to decades
role extends to the Internet, Web TV, large databases of research in video coding algorithms. These algorithms
(storage, retrieval and transmission), and mobile networks however, had to be applied to subsampled pictures - a
[22]. MPEG-4 Version 1 became an international standard single field from a frame and only half of the samples in a
in February 1999, and Version 2 in November 1999. line - to show their effectiveness. Also, coding of audio,
Version 2 with extended functionalities is backward as separate from speech, allowed reduction by 1/6 of the
compatible with Version 1. PCM bitrate, typically 256 Kbit/s for a stereo source, with
Multimedia databases on the market today allow virtual transparency. Encoded audio and video streams,
searching for pictures using characteristics such as color, with the constraint of having a common time base, were
texture, and information about the shape of objects in the combined into a single stream by the MPEG system layer.
picture. MPEG started a new work item to provide a As previously indicated, MPEG-1 formally known as
solution to the problem described earlier. One of the ISO/IEC 11172 is standardized in five parts. The first
members of MPEG family (called "Multimedia Content three parts as Systems, Video and Audio. Two more parts
Description Interface") is MPEG-7 [23, 24, 25]. It extends complete the suite of MPEG-1 standards. Conformance
the limited current search capabilities to include more Testing, which specifies the methodology for verifying
information types, such as video, image, audio, graphics claims of conformance to the standard by manufacturers
and animation. In other words, MPEG-7 specifies a of equipment and producers of bitstreams, and Software
standardized description of various types of multimedia Simulation, a full C-language implementation of the
information. This description is associated with the MPEG-1 standard (encoder and decoder) [26].
content itself, to allow fast and efficient searching for
multimedia that is of interest to users. The description can
be attached to any kind of multimedia material, no matter B. MPEG-2 (GENERIC CODING OF MOVING
what the format of the description is. Stored material that PICTURES AND ASSOCIATED AUDIO)
has this information attached to it can be indexed,
searched, and retrieved. The MPEG-2 family of standards outline the
compression technologies and bit stream syntax that
In the original ITU-T work plan, the goal was to define enable transmission of audio and video in broadband
a "near-term" recommendation in 1996, followed by a networks. These standards also describe the aspects
"long-term" recommendation several years later. The needed to multiplex programs, enable clock
near-term recommendation is referred as H.263. The synchronization and setup logical network links carrying
long-term recommendation H.26L (previously called video and audio content. MPEG-2 is, in many cases,
H.263L) is scheduled for standardization in the year 2002 associated only with video compression, which is
and may adopt a completely new compression algorithm. certainly one of the most important parts of its
After H.263 was completed, it became apparent there functionality [27, 28, 29, 30]. In total, there are eight
were incremental changes that could be made to H.263 different parts of the MPEG-2, covering the different
that could visibly improve its compression performance. aspects of digital video and audio delivery and
Thus, it was decided in 1996 that a revision to H.263 representation [31]. Table 2 contains the different MPEG-
would be created that incorporated these incremental 2 parts [32].
improvements. This is H.263 "plus" with several new
features. Hence, the name H.263+ (now called H.263
Table 2 Parts of the MPEG-2 standards
ISO/IEC 13818 MPEG-2 Description
13818-1 Systems
13818-2 Video
13818-3 Audio
13818-4 Conpliance
13818-5 Software Simulation
13818-6 Digital Storage Media - Command and Control (DSM-CC)
13818-9 Real-time Interface for System Decoders
13818-10 DSM Reference Script Format
Basically MPEG-2 can be seen as a superset of the thus prediction modes were developed to support efficient
MPEG-1 coding standard and was designed to be coding of interlaced video. In addition scalable video
backward compatible to MPEG-1 - every MPEG-2 coding extensions were introduced to provide additional
compatible decoder can decode a valid MPEG-1 bit functionalities, such as embedded coding of digital TV
stream. Many video coding algorithms were integrated and HDTV, and graceful quality degradation in the
into a single syntax to meet the diverse applications presence of transmission errors.
requirements. New coding features were added by For comparison, typical MPEG-1 and MPEG-2 coding
MPEG-2 to achieve sufficient functionality and quality, parameters are shown in Table 3 [33].
Table 3 MPEG-1 and MPEG-2 coding parameters
MPEG-1 MPEG-2
Standardized 1992 1994
Main application Digital video on CD-ROM Digital TV (and HDTV)
Spatial resolution SIF format (1/4 TV) 288x360pixels TV (4xTV) 576x720 (1152x1440)
Temporal resolution 25/30 frames/s 50/60 fields/s (100/120 fields/s)
Bit rate 1.5 Mbit/s 4 Mbit/s (20 Mbit/s)
Quality comparable to VHS Comparable to NTSC/PAL for TV
Compression ratio over PCM 20-30 30-40
However, implementation of the full syntax may not

be practical for most applications. MPEG-2 has
C. MPEG-4 (CODING OF AUDIO-VISUAL OBJECTS)
introduced the concept of Profiles and Levels to stipulate
conformance for equipment not supporting the full
implementation. Profiles and Levels provide means for Multimedia communication is the possibility to
defining subsets of the syntax and thus the decoder communicate audiovisual information that [34]:
capabilities required to decode a particular bit stream. As is natural, synthetic, or both,
a general rule, each Profile defines a new set of is real time and nonreal time,
algorithms added as a superset to the algorithms in the supports different functionailities responding to user's
Profile below. A Level specifies the range of the needs,
parameters that are supported by the implementation (i.e. flows to and from different sources simultaneously,
image size, frame rate and bit rates). The MPEG-2 core does not require the user to bother with the specifics
algorithm at MAIN Profile features non-scalable coding of the communications channel, but uses a
of both progressive and interlaced video sources. It is technology that is aware of it,
expected that most MPEG-2 implementations will at least gives users the possibility to interact with the
conform to the MAIN Profile at MAIN Level which different information elements,
supports non-scalable coding of digital video with lets the user to present the results of his interaction with
approximately digital TV parameters - a maximum content in the way suiting his needs.
sample density of 720 pixels per line and 576 lines per
frame, a maximum frame rate of 30 frames per second To reach its own target, MPEG-4 follows an object-
and a maximum bit rate of 15 Mbit/s. based representation approach where an audiovisual scene
is coded as a composition of objects, natural as well as
synthetic, providing the first powerful hybrid playground. Progressive recovery by fidelity or resolution
Thus, the objective of MPEG-4 is to provide an Tiling
audiovisual representation standard supporting new ways
Error resilience
of communication, access, and interaction with digital
audiovisual data, and offering a common technical Region of interest coding
solution to various service paradigms - Random access to image in spatial domain
telecommunications, broadcast, and interactive - among Security
which the borders are disappearing. MPEG-4 will supply
an answer to the emerging needs of application fields Image compression must not only reduce the necessary
such as video on the Internet, multimedia broadcasting, storage and bandwidth requirements, but also allow
content-based audiovisual database access, games, extraction for editing, processing and targeting particular
audiovisual home editing, advanced audiovisual devices and applications. JPEG-2000 allows extraction of
communications, notably over mobile networks, tele- different resolutions, pixel fidelities, regions of interest,
shopping, remote monitoring and control [35]. components, and more, all from a single compressed
The fully backward compatible extensions under the bitstream. This allows an application to manipulate or
title of MPEG-4 Version 2 were frozen at the end of 1999, transmit only the essential information for any target
to acquire the formal International Standard Status early device from any JPEG-2000 compressed source image.
2000. Some work, on extensions in specific domains, is Some of the technology highlights for JPEG-2000 are:
still progress. MPEG-4 builds on the proven success of
Wavelet subband coding
three fields [36]:
Digital television, Reversible integer-to-integer and nonreversible real-
Interactive graphics applications (synthetic content), to-real wavelet transforms
Interactive multimedia (World Wide Web, Reversible integer-to-integer and nonreversible real-
distribution of and access to content). to-real multicomponent transforms
Bit-plane coding
D. MPEG-4 VTC AND JPEG-2000 IMAGE Arithmetic coding
COMPRESSION STANDARDS Code stream syntax similar to JPEG
File format syntax.
With the increasing use of multimedia communication
systems, image compression requires higher performance
and new features. JPEG-2000 is an emerging standard for E. MPEG-7 STANDARDIZATION PROCESS OF
still image compression. It is not only intended to provide MULTIMEDIA CONTENT DESCRIPTION
rate distortion and subject image quality performance
superior to existing standards, but also to provide MPEG-7 formally named "Multimedia Content
functionality that current standards can either not address Description Interface" is the standard that describes
efficiency or not address at al. The compression multimedia content so users can search, browse and
advantages in JPEG-2000 are a direct result of the retrieve the content more efficiently and effectively than
inclusion into the standard of a number of advanced and they could by using existing mainly text-based search
attractive features including progressive recovery, engines [38]. It is a standard for describing the features of
lossy/lossless compression, and region of interest multimedia content. The word "features" or "descriptions"
capabilities. These features lay the foundation for JPEG- represent a rich concept, that can be related to several
2000 to provide tremendous benefits to a range of levels of abstraction. Descriptions vary according to the
industries. Some of the applications that will benefit types of data. Furthermore, different types of descriptions
directly from JPEG-2000 are: image archiving, Internet, are necessary for different purposes of categorization.
Web browsing, document imaging, digital photography, MPEG-7 will specify a standard set of descriptors that can
medical imaging, remote sensing [37]. be used to describe various types of multimedia
Functionally, JPEG-2000 includes many advanced information. Also, MEPG-7 will standardize ways to
features: define other descriptors as well as structures for the
descriptors and their relationships. This description will
Component precision: 1 to 127 bits/sample (signed or
be associated with the content to allow fast and efficient
unsigned)
searching for material of user's interest. A language to
Each component may have different precision and specify description schemes i.e. a description definition
subsampling factor language (DDL) will be standardized, too. Audiovisual
Image data may be stored compressed or (AV) material that has MPEG-7 data associated with it
uncompressed can be indexed and searched for. This material includes:
Lossy and lossless compression still pictures, graphics, 3D models, audio, speech, video
and information about how these elements are combined
in a multimedia presentation. Special cases of these transparent and augmented use of multimedia resources
general data types may include facial expressions and across a wide range of networks and devices.
personal characters [39].
The basic elements of the framework are:
There are people who want to use the audiovisual
information for various purposes. However, before the Digital Items, structured digital objects with a
information could be used, it must be located. At the same standard representation, identification and metadata
time, the increasing availability of potentially interesting within the MPEG-21 framework.
material makes this search more difficult. This Users of all entities that interact in the MPEG-21
challenging situation led to the need of a solution to the environment or makes use of MPEG-21 Digital
problem of quickly and efficiently searching for various Items.
types of multimedia material of interest to the user.
MPEG-7 standard wants to answer to this need, providing The meaning of User in MPEG-21 is very broad and is
this solution [40]. by no means restricted to the end user. Therefore an
MPEG-21 user can be anybody who creates content,
MPEG-7 is rather different from the other MPEG provide content, archive content, rates content, enhances
standards, because it does not define a way to represent or delivers content, aggregates content, syndicates
data with the objective to reconstruct the data as faithfully content, sells content to end users, consumes content,
as possible, like MPEG-1, MPEG-2 and MPEG-4 did. subscribe to content, regulate content, facilitates or
The increasingly pervasive role that audiovisual sources regulates transactions that occur from any of the above.
are destined to play in our lives and the growing need to
have these sources further processed make it necessary to The work carried out so far has identified seven
develop forms of audiovisual information representation technologies that are needed to achieve the MPEG-21
that go beyond the simple waveform or sample-based, goals. They are [40]:
frame-based (such as MPEG-1 and MPEG-2) or even Digital Item Declaration: a uniform and flexible
object-based (such MPEG-4) representations. This abstraction and interoperable schema for declaring
necessitates forms of representation that allow some Digital Items
degree of interpretation of the information's meaning, Content Representation: how the data is represented
which can be passed onto, or accessed by, a device or a as different media.
computer code. The people active in defining MPEG-7 Digital Item Identification and Description: a
standard represent broadcasters, equipment and chip framework for identification and description of any
manufacturers, digital content creators and managers, entity regardless of its nature, type or granularity.
telecommunication service provides, publishers and
Content Management and Usage: the provision of
intellectual property rights managers as well as
interfaces and protocols that enable creation,
researchers.
manipulation, search, access, storage, delivery, and
(re)use of content across the content distribution and
consumption value chain.
F. MPEG-21 MULTIMEDIA FRAMEWORK
Intellectual Property Management and Protection: the
means to enable content to be persistently and
The aims of starting MPEG-21 are: reliably managed and protected across a wide range
to understand if and how various components fit of networks and devices.
together Terminals and Networks: the ability to provide
to discuss which new standards may be required, if interoperable and transparent access to content across
gaps in the infrastructure exist and, once the above networks and terminal installations.
two points have been reached. Event Reporting: the metrics and interfaces that
to actually accomplish the integration of different enable Users to understand precisely the performance
standards. of all reportable events within the framework.
The digital market place, which is founded upon
ubiquitous international communication network such as To carry out the necessary tasks, MPEG has identified
the Internet, rewrites existing business models for trading the following method of work. First, it is necessary to
physical goods with new models for distributing and define a framework supporting the vision. This is being
trading digital content electronically. In this new market done by drafting a Technical Report (TR) that describes
place, it is becoming increasingly difficult to separate the the complete scope of the multimedia framework and
different intellectual property rights which are associated identifies the critical technologies of the framework. The
with multimedia content. TR will help understand how the components of the
framework are related and identify the goals that are not
The latest MPEG project MPEG-21 Multimedia currently filled by existing standardized technologies. The
Frameworks has been started with the goal to enable next step is the involvement of other relevant bodies in
this effort. This is necessary because some technologies access in such a way that content protection is transparent
needed for MPEG-21 are not MPEG specific and are to the end user.
better dealt with by other bodies. For each of the
technologies that is not yet available MPEG will either
G. ITU-T STANDARDIZATION PROCESS
Develop them if MPEG has the necessary expertise
Otherwise engage other bodies to achieve their For multimedia communication, there are two major
development. standard organizations: the International
Telecommunication Union - Telecommunication
MPEG-21 intends to address the tranditional
Standardization Sector (ITU-T), and the International
audiovisual content representation issues, but with one
Organization for Standardization (ISO). For example,
major difference. Content can no longer be seen as
recent video coding standards defined by these two
"essence" (what the user has traditionally consumed), or
organizations are summarized in Table 4. These standards
as "metadata" (the description of the essence), but as an
differ mainly in the operating bit rates due to applications
integrated "whole".
they are originally designed for, although all standards
A second area of the MPEG-21 standard is explained by can essentially be used for all applications at a wide range
the consideration that the way the user interacts with this of bit rates. In terms of coding algorithms, all standards
complex world is difficult to separate from the way the follow a similar framework [41].
user acquires the right to access the content. MPEG-21
will therefore also identify the interfaces with content
Table 4 Video coding standards

Standards organization Video coding standard Typical range of bit rates Typical applications
ITU-T H.261 px64 kbits/s, p=1,2,,30 ISDN Video phone
IS 11172-2
ISO 1.2 Mbits/s CD-ROM
MPEG-1 Video
IS 13818-2
ISO 4-80 Mbits/s SDTV, HDTV
MPEG-2 Video
ITU-T H.263 64 kbits/s or below PSTN Video phone
CD 14496-2
ISO 24-1024 kbits/s
MPEG-4 Video
ITU-T H.263 Version 2 < 64 kbits/s PSTN Video phone
ITU-T H.26L < 64 kbits/s -
of video coding which is specifically designed to support
the full-range of both circuit-switched and packet-
Most video compression and coding coding standards switched networks [45, 46]. H.263+ contains
were developed with a specific application and functionalities that improve the quality of video
networking infrastructure in mind. For example, ITU-T
transmission in error-prone environments and
Recommendation H.261 was optimized for use with
nonguaranteed quality of service (QoS) networks. H.26L
interactive audio visual communication equipment, e.g., a is an ongoing standard activity that is searching for
videophone [42], and in conjunction with the H.320 series
advanced coding techniques that can be fundamentally
of recommendations as multiplex and control protocols on
different from H.263.
top of ISDN [43]. Consequently, the H.261 designers
made various design choices that limit the applicability of
H.261 to this particular environment. The original H.263
H. INTERNET STANDARDIZATION PROCESS
was developed for video compression rates below 64 kbits
per second. This was first international standard for video The Internet Engineering Task Force (IETF) is a
compression which would permit video communications loosely self-organized group of people who make
at such low rates [44]. After H.263 was completed, it technical and other contributions to the engineering and
become apparent there were incremental changes that evolution of the Internet and its technologies. It is open to
could be made to H.263 that visibly improved its any interested individual. The actual technical work of the
compression performance. It was thus decided in 1996 Internet is mostly done inside the IETF: it is the principal
that a revision to H.263 would be created which body engaged in the development of new Internet
incorporated these incremental improvements. ITU-T Standard specifications, although it is not itself a part of
Recommendation H.263 Version 2 (abbreviated as the Internet society.
H.263+) is the very first international standard in the area
The Internet Society (ISOC) was officially formed in At present the IETF is organized into the following
January 1992. It was formed by a number of people with areas:
long term involvement in the IETF, in order to provide an Applications Area - issues related with applications,
institutional home and financial support for the Internet other than security and networks.
standardization process. Today, the Internet Society is a General Area - internal IETF organizations issues.
non-profit, non-governmental, international, professional Internet Area - improvements on the TCP/IP
membership society with more than 100 organizational protocols for increased usage and versatility.
and 6000 individual members in over 100 countries. It Operations and management Area - concerned with
provides leadership in addressing issues that confront the management and operation control of the Internet.
future of the Internet, and is the organization home for the Routing Area - Internet routing protocol issues.
groups responsible for Internet infrastructure standards, Security Area - provide support for security across all
including the IETF and the Internet Architecture Board areas.
(IAB).
Transport Area - handles the transport of different
ISOC aims to assure the beneficial, open evolution of payloads over IP, and the IP transport by other media.
the global Internet and its related interworking User Services Area - a form for people interested in
technologies through leadership in standards, issues, and all levels of user services, and to the quality of
education. The Society's individual and organizational information available to users of the Internet.
members are bound by a common stake in maintaining
Each area is further divided in working groups, ranging
the viability and global scaling of the Internet. The
from a couple to several dozens.
Society is governed by a Board of Trustees elected by its
membership around the world, and the Board is The Internet standardization process is managed by
responsible for approving appointments to the IAB from IESG. The existence of interoperable running
among the nominees submitted by an IETF nominating implementations is the key requirement for advancement
committee. of the process. There are two paths that a document may
take in orer to become an RFC. The first path is through
The IAB is the technical advisory group of the Internet
the IETF [47].
Society. It is chartered to provide oversight of the
architecture of the Internet and its protocols, and to serve The very first step, however, is for a document to
in the context of the Internet Standard process as a final become an Internet Draft - so that it may be distributed,
appealing body. The IAB is responsible for approving read and commented on. These Drafts, as well all IETF
appointments to the Internet Engineering Steering Group documents, should be focused, handling few points of
(IESG) from among the nominees submitted by the IETF doubt. If required, a subject can be separated in different
nominating committee. components, and each treated separately in a different
WG. If it is required, a working group can be created in a
The Internet Engineering Steering Group (IESG) is
very fast way, after an initial session in order to assess its
responsible for technical management of IETF activities
interest. When created, a WG has a very well defined
and the Internet Standards process. As part of the Internet
charter, and publishes its goals and milestones. There is
Society, it administers the Internal Standards process
no formal voting process inside the WG, and the results
according to the established rules and procedures. The
are achieved by consensus, often after discussing results
IESG is directly responsible for the actions associated
of different demonstrations.
with entry and movement along the "standards track",
including final approval of specifications as Internet Working groups are loosely co-ordinated, through their
Standards. The IESG is composed of the IETF Area ADs - besides mutual interests their participants may
Directors (ADs) and the chairperson of the IETF, who share. Most of the work is being done by volunteers, and
also serves as the chairperson of the IESG. Representative the IETF policy of accepting only working
of the increasingly larger span of the Internet is the fact implementations for final standards, makes the final
that the IESG has established formal liaison with the approval of a particular WG extremely dependent of its
ATM Forum and the ITU-T. real utility to the overall Internet community. Thus,
Internet standards are always de-facto standards -
The IETF is composed of individual Working Groups
although their widespread usage in the Internet may vary
(WG) , which are organized by topic into several Areas,
strongly. The whole structure is based on the active
each of which is coordinated by one or more Area
participations and interest of its volunteers, regardless of
Directors. These are the members of the IESG.
their individual motivations; it is an extremely fluid
Nominations to the IAB and the IESG are made by
process when confronted with more traditional
nominating committee members.
telecommunications standards.
range of 100-600 ms. For example, the public telephone
network has a maximum specification of 600 ms. In order
VIII. MULTIMEDIA COMMUNICATIONS to minimize packetization and storage delays, it has been
OVER NETWORKS proposed that voice packets should be relatively short, on
the order of 200-700 bits, and generally contain less than
The increase in communication of multimedia
10-50 ms of speech [52, 53]. Network protocols should be
information over the past decades has resulted in many
simplified to shorten voice packet headers (e.g., on the
new multimedia processing and communication systems,
order of 4-8 bytes), although timestamps and sequence
being put into service. The growing availability of optical
numbers are likely needed. Since a certain amount of
fiber links and rapid progress in VLSI circuits and
distortion is tolerable, error detection, acknowledgements,
systems have fostered a tremendous interest in developing
and retransmissions are unnecessary in networks with
sophisticated multimedia services with an acceptable cost.
low-error rates. Flow control can be exercised end-to-end
Today's fiber technology offers a transmission capacity
by blocking calls. In addition, network switches can
that can easily handle high bit rates. This leads to the
possibly discard packets under heavy traffic conditions. In
development of networks which integrate all types of
this case, embedded coding has been proposed whereby
information services [48]. By basing such a network on
speech quality degrades gracefully with the loss of
packet switching, the services (video, voice and data) can
information [54].
be dealt within a common format. Packet switching is
more flexible than circuit switching in that it can emulate
Packet voice is shown in Fig. 5. It can be seen that the
the latter while vastly different bit rates can be
packets are generated at regular intervals during talksparts
multiplexed together. In addition, the network's statistical
at the packet voice transmitter (PVT). The reconstruction
multiplexing of variable rate sources may yield a higher
process at the packet voice receiver (PVR) must
fixed capacity allocation [49, 50].
compensate for the variable delay component by adding a
A. PACKET VOICE controlled delay before playing out each packet. This is
constrained by some value, Dmax, the specified maximum
In comparison to circuit-switched networks, packet percentage of packets can be "lost" or miss playout. In
switching offers several potential advantages in terms of
addition to buffering voice packets, it might be desirable
performance. One advantage is efficient utilization of
for the PVR to attempt to detect loss packets and recover
channel capacity, particularly for "bursty" traffic. their information [55].
Although not as bursty as interactive data, speech exhibits
some burstiness in the form of talksparts [51]. Average There are two basic approaches to the reconstruction
talkspart duration depends on the sensitivity of the speech process. In the null timing information (NTI) scheme,
detector, but it is well known that individual speakers are reconstruction does not use timing information (i.e.,
active only about 35-45 percent in typical telephone timestamps) to determine packet delays through the
conversations. By sending voice packets only during network. The PVR adds a fixed delay D to the first packet
talksparts, packet switching offers a natural way to of each talksparts as shown in Fig. 6. [56].
multiplex voice calls as well as voice with data. Another
If D0 is the transit delay of a first packet through the
advantage is that call blocking can be a function of the
network and Dg is a packet generation time (assumed to
required average bandwidth rather than the required peak
be constant), then the total delay of the first packet from
bandwidth. In addition, packet switching is flexible. For
entry into the network to playout is
example, packet voice is capable of supporting point-to-
multipoint connections and priority traffic. Furthermore, Dt = D0 + D (1)
since packets are processed in the network, network
capabilities in traffic control, accounting and security are
enhanced. However, packet voice is not without Subsequent packets in the talkspart are played out at
difficulties. Continuos speech of acceptable quality must intervals of Dg after the first packet. Therefore, sequence
be reconstructed from voice packet that experience numbers are required to indicate the relative positions of
variable delays through the network. The reconstruction packets in the talkspart. If a packet is not present at the
process involves compensating for the variable delay PVR at its playout time, it is considered "lost". The choice
component by imposing an additional delay. Hence, of D involves a tradeoff. Increasing D reduces the
packet should be delivered with low average delay and percentage of lost packets but increases total end-to-end
delay variability. delays and the size of the queue at the PVR. D cannot be
too large due to the constraint from Dmax nor too small
Speech can tolerate a certain amount of distortion (e.g., due to Ploss. Since D0 is random, the silence intervals
compression, clipping) but is sensitive to end-to-end between talksparts are not reconstructed accurately.
delay. The exact amount of maximum tolerable delay is
subject to debate. It is generally accepted to be in the
packet
PVT PVR
network
speech time
PVT 1 2 3 4 5
PVR 1 2 3 4 5
reconstructed
Figure 5. Packet voice

Reconstruction of silences in null timing information If delay variability can be significant, a more elaborate
(NTI) scheme is shown in Fig. 7. Let d and d' denote the reconstruction is necessary. In the complete timing
values of D0 for the talksparts preceding and following a information (CTI) approach, the reconstruction process
silence interval(s). Suppose that d and d' are identically uses full timing information in the form of timestamps to
distributed with variance 2 and have some positive accurately determine each packet's delay through the
correlation r. Then, the error in the length of the network, denoted Dv. PVR adds a controlled delay Dr, so
reconstructed silence is that the total entry-to-playout delay Dt is
= d d' (2) Dt = Dv + Dr (4)
speech
is as uniform as possible for all packets. In addition to

PVT 1 2 3 time-stamps, sequence numbers are also desirable for
D0
detecting lost packets.
PVR 1 2 3
D 3Dg B. INTEGRATED PACKET NETWORKS
reconstructed The economies and flexibility of integrated networks
Figure 6. Null timing information (NTI) reconstruction make them very attractive and packet network
scheme architectures have the potential for realizing these
advantages. However, the effective integration of speech
and has the variance
and other signals such as graphics, image and video into
var( ) = 2 2 (1 r ) (3) an integrated packet network (IPN) can rearrange network
design properties. Although processing speeds will
which is directly proportional to the variance of packet continue to increase, it will also be necessary to minimize
delays. Evidently, the NTI scheme would be adequate the nodal per-packet processing requirements imposed by
only if a small delay variance could be guaranteed. the network design. Data signals must generally be
received error-free in order to be useful. The inherent
Since the scheme depends on the first packet of each structure of speech and image signals and the way in
talkspart, the loss of a first packet might cause confusion which they are perceived allows for some loss of
at the PVR. information without significant quality improvement.
This presents the possibility of purposely discarding
s
limited information to achieve some other goal, such as
the control of temporary congestion. One of the goals in
speech
integrated packet network is to construct a model which
d+D d'+D considers the entire IPN (transmitters, packet multiplexers
and receivers) as a system to be optimized for higher
reconstructed
s+(d'-d)
speeds and capabilities [57]. In order to simplify the
processing at network nodes, more complex processing at
Figure 7. Reconstruction of silence null timing
network edges can be allowed. The transmitter forms
reconstruction scheme
packet switch varying in it. Transmitter subsystem is
shown in Fig. 8. The transmitter first classifies speech
segments according to models of the speech production such as clipping, are eliminated whenever the network is
process (voiced sounds, fricatives and plosives). not overloaded. Even deriving periods of considerable
overload, the received quality may be better if at least a
few "background noise" packets are delivered and then
CODER used to regenerate noise which is similar in character to
CLASS I the actual noise. If a packet is lost for any reason (for
SPEECH FORM CLASSIFY ADD TO example discarded by the network because of overload or
...
SOURCE SEGMENT SEGMENT HEADER NETWORK
errors, excessively delayed in the network, etc.) the
CODER
CLASS i
receiver must first detect the loss by inspecting sequence
numbers of those packets which have been received. It
must further make a determination of the class of each
ASSIGN DELIVERY
PRIORITY
lost packet, so that the appropriate regeneration model can
be applied using previous header and sample history [57].
Figure 8. Transmitter subsystem
Figure 9. Packet multiplexer subsystem with arriving

packet discarded
This model-based classification is used to remove
redundancy during coding, to assign delivery properties
and to regenerate discarded speech packets. After
classification, the transmitter removes redundancy from OVERLOAD
CONTROL DECISIONS OVERLOAD
the speech using a coding algorithm based on the ALGORITHM MEASUREMENT
determined model. For example, voiced sounds (vowels)
CONTROL
could be coded with a block-oriented pitch prediction DECISIONS
QUEUE
STATUS
coder. After coding, the transmitter assigns a delivery ACCEPTED
priority to each packet based on the quality of PACKETS IN PACKET PACKETS PACKETS OUT
DISCARDING '
regeneration possible at the receiver. In forming packet
TRANSMIT
from speech segments, the delivery priority would be OBLIVION QUEUE
included in the network partion of the packet header. The
MAKE: ' < SO THAT: ' <
classification and any coding parameters would be
CODER
included in the end-to-end portion of the header. CLASS I
Multiplexers exist at each outgoing link of each network PACKETS DISASSEMBLE CODED DECODED TO PLAYOUT
...
node as well as at each multiplexed network access point. IN PACKET SAMPLES SAMPLES BUFFER
Packet multiplexer subsystem with arriving packet CODER

discarded is shown in Fig. 9. Here, is the effective HEADER
INFO
CLASS i
arrival rate, while represents the effective service rate. LOST

PACKET
Each packet multiplexer monitors local overload and DECISIONS
discards packets according to packet delivery priority REGEN

(read from the network portion of the packet header) and CLASS I
done locally determined measure of overload level. It is REGEN'D

...
assumed that arriving packets are discarded. It is also SAMPLES
possible to discard already-queued packets. In addition, if REGEN

CLASS i
error checking is performed by the nodes, any packet HEADER
SAMPLE
(data or speech) found to have an error is discarded. INFO
HISTORY
HISTORY
The receiver decodes the samples in speech packets Figure 10. Receiver subsystem
delivered to it based on the classification and coding
parameters contained in the end-to-end header. It also
determines the appropriate time to play them out. In sumary, the advantages gained by taking a total
Receiver subsystem is shown in Fig. 10. The receiver system approach to integrated packet network are as
synchronization problem requires only packet sequence follows:
numbers. Global synchronization is administratively
difficult and relatively timestamps must be modified at A powerful overload control mechanism is provided.
each packet multiplexer, requiring additional per-packet The structure of speech is effectively exploited.
processing [56]. Potential speech detector impairments,
Extremely simple per-packet processing for overload operations like scheduling and buffer management to
control is allowed. provide service guarantees. Delay and some loss is
Only one packet per speech segment is required. inevitable during transfers across both ATM and IP
Receiver speech synchronization is simplified. networks. The delay is chiefly caused by propagation and
Reduced per-packet error processing at packet queuing. The queuing delay depends on the dynamic load
multiplexers is possible. variations on the links and must be equalized before video
can be reconstructed. Bit errors can occur in the optics
C. PACKET VIDEO and electronics of the physical layer through thermal and
impulsive noise. Loss of information is mainly caused by
Asynchronous transfer of video which often is referred multiplexing overload of such magnitude and duration
to as "packet video" can be defined as the transfer of that buffers in the nodes overflow.
video signals over asynchronously time-division
multiplex (ATDM) networks, such as IP and ATM. The
video may be transferred for instantaneous viewing or for D. MULTIMEDIA TRANSPORT OVER ATM
subsequent storage for replay at a later time. The former NETWORKS
case has requirements on pacing so that the received
video data can be displayed in a perceptually continuos As previously stated, multimedia itself denotes the
sequence. The latter case can be seen as a large data integrated manipulation of at least some information
transfer with no inherent time-constraints. In addition to represented as continuos media data, as well as some
the requirement on pacing, there may also be bounds on information encoded as discrete media data (text and
the maximal transfer delay from camera to monitor if the graphics). Multimedia communication deals with the
video is a part of an interactive conversation or transfer, protocols, services, and mechanisms of discrete
conference. These limits are set by human perception and media data and continuos media data (audio, video)
determine when the delay starts to the information in/over digital networks. Such communication requires
exchange. Parts of the signal may be lost or corrupted by that all involved components be capable of handling a
errors during the transfer. This will reduce the quality of well-defined quality of services (QoS). The most
the reconstructed video and if the degradation is serious important QoS parameters are required capacities of the
enough, it may cause the viewer to reject the service. involved resources, compliance to end-to-end delay and
Thus, the general topics of packet video is to code and jitter as timing restrictions, and restriction of the loss
asynchronously transfer video signals under quality characteristics. A protocol designed to reserve capacity
constraints. for continuous media data, transmitted in conjunction
The synchronous transfer mode combines the circuit with the discrete media data over, for example an
switched routing of telephony networks with the asynchronous transfer mode - local area network (ATM-
asynchronous multiplexing of packet switching. This is LAN) is certainly a multimedia communication issue
accomplished by establishing a connection (fixed route) [59]. The success of ATM for multimedia
through the network before accepting any traffic. The communications depends on the successful
information is then sent in 53-octet long cells. The standardization of its signaling mechanisms, its ability to
switches route cells according to address information attract the development of the native ATM applications,
contained in each cell's 5-octet header. Traffic on a and the integration of the ATM with other
particular link consists of randomly interleaved cells communications systems. The integration of ATM into
belonging to different calls. The network guarantees that the Internet world is under investigation. If there will be
all cells of a call follow the same route and hence, get ATM applications such as video on demand, then there is
delivered in the same order as sent. The intention is that also the need for a "side-by-side" integration of ATM and
ATM networks should be able to guarantee the quality of Internet protocols. The success of wireless ATM
service in terms of cell loss and maximum delay, as well (WATM) relies on the success of ATM/BISDN in wired
as maximum delay variations [58]. networks. When ATM networks become a standard in the
wired area, the success of WATM will be realized.
The internet protocol differs in two major respects
from ATM: there is no pre-established route and the
packets are of variable length (up to 65535 octets). IP E. VIDEO OVER WIRELESS ATM NETWORKS
does not give any guarantees on the delivery of the Due to the success of ATM on wired networks, wireless
packets and they may even arrive out of order if the ATM (WATM) has become the direct result of the ATM
routing decision is changed during the session. These "anywhere" movement. WATM can be viewed as a solution
issues will be addressed by the introduction of IPng in for next-generation personal communication networks, or a
conjunction with the resource reservation protocol RSVP. wireless extension of the BISDN networks. There has been a
IPng often called Internet Protocol (version 6), packets great deal of interest recently in the area of wireless
contain a 24-bit flow identifier in addition to the source networking. Issues such as bit error rates and cell loss rates
and destination addresses which can be used in routers for are even more important when transmitting video over a
wireless network. A very high performance wireless local F. MULTIMEDIA OVER IP NETWORKS
area network which operates in the 60 GHz millimeter wave
band can experience cell loss rates of 10-4 to 10-2 . To provide Multimedia has become a major theme in today's
adequate picture quality to the user, some form of error information technology that merges the practices of
correction or concealment must be employed. One option is communications, computing, and information processing
to use the MPEG-2 error resilience techniques and to modify into an interdisciplinary field. In this Internet era, IP-
the MPEG-2 standard slightly when it is used over wireless based data networks have emerged as the most important
ATM networks. This technique is known as macroblock infrastructure, reaching millions of people anytime,
resynchronization [60]. In macroblock resynchronization the anywhere. They serve as an enabling technology that
first macroblock in every ATM cell is coded absolutely creates a whole new class of applications to enhance
rather then differentially. This allows for resynchronization productivity, reduce costs, and increase business agility.
of the video stream much more often than would be possible Anticipating that multimedia over IP will be one of the
if resynchronization could only take place at the slice level. It major driving forces behind the emerging broadband
would be relatively simple to incorporate this method with communications of the 21st century, we address the
the existing MPEG-2 coding standard by adding an challenges facing the delivery of multimedia applications
interworking adapter at the boundary between the fixed and over IP in a cost effective, ubiquitous, and quality-
wireless network [61]. A second proposal for improving guaranteed manner.
error resilience in wireless network is to use forward error
correction (FEC) methods. In addition, improved
performance can be achieved by using a two layer scalable G. VIDEO TRANSMISSION OVER IP NETWORKS
MPEG-2 coding scheme rather than one layer [62]. The problem of sending video over IP has essentialy
two main components: video data compression and design
of communication protocols, as illustrated in Fig. 11 [63].
VIDEO TRANSPORT IP TRANSPORT VIDEO

CODER PROTOCOL NETWORK PROTOCOL DECODER
Figure 11. Structure of a video streaming system

yet to realize the full potential of ADSL. Traditional
One approach consists of designing a low bit rate telephone and Internet services are only the beginning,
coder, protecting the resulting bit stream with channel while the ability to offer broadcast video services is a
codes and using one of the standard Internet transport reality. Cable TV operators are beginning to offer voice
protocols to transmit the resulting data stream. If the and data services. There is increasing competition from
source bit rate is low enough and the channel is not too Competitive Local Exchange Carriers (CLEC) and other
congested, then it is possible to use TCP, in which case no carriers, making it imperative that traditional telecom
errors occur and therefore there is no need for channel service provides video services. By offering a range of
codes. Othervise, user datagram protocol (UDP) is used services, established service providers can generate
with a constant packet injection rate, and low-redundancy additional revenue and protect their installed base. Direct
channel codes are used to protect against infrequent lost Broadcast Satellite (DBS) providers, particularly in
packets. Europe and Asia, are offering a compelling Multichannel
Video Program Distribution (MVPD) service [64].
A key factor contributing to the successful deployment
H. MULTIMEDIA OVER DIGITAL SUBSCRIBER of Asymmetric Digital Subscriber Line (ADSL) access
LINES
systems has been the facility for overlying data services
on top of existing voice service without interfering with
The Internet with all its applications is changing the the voice service. For the users this basically offers:
way we work, live and spend on time. However, today the
Always-on service capability. There is no need to dial
Internet is facing a major problem. Growing demand for
up as the Internet Protocol (IP) connection is always
access has produced bottlenecks and traffic jams, which
available and the office networking model in which
are slowing the Internet down. In an attempt to overcome
network resources are available all the time.
these restrictions, access has pushed the technology of
Virtual second voice line. Unlike when the user is
traditional telephony to new and innovative heights with
connected through a modem, the voice line remains
the emergence of Asymmetric Digital Subscriber Line
available for incoming and outgoing calls.
(ADSL) technology. High-speed ADSL eliminates
bottleneck, giving all subscribers quick and reliable For the operator, the service overlay allows ADSL to
access to Internet content. Telecom service providers have be installed throughout the network, irrespective of what
types of narrow band switches are installed. After the measured in terms of homes passed - that is, the number
initial success of ADSL, it became apparent that it could of homes adjacent to which the operator's cable passes,
be used to offer multiple phone lines together with a regardless of whether those homes have been signed up as
greater range of services (e.g., Virtual Private Networks customers. Realistically, cable modems are capable of
VPN) targeted at specific markets. This has been made passing data upstream at speeds of 200 kbit/s to 2 Mbit/s,
possible by the high bandwidth of ADSL, backed up and downstream at speeds up to about 10 Mbit/s.
progress in voice compression, echo canceling and digital
Cable modems, capable of operating at higher speeds
signal processing technologies. ADSL offers a high adta
than ADSL, have some serious drawbacks. The cable link
bandwidth, of which a portion can be used to offer
to a residence is shared among many users, so that if
additional voice services integrated with the data services.
many of them decide to log onto the Internet at the same
Symmetric Digital Subscriber Line (DSL) techniques,
time, achievable communications speeds may plunge.
such as Single pair High Speed DSL (SHDSL) cannot be
Because the lines are shared, a hacker may be able to drop
deployed as an overlay to existing analog telephone
on a neighbor's connection to the Internet or on an
services, so the delivery of voice and data services using a
intranet - a security problem that may be serious to some
single facility requires voice to be carried directly on the
users. Consequently, a customer who happens to be a road
DSL link. The techniques used to transport voice and data
warrior will be unable get access into the Internet at
in an integrated way over DSL-whether ADSL or SHDSL
airports or hotels through his laptop computer at his usual
- are reffered to as Voice over DSL (VoDSL).
data rate. If he is able to connect at all, it will be through a
With VoDSL, two main market segments are of interest dial-up modem at a much lower speed.
to service providers. The first is small to medium sized
The most valuable benefit of wireless services is that
businesses, a significant percentage of which need to be
they make access possible for people who are on the
able to send and receive data of around 500 kbit/s. The
move. They are also attractive in certain cases where the
voice needs of these customers are typically met by 4 to
user is stationary. Digital cellular telephones are quickly
12 outgoing plane lines. Using, for example, Adaptive
becoming the main communications tool for people on the
Pulse Code Modulation (ADPCM) voice coding, at peak
move. While they are good for retrieving e-mail and
times these phone lines consume only 128 to 256 kbit/s of
checking stock quotes, the present state of technology
the ADSL bandwidth, which is typically in excess of 2
does not permit multimedia communications. Multimedia
Mbit/s downstream and more than 500 kbit/s upstream.
communications are available, but at present only to
The second market interested in VoDSL services is
stationary users. Satellite broadcasts, for example, allow
residential users who will appreciate the extra two to four
fast download of Internet contents with a return path (that
voice lines that VoDSL offers [65].
is, the uplink from the user computer to an Internet
ADSL will be delivering multimedia services to service provider) over a dial-up modem. For rural user, it
millions of users. The transmission of digital multimedia is possible to use local multipoint distribution services
data requires the existing systems to be augmented with (LMDS). This option uses millimeter-wave radio, at
functions that can handle not only ordinary data. In frequencies of about 30 GHz. A typical installation has a
addition, the high volume of multimedia data can be central base station with an unidirectional antenna serving
handled efficiently only if all available system services many residences, each of which has a directional dish
are carefully optimized. aimed at the base station.
I. INTERNET ACCESS NETWORKS J. MULTIMEDIA OVER WIRELESS
Asymmetric digital subscriber line (ADSL) offers The explosion of technological advancements and the
asymmetric rates of transfer of data to and from the success of the second-generation digital cellular systems
Internet. The uplink rates can go up to 768 Kbit/s and (e.g., Global System for Mobile - GSM and Personal
down links rates are 6-8 Mbit/s, depending on the length Digital Cellular - PDC) have established wireless
and condition of the local loop - the wiring between the communications as indispensable in modern life. Because
customer's premises and the telco central office. of the low-cost and low-consumption characteristics of
emerging wireless products targeted at low to medium bit
Cable companies bring analog TV signals over optical
rate services, these products are expected to play an
filters to their neighborhood distribution points, a head-
important role in wireless communications in the next few
ends, whence the signals are distributed to residences by
years. Wireless in multimedia communications (e.g.,
coaxial cables. The combination of fiber and coaxial
audiovisual telephony and videoconferencing) require
cable, which can carry high-speed data as well as TV
medium to high bit rate channels (64 kbps to 2Mbps per
signals, is known as hybrid fiber coax (HFC). Each
user). Therefore, for these applications, it will be
distribution point typically serves 200-500 residences.
necessary to have broadband wireless networks that
The extent of the network of a cable TV operator is
support bit rates in excess of 2 Mbps per radio channel,
where each radio channel could be shared by multiple Even with scalable and multiple description based
users or sessions. In addition, these services have to be source coding schemes, there will still be lost data on
provided with some quality of service (QoS) guarantees wireless systems. Error recovery and concealment at the
over their respective, error-prone wireless connections. In receiver is therefore an important topic and has received
order to archive these goals, one has to address the some attention again primarily for video. These error
following key issues: concealment techniques rely to a large extent on knowing
haw to increase the capacity of wireless channels the underlying source compression technique and
how to provide QoS in a cost-effective way exploiting some of the tools that are used therein [68].
how to combat the wireless channel impairments. Most of the wireless systems today also support
In addition to the continuing interest in wireless mobility. The velocity associated with mobility has been
audiovisual communication applications mentioned, a one of the key parameters that affect system design. For
great deal of interest has emerged in higher end wireless this reason, many of the approaches to solving channel
multimedia services. However, current wireless networks, related problems associated with mobility have been
which are primarily low bit rate narrow band systems developed for specific classes of mobile systems -
targeted for voice or data, are inadequate for supporting pedestrian (velocity of a few meters/sec), vehicular
audiovisual communication applications or high-end (velocities of about 100 meters/sec), and high-speed
multimedia services. (velocities of hundreds of kilometers/sec). Mobility also
affects routing and addressing, which have received a
Most multimedia services tend to be real-time in nature, significant amount of attention.
i.e., the data being transported need to get to the destination
by a certain time in order to be useful. This implies the need Depending on its applications, there are two distinct
to develop techniques for call admission, bandwidth approaches to the development of wireless broadband
allocation and the handling of real time variable rate streams. communication system (WBCS): wireless local area
These are problems that apply to wired networks as well and network (WLAN) and mobile broadband system (MBS).
are not, therefore unique to wireless multimedia Although the core network dilemma is still going strong
communication systems. between IP and ATM for broadband multimedia services,
almost all of the WBCS technology demonstrations are
The two major protocol-related problems in wireless based on ATM technology. ATM as a broadband
multimedia concern medium access and quality of infrastructure has been designed for multimedia
service. Wireless systems are inherently multiple medium communications to accommodate a variety of data rates,
access in nature and therefore need to have a reliable QoS requirements, and connection and connectionless
medium access control (MAC layer that also supports paradigms. It is quite natural to assume a combination of
QoS [66]. wireless and ATM-based services at the consumer end of
Audio, video and graphics need to be compressed a wired network. In order to deliver multimedia traffic
before transport over a bandwidth constrained wireless over broadband wireless networks, we need to have
channel. Given the emphasis on mobile wireless systems sufficient bandwidth and be able to support service
in the past, the media element that has received the most specific QoS requirements concerning delay, delay
attention in the context of wireless multimedia is speech variation, and packet loss on a per-connection basis
[67]. This is natural because the most widely deployed [69,70].
wireless multimedia system today is cellular telephony, The radio physical layer is essentially the soul of any
which is a fairly limited bandwidth system. There has also wireless network. Ideally, one wants to find a radio
been a great deal of interest in wireless video given the physical layer technology that is spectrum efficient,
increased bandwidth capabilities of universal mobile minimizes the radio overhead and is robust in both indoor
telecommunications systems (UMTS). The two video and outdoor environments. Because of various channel
compression standards that are most relevant to these impairments, it is very hard to get an optimal radio
systems area MPEG-4 and H.263, both of which have physical layer.
been evaluated for uses in GSM systems. Because of
unreliable nature of wireless networks, it has become The wireless broadband air interface will demand a
important to build source coding schemes that are robust relatively large frequency band to support bit rates in
to channel errors. Scalable compression schemes that excess of 2 Mbps. This type of allocation is hard to find
offer graceful degradation with loss of data have became below 3 GHz and the availability of bandwidth becomes
popular. easier on higher frequencies, but at the cost of complex
and expensive techniques. Because at higher frequencies
Audio and graphics are two source elements that have the path loss is greater, line of sight operation becomes
not received extensive research in the context of wireless important and wall penetration becomes a challenge for
systems. There has, however, been some work on hand- WANLs.
writing coding [66].
IX. CONCLUDING REMARKS have to be applied to ensure that requests are accepted
only if sufficient network resources are available.
Research on telecommunication network is focused on Moreover, service-specific policing actions have to be
post ISDN architectures and capabilities such as an employed within the network to ensure that
integrated packet network and broadband ISDN. The nonconforming data flows do not affect the QoS
economics and flexibility of integrated networks make commitments for already active data flows. One generic
them very attractive, and packet network architectures framework that addresses both the video coding and
have the potential for realizing these advantages. networking challenges associated with Internet video is
However, the effective integration of speech and other scalability. Any scalable Internet video coding solution
signals such as graphics, image and video into an has to enable a very simple and flexible streaming
integrated packet network (IPN) can rearrange network framework. The fine-grained scalable framework strikes a
design priorities. Although processing speeds will good balance between coding efficiency ad scalability
continue to increase, it will also be necessary to minimize while maintaining a very flexible and simple video coding
the nodal per-packet processing requirements imposed by structure.
the network design. This is a motivation for new With the advent of common uses of the Internet, the
switching concepts like fast packet switching and ATM. demands for real-time and low-rate voice over IP
Data signals must generally be received error free in order applications are growing rapidly. Since the delivery of
to be useful, but the inherent structure of speech and packets is not guaranteed in the IP networks, it is
image signals and the way in which they are perceived necessary to deal with the audible artifacts, which are
allows for some loss of information without significant caused by burst packet losses. Packet loss degrades the
quality impairment. This presents the possibility of speech quality of the analysis-by-synthesis coders
purposely discarding limited information to achieve some seriously since the loss parameters not only affect the
other goal, such as the control temporary congestion. current speech frame, but also produce the so-called error
Multimedia communication standards have to rely on propagation problem resulting from corrupted filter
compromises between what is theretically possible and memory. This packet loss problem can be solved by using
what is technologically feasible. Standards can only be different model parameters.
successful in the market place if the cost performance Digital Subscriber line (DSL) technology offers
ratio is well balanced. This is specifically true in the field unprecedented scalability for interactive video services. It
of audio/video coding where a large variety of innovative is the basis for the point-to-point architecture that is the
coding algorithms exist, but may be too complex for key to providing a combination of interactive video and
implementation. broadcast services. The implementation of video services
Streaming video over the Internet faces many is a high priority for telecom providers. Delivering voice
technological as well as business challenges, and new services over digital subscriber line offers a lucrative
codecs, protocols, players and subsystems are developed opportunity for both established and emerging services.
to address them. Since its introduction in early 1990s, the Data broadcasting in support of multimedia
concept of streaming media has experienced a dramatic applications, requires efficient use of bandwidth resources
growth and transformation from a novel technology into in order to maximize the availability of playout content.
one of the mainstream manners in which people From the data delivery point of view, the digital TV
experience the Internet today. The concept of streaming infrastructure provides a broadband digital distribution
media comes at a time when basic multimedia network, data transport protocols and digital terminals on
technologies have already established themselves on the user premises.
desktop PCs. Streaming media is a technology that
enabled the user to experience a multimedia presentation
on-the-fly while it was being download from the Internet. REFERENCES
The provision, bandwidth on demand with strict QoS
[1] H.T.Mouftah, Multimedia communications: An overview,
guarantees is a fundamental property of ATM networks IEEE Comm. Magazine, vol.30, pp.18-19, May 1992.
that makes them especially suitable for carrying real time
multimedia traffic. Statistical multiplexing of VBR [2] R.V.Cox et al., On the applications of multimedia
connections within the backbone network allows effective processing to communications, Proc. of the IEEE,
aggregation and capacity engineering. vol.86, pp.755-824, May 1998.
Anticipating that multimedia over IP will be one of the [3] J.Rosenberg et al., Multimedia communications for users,
major driving forces behind the emerging broadband IEEE Comm. Magazine, vol.30, pp.20-36, May 1992.
communications, addressing the challenges facing the [4] Z.S.Bojkovic, Multimedia communication system:
delivery of multimedia applications over IP is a great modeling, standardization, requirements, in Proc. Int.
importance. In order for the Internet to allow applications Conference on multimedia technology and digital
to request network packet delivery characteristics telecommunication services, ICOMPT96, pp.5-13,
according to their needs, sources are expected to declare Budapest, Hungary, Oct. 1996.
the offered traffic characteristics. Admission control rules
[5] L.C.Wolf, C.Griwadz and R.Steinmetz, Multimedia [22] T.Sikora, "MPEG digital video-coding standards", IEEE
communication, Proc. of the IEEE, vol.85, pp.1915- Signal Processing Magazine, vol.14, pp.82-100, Sept.
1933, Dec. 1997. 1997.
[6] Z.S.Bojkovic, Image decomposition and compression in [23] MPEG Requirements Group, MPEG-7: Context and
digital multimedia systems, in Proc. IX Int. Conference objectives, ISO/IEC SC29/WG11 Doc.N1678, MPEG
on signal processing applications and technology, Bristol Meeting, April 1997.
ICSPAT95, pp.940-944, Boston, MA, USA, Oct. 1995.
[24] F.Pereira, First proposal for MPEG-7 visual requirements,
[7] K.R.Rao and Z.S.Bojkovic, Packet video communications ISO/IEC SC29/WG11 Doc.M1941, MPEG Bristol
over ATM networks, Prentice-Hall PTR, Upper Saddle Meeting, April 1997.
River, NJ, 2000.
[25] S.F.Chang, "Content-based indexing and retrieval of visual
[8] K.R.Rao, Z.S.Bojkovic, D.A.Milovanovic, Multimedia information", IEEE Signal Processing Magazine,
communication systems: techniques, standards and vol.14, pp.45-48, July 1997.
networks, Prentice-Hall PTR, Upper Saddle River, NJ,
2002. [26] ISO/IEC IS11172 (MPEG-1), Information tchnology-
coding of moving pictures and associated audio for
[9] A.Hoshi et al., An integrated multimedia desktop digital storage media up to about 1.5 Mbit/s, 1993.
communication and collaboration platform for
broadband ISDN, Proc. IEEE Multimedia92, pp.28- [27] B.L.Tseng and D.Anastassiou, "Multiview point video
37, April 1992. coding with MPEG-2 compatibility", IEEE Trans.
CSVT, vol.6, pp.414-419, Aug. 1996.
[10] A.Watabe et al., Distributed desktop conferencing system
with multimedia interface, IEEE J. Selected Areas in [28] T.Homma, MPEG contribution: Report of the adhoc group
Comm., vol.9, pp.531-539, May 1991. on MPEG-2 applications for multi-view point pictures,
ISO/IEC SC29/WG11 Doc. 861, March 1995.
[11] E.Fouques, Switching and routing: providing value with
multimedia services, Alcatel Telecomm. Review, [29] A.Puri, "Video coding using the MPEG-2 compression
pp.198-204, 3rd quarter 1999. standard", SPIE/VCIP, Boston, MA, pp.1701-1713,
Nov. 1993.
[12] A.A.Lazar et al, MAGNET: Columbias integrated
network testbed, IEEE J. Selected Areas in Comm., [30] J.Ni, T.Yang and D.H.K.Tsang, "CBR transportation on
vol.11, pp.859-871, Nov. 1993. VBR MPEG-2 video traffic for video-on-demand in
ATM networks", Proc. IEEE ICC, pp.1391-1395,
[13] P.Gonet, Fast packet approach to integrated broadband Dallas, TX, June 1996.
networks, Networks, vol.9, pp.292-298, Dec. 1986.
[31] M.Orzessek and P.Sommer, ATM and MPEG-2 integration
[14] J.Sidron and J.S.Gopal, PARIS: An approach to integrated of digital video into broadband networks, Prentice Hall
high speed private networks, Int. J. Digital Analog PTR, Upper Saddle River, NJ, 1998.
Cable Syst., vol.1, pp.77-85, 1988.
[32] ISO/IEC IS 13818-1, Generic coding of moving pictures
[15] Z.S.Bojkovic, Some issues in packet video: modeling, and associated audio, Part 1: System, 1995.
coding and compression, 2nd Int. Workshop on Image
and signal Processing: Theory, methodology, systems [33] ISO/IEC IS 13818-2, Generic coding of moving pictures
and applications, pp.2-23, Budapest, Hungary, Nov. and associated audio, Part 2: Video, 1995.
1995. [34] L.Chiarglione, "MPEG and multimedia communications",
[16] Z.S.Bojkovic, Recent trends in packet video transmission IEEE Trans. CSVT, vol.7, pp.5-17, Feb. 1997.
on ATM networks, in Proc. Basis of Electronics [35] F.Pereira, "Tutorial issue on the MPEG-4 standard", Signal
Workshop 94, pp.17-105, Cluj, Romania, Oct. 1994. Processing: Image Communication, vol.15, pp.269-270,
[17] G.Karlsson, Asynchronous transfer of video, Swedish Jan. 2000.
Institute of Computer science (SICS), Research report [36] ISO/IEC JTC1/SC29/WG11, Doc.N4030, MPEG-4
R95:14, Kist, Sweden, 1997. Overview v18.0, Singapore, March 2001.
[18] G.Karlson, ATM adaptation for video, in Proc. of 6th Int. [37] A.N.Skodras, C.A.Christopoulos and T.Ebrahimi,
Workshop on Packet video, pp.E.3.1-5, Portland, OR, "JPEG2000: the upcoming still image compression
1994. standard", Proc. of the 11th Portugese Conference on
[19] P.White and J.Groweroft, The integrated services in the Pattern Recognition, pp.359-366, Porto, Portugal, May
Internet: State of the art, Proc. of the IEEE, vol.85, 2000.
pp.1934-1946, Dec. 1997. [38] R.Koenen and F.Pereira, "MPEG-7: A standardized
[20] Z.S.Bojkovic, Digital HDTV system compression, description of audiovisual content", Signal Processing:
Journal on Communications, vol.45, pp.2-10, May- Image Communication, vol.16, pp.5-13, Sept. 2000.
June 1994. [39] ISO/IEC JTC1/SC29/WG11, Doc.N3752, Overview of the
[21] K.R.Rao and J.J.Hwang, Techniques and standards for MPEG-7 standard, La Baule, Oct. 2000.
image, video and audio coding, Upper Saddle River, [40] ISO/IEC N4041, MPEG-21 Overview, Singapore, March
NJ: Prentice Hall PTR, 1996. 2001.
[41] S.Okubo, et al., "ITU-T standardization of audiovisual [60] Y.Zhang, et al., MPEG-2 video services for wireless ATM
communication systems in ATM and LAN networks, IEEE J. Selected Areas in Comm.,
environments", IEEE J. Selected Areas in Comm., vol.15, pp.119-128, Jan. 1997.
vol.15, pp.965-982, Aug. 1997. [61] E.Ayanoglu et al., Performance improvement in the
broadband networks using forward error correction
[42] ITU-T Recommendation H.261, Video codec for audio- for lost packets recovery, Journal of High-Speed
visual services at px64 kbit/s, March 1993. Networks, vol.1, pp.287-303, June 1993.
[43] ITU-T Recommendation H.320, Narrow-band visual [62] E.Ayanoglu et al, Forward error control for MPEG-2
telephone systems and terminal equipment, March video transport in a wireless ATM LAN, Proc.
1996. IEEE ICIP, vol.2, pp.833-836, Lausanne,
Switzerland, 1996.
[44] ITU-T Recommendation H.263, Video coding for low bit [63] S.D.Servetto and K.Nahrstedt, Broadcast quality video
rate communication, March 1996. over IP, IEEE Trans. Multimedia, vol.3, pp.162-
[45] ITU-T Draft Recommendation H.263 Version 2, H.263+ 173, March 2001.
Video coding for low bitrate communication, Sept. [64] P.Merriman, Video over DSL architecture, Alcatel
1997. Telecommunications Review, pp.250-257, 4th
Quarter 2000.
[46] ITU-LBC-97-094, Draft 10 of H.263+, H.263+ Video [65] M.Verhoeyen, Delivering voice services over DSL, Alcatel
group, Nice, France, Feb. 1997. Telecommunications Review, pp.244-249, 4th
Quarter 2000.
[47] S.Bradner, Ed., The Internet standards process - Revision
[66] D.J.Goodman and D.Raychaudhuri, Eds., Mobile
3, RFC2026, Oct. 1996.
multimedia communications, Plenum, New York,
[48] N.Jayant, "High quality networking of audio-visual 1997.
information", IEEE Comm. Magazine, vol.31, pp.84-95, [67] L.Hanzo, Bandwidth efficient wireless multimedia
Sept. 1993. communications, Proc. of the IEEE, vol.86,
pp.1342-1382, July 1998.
[49] J.Sidron and J.S.Gotal, "PARIS: An approach to integrated [68] Y.Waugand and Q.F.Hu, Error control and concealment
highspeed private networks", Int. J. Digital Analog for video communication: a review, Proc. of the
Cable Syst., vol.1, pp.77-85, Jan.-March 1988. IEEE, vol.86, pp.974-977, May 1998.
[50] D.Minoli, "Optimal packet length for packet voice [69] K.Palavan et al, Wideband local access: wireless LAN and
communication", IEEE Trans. Commun., vol.COM-27, wireless ATM, IEEE Commun. Magazine, vol.35,
pp.607-611. pp.34-40, Nov.1997.
[70] J.Mikkonen et al, Emerging wireless broadband networks,
[51] P.Brady, "A model for generating on-off patterns in two- IEEE Commun. Magazine, vol.36, pp.112-117, Feb.
way communications", Bell Syst. Tech. J., vol.48, 1998.
pp.2445-2472, Sept. 1969.
[52] M.Listanti and F.Villani, "An X.25 compatible protocol for Acknowledgement: This paper was supported by the
packet voice communications", Comput. Commun., Ministry for Science and Technology, Republic
vol.6, pp.23-31, Feb. 1983. of Serbia
[53] B.Gold, "Digital speech networks", Proc. of the IEEE,
vol.65, pp.1630-1658, Dec. 1977.
[54] T.Bially et al., "A technicue for adaptive voice flow control
in integrated packet networks", IEEE Trans. Commun.,
vol.COM-28, pp.325-333, March 1980.
[55] T.S.Chen, Y.Walrand and D.G.Messerchmitt, "Dynamic
priority protocols for packet voice", IEEE J. Selected
Areas in Commun., vol.7, pp.632-643, June 1989.
[56] W.Mounteomery, "Techniques for packet video
synchronization", IEEE J. Selected Areas in Commun.,
vol.1, pp.1022-1028, Dec. 1983.
[57] D.W.Patr, L.A.DaSilva and V.S.Frost, "Priority discarding
of speech in integrated packet networks", IEEE J.
Selected Areas in Commun., vol.7, pp.644-656, June
1989.
[58] G.Karlsson, Asynchronous transfer of video, SICS
Research Report R95:14, Sweden, 1997.
[59] M.Orzessek and P.Sommer, ATM and MPEG-2:
Integrating digital video into broadband networks,
Upper Saddle River, NJ: Prentice Hall PTR, 1998.

Multimedia Communication Systems: Techniques, Standards, Networks

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Multimedia Communication Systems: Techniques, Standards, Networks

Caricato da

Copyright:

Formati disponibili

MULTIMEDIA COMMUNICATION SYSTEMS:

TECHNIQUES, STANDARDS, NETWORKS

Zoran BOJKOVI(1), Dragorad MILOVANOVI(2), Andreja SAMOVI(1)

Figure 1. Multimedia communication

Figure 2. New information system paradigm using

User Transport User

Figure 3. Elements of multimedia systems used in (a)

ENCODER: APPLICATION NETWORK FRAMING: APPLICATION DECODER:

Figure 4 Digital video signal transport

However, implementation of the full syntax may not

Table 4 Video coding standards

Figure 5. Packet voice

= d d' (2) Dt = Dv + Dr (4)

is as uniform as possible for all packets. In addition to

Figure 9. Packet multiplexer subsystem with arriving

Packet multiplexer subsystem with arriving packet CODER

arrival rate, while represents the effective service rate. LOST

discards packets according to packet delivery priority REGEN

done locally determined measure of overload level. It is REGEN'D

assumed that arriving packets are discarded. It is also SAMPLES

possible to discard already-queued packets. In addition, if REGEN

VIDEO TRANSPORT IP TRANSPORT VIDEO

Figure 11. Structure of a video streaming system

I. INTERNET ACCESS NETWORKS J. MULTIMEDIA OVER WIRELESS

Potrebbero piacerti anche