The increasing demand on high quality digital video broadcasting systems has driven the development of several coding standards, MPEG-4 being the most recent. As these standards only define the bit-stream syntax, such essential features of the coder as controlling the output bit-rate have to be implemented individually.
An important problem arising here is the optimization of the bit-rate-distortion trade-off, since in general the resulting bit-stream has to meet the requirements of a communication channel with limited bandwidth.
The new video compression standard for multimedia applications MPEG-4 (currently being developed by the ISO) poses new problems regarding the control of the encoding process, due to its object-oriented nature. In this paradigm the scene is encoded in terms of multiplexed streams, each of which conveys information of an object, corresponding to a visual entity considered as a semantic unit. Important facts about those objects are that they have a time-varying arbitrary shape, and the frame rate can vary between them.
Here is given an excerpt of the whole project papers to give an idea about theirs contents.
Gzip'ed postscript version of report.For some years a demand exists to realise multimedia communication. This demand led to institute committees and groups for developing needed coding standards. One of these is the MPEG with its successful development of the multi-industry MPEG-1 and MPEG-2 standards.
MPEG-4, the most recent standardisation project of MPEG (Motion Pictures Expert Group), combines some of the typical features of other MPEG standards with new ones, to meet the still increasing demand on high quality digital video broadcasting systems. Its main aim is to provide a coding scheme for small bandwidth channels, as used for video conferencing, picture-phone, and HDTV. Traditional coding technics are not able to target such low bit-rates, provided that a minimum quality criterion is pinpointed. The necessary improvements are only possible if the coding process takes into account the typical characteristics of natural picture sequences: Several objects exist inside the picture scene, which contain a shape and which can move independent from each other.
The first step of an object-oriented video coding is the segmentation of the picture scene in, the content of the scene conformed, sensible areas. The logical components of the pictures then are not any longer pixels or rectangular blocks, but arbitrarily shaped picture-segments.
All MPEG standards only define the syntax of the encoded bit-stream to guarantee a univocal decoding. Therefore, essential features of the coder, such as controlling the output bit-rate, have to be implemented individually, and in a way that the subsequent decoding process is not disturbed.
In Figure the bit-rates of two objects of the weather sequence are shown. The plot on the left shows the objects which represent the background, a weather forecast map. The plot on the right shows the object which represents the forecaster. Well seen is the moment in this scene, beginning with frame 200, when the forecaster steps aside which leads to an increasing bit-rate for both VOPs.
|
|
Caused by the in-depended movement of the objects and their different complexities, each ones bit-rate has its own characteristic. Nevertheless, in case of overlapping objects an interrelation takes place. Therefore, a kind of regulation for the bit-rate of each object, as well as for the whole bit-rate is advisable.
Within this master-thesis work, new algorithms have been designed and tested for the real-time regulation of the MPEG-4 video coding process. Whereas the so far developed regulation systems are either block based or related to the whole bit-stream, these algorithms are giving wide coverage to the object concept and its implications.
The algorithms were integrated and proved in the MPEG-4 VM (Verification Model) version 3.1 (actual in November 1996), which is part of the MoMuSys (MObile MUltimedia SYStems) project. The activities of MoMuSys are closely linked with the development of ISO (International Standardisation Organization) MPEG-4. Thus MoMuSys can be seen as a European initiative to push the development of standardisation, mainly in MPEG-4, but also in the field of mobile multimedia communications.
This section will give a synopsis about the structure of this report and the contents of the several chapters and
sections.
Finally, the report is ended by an extensive glossary, a bibliography, and an index.
Last but not least, this project was used and is mentioned in the bibliography part in a research and conference by the Siemens company.