25.04.2014 Views

TITRE Adaptive Packet Video Streaming Over IP Networks - LaBRI

TITRE Adaptive Packet Video Streaming Over IP Networks - LaBRI

TITRE Adaptive Packet Video Streaming Over IP Networks - LaBRI

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

If we look on PHB selection, it is obvious that for loss tolerant multimedia streams, like<br />

compressed video or audio they will be transmitted with an AF PHB. Moreover, EF PHB will be<br />

more expensive than AF PHB, because it guarantees additional constraints compared to AF PHB<br />

group (minimal latency, minimal jitter, and minimal loss). Once this choice is made, it remains us to<br />

identify the class of AF drop precedence that we use for <strong>IP</strong> packet. We recall that AF PHB group<br />

defines currently 4 AF classes within each class 3 degrees of drop precedence according to the<br />

importance of the packet.<br />

Speech has a greater importance compared to the music or other background sounds in a<br />

scene. Thus, we privilege foreground audio speech compared to other audio. Moreover, using high<br />

quality voice coding tools, speech bit rates are between 2 and 24 Kbps. Signaling MPEG-4<br />

information bit rates are in the same order of magnitude. Both data are essential for video quality<br />

decoding and rendering. Consequently, i.e. low bit rate streams are given higher priority score to<br />

raw video data (i.e. property 2).<br />

MPEG-4 uses hierarchical coding for resolving system scalability and heterogeneity issues.<br />

Different coding modes exist: “spatial scalability” allows the decoder to treat a subset of streams<br />

produced by the coder to rebuild and display textures, images and objects video with a reduced<br />

spatial resolution. For textures, a maximum of 11 levels of spatial scalability is supported, while for<br />

the visual sequence, a maximum of 3 levels is proposed. Temporal scalability permits the decoder to<br />

treat a subset of streams produced by the coder to rebuild and display a video with reduced<br />

temporal resolution. A maximum of 3 levels is supported. With SNR Scalability (Signal to Noise<br />

Ratio), the coder transmits the difference between the original image and the preceding. This allows<br />

improving subjective quality of the final image to get maximum perceived quality (i.e. property 3).<br />

Moreover, MPEG-4 introduces additional data control streams such as: the scene description and<br />

media objects description, all these signaling streams are very sensitive and need to be preserved from<br />

errors and loss during transmission (i.e. property 4). If EF PHB is not implemented, scene<br />

description stream will be marked as an AF PHB with low drop precedence (i.e. AF31).<br />

We also assume that audio and video are carried with two distinct AF classes. This enables<br />

network nodes to better cope with congestion by given some routing privilege to audio compared<br />

to video. However, a problem of media synchronization may appear which can be tackled by using<br />

RTP Timestamps.<br />

In order to generalize this algorithm we used our classification layer. The classification layer<br />

can classify any MPEG-4 AVO in the scene and affect one of the given labels. The label affected<br />

can be in our case the Diffserv PHB. Figure 5-2 gives a description of DVMA using the<br />

classification layer.<br />

112

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!