um.

awn. ..

:4th .. ‘
.J.w...vu.

A, :1 .

V H. .w H“. 7: “‘11“

.4. x

.5... n . ._ In!
$.33 drawn
. . .593 a.
3%.? in
y .9
1.

a. .
Fast
3}.

. n
in
;

.1...

‘Ir Let...

Vitt torn-ohv -I I In}

1! 3 41..
its! .
..

:5
I. ~
2.» 31.»...
. V.12... z z.
‘1 t 1..
\..

....,
thluilx...
: I34
and: :5
02. a:

l)..-
ewrnakqb. .

:uttlz
an is)

if! .16...
tors. ‘
lot. 13.11.! 1!». .2
1. 1‘1)|.l 3
.I!

 

.3}:

Iitixallll .
3 .3 1..

 

LIBRARY
Michigan State
U nive: sity

 

 

 

This is to certify that the
dissertation entitled

ADAPTIVE WIRELESS VIDEO USING CHANNEL STATE
INFORMATION

presented by

Yongju Cho

has been accepted towards fulﬁllment
of the requirements for the

Ph.D. degree in Electrical Engineering

 

 

MW

 

ajor Professor’s Signature

Dec 05, 2008

 

Date

MSU is an Aﬁinnative Action/Equal Opportunity Employer

Co-I-o-o-c—0-0-3-o-o-t-o—t-I-C-I-I-O-I-o-o-I-l-c-l-0-.---v-0-.---l-._-O-I-O-C-D-o-l-'-0-0-l-O-I-I-l-‘-U-O-O-I-t-I-O-O-l-O-I.

 

 

PLACE IN RETURN BOX to remove this checkout from your record.
TO AVOID FINES return on or before date due.
MAY BE RECALLED with earlier due date if requested.

 

DAIEDUE

DAIEDUE

DATE DUE

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

5/08 K;/Prq/Acc&Pres/CIRCIDateDue indd

ADAPTIVE WIRELESS VIDEO USING CHANNEL STATE INFORMATION

By

Yongju Cho

A DISSERTATION

Submitted to
Michigan State University
In partial fulﬁllment of the requirements
For the degree of

DOCTOR OF PHILOSOPHY

Electrical Engineering

2008

ABSTRACT
ADAPTATIVE WIRELESS VIDEO USING CHANNEL STATE INFORMATION
By
Yongiu Cho

In this thesis, we tackle the following research problems: i) accurate channel capacity
estimation and prediction under Cross-Layer Design with Side-information (CLDS)
wireless protocols; ii) an optimal source and channel coding rate prediction and tuning;
iii) Unequal Error Protection (UEP); and iv) Syndrome Partial Decoding (SPD) for
wireless video in multi-hop networks. i), ii), iii) and iv) are essential factors to complete a
rate-adaptive video streaming application over wireless LAN 5. For these problems, we
collected a comprehensive set of datasets (or error traces) which are analyzed and used to
verify our proposed schemes in realistic environments.

We provide analysis of bit-errors at IEEE 802.11b MAC layer and develop channel
Bit Error Rate (BER) prediction scheme. We show that BER can be accurately predicted
by employing multi-tier model which utilizes Received Signal Strength Indication (R881)
and checksum of a packet. On the basis of the channel BER prediction scheme, an
Optimal rate tuning; which leverages both the probability distribution of channel capacity

prediction error process and an RD (quality) function of a video sequence, is developed.

In order to employ the optimal rate timing scheme in a practical wireless environment, we

deduce a Rate Distortion (RD) model for above-capacity video and an operational rate for

a speciﬁc channel code. It is observed that the optimal rate tuning provides excellent rate

prediction performance. In addition, an UEP scheme; which utilizes characteristics of a

LDPC code, is deveIOped, and we show that the UEP scheme reduces the degradation of

video quality in case of burst errors. The optimal rate prediction architecture under CLDS

protocols, ORPACLDS , which combines all the described schemes above, provides

excellent performance in terms of accuracy of rate prediction as well as video quality.

Finally, we exhibit the applicability of the proposed architecture in multi-hop wireless

networks by introducing SPD which mitigates the complexity and memory requirement

of Decoding and Forwarding (DF), while providing reasonably high goodput.

TABLE OF CONTENTS

 

 

 

 

 

 

 

 

 

 

 

 

 

CHAPTER 1 INTRODUCTION 1
1.1. RESEARCH PROBLEM ............................................................................................................................. 2
1.2. OVERVIEW OF CONTRIBUTIONS ............................................................................................................. 7
1.3. THESIS ORGANIZATION ........................................................................................................................ 11

CHAPTER 2 BACKGROUND 13
2.1. LIMITATION OF CURRENT MULTIMEDIA DELIVERY MECHANISM/PROTOCOLS ........... 13
2.2. OVERVIEW OF CLDS . ............................................... 17

2.2.1. CLDS Channel Evaluation .......................................................................................................... 19
2.2.2. FE C for CLD/CLDS Protocols .................................................................................................... 29

CHAPTER 3 MULTI-TIER MODEL FOR BER PREDICTION 35

3.1. TRACE COLLECTION .................................... 36
3.1.1. Data Collection ........................................................................................................................... 38
3.1.2. Average Statistics of the Traces ................................................................................................... 40
3.1.3. BER Behavior at Dlﬂ'erent SSR Values ........................................................................................ 42

3 .2. A MARKOV MODEL FOR PER PREDICTION .......................................................................................... 44
3. 2. I . Correlation of the Packet-Error Process ..................................................................................... 45
3.2.2. The 3rd order Packet-Error Markov Model ................................................................................ 4 7

3 .3. PACKET-LEVEL MODEL FOR SSR PREDICTION .................................................................................... 51

3.4. BER PREDICTION USING BSC MODELS ............................................................................................... 51

3.5. PERFORMANCE EVALUATION OF THE MTM ......................................................................................... 52

CHAPTER 4 RATE PREDICTION AND TUNING ‘8
4.1. LIMITATIONS AND OPTIMAL RATE ADAPTATION PERIOD ...................................................................... 59
4.2. BER AND CHANNEL CAPACITY ESTIMATION ........................................ .. 61
4.3. CHANNEL CAPACITY PREDICTION ....................................................................... 63
4.4. OPTIMAL RATE TUNING .......................................................................................... 64
4.5. PERFORMANCE EVALUATION ............................................................................................................... 68

CHAPTER 5 RATE ADAPTT ION FOR WIRELESS SCALABLE VIDEO 79
5.1. AN RD MODEL FOR ABOVE-CAPACITY VIDEO -- . ........................................ 79

5.1. 1. Operational Rate ......................................................................................................................... 80

iv

5.1.2. The Video Rate-Distortion Model ................................................................................................ 82

 

 

 

 

 

5.1.3. Performance Evaluation .............................................................................................................. 85

5.2. UNEQUAL ERROR PROTECTION ...................................................................................... 91
5.2.1. UEP Utilizing an LDPC Code ..................................................................................................... 91
5.2.2. Performance Evaluation .............................................................................................................. 94
CHAPTER 6 SYNDROME PARTIAL DECODING 96
6.1. LIMITATIONS. - ............................................................... 97
6.2. SYNDROME-BASED PARTIAL DECODING ........................... 99

6. 2. I . Architecture for Rate-Adaptation .............................................................................................. 100
6.2.2. SPD Overview ........................................................................................................................... 103
6.2.3. Goodput Evaluation ................................................................................................................... I 05
6.2.4. Optimization of SPD .................................................................................................................. 108

6.3. PERFORMANCE EVALUATION ............................................................................................................. 110
CHAPTER 7 CONCLUSION 114
APPENDIX 118
REFERENCES 119

 

LIST OF TABLES

 

TABLE I STATISTICS OF TRACES USED IN THIS STUDY ............................ - ..................... 41
TABLE II ERROR STATISTICS FOR VARYING SSR VALUES AT 1 l MBPS ...................................................... 41
TABLE III AVERAGE ABSOLUTE ERROR OF BER PREDICTORS. .................................................................... 56
TABLE IV AVERAGE MSE OF CHANNEL CAPACITY PREDICTION. ............................................................... 75

TABLE V PREDICTION PERFORMANCE (IN TERMS OF OVERALL VIDEO QUALITY) BEFORE THE OPTIMAL
RATE TUNING. ................................................... -- .76

TABLE VI PREDICTION PERFORMANCE (IN TERMS OF OVERALL VIDEO QUALITY) AFTER THE OPTIMAL
RATE TUNING. - - - .. - ............................. .................. 77

 

TABLE VII PREDICTION PERFORMANCE WITH FOREMAN SEQUENCE AND VARIOUS CODECS (PKT RATE = 1

MBPS & PHY. RATE = 11 Maps). -- ..... 78

TABLE VIII PREDICTION PERFORMANCE WITH H.264 AND VARIOUS VIDEO SEQUENCES (PKT RATE = 1
MBPS & PHY. RATE = 11 Maps). .................................................................................... 78

TABLE IX PERFORMANCE OF ORPACLDS WITH AN IDEAL CHANNEL CODE AND WITH A LDPC CODE (IN

TERMS OF OVERALL VIDEO QUALITY). .................................... _. ............... 89
TABLE X PERFORMANCES OF ORPACLDS AND ORPACON (IN TERMS OF OVERALL VIDEO QUALITY)....90

TABLE X1 THE PERFORMANCE OF SPD WITH OPTIMAL PACKET SIZE SELECTION SCHEME IN TERMS OF

GOODPUT (XMTT RATE=500KBPS) .............................................................................. 1 12

LIST OF FIGURES

FIGURE 1 THE ARCHITECTURE OF THE PROPOSED RATE ADAPTATION. ............................................................. 4
FIGURE 2 FUNCTIONAL BLOCK DIAGRAM OF CLDS PROTOCOL BASED CLIENT .............................................. 19
FIGURE 3 A SINGLE LOGIC TRANSMISSION UNIT (LTU) ................................................................................ 20

FIGURE 4 (A) BINARY ERASURE CHANNEL (BEC) REPRESENTING THE UDP CHANNEL (B) HYBRID BINARY

SYMMETRIC/ERASURE CHANNEL (BSEC) (c) BSEC WITH SIDE INFORMATION Z. 25

 

FIGURE 5 COMPARISON OF CHANNEL CAPACITY FOR CON, CLD AND CLDS, 5 = 0.33 ............................... 25

FIGURE 6 MESSAGE PACKET THROUGHPUT :- , AFTER CHANNEL DECODING, FOR RS/LDPC BASED FEC
SCHEMES OVER CON/CLD/CLDS. CHANNEL CONDmONS ARE GIVEN BY 6 = 0.33, 2. = 0.01 . ......... 33

FIGURE 7 TOPOLOGIES USED FOR WIRELESS TRACE COLLECTION. .................................................................. 38

FIGURE 8 NORMALIZED HISTOGRAMS OF THE NUMBER OF BIT-ERRORS IN A CORRUPTED PACKET FOR

VARYING SSR VALUES; HISTOGRAMS ARE AVERAGED OVER ALL 1 1 MBPS RESIDUAL TRACES, AND

 

ONLY NUMBER OF ERRORS BETWEEN [1,200] ARE SHOWN. ......... . . - ................................ 43
FIGURE 9 SAMPLE AUTOCORRELATION COEFFICIENT OF AN 1 1 MBPS TRACE. ................................................ 47
FIGURE 10 THE MULTl-TIER BER PREDICTION MODEL. .................................................................................. 50

FIGUREll ABSOLUTE ERROR IN BER PREDICTION; TIME-WINDOW LENGTH 2' = 50 SECONDS, RESULTS ARE

AVERAGED OVER ALL THE TRACES. . - - - - .................................................................. 54

 

FIGURE 12 ALLAN VARIANCE AS FUNCTION OF TIME WINDOW. ..................................................................... 60

FIGURE 13 TEMPORAL CORRELATION IN CHANNEL CAPACITY FOR 1 1 MBPS TRACES (TRACES FROM 1 TO 5
COLLECTED WITH TRANSMISSION RATE AT 500, 6 To 7 AT 750, 7 To 10 AT 900, AND 1 1 TO 14 AT
1024KBPs). ...... - . . ............. 63

 

vii

 

FIGURE 15 THE CHANNEL CAPACITY PREDICTION (ZOOMED IN) RESULTS ............................. 73

FIGURE 16 (A), (B), AND (O) SHOW THE OPTIMALLY ALLOCATED CHANNEL CAPACITIES (OR CHANNEL

CODING RATES) BASED ON PREDICTIONS FIGURE 15, BY ORPACL D51 , OI'tPACLDS2 AND ORPACON . ....74

FIGURE 17 THE RD (QUALITY) FUNCTION (Q(-) ) OF SVC TEST VIDEO SEQUENCE FOR RATES BELOW
CAPACITY (A) AND THE EMPIRICAL MODEL ( Q '(-) ) FOR VIDEO RATE DISTORTION FOR RATES

EXCEEDING THE CHANNEL CAPACITY (B). - - ............................................................. 84

 

FIGURE 18 THE PROBABILITY OF SUCCESSFUL DECODING, WHICH IS GENERATED WITH LDPC CODES [52] AND

SVC BIT-STREAM [54], DEPENDS ON BOTH THE SIZE OF PACKET AND THE PARAMETER a .................... 92

FIGURE 19 THE PERFORMANCE COMPARISON IN VIDEO QUALITIES WITH THE PROPOSED UEP SCHEME AND

WITHOUT UEP (WHEN ONLY I-FRAME PACKETS ARE LOST, I.E., THE WORST CASE). .............................. 95
FIGURE 20 THE ARCHITECTURE OF THE PROPOSED RATE ADAPTATION ....................................................... 102
FIGURE 21 THE EMPIRICAL PER MODEL. ......................................................................... 109

 

FIGURE 22 PERFORMANCE COMPARISON AMONG FOUR DIFFERENT SCHEMES IN TERMS OF GOODPUT. FOR THIS

 

EXPERIMENT, A PACKET SIZE OF 8000 BTTS IS USED. ........ - - - ....................... 1 1 1

viii

CHAPTER 1
INTRODUCTION

Recently, digital media has had a tremendous impact on the development of the
Internet, telecommunications, and broadcasting areas. The rapid popularization of the
Internet, wireless communication systems, and digital broadcasting networks have led us
to an epochal framework for content services with an end-to-end delivery chain of
content generation, distribution and consumption. Multimedia devices and digital TVS
have accelerated the easy purchase and consumption of a vast number of media content
types. These developments, however, have forced us to consider the reliable delivery of
multimedia content over Wireless networks to diverse types of devices. Especially
Quality of Service (QoS) for multimedia content over mobile wireless networks has been
an issue.

To that end, our research embodies an advanced multimedia content delivery that
satisﬁes the needs of reliability and Q08 over wireless networks in an efﬁcient manner.
Our research on reliable video delivery over Wireless networks demonstrates ﬁve key

aspects: 1) Analyzing the behavior of errors in wireless networks and predicting accurate

Bit Error Rate (BER) by utilizing Cross Layer Design with Side-information (CLDS)
protocols, 2) Predicting an optimal source and channel coding rate, 3) Modeling of an
RD for above-capacity Wireless video, 4) Minimizing the video quality distortion in case
of packet loss (Unequal Error Protection (UEP)), and 5) Mitigating the complexity and
memory usage at each intermediate nodes in an ad-hoc network, while providing
reasonably high goodput. The solution will eventually lead the 'user to a better QOS
experience and enable network providers to beneﬁt from more efﬁcient bandwidth

utilization over the entire wireless networks.

1.1. Research Problem

In many Wireless environments, deteriorated link conditions cause bit-corruptions.
These corrupted packets cause checksum failures and packet drops at wireless receivers.
To reduce packet losses at the receivers, many recent efforts utilize cross-layer protocols
that do not discard corrupted packets [1][19][3l][32]. Consequently, two classes of
wireless multimedia protocols have emerged: (i) Cross-Layer-Design (CLD) [19]
protocols which relay corrupted packets to higher layer for ﬁrrther processing; (ii)

conventional (CON) [l9] protocols which drop any packet that has one or more residue

errors]. Prior studies have Shown that a Signiﬁcant improvement in wireless video
throughput can be achieved by CLD [1][19][3l][32]. Furthermore, it was also exhibited
that side information, which are already available from IEEE 802.11 compliant packets,
is quite valuable for providing channel state information and modeling of the underlying
(effective) video channel. This side information includes Signal to Silence Ratio (SSR)
indicators and MAC-layer checksum, both of which can be used as parameters for
channel estimation [21]. (SSR is a packet-level SNR parameter supported by IEEE
802.11 compliant devices.) This form of CLD protocols that utilize side information have
been referred to as CLDS protocols in prior literature [19] [21][33].

Despite of the demonstrated beneﬁts of CLDS, standard features that can exploit
these beneﬁts for CLDS-based video streaming applications remain unexplored. One
such important feature is video rate-adaptation based under CLDS protocol. Figure 1
illustrates the rate-adaptive video architecture on which our research focuses. In the
architecture a server and a client communicate through a heterogeneous network which
consists of a wired and a wireless network. In the given architecture, both a server and a
client are designed to support source and channel rate adaptation. Note that a great deal

of bit corruptions in packets occur in a wireless network whose link quality ﬂuctuates

 

1 Here, a residue error is an error that is not corrected by the physical layer, and hence it
appears at the MAC layer.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Wireless .
Server Network Cllent
. Video ) Video
Video _> F EC Stream ‘ Decoder
Encoder Encoder Wired ' ‘
A 7 A Network FE C
* ‘ Decoder
- Rn+1 Feedback Side-Info4
Rate Tuner N Channel
C Estimator
n I
I

 

 

 

 

Figure l The architecture of the proposed rate adaptation.

signiﬁcantly due to various interference, fading, multi-path effects, and mobility. Here,

we focus on the wireless channel (as seen by above-PHY layers) for rate estimation and

prediction in the architecture.

The client supports CLDS protocols that leverage residue-error-process and side

information, which can be relayed to an F EC decoder for soft-decoding [37], to estimate

the current channel capacity for a block of packets (or time-window). The current

channel capacity, which is estimated by the channel estimator in the client with the

entropy of the residue error process, is then transmitted to the server as feedback for rate

adaptation. Using the feedback, the rate tuner at the server predicts the optimal source

and channel rates for the next block of multimedia packets to be transmitted.

It is essential to accurately estimate and predict the channel capacity, which is in turn

4

can be used to determine the source and channel coding rates, in real-time, for rate-
adaptive video [42]. In particular, when a channel-coding rate is under-estimated relative
to channel capacity, a client virtually recovers all packets (error-free), but Operating
below channel capacity naturally and negatively effect the overall throughput of a
wireless video streaming session (and hence the video quality). On the other hand, an
over-estimated rate provides a larger amount of video information in packets (and less
amount of redundancy), but a client receives many corrupted packets with more errors
than a channel code can correct. This signiﬁcantly degrades overall video quality; i.e., the
performance of a rate-adaptive Video streaming application depends greatly on the
accuracy of the channel state estimate.

When CLDS is employed, the channel capacity can be estimated from the entropy of
the residue error process [21]. However, accurate estimation/prediction of the residue-
error process entropy using only packet-level Signal strength information is a challenging
task. While these packet level measurements are much coarser than bit-level granularity
required for error entropy computation; bit level signal strength measurements are not
readily available at the MAC layer of current wireless networks. In addition, we need to
adhere to a rate that is strictly below channel capacity to avoid excessive packet drops.

Therefore, if the predicted channel capacity is directly employed, there is a good

likelihood that the predicted capacity (as a random variable) may exceed the actual

channel capacity, which leads to severe distortion of video quality.

For multimedia streaming applications, it is essential to accurately estimate/predict

the channel capacity to provide QOS. However, it is also important to minimize distortion

of video quality when video packets are corrupted due to busty errors by severe

interference which often occurs in wireless channels. A workaround to this problem is to

employ an Unequal Error Protection (UEP) scheme. Under UEP, parts (e.g., Inna-coded

ﬂames) of the Video bit-stream, which signiﬁcantly affects video quality, are provided

with more protection (i.e. a lower coding rate) and vice versa. Therefore, the video

quality depends heavily on how well the video bit-stream is protected for severe

interference in wireless channels.

An ad-hoc network consists of many wireless hops from a server to a client, and each

wireless link between two intermediate nodes experiences interference. Therefore, it is

Obvious that there will be channel capacity drops over each hop, and hence, without

optimal rate adaptation, the video quality can be expected to degrade Signiﬁcantly at any

client located at a far-edge of an ad-hoc network. To overcome this limitation, Decoding

and Forwarding (DF), which fully decodes codewords at each intermediate node, can be

employed to provide the best Video quality. However, complexity and memory usage for

the method are signiﬁcantly high. Furthermore, intermediate nodes, which may be
participating by only forwarding the content toward a receiver further-down a multi-hop
chain, do not have much incentive to perform full decoding-encoding of the channel-
coded wireless video content. For such nodes, we need to minimize their burden in terms
of the operation they need to perform toward the delivery of the video content to the ﬁnal
receiver.

These issues motivate the robust and practical rate adaptation architecture that we

develop in this thesis.

1.2. Overview of Contributions

In this thesis, we provide novel schemes to adaptively provide reliable Video content
delivery over wireless LANS. Our objective is to develop robust and practical rate
prediction architecture to support QOS for wireless scalable video. To that end, Chapter 3
focuses on analysis of bit errors at IEEE 802.11b MAC layer and deveIOps 3 BER
estimation/prediction scheme under CLDS protocols. In Chapter 4, on the basis Of the
BER estimation/prediction scheme (with which the channel capacity is estimated), we

take the RD of video into consideration to predict and optimally tune a source and

channel coding rate. In Chapter 5 we develop the operational rate, the empirical RD
model for above-capacity wireless scalable Video, and UEP to employ our optimal rate
prediction framework to practical rate adaptation architecture. In Chapter 6 we extend
our rate adaptation architecture to an ad-hoc network by employing Syndrome Partial
Decoding (SPD) to efﬁciently manage the overall processing requirements of
intermediate nodes. Finally, Chapter 7 summarizes key conclusions of this thesis.

In Chapter 3 we describe the method of collecting residual error traces which
represents the various and realistic wireless channels and analyze MAC-to-MAC bit-
error characteristics. Based on the analysis, we develop the multi-tier model (MTM) [35];
which leverages a received packet’s SNR and checksum side-information to predict BER,
to accurately predict BER in future packets over a wireless residual channel. It is
Observed that direct inference of BER from SNR results in optimistic estimates because
of the relatively large amounts of error-free data (in comparison with corrupted data)
received on viable wireless networks. Consequently, we propose a model that separates
packet- and bit-error prediction. For packet-error prediction, a 3rd order Markov chain
model is used at the ﬁrst tier, and the current SSR value is used to predict a SSR value of
the next packet at the second tier for bit—error prediction. We demonstrate that the MTM

renders higher prediction accuracy than existing Yule-Walker and fmite-state Markov

chain predictors at any physical data rate (2, 5.5, and 11 Mbps).

Although our channel capacity prediction scheme based on the BER prediction
scheme provides very accurate prediction performance, it can not be free from error. In
addition, it is a well know fact that the rate above channel capacity will results in packet
drops and degradation of Video quality. For these reasons, in Chapter 4, we develop
optimal rate prediction architecture under CLDS protocols (ORPACLDS) [55]. In this
architecture, we leverage the probability distribution of capacity prediction error process
and the RD function of Video to optimally predict and tune a rate that maximizes Peak
Signal-to-Noise Ratio (PSNR) of video contents. We exhibit the efﬁcacy of the proposed
scheme by simulations using actual 802.11b wireless traces, an RD model for the video
source and an ideal FEC model. Simulations using source RD models derived from ﬁve -
different popular Video codecs (including H.264), Show that the proposed framework
provides up-to 5 dB improvements in PSNR when compared with conventional rate-
adaptive schemes.

In Chapter 4 we assume the following assumptions in ORPACLDS: (i) the channel
code achieves the capacity (i.e., an ideal channel coder), and (ii) a block of packets
cannot be recovered at the client if it is coded with an overestimated rate, i.e., a channel

coding rate that exceeds channel capacity. Thus, in Chapter 5 we incorporate i) an

operational rate for a speciﬁc channel code which is not an ideal code, ii) an RD ﬁrnction
for above-capacity video, and iii) UEP which utilizes LDPC code to optimally
realize ORPACL D S in practical rate adaptation architecture. We Show that
ORPACLD S provides the considerably accurate average PSNR to the maximum PSNR
and the higher (at least 6dB in lleps channel) average PSNR to that of ORPACON for
any given channel. This highlights that the proposed schemes bring ORPACLDS to a
certain level of completion in practical wireless environments.

It is obvious that each wireless channel between intermediate nodes faces channel
impairment due to interference, fading, and multi-path effects. Hence, when a multi-hop
wireless network is introduced, it can be easily observed that channel capacity decreases
over each hop, and hence, End-to-End (E2E) channel capacity becomes very low. This
leads to signiﬁcant degradation of goodput and video quality for rate adaptation
applications. The workaround to this problem is to employ Decoding and Forwarding
(DF). For DF, an intermediate node decodes the transmitted packets (or codewords) to
suppress noise on the channel between two intermediate nodes, and re-encodes the
packets, potentially with a different codebook, for transmission towards the destination
[61]. However, complexity and memory usage for the method are Signiﬁcantly high.

Moreover, intermediate nodes, which may be participating by only forwarding the

10

content toward a receiver further-down a multi-hop chain, do not have much incentive to
perform full decoding-encoding of the channel-coded wireless video content. For such
nodes, we need to minimize their burden in terms of the operation they need to perform
toward the delivery of the video content to the ﬁnal receiver. To that ends, in Chapter 6
we develop the new partial processing framework for rate-adaptive wireless Video, which
reduces the overall processing requirements of intermediate nodes. We refer to as
Syndrome Partial Decoding (SPD) architecture. We exhibit that SPD, which reduces the
overall processing requirements of intermediate nodes, provides reasonably high goodput,
when compared to End node decoding and ARQ, and less complexity and memory

requirements, when compared to DF.

1.3. Thesis Organization

The rest of this part is organized as follows. Chapter 2 provides background that is
required to understand the material presented in this thesis. Chapter 3 focuses on
“accurate” BER estimation and prediction of IEEE 802.11b channel. Chapter 4 proposes
optimal rate prediction architecture under CLDS protocols for wireless multimedia. In

Chapter 5 we complete optimal rate prediction architecture to be employed as a real

11

world application. In Chapter 6 we extend optimal rate prediction architecture to an ad-

hoc network by considering efficient processing requirements of intermediate nodes. In

Chapter 7 we summarize key conclusions of this thesis.

12

CHAPTER 2
BACKGROUND

2.1. Limitation of current multimedia delivery

mechanism/protocols

One of the ftmdamental challenges for wireless multimedia systems is the availability
of sufﬁciently high “effective” bandwidth. In principle, “sufﬁciently high” “effective”
bandwidth implies bandwidth that can support the best possible multimedia quality in
presence of packet drops and while resolving access among competing nodes over the
Shared wireless medium. Video applications are usually extremely bandwidth hungry and
can get adversely affected by excessive reduction in the effective bandwidth. Unlike the
traditional wired Internet based communication, the number of packet drops due to bit
errors can be substantial in wireless networks. In typical multi-room Home/Ofﬁce
settings, there exist many clients that do not have a Line of Sight (LoS) communication
path with the wireless Access Point (AP). Such clients often experience signiﬁcant

attenuation in the received Signal. In addition, the fading effects and, the variation in

13

channel quality there of, experienced by such clients is non-trivial. The Video quality
service available to such clients can be severely limited on account of excessive packet
drops due to corruption. The effects of such packet dr0ps can get further exaggerated on
account of interference.

The impact of bit corruptions on the effective bandwidth and thus Video quality can
be substantially reduced by adopting a fresh perspective in the design of Multimedia
Communication Protocols for the wireless links. From Shannon’s Information Theory
(especially the Noisy Channel Coding Theorem), it is well established that it is feasible
to communicate information even in presence of bit corruptions (noise). Thus it can be
intuitively argued that even partially damaged packets may contain information that
should not be completely discarded. This information if retrieved can help alleviate the
impact of poor channel quality on video applications. Signal processing algorithms
associated with channel/source decoding can enable the efﬁcient retrieval of information
from corrupted packets. Thus it makes prudent sense to develop multimedia protocols
that do not drop all of the corrupted packets.

The ﬁrst generation of protocols that relayed corrupted packets to higher layers are
developed by Simply turning off a checksum at the link-layer and/or by employing partial

checksums at the transport layer (and at times other layers). To the best of our knowledge,

14

the author’s were the ﬁrst to: extended this work to 802.1 lb WLAN S [l] . Subsequently,
many researchers have appreciated the utility of not dropping corrupted packets in
802.1 lb networks (e. g. [1 1]-[18]) and wireless networks in general. We refer to all these
cross-layer protocols that are achieved simplistically by employing partial checksums as
Cross-layer Design (CLD).

CLD protocols have shown great promise in improving multimedia delivery, yet they
suffer from some ftmdamental drawbacks. We elaborate upon these drawbacks, by
raising the following three important questions.

I How do we estimate the utility of a corrupted packet?

The information content and the utility of a corrupted packet are closely tied with the
level of corruption in a packet. The CLD schemes do not provide any information about
the corruption level in a packet. AS a matter of fact, they do not even differentiate
between corrupted or uncorrupted packets. Inability to localize the errors can actually be
a major drawback for CLD schemes, at times causing it to perform worse than the
conventional (CON) protocols that drop/discard corrupted packets. Information retrieval
from corrupted packets incurs a processing overhead. The information content in highly
corrupted packets may be tOo little to justify the additional processing, While the

processing overhead can be completely avoided if uncorrupted packets could be

15

identiﬁed.

' How do I retrieve information when packet identities are questionable?

CLD schemes drop packets that have a corruption in the header, Since such

corruptions can modify Critical Header Fields (CHF), which are essential to determining

the identity of a packet (i.e. ﬂow, destination etc.). Such packet drops due to header

corruption could severely diminish the utility of CLD schemes, and thus entirely nullify

the primary motivation for their design.

I How do we eﬂiciently retrieve information from corrupted packets?

Finally it is important to realize, that Forward Error Correction (FEC) schemes used

over CON are not suitable for cross-layer protocols. Channel decoding used with cross-

layer protocols need to cater to errors as well as erasures. The ability of a channel

decoding to correct errors is closely tied to the ability to localize the errors. In a CLD

scheme, the FEC is provided with no information about the correctly received packets.

This can severely affect the ability of the decoding algorithm to localize errors. Channel

State Information (CSI) is critical in improving the performance of channel decoding

algorithms.

16

2.2. Overview of CLDS

In the discussion provided in the previous sub-section the drawback of CLD
schemes were highlighted. In each of the three questions raised above it was noted that
the performance of CLD schemes can deteriorate due to lack of receiver-side channel
state information or Simply Side-information that could provide us more information
about the corruption status of the packet. Thus a key component of our work is to
identify mechanism that can practically facilitate the estimation of the corruption status
of packet. We shall use Signal strength indications, checksums, traffic information,
temporal correlation etc as Side-information to estimate the corruption level. We
generically refer to all cross-layer protocols, which utilize some sort Of side-information
as Cross-Layer Design with Side-Information (CLDS).

The various components of a typical CLDS protocol have been shown in the
schematic in Figure 2. Unlike CLD schemes which turn off a checksum and thus adhere
to a paradigm where the corruption status of a packet is not relayed to higher layers, as
Shown in Figure 2, in a CLD scheme the checksum status and all other Link-Quality
Indications are provided to the higher layers. These link-quality indications can be used

by higher layers for improved channel decoding and subsequently for improved source

17

decoding. The CLDS scheme also employs a Header-estimation block that can detect the

identity of a packet in presence of noise. The Header-estimation block typically employs

mechanisms/models that facilitate channel prediction/estimation. This channel prediction

(possibly in conjunction with other indicators such as SNR, background trafﬁcs) is used

as Side-information for robust detection of headers.

18

[1; - No error in a packet

If];- Only data is corrupted

[f] ;- Header & data are corrupted
MAC & Transport

 

 

 

 

Header

 

 

 

Header
estimation

 

 

 

 

‘---------- —-J

 

 

 

Application 0 o Corrupted or not corrupted RTP
W packets with Side information

 

 

 

 

 

 

 

 

 

 

Video

 

 

 

Bun Video I

 

 

   

Error '

Resilience :

 

 

 

y<
E]
—>

 

 

 

b"- -----—_--—'--‘----- —---_------d

 

Figure 2 Functional block diagram of CLDS protocol based client

2.2.1. CLDS Channel Evaluation

In order to get a taste of the utility of relaying corrupted packets to higher layers and

19

the importance of side-information in such protocols, in this section we consider three
abstract protocols. We Shall characterize the behavior of these protocols with equivalent
channel models and subsequently deduce the capacity of these channel models. The

comparison of these capacities should allow us to develop clear insight for the problem at

 

 

 

 

 

 

 

hand.
CH-HDR
A
ﬁ \
Header Information (APP) DATA PAYLOAD
\ J
V
CH-DATA

Figure 3 A Single Logic Transmission Unit (LTU)

The three communication schemes considered in this paper can be explained by
considering a generic Logic Transmission Unit (LTU) as Shown in Figure 3. The general
packet structure can be segregated into two parts 1) the header information2 and 2) the
data payload. In addition to traditional header information (e. g., node addresses etc), the
LTU header contains two sets of checksums CK-HDR and CK-DATA. The CK-HDR

checksum is applied to and dependent on the header information only; while the CK—

 

2 In many practical implementations the header and CK-HDR might be further

partitioned into multiple headers and checksums. In addition a direct correlation of a
speciﬁc standard/architecture/implementation with the above-considered abstract LTU
may not always be possible. Reallocation (or even addition) of some header ﬁelds might
be required. However, we intentionally do not address such implementation details to
maintain the generic nature of arguments presented in this section.

20

DATA check sum is applied to and dependent on the data payload only. Hence, under

this generic LTU model:

0 CON, the conventional (non-cross layer) protocol drops a packet if either of the

checksums CK-HDR, or CK-DATA is not satisﬁed.

0 CLD, turns the CK-DATA checksum off and drops the packet only if CK-HDR is not

satisﬁed. Therefore, a CLD channel exhibits both erasures (due to CK—HDR violations)

and possible errors in some of the delivered packets. It is important to note that

(without ﬁrrther information or additional parity bits) the CLD channel receiver does

not know which delivered packets are error-free and which packets are corrupted. It

only distinguishes between erasures and delivered packets.

o CLDS is an alternative to the above schemes. Similar to CLD, a CLDS channel drops a

packet only if CK~HDR is not satisﬁed. However, in CLDS the CK-DATA is not turned

off but neither is the decision to drop a packet dependent on this checksum. Moreover

CK—DATA and information about the success or failure of this check-sum is made

available to the application layer as side information. Therefore, and unlike a CLD

receiver, the CLDS receiver can distinguish corrupted packets from error-free packets.

Each of the above schemes presents a different type of charmel to the higher

(application) layer. Before we illustrate, what each of these channel look like, we need to

21

establish certain important parameters that can capture the performance of each of the

above protocols. Thus the channels under consideration can be parameterized by:

I 6' : The probability that at least a single bit is in error in the header and/or the data
payload. Thus 6 is the probability of a packet being dropped in a conventional (non-
cross layer) protocol because at least one of the checksums, CK-HDR and/or CK-
DATA, was not satisﬁed.

I ,1 : The probability that the packet header contains at least a Single bit in error. Thus 2
is the probability of packet being dropped in the cross-layer schemes because the check
CK-I-IDR was not satisﬁed. (Note that this event could occur regardless if there

I is an error within the packet data or not.)

I Z : A discrete random variable that takes on three possible outcomes: S2 = {0,1,?}.
Where, (i) Z = ? if the header contains at least Single bit error and CLDS drops the
packet. Thus p(Z = ?) = 2. (ii) Z = 0 if a packet contains no errors in the header but
contains at least a single bit error in the data payload. Thus p(Z =0) = (6 -/I.), i.e.
6 — 2 represents the probability of a corrupted packet being delivered to a CLD/CLDS
channel receiver. (iii) Z =1 if neither the header nor the data payload contain even a

Single erroneous bit. Thus p(Z =1) = (1 — 6) .

22

I a: The conditional probability of a bit in the data payload being in error given that the
checksum CK—HDR is satisﬁed and checksum CK-DATA has failed. Given a corrupted
packet at a CLD/CLDS receiver, 3 represents the probability of having a random bit
selected from that packet to be in error, i.e. a speciﬁcally represents the probability of
error in corrupted packets relayed to the application layer.

I p: The conditional probability of a bit in the data payload being in error given that
the checksum CK—HDR is satisﬁed. In other words p is the overall probability of bit

error in packets that are received at the application layer (i.e. all packets that don’t get

(6—l)-£

dr0pped due to corruption in the header). Also note that p = (1 A)

Thus,
0 The conventional protocol (CON), which is the simplest among the three (CON, CLD,
and CLDS), can be represented by a Binary Erasure Channel (BBC) as shown in
Figure 4 (a). It is well known [30] that the channel capacity of a BBC is given by

1— 6 ; hence, the capacity of a conventional protocol is given by

CCON =1—6 (1)

23

o The cross-layer protocol, CLD, can be represented as a cascade of a BBC channel
with probability of erasure equal to ,1 followed by a Binary Symmetric Channel
(BSC) with probability of bit error equal to p, as shown in Figure 4 (b). Such a
cascade can hence be termed as Binary Symmetric/Erasure Channel (BSEC). It can
be easily shown that the channel capacity of such a cascade is given by the product of
the channel capacities of the individual channels:

ch = (l-l)-(l-hb (19)) (2)

where hb (p) is the entropy of a binary random variable with parameter p .

o The CLDS protocol can be represented by Figure 4 (c). The channel capacity of the
cross-layer channel in presence of side information Z is as follows: when Z =1
all the bits are transmitted reliably; and in this case, which occurs with probability
(1—6 ), the (conditional) capacity is 1. When Z = ? all the bits get erased and the
conditional capacity is 0 while when Z = 0 the channel reduces to a BSC with a

cross-over probability a and the conditional capacity is (1— hb (6‘)). Thus the

channel capacity of CLDS is given by:

CCLDS = (1 -5)+(5-/1)'(1-hb(8)) (3)

24

 

 

 

   

 

8
041—5 o
(a) (b)
/1(Z=l)
1-8
1 5 ' ’1 ——1‘(z=0)3-———- l-e - 1
\‘w. 8
A \ 9 \__.'jf.-’f
/ . (2"?) fink-a ?

f

A If 8
/ . _ .3.- \-
0“ ’1 —'0 (Z‘Olm-l/o
1-8 \
0 (z=1)

(C)
Figure 4 (a) Binary Erasure Channel (BEC) representing the UDP channel (b) Hybrid Binary
Symmetric/Erasure Channel (BSEC) (c) BSEC with Side information Z.

25

 

0.95» \ ..................

 

  

 

 

 

 

 

 

 

I
b E 5 5 —e—LCON
'8 0_9- "K ............... ; ........... 1 ...... — - CLD x=0.01
cc ; ; — CLDS x=o.01
% g : + CLD 7L=0.05
3 0,35 ........... ; ﬂee CLDS x=0.05
“>1 9
2 i
g (ml -------- ------
.9. .
*5 :
.2 0-75 --------- r ---------- . -------- : ------
‘6 I I K 1‘ I
0.7 ————————— L —————————— : —————————— at x ———————— : —————————— —
: I :\ \ :
rxnnnAnanAAnn/NAAA \nmdxnnnﬁ/N
\J U U U \1/ V \J \J V \j/ V \J V U \f) U U XV‘IJ U W U \l
0.05 0 1 0.15 0.2 0 25

Figure 5 Comparison of channel capacity for CON, CLD and CLDS, 6 = 0.33.
Figure 4 provides a comparison between the capacities for various channel conditions.
Some important conclusions that can be derived from the above Figure are as follows:
I There exist a variety of channel conditions under which the performance of any
cross-layer scheme is better than CON. In Figure 4 observe performance for
a S 0.17 .
I There exist channel conditions under which the performance of CON is better than
CLD. In Figure 4 observe performance fore 2 0.18 .

I The performance of CLDS is always better than either of CON or CLD.

26

I The relative performance of CLDS over CLD increases as corruption level

increases.

Above conclusions are formally proved in a publication associated with this work [19].

The same paper provides extensions of the above conclusions to multi-hop

communication. We encourage the reader to refer to [19] for additional details.

Can we do better?

The above discussion clearly exhibits the utility of side-information from a capacity

perspective. However, the CLDS scheme presented employs a rather simplistic side-

information In the CLDS mechanism all the corrupted packets are treated equal, however

in practice the channel impairments experienced by each packet may be distinct. So a

natural question, to ask, is whether we can identify methodologies that allow us to

differentiate between corrupted packets at ﬁner resolution.

The answer to the above question is a YES!. There are multiple methods of providing

additional side-information, however one effective method that we have identiﬁed is

based on the observation that: Radio hardware used for reception of 802.11b frames is

capable of recording and associating a SSR to each received frame. Thus the relationship

of SSR to BER can be used to provide a robust CSI to the cross-layer error recovery

mechanism.

27

For notational convenience let’s name the CLDS scheme that utilizes SSR as a side-
information as an SSR_aware scheme, while the CLDS scheme that does not utilize the
SSR information can be referred as SSR_unaware scheme. Let f (SSR) denote the
probability of receiving a packet with a particular SSR indication. For a packet received
with a particular SSR indication let 6 (SSR ) represent the probability of a packet being
corrupted and £(SSR) represent the probability of a bit error in the corrupted packet.
Also, for convenience let’s assume that we do not see any packet drops due to header
corruption. Then ﬁom equation (3) we already know that the capacity of the

SSR_unaware scheme is given by:

CSSR_ unaware = [1" 2 f (SSRP(SSR)] (4)

SS?‘

+[zf(SSR)6(SSR)][l-hb [Zf(SSR)6(SSR)£(SSR)]]

35" SS?“

where 6 = 2 f (SSR)6(SSR) & s = 2 f (SSR)6(SSR)£(SSR) represent the overall

SST SS r

probability of a packet being corrupted and the overall probability of bit error in a
corrupted packet respectively.
The capacity of an SSR_aware scheme can be found as:

CSSR_aware = ESSR [CCLDS (5(SSR)’£(SSR))] (5)

where, CCLDS (SSR) is the CLDS capacity for a particular SSR value, which can be

28

obtained by employing equation (3) for channel parameter values obtained for a

particular SSR value, i.e.
CCLDS (6(SSR),£(SSR)) =(1—6(SSR))+6(SSR)-(1—hb(3(SSR)))

Thus, we obtain:

CSSR_aware : (1" 5) + Z f(SSR)°6(SSR)(1—hb (£(SSR))) (6)
SSR

At this stage it should be noted that, equation (4) can be rewritten as
CSSR_unaware = CCLDS (ESSR [5(SSR)]’ESSR [3 (5510]) - (7)
Thus by convexity of the capacity function [30] and comparing (5) and (7) we have:
CSSR_aware Z CSSR_unaware (8)
The above deduction can be repeated for a variety of feasible side-information
mechanisms. Thus at this stage, at least from capacity perspective, the motivation for

developing CLDS protocols has been clearly established.

2.2.2. FEC for CLD/CLDS Protocols

The FEC schemes we use are realized by interleaving codewords across packets.
Thus an FEC scheme can be completely characterized by N , the packet block length,
the code length n , code rate R and the packet size s in bits and the symbol size b

(in bits) on which the code has been implemented. Thus each FEC block consists of

29

(N 12) message packets and (N -s)/ (b.n) codewords. The FEC decoding algorithm
uses erasure decoding when used in conjunction with traditional protocols and erasure-
error decoding when used with cross-layer protocols.

For Reed-Solomon based FEC the symbol size is typically the size of a byte i.e.
b = 8. With the CON scheme we employ the conventional FEC decoding as discussed in
[27]. For CLD and CLDS schemes, we modify the algorithm, the decoding algorithm
used for simultaneous erasure and error decoding of RS code is the Berlekamp-Massey
Algorithm (see e. g. [25], [26] for background).

For cross-layer schemes, 3 signiﬁcant improvement can be achieved by employing
LDPC based FEC schemes. For Low-Density Parity Check (LDPC) code based FEC
schemes the symbol size is typically chosen to be the size of a bit i.e. b=1. The decoding
algorithm used for LDPC is based on the Log-Likelihood Ratio (LLR) domain (see e. g.
[28]-[29] for relevant background on LDPC codes) implementation of the sum-product
decoder.
2.2.2.1. Side-Information for Improved decoding efficiency:

Performance over CLDS can be improved by taking advantage of the side-
information provided by CK-DATA for FEC decoding. This can be done as described

below

30

A. FEC block decoding failure: On occurrence of a block decoding failure the

channel decoder is unable to recover all the dropped/corrupted packets.

However as the FEC block is systematic, message packets that can be

identiﬁed as uncorrupted can still be forwarded to the eventual application. In a

cross layer design like CLD as we do not have information about CK-DATA, if

a decoding failure is encountered, the entire packet block is dropped. However

in a CLDS scheme the CK-DATA information is used to forward the correctly

received message packets to the application layer.

B. Apriori estimates for improved FEC decoding: CK-DATA side-information

can be utilized to acquire improved apriori estimate of channel impairments

and thus improve FEC decoding:

a. In case of RS based F EC, we use this information rather simplistically; if

the number of packets for which CK-DATA fails is less than the total

number of redundant packets, then even if all the corrupted packets are

treated as erasures the entire FEC packet block is recovered by pure

erasure decoding. Thus the CLDS scheme can switch to the pure erasure

decoding whenever possible to avoid decoding failure due to a high

corruption level in the packets.

31

b. In the LDPC decoding algorithm the LLR L(c,-)associated with code bit

c,- is initialized at the start of the decoding on the basis of the received bit

yi and an apriori assumption of the bit error probability of the ith bit

being Pe as

 

L<c.->=(—1)yi 10413012.) (9)

e

The message-passing algorithm (log-domain surn-product algorithm) is then
iteratively used to update L(ci). A bit is decoded to 1 if L(c,-) S 0 and 0 if L(c,-) > 0. An
accurate initialization of LLR plays a key role in the performance of LDPC codes and
thus improved CSI can help in improving the LDPC decoding, in particular if a packet is
received without any errors then L(c,-) corresponding to the bits in that packet is
obtained by setting I; = 0 and similarly if a packet is dropped L(c,-) is obtained by

settingPe = 0.5 . Thus we initialize the LLR for various cross-layer protocols as described

below:
A. CLD:
yo 1'. p . . .
L(c,-) = (-l) 1 log — 1f brt rs not erased,
p (10)
L(c,-) = 0 if bit is erased
B. CLDS:

32

L(Ci) = 0

if bit belongs to a dropped packet
L(Ci) = 00

if bit belongs to an uncorrupted recieved packet
L(Ci) = (-1)y " log (—1 _ a)
a

if bit belongs to a corrupted recieved packet

(11)

 

 

 

 

 

 

 

 

 

 

 

 

1
0%
(1m
9 \
0.88 ‘- 'x'm‘
,i 1 .a - RS-CLD
0-84)e-*--x--X-ll-*"*'H.+H_I +F6-CLDS
' .|
2 I. — UPC/CLDS
0.8 ﬂ T I I ﬁt I r T +1 I l I I I r I I ﬂ
0.01 0&5 0.09 0.13

0.17
8

Figure 6 Message packet throughput 2' , after channel decoding, for RS/LDPC based FEC
schemes over CON/CLD/CLDS. Channel conditions are given by 6 = 0.33, A = 0.01.

Figure 6 shows the performance of RS/LDPC based FEC schemes in which each

FEC block consists of 30 packets (20 message, 10 parity/redundancy) of 500 bytes.
Performance trends predicted by the capacity deductions are observed in F EC
performance also. In particular it can be seen that the performance of CLDS schemes are

the best. Even though CLD schemes can perform better than CON, as the corruption

level increases the performance of CLD is worse than CON.

33

With regards to the performance of a speciﬁc FEC scheme, it can be clearly seen that

LDPC based F EC schemes can provide signiﬁcantly improved performance when

compared with RS based FEC. This improvement at least in part should be attributed to

the fact that we employ a soft-decoding algorithm for LDPC based F EC. When the

corruption level is high, the performance improvement of LDPC with side-information is

signiﬁcantly better than all other combinations. Thus, in practical deployments side-

information may play a pivotal role in increasing the robustness of video communication

architecture.

Just as had been commented in the capacity deduction, we highlight that the CLDS

scheme explored above is a simplistic one. Improved performance can be obtained by

including better indicators of link quality. In particular in SSR_aware CLDS scheme we

initialize the LDPC algorithm as follows:

L(c,-) = 0 if bit belongs to a dropped packet
L(c,~) = 00 if bit belongs to an uncorrupted packet
. 1— X
L(Ci) 2 (.DJ’1 log {—12%} if bit belongs to a corrupted packet (12)
a
with with SSR indication X

34

CHAPTER 3
MULTI-TIER MODEL FOR BER

PREDICTION

In this chapter, we describe the method of collecting residual error traces which
represents various and realistic wireless channels; and we analyze MAC-to-MAC bit-
error characteristics of these wireless MAC-layer channels. Based on our analysis, we
develop a multi-tier model (MTM) for BER estimation and prediction. Here it is
important to explain what we mean by BER estimation and prediction. In contemporary
wireless networks [43], [44], a wireless receiver’s link layer performs a packet-level
checksum to determine whether a packet is error-free or corrupted. Obviously, the BER is
zero if a packet passes the checksum, thereby eliminating the need for BER estimation.
For a packet that fails the checksum, the receiver does not know how many hits within
the packet are corrupted. This knowledge is important analytically and in practice, and it
has a direct implication on the effective channel capacity and the choice of certain cross-
layer wireless protocols [45]-[47]. Thus for a corrupted packet, one needs a scheme that

can render an accurate estimate of the number of errors in the packet. Once the BER of

35

the current packet is estimated, the next problem is to predict the number of errors in the
following packets.

The proposed MTM leverages Signal to Silence Ratio (SSR) indication and
checksum side-information to estimate the BER in the current packet and to predict the

BER in future packets.

3.1. Trace Collection

Wireless network simulators often rely on channel models to recreate the
complexities of the real-world. Channel models that fully take into account channel
memory that produces persistence and clustering of errors in transmissions are often
based on Markov chains [8]-[10]. Unfortunately, the complexity of Markov models
grows as 0(n2) with increase in the memory length n of wireless channel errors. To
provide an appreciation of the increasing richness of the inhabitants of the 2.4 GHz
Industrial-Scientiﬁc-Medical (ISM) band, consider that IEEE 802.11b Wireless Local
Area Networks (WLAN) [57], microane ovens and cordless phones all operate at these
frequencies. Additional factors which further complicate channel modeling include

effects of interference, differences in environment (open space, ofﬁce, residential, public

36

etc.) and the question of what constitutes “typical” background trafﬁc and noise. Of

course, it is possible to create interference scenarios by placing additional interference

sources in simulated settings and hope that the simulator does a good enough job at

modeling interference effects. However, the validity of assumptions made about trafﬁc

patterns and background noise due to interfering sources is unreliable. Since simulations

[58] are often based on simple channel models, the validity and accuracy of simulator

results is doubtﬁrl. The environments are often simplistic and unrealistic in comparison

to what is observed in real-life. To that end, we collect a comprehensive set of residual

bit-error traces representing realistic wireless channels and use them in our experiments

to verify the performance of our developed schemes.

37

3.1.1. Data Collection

We collected error traces simultaneously on ﬁve IEEE 802.11b wireless receivers (or
sniffers). The receivers were located at different places in a lab, while the access point
(AP) was placed in a room across a hallway from the receivers to simulate a realistic
classroom/office setting [Figure 7]. The receivers’ MAC layer device drivers were
modiﬁed to capture corrupted packets. Each experiment comprised of one million

packets with a payload of 1,000 bytes each, i.e., each trace has approximately 1 GB of

21ft

 

1211]

 

 

Figure 7 Topologies used for wireless trace collection.

38

data, which corresponds to approximately 4 and half hours of error trace collection when

packets are transmitted at 500kbps. Note that the error traces were collected in a

university lab environment so it is not possible to precisely account for movement

activity in the hallway. However, even though the receivers were located in the same

room, they were place in different locations, and consequently they encountered very

different channel state or loss pattern. For instance, it was clearly observed that BER of

the traces collected by the receiver 5 in Figure 7, which was placed near by the window

(which was close to a street) and surrounded by metal bookshelves, was considerably

higher than that of other traces collected by other receivers. Therefore, the error traces

can represent the various and realistic channel states of wireless environment.

A wired sender was used to send multicast packets, in which a multicast IP address is

deﬁned, with a predetermined payload on the wireless LAN; multicasting disabled MAC

layer retransmissions. In addition to a packet’s header and payload information, we

logged signal to silence ratio (SSR) for each packet. The sender used four different

transmission rates which are 500, 750, 900 Kbps and 1 Mbps. Note that a packet rate

(packets per second) differs with transmission rates. At the physical layer, the auto rate

selection feature of the AP was disabled and for each experiment the AP was forced to

transmit at a ﬁxed data rate (2, 5.5 or 11 Mbps). For each trace collection experiment, we

39

set a transmission rate and a physical layer data rate, and hence 12 different experiments

were conducted at different times of day, i.e., 60 error traces were collected.

3.1.2. Average Statistics of the Traces

TABLE I provides some statistics of the traces collected for this study. Since the
physical layer robustness decreases with an increase in data rate, the average packet error
rate increases with an increase in the physical layer data rate. In particular, the average
packet error rate increases from approximately 10% at 5.5 Mbps to almost 40% at 11
Mbps. Since the wireless receivers were placed at different locations, the receivers
experienced different packet error rates. The overall minimum and maximum error rates

in TABLE I outline that the receivers under consideration were experiencing both good

and bad link conditions.

40

TABLE 1 Statistics of Traces Used In This Study

 

 

 

 

Phy. data Avg. PER Min. Max. Avg. Min. Max.
rate PER PER SSR SSR SSR
( MbPS) (dB) (dB) (dB)
2 5.97% 0.75% 14.31% 14.75 0 34
5.5 9.79% 0.61% 22.74% 15.27 0 32
11 39.5% 10.99% 77.83% 16.51 0 35

 

 

 

 

 

 

 

 

 

TABLE II Error Statistics For Varying SSR Values At 11 Mbps

 

 

 

 

 

 

SSR Average Packet— BER of all packets BER of
(dB) Error Rate (error-free & corrupted
corrupted) packets
5 0.701 0.0253 0.0361
13 0.6248 0.0157 0.0251
20 0.2166 0.0048 0.0223
26 0.0384 0.0023 0.0591

 

 

 

 

 

The average, minimum and maximum SSR values are also shown in TABLE 1. Note
that the minimum SSR value is zero at all three data rates. From a prior analysis [21] [35],
we know this SSR range is of interest for the protocols considered in this paper.

The relationship between SSR values and the channel error rate is also shown in
TABLE I. It is easily observed ﬁom the second column of TABLE II that packet error
rates increase drastically with a decrease in SSR values. In particular, the packet error
rate increases by approximately 18% as the SSR decrease from 26 dB to 20 dB. Similarly,

there is a packet error rate increase of about 41% between SSRs 13 and 20.

41

3.1.3. BER Behavior at Different SSR Values

Figure 8 uses the 11 Mbps residual channel to show that the BER behavior of the
channel changes considerably with a change in the received packets’ SSR values. (For
brevity, in this section we only show results for the 11 Mbps channels because results for
2 and 5.5 Mbps channels were similar. In the modeling sections, we show results for all
three channels under consideration.) At an SSR of 5 dB, small number of errors in a
corrupted packet are fairly infrequent, as shown by the low normalized ﬁ'equency of
errors in the [1,200] range of Figure 8 (a). This is mainly because at such low SSR values,
most of the received packets are corrupted, and a large number of bits in these packets
are corrupted. As the SSR increases, the skew of the histogram changes and the corrupted
packets have fewer bit-errors. This trend can be observed in Figure 8 (b), (c) and (d),
which show that the frequency of small number of bit-errors increases with SSR. For
instance, at an SSR of 26 dB, almost 25% of corrupted packets have less than ﬁve bit-
errors. Comparison of Figure 8 (a), (b), (c) and ((1) clearly shows that an increase in SSR

decreases the mean number of bit-errors in a corrupted packet.

42

 

 

 

 

 

   

 

 

 

 

 

 

 

 

 

 

 

 

g g
'5, 0.01» 50'06
.9 .9
w .2
E c 0.04
g 8
N 0.005 :3
ﬁ 5 0.02
E E
0
Z Z 0
0 40 80 120 160 200 50 100 150 200
Number of errors Number of errors
(a) SSR=5dB (b) SSR: 13dB
E F I 7 E
a, a 0.2»
.3 0.1' 53,
E E 0.15
.8 8 0 1 b
g. 0.05 g -
g g 0.05
2 0 Z 0
50 100 150 200 50 100 150 200
Number of errors Number of errors
(c) SSR = 20 dB (d) SSR = 26 dB

Figure 8 Normalized histograms of the number of bit-errors in a corrupted packet for varying
SSR values; histograms are averaged over all 11 Mbps residual traces, and only number of
errors between [1,200] are shown.

The relationship between SSR values and the channel error rate is also shown in
TABLE II. It is easily observed from the second column of TABLE H that packet error
rates increase drastically with a decrease in SSR values. In particular, the packet error
rate increases by approximately 18% as the SSR decrease from 26 dB to 20 dB.

Similarly, there is a packet error rate increase of about 41% between SSRs 13 and 20.

To avoid repetition, we defer discussion on columns 3 and 4 of TABLE II to a later

43

section.
Based on the results presented so far, we deduce that SSR is a robust and effective
side-information of a wireless link’s condition. The following section leverages this side-

information to accurately estimate and predict the BER of the channel.

3.2. A Markov Model for PER Prediction

We ﬁrst emphasize an important point highlighted by columns three and four of
TABLE 11. Note that for low SSR values (i.e., poor link conditions,) the BER computed
using all (error-free and corrupted) packets is somewhat similar to the BER computed
using only corrupted packets. However, when we compare the BER at higher SSR (20
and 26 dB in TABLE II), the BER computed using all packets is orders of magnitude
different from the BER computed using corrupted packets only.

The BER difference for high SSR value is very stark because: (i) most of the received
packets are error-free; and (ii) the relatively small number of corrupted packets has
relatively-high BER. In general, and based on our extensive study of this issue, it became
evident that the relatively-small number of corrupted packets, at relatively-high

SSR/SNR values, provides good (yet conservative and slightly biased) estimates of BER.

44

Meanwhile, including the relatively large number of error-free packets in the BER
estimates (especially at high SSR/SNR values), provides highly optimistic (very biased)
estimates of BER in actually corrupted packets. A surprising result shown in TABLE II is
that the BER in the corrupted packets is very high for high SSR values. We observed that
while the total number of corrupted packet at high SSR values is quite small, the
corrupted packets contained a large number of bit-errors. We believe that this
phenomenon occurs because the corrupted packets at high SSR values are mostly due to
packet collisions, and therefore these packets contain are highly corrupted.

High SSR values are quite important because over real-life wireless channels, many
packets are received with high SSR values. (For instance, as shown in TABLE I, we
observed maximum SSR values of 34, 32 and 35 dB for the 2, 5.5 and 11 Mbps channels,
respectively.) Therefore, we propose that at the ﬁrst tier, an accurate predictor should
solely focus on packet-error prediction. In turn, BER prediction should only be done for

packets which are predicted to be corrupted by the packet-error model.

3.2.1. Correlation of the Packet-Error Process

To determine an accurate ﬁrst-tier packet-error model, we ﬁrst analyze the correlation

coefﬁcient of the packet-error process. Let X" denote a discrete binary packet-error

45

random process. Thus for a givenn , X" 6 {0,1} is a binary random variable with

X" = 0 and X" =1 respectively representing that packet n is error-free and

corrupted.

For each physical layer data rate, we treat the packet-error sequences obtained from
the traces as realizations of the packet-error process for that data rate. Then the random
process’ autocorrelation ﬁmction is [39]

R[q]=E{x0x,,}, (13)
where E{.} is the sample expected value computed using the process realizations.
Using the autocorrelation, we compute the sample correlation coefficient of a packet-

CITOI' pI'OCCSSI

[ ]_E{X0Xﬂ}-E{XO}E{X'I} (14)
p 77 — Uxoc’x,’ .

 

where 0' X represents the sample standard deviation of random variable X . This
correlation coefficient provides a normalized measure of the amount of correlation
present in the data. Markov chains are generally used to model processes with low-

correlation values, whereas highly correlated processes are typically modeled using

heavy-tailed models [48].

46

0.1

 

0.08 ~

0.06 -

 

 

 

Correlation coefﬁcient

12345678910

Lag

Figure 9 Sample autocorrelation coefficient of an 11 Mbps trace.
Figure 9 shows the correlation coefﬁcient of a sample 11 Mbps residual trace.
(Correlation decays for other traces were similar and are therefore skipped.) Clearly, the
packet-error process’ correlation shows an exponentially decaying trend. The correlation
function drops to and stays at an insigniﬁcant value at lags of more than three. Thus we

deduce that a 3rd order Markov chain can be used to model the packet-error process.

3.2.2. The 3rd order Packet-Error Markov Model

To predict packet errors, we deﬁne a discrete-time Markov chain such that each state

of the chain corresponds to one of the 23 = 8 possible combinations of three binary

symbols; each binary symbol represents an error-free or corrupted packet. Transition

47

probabilities of the 3'd order Markov chain are computed by sliding a 3-bit window over
the wireless traces and by observing the ﬁ'equency of a binary pattern 5 =[x1x2 ...xk]
followed by another bit-pattem )2 =[y1y2 yk] , for all patterns gt; and X .

The 3rd order Markov chain model’s transition probabilities are computed from the
packet-errors observed in the traces of this study. Since the BER behavior changes
drastically with respect to the physical layer data rate, for each data rate (2, 5.5 and 11
Mbps), we train a different 3rd order Markov model. After model training, given a
packet’s checksum side-information (pass/fail), we use the Markov chain to predict
whether the next packet will be error-free or corrupted. The prediction process is
conducted as follows. From any given state of the 3rd order Markov chain, the process
can transit either to a state with an upcoming error-free packet or to a state with an
upcoming corrupted packet. There are two transition probabilities associated with these
two possible transitions, say p and] — p. To predict the checksum of the next packet,
we treat these probabilities as a Bernoulli random variable, with probability of success
p corresponding to the probability that the next packet will be error-free, and the

probability of failure 1— p corresponding to the probability that the next packet will be

corrupted. Furthermore, from the checksum of the next packet, we know whether or not

our prediction was correct. If the prediction was correct then the predicted state of the

48

Markov chain is used for the subsequent prediction. Otherwise, the Markov chain’s

current state is changed to the opposite of what was predicted.

As explained earlier, BER estimation is only invoked for corrupted packets. In the

following section, we explain the BER estimation using SSR values.

49

 

 

o 0

SE 38: \e Kmm hmuowhmun.
” 3336 N no:

 

85. L
2: u 35¢.

an Emma

.m. .m.
o o o o

owlﬁ
o n Exam

 

 

an :Jawusegfea __,

Tier3 : BER

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

academies" .5287 1.65.3 a 2 3&3 83m ion £6.28? 1:2me
.> H . .
ov
32m u 32 u 22mm. _ _ n 1.535 ............ Sm u a n $5. _ L n 33.35
odor u 32 u 22% _ o n 35.3 a of u a u :55 _ o u 3me
o u Em
ER :8: Re «mm VENEER Took I Bio 33‘ 38:10:.“ .N 6 $8M: ink
”Sauagamﬁ
,__. _ Ea 33280” H
H F _ + #9 + in actuate u o
73; 35133— Tmooi 35;
a 2% a 3% m 2% 4 23 m 2%
< a ._. . _. ._. ; 2
,4 1+.

 

 

  

 

 

T ierl : packet — error prediction Tier 2 : SSR prediction prediction

Figure 10 The multi-tier BER prediction model.

50

3.3. Packet-Level Model for SSR Prediction

Once a corrupted packet is predicted by the tier 1 model, at the second tier we use the
SSR of the received packet to predict the SSR of a future packet. For each possible SSR

value (which ranges between 0 and 100 dB), we maintain a discrete conditional

probability distribution of the next SSR values. Thus a conditional probability value Pi]-
yields the probability that if the SSR value of the current packet is i dB then the next

packet will be received at an SSR of j dB. The conditional probability distributions are

computed using corrupt packets observed in the traces.

3.4. BER Prediction using BSC Models

The second tier model predicts the SSR of the next packet using a conditional
probability distribution. In accordance with prior discussions and as shown in TABLE II,
the BER of the channel changes with respect to the channel SSR. In general, we observed
a non-linear relationship between SSR and BER. Therefore, after predicting the next
packet’s SSR, we employ a binary symmetric channel (BSC) model to predict the BER
of the next packet. While we acknowledge that the BSC model is somewhat simplistic

for the present problem, we observed that this simple model can provide quite accurate

51

BER prediction, especially at low physical layer data rates. Hence, we use the crossover
probability of the BSC model corresponding to the predicted SSR as the BER estimate of

the next packet.

3.5. Performance Evaluation of the MTM

We now compare the performance of the proposed MTM model with two leading
predictors, namely the Yule—Walker (Y-W) predictor [39] and frrrite-state Markov Chain
(F SMC) predictor [49],[50].

For the Y-W based BER prediction, we ﬁrst use SSR values of the last k packets to

predict the SSR of the next packet. More speciﬁcally, let S n—k+1 ,S n_k+2,...,S n be

the SSR values of the last k packets. The Y-W predictor predicts the next SSR

value, SM] , using a linear ﬁlter of the form:
k-l
Sn+1 = Z hiSn—i °
.=0

The coefﬁcients of the ﬁlter are computed as:

52

 

'h[o]' 'R[o] R[l] R[2] R[k-1]‘"1'R[1]'
h[1] = R[1] R[o] RD] RUC—z] RP],
.hlr-IL _R[k-1] ~3- Rm Rm- gem-

 

 

 

 

 

where R[.] is the autocorrelation function of (13). We experimented with different
values of k , and obtained the best Y-W prediction performance for k = 5. Once the

SSR is predicted using the above equations, we use the tier 3 BSC models of Figure 10 to

predict the BER of the next packet.

53

 

 

 

 

u 10
o 7" L
561 5‘ —MTM

8 ,J . “Yule-Walker
-- 5- r i ---FSMC

E 1 ﬁrst}! "1'1
3411 ,a-vé 11.]

5.3 m V

e 2 ‘ " '1‘

3

'6

B1 . . .
“5 o 50 100 150

time-window
(a) Physical layer data rate = 2 Mbps

 

 

 

 

 

 

 

 

 

 

 

-3
9 1o.X10 . .
5 . —MTM
g 8 1) I] ,1, (\E ”Yule-Walker
._ * "I i : .....
:3 '1 ﬁg 15' , FS'MC ‘
'o . r" 9
9361111 1 114.
a 1 111
.9. ~ , g!" “In ~ 1
2 4’ t.
o
0)
.0 . J .
‘° 0 50 100 150
time-window

(b) Physical layer data rate = 5.5 Mbps
S—
o 002.
at, 1 —TTM
: *mYule-Walker
.9 .....
“‘5 0.015 FSMC
‘6 1 ,
9 . 1...
0- ‘
a: . , . ”r
B r (J ‘r 11 t 1 1123131,“)! .
m 0 50 100 150

time-window

(0) Physical layer data rate = 11 Mbps
Figurell Absolute error in BER prediction; time-window length 1' = 50 seconds, results

are averaged over all the traces.

54

The FSMC predictor [49] employs a Markov chain model of SNR values to predict
ﬁiture SNR values. The FSMC model of SNR values is designed speciﬁcally for a fading
channel and the model assumes the availability of SNR values for each symbol. The
FSMC partitions SNR values into N disjoint intervals, where each interval represents
an FSMC state. Different state SNR partitioning strategies have been proposed in prior
literature [49],[50]. It has also been shown that under fading conditions, an FSMC model
in state i can only transit to its neighboring states i —l andi +1. After predicting the
next SNR value, BSC models are used to predict the BER of the next packet.

Since on a residual channel we only have packet-level SSR information, the
previously-proposed SNR partitioning algorithms are not directly applicable here.
Moreover, at times we observed large ﬂuctuations in SSR values. These ﬂuctuations
made the constraint of transitioning only to the neighboring states impractical. Therefore,
we modify the original FSMC model [49] such that the model can transit from any
current SSR value to any other SSR value. For fair comparison with the MTM, we place

each SSR value into a different FSMC state.

55

To clearly show the accuracy of a BER predictor without short-term biases, we

compare the predicted and actual BERs in non-overlapping time-windows of length 2'

TABLE 111 Average Absolute Error of BER Predictors.

 

 

 

 

Phy. data rate Three-Tier Yule- Finite-State Markov
(Mbps) Model Walker Chain
2 0.0015 0.0042 0.0028
5.5 0.0044 0.0075 0.0057
11 0.0094 0.0097 0.0101

 

 

 

 

 

 

seconds. In each time-window, we compute the absolute value of the difference between
the predicted and actual BERs. This difference is henceforth referred to as absolute
prediction error. Clearly, smaller prediction error implies higher prediction accuracy.

F igurell compares the performance of the proposed model with Y-W and FSMC
predictors for r = 50 seconds; that is, each point shown in Figurell is the average
prediction error in a 50 second time-window. It can be observed that at 2 and 5.5 Mbps
the proposed model provides consistently better performance than Y-W and FSMC
predictors. At 11 Mbps, the accuracies of all the predictors are comparable. Performance
of the MTM at 11 Mbps is less convincing because the bit-error characteristics of the 11
Mbps channel are quite different from 2 and 5.5 Mbps. Speciﬁcally, prior studies have
shown that 11 Mbps bit-errors exhibit long-range dependence, and therefore multi-scale

models are required to characterize these bit-errors [51]. We are currently incorporating

56

such models in the multi-tier framework to improve BER prediction at 11 Mbps.

In TABLE III, we compare the average accuracies of the predictors under

consideration. For the results of this table, the absolute prediction error was averaged

over the entire trace. It can be clearly seen that average prediction accuracy of the MTM

is consistently higher than both Y-W and F SMC predictors. Thus, irrespective of the

physical layer data rate, on-average the MTM renders higher prediction accuracy than

both Y-W and F SMC predictors.

57

CHAPTER 4
RATE PREDICTION AND TUNING

In this chapter, we develop a rate adaptation architecture using the two main
contributions: channel capacity prediction and optimal rate tuning. Note that the “channel
estimator” in the client and the “rate tuner” in the server shown in Figure I employ the
channel estimation and tuning schemes developed in this chapter, respectively.

For channel capacity prediction, BER is estimated from the crossover probability, 8 ,
of a BSC model that is inferred from the packet SSR values. The residue-error
entropy, Hb(£), for a time-window is then calculated using two methods: (i) H b(6) is
estimated by the average of the instantaneous entropies (entropy of each packet) over the
time-window (CLDSI), or (ii) H b (8) is estimated by the entropy of the average BER
over the time—window (CLDS2 ). These entropy estimates are used as the prediction for
the next time-window. Optimal rate tuning is achieved by finding the rate that provides

the maximum PSNR over the distribution of the channel prediction error process.

58

4.1. Limitations and Optimal Rate Adaptation Period

In this chapter we consider the following simplifying assumptions: (i) the channel
code achieves the capacity (i.e., an ideal channel coder), and a block of packets cannot be
recovered at the client if it is coded with an overestimated rate, i.e., a channel coding rate
that exceeds channel capacity; (ii) A video encoder provides a bitstream having a bitrate
that is exactly the same as the required bitrate, and the bit stream renders PSNR value
[3 8] according to the bitrate. With (i) and (ii), we realize the architecture where for video
rates that cannot be supported by the underlying channel capacity, the video quality
reduces to zero (i.e., zero PSNR value). Note that without the assumptions (i) and (ii)
(i.e., a realistic channel coder or video encoder was used), we should carefully take the
performances of the channel coder and video encoder into consideration in ﬁnding the
optimal rate. Therefore, the above assumptions were made to focus only on the
performance of our proposed scheme.

In addition, the time-window for the experiment is selected such that the variation in
link-quality ﬁom one window to the next is minimized. Allan Variance is a measure
which allows us to methodically determine such an optimal size of the window [40], and

it computed as follows:

59

 

2000

 

8 1500
C
52
(U .
> 1000
C
1:“ 500-
<

 

 

 

’0 1‘0 2'0 3‘0 40 50
time-window (seconds)

Figure 12 Allan Variance as function of time window.

 

2 _ 1 k _ 2
AVAR (1)—2(k_1)§1(C(r),,,1 C(r),,) (15)

where AVAR2(r) is the Allan Variance as a function of averaging time, I ;

m
Cn(=1—iZHb(£,-)) is the average channel capacity of the measurement in time
i=1

window n; and k is the total number of time windows. Note that r in equation (15) is
related to the number of packet, m. However, different transmission rates lead to
different packet rates, and hence the number of packets, m , in a time window differs

with transmission rates. For this reason, we expressed equation (15) in terms of r , with

which we can represent a common time interval for any transmission rate.

60

For all traces that we have considered, the Allan Variance of BER is minimal for a
time-window size of 5 seconds [Figure 12]. Hence, in this study we use the time window

with a size of 5 seconds.

4.2. BER and Channel Capacity Estimation

As shown in TABLE II, the BER of the channel changes with respect to the channel
SSR. In general, we observed a non-linear relationship between SSR and BER. Here, we
employ a BSC model, which utilizes each SSR of a packet to estimate the BER over a
given block of packets. While we acknowledge that the BSC model is somewhat
simplistic for the present problem, we observed that this simple model can provide quite
accurate BER prediction; and more importantly it provides a conservative estimate for
the channel capacity due to the lack of a memory model in the channel.

We partitioned the collected traces into training and test data. Next, with training data
we deﬁne bins over the entire SSR range and determine the average BER for each bin.
Note that we regard the average BER as the crossover probability in Binary Symmetric
Channel (BSC) model [21], [33], [35]. Hence, a packet renders a BER estimate according

to its SSR. During the testing phase we use these estimates of BER to determine channel

61

capacity over the time window (or the number of packets, m for a given transmission

rate) that is deﬁned in equation (15). The channel capacities are calculated as follows:

~CLDSI m ~
(2,, =1——ZHb(e,-), (16)
m.=l
~CLDS2 l m ~
C" =l—Hb(—Z£,-),and (17)
’"i=1
~CON 1 m
C" =1-—ZZ,~=1-PER (18)
m

i=1

~

where 8,- represents the channel BER estimate for packet i, Z,- is a binary variable

representing the status of the checksum of packet i (Z: =1 if checksum fails).
Equation (16) is the capacity estimate CLDSI computed based on the average of the

instantaneous per-packet error-process entropy in a block of m packets, equation (17) is

the capacity estimate CLD82 computed based on the error-process entropy using the

average error of packets over the same m -packet block, and equation (18) is the estimate

of channel capacity under CON protocols.

62

4.3. Channel Capacity Prediction

Prior studies have indicated that bit-errors have high temporal correlations, especially
in 802.11b wireless networks [34]-[36]. In Figure 13, the correlation coefﬁcients of a
number of traces are shown and calculated on the basis of the channel capacity process.

The correlation coefﬁcient is computed as

= E[CnCn+l]— Elcn]E[Cn+l]

,0 Jvar[Cn]- var[C,,+1]

’ (19)

where E [.] and var [] are the sample mean and the sample variance ﬁmctions. Figure

 

0.6 - _

0.4 - -

correlaion

0.21 4

 

 

1 2 3 4 5 6 7 8 91011121314
trace number

Figure 13 Temporal correlation in channel capacity for 11 Mbps traces (traces from
1 to 5 collected with transmission rate at 500, 6 to 7 at 750, 7 to 10 at 900, and 11 to
14 at 1024Kbps).

63

13 clearly exhibits the existence of temporal correlation that is non-negligible in all
traces, and it is quite signiﬁcant in most traces. This correlation can be taken advantage
of to predict the channel capacity for the next time-window. We exploit this correlation
by using the channel capacity estimate of the current packet as an estimate for the next

packet’s channel capacity:

A

Cn+1 : C" 9 (20)

A A A

~

which deduces the channel prediction error; en+1 = C" — C n = C n — C n = C n+1 — C n .

Although we have considered other optimum predictors (e.g., Yule-Walker [39]), our
simulation results demonstrate that the above simplistic prediction in conjunction with
the optimal rate tuning (described below) provides signiﬁcant improvement over
conventional protocols. Furthermore, the prediction performance of equation (20) is very
similar to the optimum Yule-Walker predictor [39] [TABLE III]. Consequently, we focus
the remainder of this paper (in the context of the proposed rate prediction architecture) on

the predictor determined by equation (20) due it is minimal complexity.

4.4. Optimal Rate Tuning

We need to adhere to a rate that is strictly below channel capacity to avoid excessive

64

packet drops. Therefore, if the predicted channel capacity is directly employed, there is a
good likelihood that the predicted capacity (as a random variable) may exceed the actual
channel capacity. A natural workaround to this problem is to make the predicted capacity

more conservative by subtracting a small offset A from the channel capacity prediction,

A

Cn—A. Such a strategy can, however, results in considerable long-run bandwidth

wastage, and therefore judicious selection of the A parameter is extremely important.
We propose to ﬁnd the “optima ” value of A (leading in turn to the optimal video rate)
such that the average video peak signal-to-noise ratio (PSNR) over some period of time

(or over a set of blocks of packets) is maximized:

at: N A A
An =argmax §Z<I<Cn >Cn—A»-Q«Cn—A)-T)
A -1

(21)

i A t

and Rn = Cn-An

65

0.035

 

 

 

actual prediction error
0.03 ~ I I I Igaussian distribution p = 0, o = 0.01 ~

 

 

 

0.025
U. 0.021
E

0_ 0.015»

0.01 ~

    

0'005' MSE: 5.4x10'6

 

 

1

49.03 002 -0.01 0 0.01 0.02 0.03
prediction error

Figure 14 The channel prediction error process (e) resembles the probability distribution

of Gaussian.

A

where, C" and C" are the actual and predicted channel capacities, and I (~) , Q(-) and

T respectively represent an Indicator function”, an RD (video quality) ﬁmction and a
transmit rate4. Thus the above objective function assumes a rather simple binary quality-
indicator that forces the estimated PSNR value to zero when the rate exceeds the capacity.
Note that based on the present objective ﬁmction deﬁned in equation (21), the “optimal”
rate can be computed only when the overall statistics and the actual channel capacity

values are available. Since the optimal rate determined using (21) is based on the entire

 

3 1(;)={0 ifé'isnotsatisﬁed

1 otherwise

4 T = p kts m T ’ p kt Size where 2' is time — window (bits per second), and hence R- T

 

r
is the effective transmission rate which is actually transmitted for the underlying channel.

66

trace, it cannot be utilized in real-world applications. We resolve this issue by observing
that the probability distribution of the channel prediction error process, e , is very close to
a Gaussian distribution, N (0, 0'3) , as shown in Figure 14. Furthermore, we take into
account the normal distribution, which is well known to have the highest entropy, and
hence provides the most conservative estimate. Based on the Gaussian assumption for the
prediction error, this leads to an expression that replaces the indicator function with the

normal distribution function. The predicted channel capacity can then be optimally timed

by ﬁnding a rate Rn to maximize PSNR n as

R, = argmax Q(R,, ~T)-Pr{C,, 2 Rn}+Q'(|R,, —C,,|-T)-Pr{C,, < Rn}
Rn(OSR,,SI)

l A
1 1 exp —(Cn—Cn)’-
Rn J27“)? ch'e2
= argmax Q(R,,-T)~l A
R 05R 1
n‘ "5) 1 1 exp —(C,,-Cn)2 C (22)
0J2n'0'e zo-e2 "

 

Cn

 

 

 

 

 

 

 

 

 

R A
n l "(Cn _ Cn )2
A I exp 2 Cn
R - C O \/ 27t0’e 20'e
+ Q0 n n . A
Rn l 2
I l "(Cn - Cn)

I— exp 2 C"
0 s/ 27r0'e 20¢,

67

where Q(-) is the RD (quality) function of the video sequence (as a function of total
number of bits used to code the sequence); Q'(-) is the quality function of the video
sequence for rates above capacity; and [(9 represents the probabilities of rate below
capacity (the ﬁrst term in equation (22)) and of rate exceeding capacity (the second term
in equation (22)) based on the distribution of channel prediction error process. As we
highlighted in the previous subsection, we assume that Q'(-) equal to 0. Thus the
predicted channel capacity can be fully utilized and the video quality can be optimized
when the product of the RD function and the probability distribution of the channel
prediction error process is maximized. In the subsequent section, we use the above

objective function to compute the optimal video rate.

4.5. Performance Evaluation

We now compare the performance of the proposed rate prediction
architecture, ORPACLDS , with ORPACON. For notational convenience, henceforth we
refer to the architectures of equation (16) and equation (17) as ORPACLDSl

and ORPACL D 52 , respectively. For ORPACON , we ﬁrst use checksums for a time-window

to ﬁnd the PER, which is in turn used to estimate the channel capacity for the time-

68

window. As explained earlier, this channel capacity estimate is then used as the predicted

capacity for the next time-window. For ORPACON , the optimal rate adaptation scheme is
also employed in the same way as the ORPACLDS schemes.
To compare the performance of each architecture, experiments in this study were
conducted as follows:
1.Estimate the channel capacities of k time windows in a trace by using BER estimate
of each packet and equation (16), (17), (18), and (20).
2.F ind the optimal rates of k time windows by using equation (22).
3.Calculate the average PSNR values over all time windows of a trace (when the
chosen optimal rate is greater than the actual capacity for a given time window, we

set the PSNR of the time window to zero) as follows:

k a It
Average PSNR :11: Z Q(Rn-T)-I(Cn —Rn > 0).
n=1

4.Repeat 1-3 for different error traces.

As for channel capacity prediction, Figure 15 clearly shows that ORPACL D51 and

ORPACLDS2 perform better than ORPACON. TABLE IV also compares the performance

of channel capacity prediction in terms of MSE for all three architectures. It can be

observed that at any physical data rate ORPACL D 51 provides consistently better

69

performance than ORPACLDS2 and ORPACON. Note that exploiting of PER results in
less accurate prediction than the prediction based on a BER estimate, and because of the
convex property of the entropy frmction, ORPACLDS2 provides more conservative and

hence less accurate measure (in MSE sense) than ORPACLDSl .

From TABLE V, we can observe that accurate prediction alone does not necessarily
imply better overall rate adaptation. Speciﬁcally, if capacity prediction is employed
(directly) without optimal rate selection, over-predicted channel capacities cause
signiﬁcant packet drops, thereby leading to considerable PNSR degradation. Therefore, it
is imperative that any capacity estimation and prediction framework is complemented
with rate tuning.

The excellent performance of the proposed optimal rate tuning scheme can be clearly
seen in TABLE VI and Figure 16. By comparing TABLE VI (optimum rate selection)
and TABLE V (no optimum rate selection), one can observe that the proposed scheme
signiﬁcantly improves the overall performances of ORPACLDSI and ORPACLDS2 as
well asORPACON. For ORPACON, it should be observed that the optimal rate tuning
degrades the overall performance at 11 Mbps. This is because ORPACON predicts the
channel capacity rather conservatively at 11 Mbps due to the fact that large number of

packet drops are often introduced. Hence, the predictions of ORPACON are consistently

70

and considerably lower than the actually available channel capacity. As a result, the

performance of ORPACON at 11 Mbps is slightly degraded.

In TABLE VI, we also compare the overall performances of

ORPACLDSI ,ORPACLDS2 , and ORPACON in terms of the average PSNR at different
physical data rates and transmit rates. It can be observed that at any physical data rate,
the proposed rate adaptation architectures, ORPACL D S1 and ORPACL D 52 , perform
consistently better than ORPACON. However, at 2 and 5.5 Mbps channels, ORPACLDS2
on average performs better than ORPACL D 51 whose prediction performance is better
than ORPACLDS2 . It is true that ORPACLDSI 2 ORPACLDS2 because of the convexity
property of the entropy frmction, H b (c) [41]. Additionally, for 2 and 5.5 Mbps channels,
the variance of channel capacity is considerably smaller than the 11 Mbps channel as
shown in Figure 15. Thus, ORPACL D 51 has more probability of selecting the optimal rate
exceeding the channel capacity than ORPACLDS2 even after applying the rate tuning
scheme. This results in the performance degradation of ORPACL D 51 in the video quality
although ORPACLDS1 provides a better performance in prediction of the channel
capacity.

To further illustrate the performance of proposed architecture, we have thoroughly

tested it using a comprehensive set of RD functions from various CODECS (H.264,

71

Dvix5.l, WMV9, MC-EZBC, and XivD0.9), and a variety of video sequences (Foreman,
Head, Mobile, and Stefan) [3 8].

TABLE VII and TABLE VIII show that the proposed architecture can achieve up to 5
dB improvement in the average PSNR comparing to ORPACON and provides
consistently good performance with all types of CODECS and video sequences.

In summary, higher channel capacity and more accurate channel capacity prediction
are achieved by ORPACLDS than ORPACON at any physical layer data rate; and this
leads to better channel utilization and more optimum video rate selection. Moreover, as
described in Chapter 3, the collected traces can be considered as the representation of
realistic wireless channels, and consequently the trace-driven experiment results

represent the outstanding performance of our proposed scheme in the realistic wireless

environment.

72

P
N

.o
on

channel capacity
O
on
. :I-

I
1% ‘ a" ‘0‘ 5:2, ‘I/(v
' ...~$' °

5 0 995 ‘3 / “‘no'

_ '-.’.\_ . —Actual channel
o :v.’

E “-._! "'ORPACLDS1
2 0.99_ ""ORPACLosz

° ...... ORPACON

 
      

 

 

 

 

 

 

13180 13.82 13734 13186 13.88
time-window

(a) Phy. rate- 2 Mbps & Xmit rate- 750Kbps

1

 

  
  

  

 

 

 

 

 

its
g lk‘s 3‘\
0 0.984. ’-.‘.
8 . 3.
:- l. '1‘
8 :'-..°\,"-"~ .
0 96 o. ....... . —Actual channel -
7'3 ' .. ORPA
E "' cros1
g 0.94. m ORPACLDSZ
..... ORPACON
720 725 730 735
time-window

(b) Phy. rate-5.5 Mbps & Xmit rate- 750 Kbps

9
co

 

570

 

 

—Actua| channel . ..
-.-.ORPA "

CLDS1 '
""ORPACLDSZ
...... ORPA

 

CON

 

 

5'75

550

505
time-window

(c) Phy. rate- 11 Mbps Xmit rate- 1 Mbps
Figure 15 The channel capacity prediction (zoomed in) results.

73

——0
" N ’G—' .----—‘ .~“
‘0‘

 

 

 

 

 

 

Q
0 0.99»
g “‘1“
o 0 98 .O:.::o’
'5 —Actual channel
g 0.97 "'ORPACLDS1
5 "' ORPACLDSZ
0.96 - . . ----- OLRPACO‘N ,
1378 1380 1382 .1384 1386 1388
time-Window

(a) Phys. rate - 2 Mbps & Xmit rate - 750 Kbps

 

 

 

 

 

 

 

{m
.é‘
0 0.95 ~~—_ _ _,o"-------.‘~
g- s.‘ . .‘C-D-g-I—O‘. .
m 0.9; '-'§.— IIIII -|.0’ . ............. . ’
O .0. r A9. ..
.5 0.85 ............. qr—Actual channel 1'
g 08 ""ORPACLDS1
'5 - -.- ORPACLDSZ
0.75) ----- ORPACON
720 725 730 735
time-window

(b) Phys. rate - 5 .5 Mbps & Xmit rate - 750 Kbps

 

 

 

 

 

 

1.
'23; 0.8 :- ‘-sz.vw"=-'-'-'""’"‘°~‘-=r-'==:
(0 ~ co .0"- oooooo a.
30 6 : .. .. "'
o ' 1 :0 . .- A9‘ '5
'5 0 4 o..°"‘°.. : Euro; —Actual channel
g ’°: "'ORPACLDS1
5 0.2 ""ORPACLDS2
...... ORPACON
865 570 575 580 585 590
time-window

(c) Phys. rate - 11 Mbps & Xmit rate - 1 Mbps
Figure 16 (a), (b), and (0) show the optimally allocated channel capacities (or channel

coding rates) based on predictions Figure 15, by ORPACL D S] , ORPACL D 52 and ORPACON .

74

TABLE IV Average MSE Of Channel Capacity Prediction.

 

Phy.

Packet

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

data rate transmit rate ORPACLDSI ORPACLDSZ 011mm:
(Mbps) (Kbps) (Info.Bitsz) (Info.Bitsz) (Info'Blts )
500 0.0004 0.0007 0.0035
750 0.00002 0.00009 0.0002
2 900 0.0002 0.0006 0.0037
1024 0.0002 0.0009 0.0039
Overall avg 0.0002 0.0006 0.0028
500 0.0005 0.0012 0.0053
750 0.0007 0.0021 0.0121
5.5 900 0.0002 0.0006 0.0019
1024 0.0017 0.0043 0.0172
Overall avg 0.0008 0.0020 0.0091
500 0.0014 0.0040 0.0563
750 0.0011 0.0056 0.0168
11 900 0.0033 0.0063 0.1267
1024 0.0056 0.0129 0.2462
Overall avg 0.0029 0.0072 0.1 115

 

75

 

TABLE V Prediction Performance (In Terms Of Overall Video Quality) Before The Optimal

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Rate Tuning.
Pkt Xmit Adm}
Phy Rate Chan. ORPACLDSI ORPACLDSZ ORPACON
(Kbps) (PSNR) (dB) (dB) (‘13)
(dB)
500 33.98 12.54 30.95 33.30
750 35.63 21.45 28.04 28.24
2 900 38.63 21.45 28.04 33.31
1024 39.36 27.33 34.80 37.61
avg 36.90 20.69 30.46 33.12
500 33.97 17.50 29.36 32.15
750 35.36 20.25 30.45 31.26
5.5 900 38.53 21.48 33.51 34.64
1024 39.37 32.15 37.89 37.31
avg 36.81 22.85 32.80 33.84
500 33.98 28.34 32.05 33.25
750 35.44 32.96 34.61 34.88
11 900 37.95 32.96 34.61 34.68
1024 38.95 38.02 37.39 33.56
avg 36.58 33.07 34.66 34.09

 

 

 

 

 

 

 

76

 

TABLE VI Prediction Performance (In Terms Of Overall Video Quality) After The Optimal

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Rate Tuning.
Pkt Xmit Am“
Phy Rate Chan. ORPACLDSI ORPACLDSZ ORPACON
amps) (PSNR) (48) (dB) (‘13)
(dB)

500 33.98 33.55 33.70 33.69

750 35.63 34.83 35.34 35.30

2 900 38.63 37.78 37.56 36.84
1024 39.36 38.60 38.94 38.34

avg 36.90 36.19 36.39 36.04

500 33.97 32.32 33.82 33.74

750 35.36 34.81 34.81 34.46

5.5 900 38.53 37.78 37.56 37.06
1024 39.37 38.83 38.49 37.45

avg 36.81 35.94 36.17 35.68

500 33.98 33.63 33.71 32.61

750 35.44 34.79 34.77 34.55

11 900 37.95 35.35 35.36 32.68
1024 38.95 37.49 36.71 32.80

avg 36.58 35.32 35.14 33.16

 

 

 

 

 

 

 

77

 

TABLE VII Prediction Performance With Foreman Sequence And Various CODECS (pkt
rate = 1 Mbps & phy. rate = 11 Mbps).

 

 

 

 

 

 

 

 

 

 

 

 

TABLE VIII Prediction Performance With H.264 And Various Video Sequences (pkt rate =

1 Mbps & phy. rate = 11 Mbps).

Actual
ORPACLDS1 ORPACLDSZ ORPACON
Codec channel
(dB)
(PSNR) ((13) (dB)

H.264 38.95 37.49 36.71 33.56
Dvix5.l 36.93 36.54 36.36 31.55
WMV9 37.12 36.69 36.50 32.37

MC 38.13 37 . 11 36.87 34.08
XivD0.9 36.52 36.05 35.84 33.22

 

Actual

 

 

 

 

 

 

 

 

 

Sequence channel ORPACLDSI ORPACLDSZ ORPACON
(PSNR) (43) (dB) (dB)
Foreman 38.95 37.49 36.71 33.56
Head 39.23 ' 38.79 38.60 35.66
Mobile 31.92 31.28 30.98 27.30
stefan 31.51 31.18 31.06 27.46

 

78

 

 

CHAPTER 5
RATE ADAPTTION FOR WIRELESS

SCALABLE VIDEO

On the basis of the rate adaptation architecture described in the previous Chapter, in
Chapter 5 we outline the following research topics: i) a Rate Distortion (RD) model for
above-capacity video and ii) Unequal Error Protection (UEP). The proposed topics are
selected to bring Optimal Rate Prediction Architecture under CLDS (ORPACLDS) to a

certain level of completion and to support utilization of ORPACLDS in different types of

operational networks.

5.1. An RD Model for Above-Capacity Video

In Chapter 4, for practical video streaming over wireless LANs ORPACLDS is

developed. We also show that an accurate source and channel coding rate prediction can
be achieved by utilizing the Rate Distortion (RD) of video and the distribution of channel

prediction error process. However, it is important to note that in Chapter 4 we made the

79

following assumptions in ORPACLD S : (i) the channel code achieves the capacity (i.e., an
ideal channel coder), and (ii) a block of packets cannot be recovered at the client if it is
coded with an overestimated rate, i.e., a channel coding rate that exceeds channel
capacity. Thus, to optimally realize ORPACLDS in real world scenarios, we must
incorporate i) an operational rate for a speciﬁc channel code which is not an ideal code

and ii) an RD ﬁrnction for above—capacity video.

5.1.1. Operational Rate

In Chapter 4 we assume that the channel code achieves the capacity (i.e., an ideal
channel coder) and a block of packets for a time window cannot be recovered at the
client if it is coded with an overestimated rate; here an overestimated rate is a channel
coding rate that exceeds channel capacity. However, the optimal rate, which is calculated
based on (22), should be adjusted in conjunction with a speciﬁc channel code, which is

not an ideal code. Therefore, we formulate the operational rate5 as:

1
R0P=1- -H ,Is 3 ,
a (a) a H(£) (23)

 

where a is the actual channel BER. Note that the channel capacity for the considered

 

5 The operational rate in this study is equivalent to the rate which embodies inferior
performance of a practical code to an ideal code.

80

BSC channel can be estimated as C = l—H b (a) [53]. For reliable communication (i.e.

distortion free communication,) the operational rate has to satisfy Rap S C . While it is

theoretically possible to satisfy this constraint, in practice the performance of a code is
inferior to the theoretical predictions because of ﬁnite length and other design limitations.
We capture this performance drop by introducing the parametera . Thus, we use a stricter
constraint Rap +(1—a)H(£) S C , where a hypothetically optimal code can be
represented by a = l .

As explained above, the value of 62 plays a critical role in the rate selection.
Therefore, we fnst deduce a suitable value for this parameter. Analytical deduction of a ,
requires us to consider ﬁnite length analysis of LDPC codes. This is a challenging and
reasonably open area of research. In this study, we circumvent this issue, and provide a
practically workable solution, by empirically evaluating the value of a . Hence, for this
purpose, we conducted a comprehensive set of experiments with LDPC channel code
[52] and test scalable video sequences. For example, we observed that fora = 1.8 , 99%
of the video packets are decoded successfully. Thus, moving forward, we use this
empirically deduced value in our simulation and performance studies. Further, for
notational convenience, we refer to the Rap as the operational channel capacity, C 0” ,

which is the maximum achievable rate for the given channel code and for the underlying

81

channel. Note that to complete equation (22) we develop an empirical model for the

quality distortion ﬁmction, Q'(o) of video sequences for rates exceeding capacity.

5.1.2. The Video Rate-Distortion Model

It is well known that a channel coding rate that exceeds the capacity leads to
unreliable communication and increased distortion. The precise increase in distortion
depends on a number of factors, such as: (i) the speciﬁc video content, (ii) the error
concealment/resilience and robustness of a particular source codec, and most importantly
(iii) the difference between the overestimated rate and the actual channel capacity.
Commercially available video systems are complex and consist of a number of suboparts.
Hence, it is diﬂicult (if not impossible) to deduce a closed form analytical expression that
can precisely capture the impact of the overestimated rate on the video quality.
Consequently, we develop an empirical model, which can in turn be used in equation (22),
to express the video quality for normalized rates6. Note that in this study a video test
sequence (foreman) was encoded by using Scalable Video Coding (SVC) [54] to easily

adapt bitrates rather than re-encoding for different rates. When encoding, the following

 

6 Rate difference between the over-estimated rate and the operational channel capacity,
R — C 01’

 

which normalized by the over-estimated rate,

82

encoding conﬁgurations were used: two-layered spatial scalability and three-layered Fine
Granularity Scalability (FGS); 30 frame per-second (fps) only with I and P frames; and
Group of Pictures (GOP) of 64. Note that the above encoding conﬁgurations were used
to reduce the computational complexity in wireless mobile terminals, on which our study
focuses. In addition, the LDPC code [52] was used as a channel coder.
To deduce the empirical model, we conduct the following measurement procedure:
1. Compute the operational channel capacity of the underlying channel (i.e., the
average capacity of 11 Mbps traces).
2. Extract the test SVC bitstream at the rate of the operational channel capacity and
compute the PSNR of the stream.
3. Encode the extracted bitstream with the LDPC encoder and transmit it to the
underlying channel.
4. Decode the bitstream packets using the LDPC decoder and the SVC decoder
(with error concealment7).
5. Compute the normalized PSNR8 from the successfully decoded packets.

6. Repeat 3-5 with the increased rate and for four different transmission rates.

 

7 A lost ﬁame was replaced with the previous frame.

W]

 

83

00
00

  
   

32~--3 ------------------- é -------- f --------
8 : 5
3 31 --------------------- i --------- ' ——————————
g 5 :
(D 30----——----—- --------- i ----------
n. : :
29- ----------------- f --------- E ----------
28 l l i L
200 400 600 800 1000
rate (kbps)
(a)
' """"""""""""" .‘.‘_’;'5‘0‘(,.;g,;,;
[I ..... 750kbps -
z --- 900kbps
8 -0-1mbps
.0 —Model
a) :
.21 :
E “““““““
‘- G...
g ......................... - as .........
0o 0.65 0:1
normalized rate over the capacity

 

     
 

 

 

 

 

 

 

 

 

 

(b)

Figure 17 The RD (quality) function (Q(-)) of SVC test video sequence for rates below
capacity (a) and the empirical model (Q'(-)) for video rate distortion for rates exceeding the

channel capacity (b).

84

From a comprehensive set of measurements, it can be observed that a normalized
PSNR decreases as a function of normalized rates [Figure 17]. Consequently, we derive
the empirical model to embody the distortion of video quality for rates over the channel
capacity as

f(x)=axb+c, 0950.12 (24)
where a = —1.18x102; b = 2.148; and c = 0.9898. Note thata , b and c were found
by a curve ﬁtting and were speciﬁed for the foreman sequence which was encoded with
SVC [54].

We leverage this empirical video quality distortion model, Q'(-) in equation (22),
to optimally select the source and channel rate, i.e., the channel coding rate [53]. The
performance of ORPACL DS , which fully employs the model, is described in the following

section.

5.1.3. Performance Evaluation

We now evaluate the performance of the proposed video rate-distortion model by

simulating ORPACLDS which employs the proposed model. The simulation procedure

with the video quality functions of the pre-encoded bitstream [Figure 17] is as follows:

85

1; 0p 1. 0p
1. Predict the operational optimal rates, Rn and Rn+1, for randomly chosen two

consecutive time windows of a randomly chosen trace (by using equation (22)).

2. Extract a bitstream based on the rates from 1.

3. Encode the extracted bitstream packets with the LDPC encoder and transmit the

FEC-encoded packets to the underlying channel.

4. Decode the transmitted packets with the LDPC and the SVC decoder (with error

concealment).

5. Repeat 1-4 for different traces with transmission rates.

The simulation results are show in TABLE IX. In TABLE IX, the ﬁrst and the second

columns represent underlying channels (or traces collected at the given physical data

rates and transmit rates), the third column represents time window indices, which were

randomly chosen from a randomly chosen trace, and the predicted (operational optimal)

rates by ORPACLDS ; the forth column represents bitrates and PSNRs of the video source,

which was extracted based on the predicted rates; and PSNRs of video source which was

preserved at a client are shown in the ﬁfth column.

We can observe from TABLE IX, by utilizing the prOposed model in conjunction

with the CLDS protocol, that there is a small amount of PSNR reduction relative to an

error-free decoded bitstream. This distortion is due to burst errors in wireless channel

86

which cause severe corruption on a few packets. However, the distortion is very small,
and the bitstream at the clients preserves most of the original video quality (PSNR) for
randomly selected rate adaptation intervals (or time windows) and for different
transmission rates [TABLE D(]. This implies an accurate rate prediction performance
by ORPACLDS and a viable empirical model for video rate-distortion. Note that since the
test bitstream is 10 second long (300 ﬂames), two consecutive time windows were used
for simulations. Thus, the ﬁrst 150 ﬂames and the rest were extracted according to two
different rates. On the contrary, the video quality was computed ﬂom the entire bitstream.

Based on the accurate performance of the proposed video rate-distortion model, we

extend our analysis on the performance of ORPACLDS to the entire time windows of all

traces by using the following equation:

 

 

P sop atop _
Q(Rn -T)-I(R,, SC?)
k
1
avg PSNR = I Z 12"” _ Cop .. op (25)

"=1 +Q' -"—,,—" 'I(Rn > C3?)

.. Rn .J

atop

where I (-) is an Indication functiong; R n is the rate computed by using equation (22);

C3” is the operational channel capacity; and k is the number of time windows in a trace.

 

0 if C is not satisﬁed
9 1(6) = ,
l otherwrse

87

Additionally, we compared the performance of ORPACL D S with ORPACON to
highlight the eﬂiciency of CLDS protocols in practical rate adaptation applications.

The conventional protocols can only observe the number of packet drops. Thus, for

ORPACON we estimated the channel capacity for a time window using PER as

~CON 1 ,,,
C" =1——ZZ,-=1—PER, (26)

i=1

and used the same simulation procedure as ORPACL D S .

As shown in TABLE X, ORPACLDS provides an average PSNR performance that is
very close to the maximum PSNR for any given channel. Note that the ﬁrst column
represents the best video quality to be possibly achieved for a transmission rate and for
the given physical data rate. These results remark the performance of the proposed video
rate-distortion model, with which ORPACLDS is able to provide an outstanding
performance. It should be also noticed that the average PSNRs of both ORPACLDS and
ORPACLDS for the error trace (or channel), at which is collected the transmit rate at 750
transmit rate and the physical rate at 11 Mbps, have signiﬁcant gains relative to others.
This is due to the fact that almost no rates, which are predicted ﬂom the error trace,

exceed the maximum achievable rate, while keeping close to the maximum rates.

Moreover, it can also be observed that ORPACLDS outperforms ORPACON. This

88

performance difference is due to how the channel is estimated, i.e., BER estimate using
side-information provides more accurate channel state estimate than PER.

TABLE IX Performance of ORPACL D S with an ideal channel code and with a LDPC code
(In Terms Of Overall Vrdeo Quality).

 

 

 

 

 

 

 

 

 

_ Time Wmdow Index Extracted , -
Xmit _ _ - Decoded
Phy (Optimal Rates) Brtstream ~
(Mb ) Rate PSNR(dB) Brtstream
S o o 1 .
p (Kbps) ' p * p _ ~ ~PSNR(dB)
(Rn ,Rn+1) Bltrate(Kbps) .
437,438 29.87
29.20
500 (0774,0744) (379.7)
1703, 1704 30.28
29.56
(0877,0886) (440.3)
227,228 31.57
(0 904 0 901) (677) ”'67
750 ' ’ '
1837, 1838 31.59
31.21
N (0890,0926) (681)
384,385 30.96
(0 638 0 558) (538 9) 3096
900 ' ’ ' '
1014,1015 31.80
31.70
(0849,0779) (732)
422,423 32.10
(0 843 0 818) (8491) ”'05
1024 ' ’ ' '
739,740 32.12
32.02
(0841,0826) (853.5)

 

 

 

 

 

 

 

89

TABLE X Performances of ORPACLDS and ORPACON (In Terms Of Overall Video

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Quality).
Phy 1:: Actual Channel ORPACLD S ORPACON
(Mbps) (Kbps) (dB) ((13) ((13)

29.0564 28.9583 28.7263
29.0133 28.8043 24.3700
500 29.0509 28.8675 24.3804
29.0422 28.6611 27.8340
29.0317 28.8711 23.3747

avg 29.0389 28.8325 25.7371
30.9815 30.7850 30.3802

750 30.9465 30.6392 30.0243
1 1 avg 30.9640 30.7121 30.2022
31.8695 31.4052 27.0839
900 31.9313 31.6742 27.0339

31.8198 31.3434 18.5583
avg 31.8735 31.4743 24.2254

32.1503 31.5112 0.1117

1024 32.4389 31.8527 11.5011
32.4996 32.0516 24.3254
32.5214 32.0582 30.2564
avg 32.4026 31.8684 16.5486

 

 

 

 

 

 

90

 

5.2. Unequal Error Protection

For multimedia streaming applications, it is essential to accurately estimate/predict
the channel capacity to provide QoS as emphasized in previous chapters. However, it is
also important to minimize distortion of video quality when video packets are corrupted
_ due to burst errors by severe interference which often occurs in wireless channels. A
workaround to this problem is to employ an Unequal Error Protection (U EP) scheme.
Under UEP, parts (e. g., Intra-coded ﬂames) of the video bit-stream, which signiﬁcantly
affects video quality, are provided with more protection (i.e. a lower channel coding rate)

and vice versa.

5.2.1. UEP Utilizing an LDPC Code

It is well known that for an LDPC code the size of the packet (or message bits) plays
an important role in the channel coding performance, i.e., a lager packet size has a better
chance of successful decoding than a smaller packet with a given coding rate. As shown
in Figure 18 we can observe that the probability of successful decoding is a ﬁrnction of
both the size of packet and the redundancy parametera which is used in (23) in section

5.1.1.

91

 
    
 

 

 

 

 

 

1 ....... l 1 .L
i .1- 5
08 ________
49 i :
E 1 :
(D 0.6 -------- J: ------- l ............
(D 1 :
8 3,." a
0.4 -------- ' ————— . .
a --- pkt-siZe<1kbits
02 pkt-size<3kbits
' ---pkt-size<8kbits
5 —pkt-size>=8kbits

 

9.4 1.6 1.8 2 2.2 2.4 2.6
alpha
Figure 18 The probability of successful decoding, which is generated with LDPC codes [52]
and SVC bit-stream [54], depends on both the size of packet and the parameter a .
Consequently, when a packet containing I ﬂames is encoded with a which leads to
the probability of success decoding close to 1 (i.e., a = 2.1 for a packet size of 8 Kbits
or more), it is highly likely that the packet is successfully decoded. Note that in the
previous subsection we apply a =1.8 for every video (I- and P- ﬂame) packets where
almost every packet can be successﬁrlly decoded. However, channel impairments (e.g.,

burst errors) can be introduced in any wireless channel, and hence, there is always a

chance of a sever packet corruption (or unsuccessful decoding). This results in enormous

92

distortion of video quality in case of I-ﬂame packet loss (rather than P-ﬂame packet). To
that end, we differentiate a for I-frame packet ﬂom that for P-ﬂame packet.
After a for each I-ﬂame packet is chosen, a for each P-ﬂame packets can be

simply calculated as

n k
aZLi - 2a,! 'L{
“P = i=1 N i=1 (27)
Z Li

i=k+l

 

where a =1.8;n represents the total number of video packets; k represents the number

of I-frame packets; or] represents the redundancy parameter for I-ﬂame packets;
a P represents the redundancy parameter for P-ﬂame packets; Li represents the length of
ith packet; L; represent the length of ith packet containing an I-ﬂame; and L?
represent the length of ith packet containing a P-ﬂames. Note that the total number of

redundant bits for a given time-window can be expressed as

n k n
Maggy, =H(£)-Za,-I -L{ +ap-H(a)- Z L?” (28)

i=1 i=1 i=k+l

at 0P
where a -H(£) =1— R , and up is used for FEC encoding of P-ﬂame packets.

93

5.2.2. Performance Evaluation

We now evaluate the performance of the proposed UEP scheme by simulating

ORPACLDS which employs the proposed model. The simulation procedure with the video

bit-stream [Figure 17 (a)] is as follows:

1.

2.

at 0p
Predict the operationally optimal rate, Rn for a given time window.

Extract a SVC bitstream based on the rates ﬂom 1.

Choose a, and encode each I-ﬂame packets with the LDPC encoder.

Calculate ap and proceed FEC encoding for P-ﬂame packets by using equation

(27).

Transmit the F EC-encoded packets over the underlying channel and decode the

transmitted packets with the LDPC and the SVC decoder (with error

concealment)

Compute the normalized PSNR ﬂom the SVC decoded packets

Repeat 1-6 with an increased channel BER.

The excellent performance of the proposed UEP scheme can be clearly seen in Figure

20. We can observe ﬂom Figure 20, by utilizing the proposed UEP scheme in

conjunction with the CLDS protocol, that there is a small amount of PSNR reduction

94

 

0.98 -------------
0.96 -------------
0.94 -------------
0.92 -------------

0.9 -------------
0.88 —————————————

0.86 -------------

0.34 . — Quality Distortion w/ P-frame packet loss ,
--- Quality Distortion w/ I-frame packet loss I

 

normalized PSNR

 

_.A--_..L_..--_l__-_.L.—_--L__-_J-_-_

 

 

 

 

 

0.82

0 5 10 15 7 20
number of packet loss

Figure 19 The performance comparison in video qualities with the proposed UEP scheme and

without UEP (when only I-frame packets are lost, i.e., the worst case).

relative to an UEP-ﬂee decoded bitstream (worst case occurs when I—ﬂame packets are
lost) as the number of ﬂame loss increases. This PSNR reduction difference is due to the
number of successfully decoded I-ﬂame packets. In other words, the proposed UEP
scheme protects I-ﬂame packets more securely so as to minimize the amount of video

quality distortion.

95

CHAPTER 6
SYNDROME PARTIAL DECODING

In general, a client within an ad-hoc network may receive video content after being
transmitted over multiple wireless hops. Due to the cascading effect of noise and
interference, the overall channel capacity of a multi-hop wireless path drops
progressively over each hop, and hence, without optimal rate adaptation, the video
quality is expected to degrade signiﬁcantly at any client located at a far-edge of an ad-
hoc network. To overcome this limitation, Decoding and Forwarding (DF), which fully
decodes codewords at each intermediate node, can be employed to provide the best video
quality. However, complexity and memory usage for DF are signiﬁcantly high.
Consequently, we propose Syndrome-based Partial Decoding (SPD). In the SPD
ﬂamework an intermediate node partially decodes a codeword and relays the packet
along with its syndromes if the packet is corrupted. We exhibit the efficacy of the
proposed scheme by simulations using actual IEEE 802.11b wireless traces. The trace-

driven simulations show that the proposed SPD ﬂamework, which reduces the overall

96

processing requirements of intermediate nodes, provides reasonably high goodput"),
when compared to simple forwarding, and less complexity and memory requirements,

when compared to DF.

6.1. Limitations

Some studies (e.g., [59]-[60]) have focused on wireless multimedia communication
over an ad-hoc network which consist of multiple wireless hops ﬂom a server to a client.
It is well known that each wireless channel between intermediate nodes suffers ﬂom
impairment due to interference, fading, and multi-path effects. Hence, when a multi-hop
wireless network is employed, it can be easily observed that channel capacity decreases
over each hop, and hence, the End-to-End (E2E) channel capacity can become very low.
This leads to signiﬁcant degradation of goodput and video quality for rate adaptation
applications.

An ideal workaround to this problem is to employ Decoding and Forwarding (DF).
Under DF, channel coding is employed over every hop of the end-to-end path. Hence, an

intermediate node decodes the transmitted packets (or codewords) to suppress noise on

 

1° In this study the goodput is deﬁned as the application level throughput, i.e. the number
of information bits per total number of bits forwarded by the network ﬂom a certain
source address to a certain destination.

97

the channel between two intermediate nodes, and re-encodes the packets, potentially with

a different codebook, for transmission towards the destination [61]. Thus, DF can achieve

optimal performance with regard to network capacity. However, complexity and memory

usage for DF are signiﬁcantly high. Moreover, intermediate nodes, which may be

participating by only forwarding the content toward a receiver further-down a multi-hop

chain, do not have much incentive to perform full decoding—encoding of the channel-

coded wireless video content. For such nodes, we need to minimize their burden in terms

of the operation they need to perform toward the delivery of the video content to the frnal

receiver.

The above discussion motivates the usage of partial processing ﬁamework for rate-

adaptive wireless video, which reduces the overall processing requirements of

intermediate nodes. In particular, in this paper, we propose Syndrome-based Partial

Decoding (SPD) architecture. The proposed architecture employs CLDS protocols,

which among other things, accurately estimate the channel capacity using simple Binary

Symmetric Channel (BSC) [56] model. The two main contributions of this paper are:

l) A syndrome-based partial decoding scheme. Under this scheme, each intermediate

node only computes the syndromes of the received packet (i.e., partial decoding) and

relays the syndromes along with the packet (if the packet is corrupted) to the next hop.

98

The client then fully decodes the packet by using the syndromes which are appended to
the packet.

2) An optimal packet size selection scheme. As explained later, SPD utilizes channel
Packet Error Rate (PER) as an error parameter for its operation; and hence SPD goodput
heavily depends on the size of packets. Thus, we derive the optimal packet size selection
scheme to maximize the goodput in the proposed architecture.

The proposed SPD architecture is tested using a comprehensive set of wireless
residual error traces collected at 2, 5.5 and 11 Mbps physical data rates of an operational
802.1 lb network. We compare the performance of SPD with DF, E2E decoding (END)
and automatic repeat request (ARQ). We show that SPD outperforms END decoding and

ARQ and provides relatively good goodput compared to that by DF.

6.2. Syndrome-based Partial Decoding

In this chapter we develop the rate adaptation architecture over an ad-hoc network
using the two main contributions of this paper: SPD and optimal packet size selection.
SPD is achieved by i) only computing a syndrome for a packet at each intermediate node;

ii) forwarding codeword for the syndrome of the packet (only when the packet is

99

corrupted at a hop) to the next hop; and iii) fully decoding the packet by using its
syndromes at a client. SPD embodies BER as well as PER; i.e., the goodput is a function
of both BER and PER. Thus, the optimal goodput is achieved by ﬁnding an optimal

packet size that provides the best goodput in the proposed architecture.

6.2.1. Architecture for Rate-Adaptation

The architecture of a multimedia streaming application depends heavily on a network
over which it operates. Therefore, in this section, we deﬁne the proposed architecture for
rate adaptation. Additionally, we outline the assumptions under which the proposed
architecture is to be evaluated.

The proposed architecture consists of a server, intermediate nodes and a client that
receives packets over a multi-hop wireless network. In the given architecture, a server,
intermediate nodes and a client are designed to support source and channel rate
adaptation. Figure 20 illustrates such architecture. The intermediate nodes and client
supports CLDS protocols that leverage residue-error-process and side information, which

can be relayed to an FEC decoder (for partial decoding at each intermediate nodes or full

100

decoding at a client)[37], to estimate the current channel capacity” for a block of packets

(or a rate adaptation period). The current channel capacity, which is estimated by the

channel estimator with the entropy of the residue error process, is then transmitted to the

server as feedback for rate adaptation. Using the feedback, the rate tuner at the server

predicts the optimal source and channel rates for the next block of multimedia packets to

be transmitted [56].

For this study, we focus on the performance of SPD, and hence we assume the system

employs generic (arguably ideal) channel and source coding schemes. In particular, we

consider the following simplifying assumptions: (1) the channel code achieves the

capacity (i.e., an ideal channel coder), and a block of packets can be successfully

decoded at the client if a source and channel coding rate does not exceed channel

capacity, i.e., R .<_ C [25]; (ii) A video encoder provides a bit-stream having a bitrate that

is exactly the same as the required bitrate, and the bit-stream renders Peak-to-Peak Signal

to Noise Ratio (PSNR) value [3 8] according to the bitrate. With (i) and (ii), we realize the

architecture where for video rates that cannot be supported by the underlying channel

 

m
11 Here, the channel capacity is estimated as C" = l—J-ZH b (83,-) where 8,. represents
i=1
1. m
the BER estimate for packeti; —ZH b (83-) is the instantaneous per-packet error-process
m .
1=l

entropy in a block of m packets, and n is the time-window index [56].

101

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

nméusﬁ as... NwLmNIG+GHNwtm 95%: m.
a
0.. UN w
99 15585545: .98.. _ 39153 .988 L m .9. ..._ _ .m
u o in S
‘ w u u m
.BuEamm . . .65... 2mm P
.0520 - e 2
1‘ ‘ 802 1%” 802 L _+=- m 0
‘11 8 1
.888 mm 9388.25 mm. 2.385525 w. .. M
0m... + + ..m
« 822:5 62-2.3.2 .885 m
5000.5 6
1' m N _ H WNW.» All ‘1 82> m
888 m. m. m om“.
. 89> 9.02 noon. ._. 3.. 58% 82> m
Em=o .oEow m
H

 

capacity, the video quality reduces to zero (i.e., zero PSNR value). Note that without
the assumptions (i) and (ii) (i.e., a realistic channel coder or video encoder was used), we
should carefully take the performances of the channel coder and video encoder into
consideration in ﬁnding the rate. Therefore, the above assumptions were made to focus

only on the performance of the proposed SPD scheme.

6.2.2. SPD Overview

Each intermediate node computes the syndromes of packet
( Syndromerxl = Check _matrix,,,<, -Packet17;<n ) ﬂom the server or the previous
intermediate node and forwards the packet with the packet’s syndrome (including the
redundancy of the syndrome; i.e., codeword for the syndrome) only if the packet is
corrupted. Note that when a packet is not corrupted (the syndromes consist of all 0s) only
the packet is relayed to the next hop, Meanwhile, when a packet is corrupted each node
adds its syndrome (including syndrome redundancy) to the packet in an embedded
manner; and hence, this process is repeated until a client receives the packet. A client
then uses the syndromes, which are coded and appended to the packet, to decode the

packet. For example, under the scenario shown in Figure 20, when the packet is

103

corrupted at the ﬁrst hop, then a client receives the packet and two different syndromes

appended to the packet as shown in Figure 20 and computes the syndrome for the last

hop. Note that a syndrome is used as the information needed to successfully decode the

packet which is corrupted in a hop (or a channel link).

It should be noted that a server uses the maximum channel BER

(ramax = max(£1,£2,a3)) to select the rates rather than E2E BER (81525 2 emax) [Figure

20] since each syndrome can be used to correct errors occurred at each hop. This leads

SPD to suppress noise on each channel between a server and a client and to provide the

goodput performance close to DF.

Using Figure 20 as an illustrative example, the packet received at a client is then

decoded along with its syndromes as follows:

i) Decode the codewords for the syndromes that is forwarded ﬂom the previous

nodes

ii) Decode the codeword for the message with each of the decoded codewords for the

syndromes

Note that the above procedure speciﬁes the (1 bit error correcting) Hamming code

and the similar procedure can be applied to Reed Solomon (RS) code or turbo code, for

which the syndrome polynomial can be employed.

104

6.2.3. Goodput Evaluation

In this section, we develop the expression for the goodput of proposed SPD and other
schemes such as DF, End node decoding, and ARQ. As described in the previous
subsection, for SPD the total number of bits transmitted to multi-hop wireless network

increases as PER and the number of hops increase. Therefore, the goodput for SPD,

GooaputSPD can be expressed as

 

 

 

 

 

 

 

 

k k
GoodputSPD = n = - h _
’0’“ (k+r)-h+Z(PER,- ~(kSi +41. )h) (29)
i=1
h
L _
_ k
h
(k + r) + Z(PER,- -(kSi + r31.»
i=1
_ k
_ . h .
[h k H(amax))+ZPERi (H r H(gm,x)]
l-H(8max) i=1 1_H(£max)
_ k
— h
k-[1+—q—(£mL)—J+ZPER,- ~r-[1+ H(3max) ]
1_H(£max) i=1 1_H(£max)

105

k k

’ h — h .
[k+ZPER,.r].Am [k+ZPERi-f—H(—gﬂﬁ]-Amax

 

i=1 i=1 1- H(£max)

l

h
[l+§PER,-l—_I%9(—:§:—)]-Am

 

where k is the number of information (original source) bits; r is the number of
redundant bits for k; n (= k + r) is the total number of bits in a packet length; ks is the

number of syndrome (bits) of the packet; rs is the number of redundant bits for the

syndrome; h represents the number of hops ﬂom a server to a client; and

(k+r)-h+Zh:(PER,- -(ksi +rsi )h)

Note that ”total 2 H h which

_ M
“m— 1-H(gmax)'

 

normalizes the total number of bits by deﬁning per-hop transmission cost; the length of

ks,- -H(8max) _ r-H(£m3x_) __ r~H(£max)
C " C ‘1—H(gm,x)'

 

ks is the same as that of r; and rsl. =

DF corrects error bits (full decoding) and re-encoding at every intermediate node; and

hence, the goodput for DF, Goodput D F , can be expressed as

= E (30)

”total "

 

GoodputDF =

106

k k k 1

[k+LH_(£nlalL)_) k.[1+___H(£max)] k'Amax Amax
1-H(£max) 1-H(£max)

 

 

where ntoml = 51—h]: normalizing the total number of bits by deﬁning per-hop
transmission cost.
End node decoding, which we deﬁned in this study, employs CLDS protocols but

does not incorporate a partial or ﬁrll decoding at any intermediate node and uses the E2E
channel BER, e E2 E , for ﬁnding a source and channel coding rate. Thus the goodput for

end node decoding, Goodput END , can be expressed as

 

 

 

 

 

k k
GOOCbutEND = =
"total (1‘ + r)
k _ k
(k+ k'H(5E2E)) k-(H H(£EzE) J (31)
1-H (£1325) 1-H(513215)
k 1

 

1‘ 'AEZE A19215:

 

where EEZE=£1*£2*...*£,, and AE2E=1+ 1106525) . Note that BER for two
1-H(€1525)
cascaded BSCs is £1 *82 = 81 + £2 — 281 62.
For ARQ, a packet is re-transmitted until a deﬁned time-out by a server if it is

corrupted during its transmission ﬂom a server to a client. Therefore, the goodput for

ARQ can be expressed as

107

GoodputARQ =1— PEREZE (32)

where 191518,;2 E = PER1*PER2 *...*PER,,.

6.2.4. Optimization of SPD

It is well known that PER changes with respect to a size of packet for a given channel
BER, and GoodputSpD is a function of BER and PER. Therefore, we capture this aspect
to ﬁnd the optimal packet size ( pkt _ size), which maximizes GoodputSPD [41] as

*

pkt _ size = arg max GoodputSpD

 

pkt_size
(33)
1
=argmax , h .
pkt_szze 1+ZPERi- H<£max) ‘Amax
i=1 1—H(£max)

It requires to model the PER (as a ftmction of packet sizes) to ﬁrlly utilize equation (33).
Consequently, we develop an empirical PER model to express the PER for packet sizes.
To deduce the empirical model, we conduct the following measurement procedure:

i) Calculate BER of the underlying channel (i.e., the BER of a collected trace).

ii) Calculate PER with respect to a packet size.

iii) Repeat i) and ii) with different traces.

108

 

 
   

 

 

 

----~BER=0.004898 .\

0.2» -°--BER=0.007923 "
_.-. BER=0.006375 :: ’

0.15 I

a:

LU

0- 0.1-
0.05

0 .

 

 

0 0.5 ' 1 1.5 2 2.5
packet Size (bits) x104
Figure 21 The empirical PER model.
From a comprehensive set of measurements, we observed that PER linearly increases

as a packet size increases [Figure 21]. Consequently, we derive the empirical model to

embody the PER for packet sizes as

PER = ax + b (34)
where a =1.071x10—5 , b = 0.01597 , and x is a packet size. Note thata and b were

found by a linear ﬁtting.
We leverage this empirical PER model to optimally select the packet size which

maximizes GoodputSpD. The performance of SPD, which fully employs this model, is

described in the following section.

109

6.3. Performance Evaluation

We now compare the performance of the proposed SPD with DF, End node decoding,
and ARQ in terms of goodput and the total number of bits transmitted in an ad-hoc
network. To compare the performance of each scheme, experiments in this study are
conducted as follows:

1. Calculate the BER and PER of each channel (or hop) for a rate adaptation period.
Note that we use 2 Mbps traces for this simulation and treat each trace as the
channel between two nodes in the network.

2. Find a source and channel coding rate, for a rate adaptation period. Note that we
assume to use an ideal channel code; and hence, we use the capacity as the rate.

3. Calculate the goodput for SPD, DF, End node decoding, and ARQ by using
equations (29), (30), (31), and (32).

4. Find the optimal packet size by using equation (33) which incorporates the

empirical PER model [see (34)] and the corresponding goodput.

110

 

 

 

 

 

 

 

0-95i#;=;:'.‘.;;;;_- ———————————— 5 ------------ .3
0.9 5" ’ “1.1.-2'40...- - - -.._T.-..-.T= :TSEPEE aqua-4

.‘oi . "”001 i i

S 0'85 ------ 1“; _______ f.‘i5;.—. ------- : ____________ _:
o. : .~.\ : ‘0'... : :
.8 0 8 —————— i _______ ,‘;—-i- __________ {nit-“3;; ______ 1:
CO) 075 “ ": ---------- TOR-’~ --------- : ........ . :31:
. — DF E .‘o~.‘ E E

07" ---SPD ------- E— __________ . ~E-‘;:;""“"E
0.65' """END ------- E. ____________ E ______ T f-‘ng:
-l-1ARQ i E E

# of hops

Figure 22 Performance comparison among four different schemes in terms of goodput. For

this experiment, a packet size of 8000 bits is used.

5. Calculate the PSNR using the rate-distortion (RD) function of a scalable video

coding (SVC) sequence, Q(-) [Figure 17 (a)][54], for the given goodput as

PSNR = Q(GoodputSPD -T) where T represents a transmission (Xmit) rate in
bits-per-second (bps).

As for goodput, Figure 22 shows that SPD outperforms End node decoding and ARQ

and provides the performance, which is a little inferior to DF over hops. However, it

should be noticed that although DF provides the best performance in terms of goodput,

the complexity and memory usage at each intermediate node are signiﬁcantly high since

111

each intermediate node proceeds full decoding and re-encoding of packets. Additionally,
the result shown in Figure 22 does not employ the optimal packet size selection scheme.
The excellent performance of the proposed optimal packet size selection scheme for
SPD can be clearly seen in TABLE XI. The ﬁrst column of TABLE XI represents the
number of hops; the second, third, and fourth column represent the optimal packet size,
goodput, and PSNR achieved by SPD under the actual channel (i.e., the collected traces
used for simulations in this study); the ﬁfth column represents the optimal packet size
estimated by equations (33) and (34); and the sixth and seventh column respectively
represent the goodput and PSNR with the optimally estimated packet size under actual
channel. As shown in TABLE XI, the optimal packet size selection scheme estimates
very accurate packet size that maximizes both goodput and PSNR. In fact, they are very

TABLE XI The performance of SPD with optimal packet size selection scheme in terms of
goodput (Xmit rate=500Kbps)

 

 

 

 

 

 

 

 

 

 

 

 

 

hop Actual Optimization
timal timal
Op . PSNR Op . PSNR
pkt_srze Goodput pkt_srze Goodput
. (dB) . (dB)
(brts) (brts)
2 28555 0.9126 30.38 26700 0.9124 30.38
3 18806 0.8999 30.34 19100 0.8960 30.32
4 14487 0.8760 30.26 14456 0.8759 30.25
5 13754 0.8669 30.23 12649 0.8646 30.21

 

112

 

close to the optimum.

113

CHAPTER 7
CONCLUSION

In this thesis, we developed an optimal rate prediction architecture using CLDS
protocols, ORPACLDS to support QoS for wireless scalable video. ORPACLDS
combines the following schemes: i) a BER prediction which leads to an accurate channel
capacity estimation and prediction; ii) an optimal source and channel coding rate
prediction and tuning; iii) Unequal Error Protection (UEP); and iv) Syndrome Partial
Decoding (SPD).

As for BER prediction over wireless networks, a multi-tier model (MTM) is
developed. The MTM relies on the premise that packet-error prediction should be
performed before and in isolation ﬂom BER prediction. For predicted error-ﬂee packets,
BER prediction is not required. The MTM achieved packet-error prediction using a 3rd
order Markov chain model. For predicted packet-errors, the MTM uses a second-tier
model of Signal to Silence Ratio (SSR) values to predict the next packet’s SSR. These
predicted SSR values are in turn used in a third tier for BER prediction. We showed that

the MTM renders higher prediction accuracy than existing Yule-Walker and ﬁnite-state

114

Markov chain predictors. Consequently, it is revealed that packet level side information
can be utilized to accurately estimate/predict channel conditions.

We observe that accurate channel prediction alone does not necessarily imply better
overall rate adaptation. Speciﬁcally, if capacity prediction is employed (directly) without
optimal rate selection, over-predicted channel capacities cause signiﬁcant packet drops.
To that end, on the basis of the BER prediction we develop an optimal rate tuning
scheme which leverages both the probability distribution of channel capacity prediction
error process and an RD (quality) function of a video sequence. Further, a Rate
Distortion (RD) model for above-capacity video and an operational rate for a speciﬁc
channel code are deduced to employ the optimal rate tuning scheme in a practical
wireless environment.

Our experimental results showed that ORPACL D S provides higher prediction
accuracy and better multimedia quality than ORPACO N (optimal rate prediction
architecture under conventional protocols) at any physical layer data rate. another
interesting observation is that the proposed optimal rate tuning scheme performs
considerably well on traces with very low temporal correlation, thus highlighting that the
proposed scheme can efﬁciently cater for bad channel predictions, i.e., the proposed

scheme provides good performance even for relatively memoryless and isolated errors.

115

When burst error occurs over wireless networks, it is highly likely that video quality
faces severe degradation. To resolve this problem we develop an UEP scheme which
utilizes characteristics of a LDPC code. It is shown that the proposed UEP scheme
reduces the degradation of video quality. It should be noticed that the impact of such a
scheme is potentially quite signiﬁcant, not only on the overall video rate selection, but
rather on the overall coding strategy of the video encoder (in terms of rate allocation
among different ﬂame types).

Finally, we proposed the new partial processing framework for multi-hop rate—
adaptive wireless video, i.e., SPD. From our experiments, SPD, which reduces the
overall processing requirements of intermediate nodes, provides reasonably high goodput,
when compared to End node decoding and Automatic Repeat request (ARQ). It is also
observed that SPD provides less goodput than Decoding and Forwarding (DF). However,
the performance difference between DF and SPD is relatively small if we take
complexity and memory requirements into consideration. From this study, we exhibit the

applicability of ORPACL D S in multi-hop wireless networks by introducing SPD.
The current research shows that ORPACLD S provides the outstanding performance as
a rate-adaptive video streaming application over wireless LAN s. However, it should be

noted that in order to employ ORPACLDS as a real-time application the followings are

116

required: a real-time Rate Distortion (RD) model for both above-capacity video and

below-capacity video. As described in section 5.1.2, the precise increase in distortion

depends on a number of factors, such as the speciﬁc video content, the error

concealment/resilience and robustness of a particular source codec, and etc. Thus, it is

very difficult (if not impossible) to deduce an RD model which accurately estimates a

PSNR of any video bit-stream in real-time. Additionally, if SPD incorporates entropy

coding (e. g. Hoffman or run-length coding) for syndromes at each intermediate node, the

goodput can be further improved. These are the possible future topics to possibly utilize

the optimal rate prediction architecture as a real-time rate-adaptive video streaming

application over any kind of wireless LANS.

117

APPENDIX

List of papers published or submitted

Journal Papers

Y. Cho, S. S. Karande, K. Misra, H. Radha, J. Y00, and J. Hong, “0n Channel
Capacity Estimation and Prediction for Rate Adaptive Wireless Video,” accepted for
publications in IEEE Transactions on Multimedia.

S. Karande, U. Parrika, K. Misra, H. Radha, Y. Cho, J. Y00, and J. Hong, “Side-
Information Aware Non-Homogenous Markov Models F or The 802.11b MC-to-
MC Channel,” submitted to IEEE Communications (under review), 2008.

Y. Cho, H. Radha, and J. Yoo, “Channel Capacity Prediction for Rate Adaptive
Wireless SV ,” submission to European Association for Signal Processing
(EURASIP) on Wireless Communications and Networking.

Y. Cho, H. Radha, and J. Seo, “Multi-hop Rate Adaptive Wireless Scalable Video
Using Syndrome-based Partial Decoding, ” submission to Electronics and
Telecommunications Research Institute Journal.

Conference Paper

Y. Cho, H. Radha, J. Y00, and J. Hong, “A Rate-Distortion Empirical Model for Rate
Adaptive Wireless Scalable Video,” Conference on Information Sciences and
Systems (CISS), Princeton University, March 2008.

Y. Cho, S. A. Khayam, S. S. Karande, H. Radha, J. Kim, and J. Hong, “A Multi-Tier
Model for BER Prediction over Wireless Residual Channels,” Conference on
Information Sciences and Systems (CISS), John Hopkins University, March 2007.

S. Karande, S. Khayam, Y. Cho, K. Misra, H. Radha, J. Kim, and J. Hong, “0n
Channel State Inference and Prediction using Observable Variables in 802.11b
Networks,” IEEE International Conference on Communications (ICC), June 2006.

118

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[3]

[9]

REFERENCES

S. A. Khayam, S. S. Karande, H. Radha and D. Loguinov, “Performance Analysis
and Modeling of Errors and Losses over 802.1 lb LAN S for High-BitIate Real-Time
Multimedia,” Signal Processing: Image Communication, vol. 18, Aug 2003.

S. A. Khayam, S. Karande, M. U. Ilyas, and H. Radha, “Header detection to improve
multimedia quality over wireless networks,” IEEE Trans. On Multimedia, to appear.

S. A. Khayam, S. Karande, M. U. Ilyas, and H. Radha, “Improving wireless
multimedia quality using header detection with priors,” IEEE ICC, 2005.

S. A. Khayam, M. U. Ilyas, K. Porsch, S. Karande, and H. Radha, “A statistical
receiver-based scheme for improved multimedia throughput over wireless networks,”
IEEE ICC, 2004.

H. L. Van Trees, Detection, Estimation, and Modulation Theory, Part I, Wiley: New
York, 2001 .

J. G. Kemeny and J. L. Snell, Finite Markov Chains, Springer-Verlag: New York,
1976.

R. Riedi, M. Crouse, V. Ribeiro, and R. Baraniuk, “A multiﬂactal wavelet model
with application to network trafﬁc,” IEEE Trans. on Info. Theory, (45)3, pp.
992-1018, 1999.

S. A. Khayam and H. Radha, “Linear-complexity models for wireless MAC-to-MAC
channels,” ACM WINET, (11)5, pp. 543-555, 2005.

S. A. Khayam and H. Radha, “Markov and multiﬂactal wavelet models for wireless
MAC-to-MAC channels,” Elsevier Performance Evaluation Journal, to appear.

119

[10]S. A. Khayam and H. Radha, “Constant-complexity models for wireless channels,”
IEEE Infocom, April 2005.

[11]A. Servetti, J .C. De Martin, "Link-Level Unequal Error Detection for Speech
Transmission over 802.11b Networks," Proc. Special Workshop in Maui (SWIM),
Maui, Hawaii, January 2004.

[12]E. Masala, M. Bottero, J .C. De Martin, “Link-Level Partial Checksum for Real-Time
Video Transmission over 802.11 Wireless Networks”, Proceedings of 14th
International Packet Video Workshop (PVW), Irvine, CA, December 2004.

[13]E. Masala, M. Bottero, J.C.] De Martin, “MAC-Level Partial Checksum for H.264
Video Transmission over 802.11 Ad-hoc Wireless Networks” Proceedings of IEEE
61 st Semiannual Vehicular Technology Conference (VT C), May 2005.

[14]E. Masala, A. Servetti, J .C. De Martin, “Standard Compatible Error Correction for
Multimedia Transmissions over 802.11 WLAN”, Proc. of IEEE International
Conference on Multimedia & Expo (ICME), July 2005.

[15]S. Yi, Y. Shan, S. Kalyanaraman and B. Azirni-Sadjadi, "Header Error Protection for
Multimedia Data Transmission in Wireless Ad-hoc Networks", ..

[16]R. Riemann and K. Winstein, “Improving 802.11 Range with Forward Error
Correction”, MIT-CSAIL-TR—ZOOS-Oll

[17]F . Pauchet, C. Guillemot, M. Kerdranvat, S. Pateux, P. Siohan, “Conditions for SVC
bit error resilience testing” Joint Video Team (JVT) 17th Meeting: Nice, FR, 14-21
October, 2005

[18]W. J iang, “A Bit Error Correction without Redundant Data: a Mac layer technique
for 802.11 Networks”, WinMee 2006.

[19]S. Karande and H. Radha, “Hybrid Erasure-Error Protocols for Wireless Video,”
IEEE Transactions on Multimedia, Feb. 2007.

120

[20]S. Karande, U. Parrikar, K. Misra and H. Radha, “On Modeling of 802.1 lb Residue
Errors,” Conference on Information Sciences & Systems (CISS), March 2006.

[21]S. Karande, U. Parrikar, K. Misra and H. Radha, "Utilizing Signal to Silence Ratio
indications for improved Video Communication in presence of 802.11b Residue
Errors ", IEEE International Conference on Multimedia & Expo (ICME), July 2006.

[22]Utpal Parrikar, “On SNR Aware Analysis and Modeling of 802.11b Link-level
Residue Errors”, Master’s Thesis, Electrical and Computer Engineering Department,
Michigan State University.

[23]S. Karande, K. Misra and H. Radha, “CLIX: Network Coding and Cross-layer
Information Exchange of Wireless Video”, IEEE ICIP 2006.

[24]S. Karande, K. Misra, U. Parrikar and H. Radha, “Utility Of Signal To Silence
Ratio’s As A Predictive Tool For Enhancing Cross-layer Multimedia Protocols”
under review, IEEE Transactions on Multimedia.

[25]S. A. Vanstone and P. C. van Oorschot, “An Introduction to Error Correcting Codes
with Applications,” Kluwer Academic Publishers.

[26]R. H. Morelos-Zaragoza, “The Art of Error Correcting Coding,” J ohn-Wiley & Sons
Ltd.

[27]L. Rizzo, “Effective Erasure Codes for Reliable Computer Communication
Protocols”, Computer Communication Review, April 1997.

[28]W.E. Ryan, “An introduction to LDPC codes,” in CRC Handbook for Coding and
Signal Processing for Recoding Systems (B. Vasic, ed.), CRC Press, 2004.

[29]X. Hu, E. Eleftheriou, D. M. Arnold, "Regular and Irregular Progressive Edge-
Growth Tanner Graphs", IEEE Transactions on Information Theory, vol. 51. no. 1,
pp. 386-398, 2003.

121

[30]T. M. Cover and J. A Thomas, “Elements of Information Theory,” Wiley Series in
Telecommunications.

[31]L. Larzon, M. Degermark, and S. Pink, “UDP Lite for Real Time Multimedia
Applications,” IEEE International Conference of Communications (ICC), Vancouver,
June 1999.

[32]A. Singh, A. Konrad, A. D. Joseph, “Performance Evaluation of UDP Lite for
Cellular Video,” NOSSDAV, 2001.

[33]S. Karande, K. Misra, and H. Radha, "Survival Of The F ittest: An Active Queue
Management Technique For Noisy Packet F lows," Advances in Multimedia, vol.
2007, Article ID 64695, 2007.

[34]S. A. Khayam, “Wireless Channel Modeling and Malware Detection using Statistical
and Information Theoretic Measures,” Ph.D dissertation, Dec 2006.

[35]Y. Cho, S. A. Khayam, S. S. Karande, H. Radha, J. Kim, and J. Hong, “A Multi-Tier
Model for BER Prediction over Wireless Residual Channels,” CISS 2007, March
2007. '

[36]D. Aguayo, J. Bicket, S. Biswas, G. Judd, R. Morris, “Link-level Measurements ﬁ‘om
an 802.11b Mesh Network”, SIGCOMM 2004, Aug 2004.

[37]S. Lin and DJ. Costello, Jr., "Error Control Coding: Fundamentals and
Applications", Englewood Cliffs, NJ: Prentice Hall, 1983.

[38]P. Lambert, w. D. Neve, P. E. Neve, and 1. Moerman, “Rate-Distortion Performance
of H.264/A VC Compared to State-oﬁthe-Art Video Codecs,” IEEE Transactions on

Circuits and Systems for Video Technology, Jan 2006.

[39]A. Leon-Garcia, “Probability and Random Processes for Electrical Engineers,”
Addison-Wesley, 2nd ed., 1994.

122

[40]W. Allan, “Time and Frequency (T ime-Domain) Characterization, Estimation, and
Prediction of Precision Clocks and Oscillators,” IEEE Transactions on Ultrasonics,
F erroelectrics, and Frequency Control, UFFC-34, 647-654, 1987.

[41]S. Boyd and L. Vandenberghe, “Convex Optimization,” Cambridge, 2006.

[42]T Wiegand, H Schwarz, A Joch, F Kossentini, G.J. Sullivan, “Rate-constrained
coder control and comparison of video coding standards, ” IEEE Transactions on
Circuits and Systems for Video Technology, vol. 13, p688-703, 2003.

[43]IEEE 802 Working Groups, http://grouper.ieee.org/groups/802/11/.

[44]BGPP Ofﬁcial Website, http://www.3gpp.org/.

[45]S. Karande and H. Radha, “Does relay of corrupted packets increase capacity?,”
IEEE WCNC, Mar. 2005.

[46]F. Long, K-L. Lo, W-C. Siu, and F. Long, “Effect of channel quality estimation error
on the performance of interactive mobile video system,” ISIMP, May 2001.

[47]S. Ray, M. Médard, L. Zheng, and J. Abounadi, “0n the sublinear behavior of
MIMO channel capacity at low SNR, ” ISITA, Oct. 2004.

[48]R. Adler, R. Feldrnan, and M. Taqqu (Eds), A Practical Guide to Heavy Tails:
Statistical Techniques and Applications, Birkhauser, 1st ed., Oct. 1998.

[49]H. S. Wang and N. Maoyeri, “F inite-state Markov channel - A useful model for
radio communication,” IEEE Trans. Veh. Tech., (44)l, 163-171, Feb. 1995.

[50]Q. Zhang and S. A. Kassarn, “Finite-state Markov model for rayleigh fading
channels,” IEEE Trans. Comm, (41)] 1, 1688-1692, Nov. 1999.

[51]S. A. Khayam and H. Radha, “On long-range dependence in wireless residual
channels,” CISS, Mar. 2005.

123

[52]Software for Low Density Parity Check Codes,
“http://www.cs.toronto.edu/~radford/ftp/LDPC-2006—02-08/index.htm1,” last
accessed on Nov 10.

[53]R. W. Yeung, “A First Course in Information Theory,” Kluwer Academic/Plenum
Publishers.

[54]ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC, “Advanced Video Coding for Generic
Audiovisual Services,” version 8.9, 2007.

[55]Y. Cho, S. S. Karande, K. Misra, H. Radha, J. Yoo, and J. Hong, “0n Channel
Capacity Estimation and Prediction for Rate Adaptive Wireless Video s,” to be
appeared in IEEE Transactions on Multimedia.

[56]Y. Cho, H. Radha, J. Y, and J. Hong, “A Rate-Distortion Empirical Model for Rate
Adaptive Wireless Scalable Video,” CISS 2008, March 2008.

[57]“ANSI/IEEE Std 802, Part 11: Wireless LAN Medium Access Control (MAC) and
Physical Layer (PHY) Speciﬁcations,” 1999.

[58]“ANSI/IEEE Std 802, Part 15.4: Low Rate Wireless Personal Area Networks,” Sep 8,
2006. '

[59]Y. C. Hu and D. B. Johnson, “Design and demonstration of live audio and video over
multihop wireless ad-hoc networks,” in: Proceedings of the MILCOM2002 (2002).

[60]C.M. Cordeiro, H. Gossain, and DP. Agrawal, “Multicast over Wireless Mobile Ad-
hoc Networks: Present and Future Directions, ” IEEE Network, vol. 17, no. 1, 2003,
pp. 52-59.

[61]J. N. Laneman and G. W. Womell, “An Eﬁ'icient Protocol for Realizing Distributed
Spatial Diversity in Wireless Ad—Hoc Networks,” in ARL F edLab Symposium on
Advanced Telecommunications and Information Distribution (ATIRP-2001), College
Park, MD, March 2001.

124

  
  

     
    

        

VERSITV LIB

”'il'lliiiiiliiiil]11111111111“

063 0259