CAPACITYASSURANCEINHOSTILENETWORKS
By
JianLi
ADISSERTATION
Submittedto
MichiganStateUniversity
inpartialentoftherequirements
forthedegreeof
ElectricalEngineering{DoctorofPhilosophy
2015
ABSTRACT
CAPACITYASSURANCEINHOSTILENETWORKS
By
JianLi
Linearnetworkcodingprovidesanewcommunicationdiagramtotlyincreasethe
networkcapacitybyallowingtherelaynodestoencodetheincomingmessages.However,
thiscommunicationdiagramisfragiletocommunicationerrorsandpollutionattacks.How
tocombaterrorswhilemaintainingthenetworkisachallengingresearchproblem.
Inthisdissertation,westudyhowtocombattheattacksinbothxednetworkcodingand
randomnetworkcoding.
Fornetworkcoding,weprovideanovelmethodologytocharacterizelinearnetwork
codingthrougherrorcontrolcoding.Weproposetomapeachlinearnetworkcodingto
anerrorcontrolcoding.Underthismapping,thesetwocodesareessentiallyidenticalin
algebraicaspects.Meanwhile,weproposeanovelmethodologytocharacterizealinear
networkcodingthroughaseriesofcascadedlinearerrorcontrolcodes,andtodevelopnetwork
codingschemesthatcancombatnodecompromisingattacks.
Forrandomnetworkcoding,weproposeanewerror-detectionanderror-correction(EDEC)
schemetodetectandremovemaliciousattacks.TheproposedEDECschemecanmaintain
throughputunchangedwhenmoderatenetworkpollutionexistswithonlyaslightincrease
incomputationaloverhead.ThenweproposeanimprovedLEDECschemebyintegrating
thelow-densityparitycheck(LDPC)decoding.Ourtheoreticalanalysis,performanceevalu-
ationandsimulationresultsusingns-2simulatordemonstratethattheLEDECschemecan
guaranteeahighthroughputevenforheavilypollutednetworkenvironment.
Distributedstorageisanaturalapplicationofnetworkcoding.Itplaysacrucialrolein
thecurrentcloudcomputingframeworkinthatitcanprovideadesignbetweensecu-
ritymanagementandstorage.Regeneratingcodebasedapproachattracteduniqueattention
becauseitcanachievetheminimumstorageregeneration(MSR)pointandminimumband-
widthregeneration(MBR)pointfordistributedstorage.Sincethen,Reed-Solomoncode
basedregeneratingcodes(RS-MSRcodeandRS-MBRcode)weredeveloped.Theycanalso
maintaintheMDS(maximumdistanceseparable)propertyincodereconstruction.However,
inthehostilenetworkwherethestoragenodescanbecompromisedandthepacketscanbe
tamperedwith,thestoragecapacityofthenetworkcanbetlyted.Inthis
dissertation,weproposeaHermitiancodebasedminimumstorageregenerating(H-MSR)
codeandaHermitiancodebasedminimumbandwidthregenerating(H-MBR)code.We
provethattheycanachievethetheoreticalMSRboundandMBRboundrespectively.We
thenproposedataregenerationandreconstructionalgorithmsfortheH-MSRcodeandthe
H-MBRcodeinbotherror-freenetworksandhostilenetworks.Theoreticalevaluationshows
thatourproposedschemescandetecttheerroneousdecodingsandcorrectmoreerrorsin
thehostilenetworkthantheRS-MSR/RS-MBRcodewiththesamecoderaterespectively.
InspiredbythenovelconstructionofHermitiancodebasedregeneratingcodes,anatural
questionishowtoconstructoptimalregeneratingcodesbasedonthelayeredstructurelike
Hermitiancodeindistributedstorage.ComparedtotheHermitianbasedcode,thesecodes
havesimplerstructuresandareeasiertounderstandandimplement.Weproposetwooptimal
constructionsofMSRcodesthroughrate-matchinginhostilenetworks:2-layerrate-matched
MSRcodeand
m
-layerrate-matchedMSRcode.Forthe2-layercode,wecanachievethe
optimalstorageforgivensystemrequirements.Ourcomprehensiveanalysisshows
thatourcodecandetectandcorrectmaliciousnodeswithhigherstoragecompared
totheRS-MSRcode.Thenweproposethe
m
-layercodebyextendingthe2-layercodeand
achievetheoptimalerrorcorrectionbymatchingthecoderateofeachlayer'sMSR
code.Wealsodemonstratethattheoptimizedparametercanachievethemaximumstorage
capacityunderthesameconstraint.ComparedtotheRS-MSRcode,ourcodecanachieve
muchhighererrorcorrection.Theoptimized
m
-layercodealsohasbettererror
correctioncapabilitythantheH-MSRcode.
Copyrightby
JIANLI
2015
ThisdissertationisdedicatedtomywifeZhang,Ying.
v
ACKNOWLEDGMENTS
DuringmyPh.D.study,Dr.JianRenhasbeenanexcellentadvisorandmentor.Iwould
liketothankDr.JianRenforbringingmeintotheacademicarea.Henotonlyteaches
mesolidknowledgeincybersecurityarea,butalsohelpsmeestablishthemethodologyof
doingseriousandmeaningfulresearch.Withouthissupportthisdissertationwouldnothave
happened.
Ialsowouldliketoexpressgratitudetotheprofessorsinmycommittee:Dr.Subir
Biswas,Dr.FathiSalem,andDr.RichardEnbodyfromdepartmentofComputerScience.
Iwouldnothaveachievedcurrentgoalswithouttheirhelpfuladvice.
IwanttothankDr.TongtongLiforthoseenlighteningdiscussions.Shehastaughtme
somuchandgreatlyexpandedmyresearchvisions.
AndIcannotimaginewhatitwouldbewithoutthegreatsupportfrommyfamily,my
labmatesandmyfriends.Iloveyouall.
vi
TABLEOFCONTENTS
LISTOFTABLES
...................................
xi
LISTOFFIGURES
...................................
xii
KEYTOABBREVIATIONS
..............................
xv
LISTOFALGORITHMS
................................
xvi
CHAPTER1INTRODUCTION
...........................
1
1.1CombatingPollutionAttacksinNetworkCoding...............1
1.1.1BriefReviewofNetworkCoding.....................1
1.1.2SecurityProblemsofNetworkCoding..................2
1.1.3ExistingworkonCombatingPollutionAttacksinNetworkCoding..3
1.1.4SummaryoftheLimitationsofexistingworkonCombatingPollu-
tionAttacksinNetworkCoding.....................4
1.2DistributedStorageinHostileNetworks....................5
1.2.1BriefReviewofCurrentAlgorithmsforDistributedStorage.....5
1.2.2ExistingWorkonDistributedStorage..................7
1.2.3ExistingWorkonDistributedStorageinHostileNetworks......8
1.2.4LimitationsofExistingWorkonDistributedStorageinHostileNetworks9
1.2.5withExistingWorkonSecureNetworkCommunication..9
1.3ProposedResearchDirections..........................9
1.3.1DirectionsforCombatingPollutionAttacksinNetworkCoding...9
1.3.2DirectionsforDistributedStorageinHostileNetworks........10
1.4Overviewofthedissertation...........................11
1.4.1MajorContributions...........................11
1.4.2Structure.................................13
CHAPTER2PRELIMINARY
............................
15
2.1NetworkCoding..................................15
2.2ErrorControlCoding...............................16
2.2.1ErrorDetection..............................17
2.2.2ErrorCorrection.............................18
2.2.3SomePropertiesofErrorControlCodes................19
2.3RegeneratingCode................................20
2.4HermitianCode..................................21
CHAPTER3COMBATINGPOLLUTIONATTACKSFORFIXEDNETWORKS
.
24
3.1CharacterizationofLinearNetworkCodingforPollutionAttacks......24
3.1.1ModelsandAssumptions.........................24
3.1.2AnIllustrativeExample.........................25
vii
3.1.3RelationshipbetweenNetworkCodingandErrorControlCodingin
Point-to-PointCommunication.....................27
3.1.3.1The.........................27
3.1.3.2TheNecessity..........................31
3.1.3.3ApplicationinCombatingpollutionattacks.........33
3.1.4MulticastCase..............................34
3.1.4.1The.........................34
3.1.4.2TheNecessity..........................35
3.2ACascadedErrorControlCodingApproach..................35
3.2.1ModelsandAssumptions.........................35
3.2.2AnIllustrativeExample.........................36
3.2.3CharacterizationofNetworkCodingusingCascadedErrorControl
CodinginPoint-to-PointCommunication................38
3.2.3.1The.........................38
3.2.3.2TheNecessity..........................41
3.2.3.3ApplicationinCombatingpollutionattacks.........42
3.2.4MulticastCase..............................43
CHAPTER4COMBATINGPOLLUTIONATTACKSFORRANDOMNETWORKS45
4.1System/AdversarialModelsandAssumptions.................45
4.2ProposedEDECScheme.............................46
4.2.1EDECScheme..............................47
4.2.1.1LimitationsofErrorControlCode..............47
4.2.1.2MoErrorControlCode.................47
4.2.1.3PerformanceofMoErrorControlCode........49
4.2.1.4AlgorithmsforEDECScheme.................50
4.2.2Simulationinns-2.............................53
4.2.2.1SimulationPlatform......................54
4.2.2.2NodesDesign..........................56
4.2.2.3SimulationResults.......................57
4.3LDPCDecodingandLEDECScheme......................59
4.3.1LDPCCode................................59
4.3.2DecodingofLDPCCode.........................60
4.3.3RelationshipBetweenLinearNetworkCodeandLDPCCode.....61
4.3.4LEDECSchemeUsingBPA.......................62
4.3.5TheoreticalAnalysis...........................62
4.3.6PerformanceAnalysisandSimulation..................64
4.3.6.1NodesDesign..........................64
4.3.6.2SimulationResults.......................65
CHAPTER5DISTRIBUTEDSTORAGEINHOSTILENETWORKS|HER-
MITIANCODEBASEDREGENERATINGCODESAPPROACH
..
71
5.1System/AdversarialModelsandAssumptions.................71
5.2AnIllustrativeExample.............................72
viii
5.2.1RSCodeinDistributedStorage.....................72
5.2.2HermitianCodeinDistributedStorage.................75
5.2.3Inspirationfromthisexample......................77
5.3HermitianCodeBasedMSRRegeneratingCode(H-MSRCode).......77
5.3.1EncodingH-MSRCode..........................77
5.3.2RegenerationoftheH-MSRCodeintheError-freeNetwork.....83
5.3.3RegenerationoftheH-MSRCodeintheHostileNetwork.......85
5.3.3.1DetectionMode.........................86
5.3.3.2RecoveryMode.........................89
5.3.4ReconstructionoftheH-MSRCodeintheError-freeNetwork....89
5.3.5ReconstructionoftheH-MSRCodeintheHostileNetwork......91
5.3.5.1DetectionMode.........................91
5.3.5.2RecoveryMode.........................100
5.3.6RecoverMatrices
S
l;t
;T
l;t
from
q
2
StorageNodes...........100
5.4HermitianCodeBasedMBRRegeneratingCode(H-MBRCode).......102
5.4.1EncodingH-MBRCode.........................102
5.4.2RegenerationoftheH-MBRCodeintheError-freeNetwork.....105
5.4.3RegenerationoftheH-MBRCodeintheHostileNetwork.......106
5.4.3.1DetectionMode.........................107
5.4.3.2RecoveryMode.........................108
5.4.4ReconstructionoftheH-MBRcodeintheError-freeNetwork....108
5.4.5ReconstructionoftheH-MBRcodeintheHostileNetwork......110
5.4.5.1DetectionMode.........................110
5.4.5.2RecoveryMode.........................112
5.4.6RecoverMatrices
M

l
;t
from
q
2
StorageNodes.............113
5.5PerformanceAnalysis...............................114
5.5.1ScalableErrorCorrection........................114
5.5.1.1Errorcorrectionfordataregeneration............114
5.5.1.2Errorcorrectionfordatareconstruction...........115
5.5.2ErrorCorrectionCapability.......................115
5.5.3ComplexityDiscussion..........................117
5.5.3.1H-MSRregeneration......................118
5.5.3.2H-MSRreconstruction.....................118
CHAPTER6DISTRIBUTEDSTORAGEINHOSTILENETWORKS|OPTI-
MALCONSTRUCTIONOFREGENERATINGCODESTHROUGH
RATE-MATCHINGAPPROACH
...................
119
6.1System/AdversarialModelsandAssumptions.................119
6.2ComponentCodesofRate-matchedMSRCode................120
6.2.1FullRateCode..............................120
6.2.1.1Encoding............................120
6.2.1.2Regeneration..........................121
6.2.1.3Reconstruction.........................121
6.2.2FractionalRateCode...........................123
ix
6.2.2.1Encoding............................123
6.2.2.2Regeneration..........................124
6.2.2.3Reconstruction.........................125
6.32-LayerRate-matchedMSRCode........................125
6.3.1RateMatching..............................126
6.3.2Encoding.................................126
6.3.3Regeneration...............................127
6.3.4ParametersOptimization.........................128
6.3.5Reconstruction..............................129
6.3.5.1OptimizedParameters.....................130
6.3.6PerformanceEvaluation.........................131
6.4
m
-LayerRate-matchedMSRCode.......................132
6.4.1RateMatchingandParametersOptimization.............133
6.4.1.1Optimizationfor
m
=3....................134
6.4.1.2EvaluationoftheOptimizationfor
m
=3..........135
6.4.1.3GeneralOptimizationResult.................136
6.4.1.4EvaluationoftheOptimization................139
6.4.2PracticalConsiderationoftheOptimization..............141
6.4.2.1EvaluationoftheOptimalErrorCorrection...142
6.4.3Encoding.................................142
6.4.4Regeneration...............................143
6.4.5Reconstruction..............................144
6.4.5.1OptimizedParameters.....................145
CHAPTER7CONCLUSIONS
............................
146
BIBLIOGRAPHY
....................................
148
x
LISTOFTABLES
Table2.1
q
3
rationalpointsoftheHermitiancurve..................23
Table4.1Fourcasesofdecodedcodewordsinmoerrorcontrolcode......49
xi
LISTOFFIGURES
Figure1.1Asimpleexampleofnetworkcoding....................2
Figure1.2Diagramofdistributedstorage........................5
Figure2.1Illustrationofregeneratingcode.......................20
Figure3.1Exampleforillustratingthemainidea...................26
Figure3.2Theprocessesoftransferringnetworkcodeintobipartitegraph.....26
Figure3.3ThecorrespondingbipartitegraphofFigure3.1..............26
Figure3.4Equivalenceofthreekindsofnodesinnetworkcoding...........27
Figure3.5Anexampleofpoint-to-pointnetworkcoding...............29
Figure3.6ThecorrespondingbipartitegraphofFigure3.5..............29
Figure3.7Implementthe(7
;
4)Hammingcodeinnetworkcoding..........34
Figure3.8TransferthenetworkcodingschemeinFigure3.1intoa3-levelcascaded
codingbyadding2virtualnodes.......................37
Figure3.9Thecorrespondingbipartitegraphsof3cascadedlevelsinFigure3.8..37
Figure3.10Transferincomingedgesofnodeshavingmultipleincomingedgesby
addingvirtualnodes.............................38
Figure3.11Partitionanetworkcodeintoseverallevels.................39
Figure3.12ThecorrespondingcascadedbipartitegraphofFigure3.5.........40
Figure3.13Implementa2levelcascadederrorcontrolcodeinnetworkcoding....44
Figure3.14ThecorrespondingcascadedbipartitegraphofFigure3.13........44
Figure4.1Applyerrorcontrolcodesinlinearnetworkcoding.............46
Figure4.2Limitationsoferrorcontrolcodes......................47
Figure4.3TheencodingprocessofmoerrorcontrolcodeinEDECscheme..48
Figure4.4ThedecodingprocessofmoerrorcontrolcodeinEDECscheme..49
xii
Figure4.5PerformanceofmoerrorcontrolcodeinEDECschemewhen
k
=650
Figure4.6PerformanceofmoerrorcontrolcodeinEDECschemewhen
k
=451
Figure4.7Simulationscenario..............................55
Figure4.89timeslotstoavoidpacketscollisions...................55
Figure4.9ThroughputcomparisonbetweenEDECschemeandtheerror-detection
schemesbasedonthenumberofbitcorruptedineachsymbol|for
smallnumberoferrors............................58
Figure4.10ThroughputcomparisonbetweenEDECschemeandtheerror-detection
schemesbasedonthenumberofbitcorruptedineachsymbol|forlarge
numberoferrors...............................59
Figure4.11AnillustrativeexampleofparitycheckmatrixandTannergraph....60
Figure4.12MainideaoftheLEDECscheme......................63
Figure4.13FlowchartoftheLEDECalgorithmimplementedinthesinknodes...66
Figure4.14Anexampleoftheparitycheckmatrixinnetworkcoding.........66
Figure4.15Performancecomparisonforsmallnumberofmaliciousnodes.......67
Figure4.16Performancecomparisonformediumnumberofmaliciousnodes.....68
Figure4.17Performancecomparisonforlargenumberofmaliciousnodes.......69
Figure4.18Performancecomparisonbasedonmediumnumberofmaliciousnodes
(randomnumberoferrors)..........................70
Figure5.1Anexampleillustrationofmatrix
S
.....................78
Figure5.2Illustrationofstoringthecodewordmatricesindistributedstoragenodes81
Figure5.3Anexampleillustrationofmatrix
M
....................103
Figure5.4ComparisonoferrorcorrectioncapabilitybetweentheH-MSRcodeand
theRS-MSRcode...............................117
Figure6.1Thenumberoffractional/fullratecodeblocksfort
P
det
.....131
Figure6.2ratiosbetweenthe2-layerrate-matchedMSRcodeandthe
RS-MSRcodefort
P
det
.......................132
xiii
Figure6.3Comparisonoftheerrorcorrectioncapabilitybetween
m
-layerrate-
matchedMSRcodefor
m
=3andRS-MSRcode.............136
Figure6.4Comparisonoferrorcorrectioncapabilitybetweenthe
m
-layerrate
matchedMSRcodeandtheH-MSRcode..................139
Figure6.5Theoptimalerrorcorrectionofthe
m
-layerrate-matchedMSR
codeundertmfor2

m

16...................140
Figure6.6Theoptimalerrorcorrectionfor2

m

16..........142
Figure6.7Latticeofreceivedhelpsymbolsforregeneration..............143
xiv
KEYTOABBREVIATIONS
AODVAdhocOn-DemandDistanceVector
BECBinaryErasureChannel
BPABeliefPropagationAlgorithm
CRCCyclicRedundancyCheck
DCDataCollector
EDECErrorDetectionandErrorCorrection
GFGaloisField
H-MBRHermitianCodeBasedMinimumBandwidthRegeneration
H-MSRHermitianCodeBasedMinimumStorageRegeneration
LDPCLowDensityParityCheck
LEDECLDPCBasedErrorDetectionandErrorCorrection
MACMediaAccessControl
MBRMinimumBandwidthRegeneration
MSRMinimumStorageRegeneration
MDSMaximumDistanceSeparable
P2PPeertoPeer
RSReed-Solomon
RS-MBRReed-SolomonCodeBasedMinimumBandwidthRegeneration
RS-MSRReed-SolomonCodeBasedMinimumStorageRegeneration
xv
LISTOFALGORITHMS
Algorithm4.1EDECAlgorithmforSourceNodes.................
52
Algorithm4.2EDECAlgorithmforRelayNodes..................
53
Algorithm4.3EDECAlgorithmforSinkNodes..................
53
Algorithm4.4BPADecodingAlgorithmforBEC.................
61
Algorithm5.1EncodingH-MSRCode........................
81
Algorithm5.2
z
0
RegeneratesSymbolsoftheFailedNode
z
............
85
Algorithm5.3(DetectionMode)
z
0
RegeneratesSymbolsoftheFailedNode
z
.
87
Algorithm5.4(RecoveryMode)
z
0
RegeneratesSymbolsoftheFailedNode
z
..
90
Algorithm5.5DCReconstructstheOriginalFile.................
91
Algorithm5.6(Detectionmode)DCReconstructstheOriginalFile.......
92
Algorithm5.7(RecoveryMode)DCReconstructstheOriginalFile........
100
Algorithm5.8EncodingH-MBRCode.......................
104
Algorithm5.9
z
0
RegeneratesSymbolsoftheFailedNode
z
............
106
Algorithm5.10(DetectionMode)
z
0
RegeneratesSymbolsoftheFailedNode
z
.
107
Algorithm5.11(RecoveryMode)
z
0
RegeneratesSymbolsoftheFailedNode
z
..
109
Algorithm5.12DCReconstructstheOriginalFile.................
109
Algorithm5.13(DetectionMode)DCReconstructstheOriginalFile.......
111
Algorithm5.14(RecoveryMode)DCReconstructstheOriginalFile........
113
Algorithm6.1
z
0
RegeneratesSymbolsoftheFailedNode
z
............
121
Algorithm6.2Regenerationforthe2-layerRate-matchedMSRCode......
127
Algorithm6.3Reconstructionforthe2-layerRate-matchedMSRCode.....
130
Algorithm6.4Regenerationforthe
m
-layerRate-matchedMSRCode......
144
Algorithm6.5Reconstructionforthe
m
-layerRate-matchedMSRCode.....
145
xvi
CHAPTER1
INTRODUCTION
Inthisdissertation,wehavedoneresearchesonensuringthenetworkcapacityintwohot
researchareas:networkcodinganddistributedstorage.
1.1CombatingPollutionAttacksinNetworkCoding
Networkcodingisanewcommunicationdiagramthatisdesignedtoimprovethethroughput
androbustnessinnetworkenvironment.Thecorenotationofnetworkcodingisthatit
allowstheparticipatingnodestoencodeincomingpacketsatintermediatenetworknodesin
awaythatwhenasinkreceivesthepackets,itcanrecovertheoriginalmessage.Network
codingprovidesatradebetweenmaximummulticastwrateindirectednetworksand
computationalcomplexity.However,inthecontextofnetworkcoding,allparticipating
nodesmustencodetheincomingpacketsaccordingtoacodingalgorithm.Ifapacket
fromanintermediaterelaynodeiscorruptedorbeingtampered,theentirecommunications
maybedisrupted.Onemainpurposeofthisdissertationistodevelopschemesthatcan
combatnetworkpollutionandmaliciousattacksfromthenetworknodesbasedonerror
controlcoding.
1.1.1BriefReviewofNetworkCoding
Networkcodingwasintroducedintheseminalpaperby[1].[2]formulatedthemulticast
probleminnetworkcodingasthewfromthesourcetoeachreceivingnode.They
provedthatlinearcodingisttoachievetheoptimum.Thisworkmadethenetwork
codingsimplerandmorepractical.[3]haveshownthatlinearcodesarettoachieve
themulticastcapacitybycodingonalargeenough[4]haveshownthatusingofrandom
1
Figure1.1Asimpleexampleofnetworkcoding
linearnetworkcodingisamorepracticalwaytodesignlinearcodes.[5]haveappliedthe
principlesofrandomnetworkcodingtothecontextofpeer-to-peer(P2P)contentdistribu-
tion,andhaveshownthatledownloadingtimescanbereduced.Sinceithasbeenproved
thatlinearnetworkcodesarettoachievethemulticastcapacity,wewillfocusour
discussiononlinearnetworkcodinginthisdissertation.
ThemainideaofnetworkcodingcanbeillustratedthroughFigure1.1.Assumethe
capacityofalltheedgesis
C
,thecapacityofthisnetworkis2
C
accordingtothew
min-cuttheorem.Onlybyencodingtheincomingpacketsymbols
x
1
;x
2
atnode3,this
networkcanachievethemaximumcapacity.
1.1.2SecurityProblemsofNetworkCoding
Fornetworkcodingtoachievetheexpectedbs,alltheparticipatingnodesinthe
networkshouldbefreeofnetworkpollutionandmaliciousattacks.Supposeunderthelinear
networkcoding,thesinknodereceives
m
packetsymbols
y
1
;:::;y
m
.Itdecodestheoriginal
messagesymbols
x
1
;:::;x
l
bysolvingasetoflinearequations:
Bx
=
2
6
6
6
6
6
6
6
4

11

12
:::
1
l

21

22
:::
2
l
.
.
.
.
.
.
.
.
.
.
.
.

m
1

m
2
:::
ml
3
7
7
7
7
7
7
7
5
2
6
6
6
6
6
6
6
4
x
1
x
2
.
.
.
x
l
3
7
7
7
7
7
7
7
5
=
2
6
6
6
6
6
6
6
4
y
1
y
2
.
.
.
y
m
3
7
7
7
7
7
7
7
5
:
(1.1)
2
Ifalltherelaynodesencodecorrectlyand
m

l
,wecandecodeallthemessagesymbols
successfully.However,ifthereareadversariesinthenetworkthatcanmodifythecontents
ofthemessagesandsendthemtothesucceedingrelaynodes,theequationsabovemaynot
besolvedsuccessfully.Sothecommunicationfails.Inaddition,foralargescalenetwork,a
smallerroroccurringatthebeginningofrelaymaytomostofthemessagesinthe
end.Thiscancauseatwasteofnetworkresourcesandsometimescanevenruin
thewholenetworkcommunication.Thiskindofattackiscalled
pollutionattack
.
1.1.3ExistingworkonCombatingPollutionAttacksinNetworkCoding
Existingworkonpollutioneliminationcanlargelybedividedintoerror-detectionbased
schemesanderror-correctionbasedschemes.Forerror-detectionbasedschemes,theerrors
arenormallydetectedattheintermediateforwardnodes,whileforerror-correctionbased
schemes,theerrorsaregenerallycorrectedatthesinknode.Whiletheerror-correction
basedschemesseemtobemoreappealing,thecomplexityforencodinganddecodingis
relativelyhigh.Italsocomeswitharelativelyhighercomputationaloverhead.In[6{8],
classicnetworkerrorcorrectioncodeswerestudiedfollowingtheworkof[9].Theexistence
andconstructionofMDSnetworkerrorcorrectioncodeswerealsostudiedin[10{19].In[20],
theauthorsproposedtousenetworkerrorcorrectioncodetolocatethemaliciousattackers.
Thedecodingofnetworkerrorcorrectioncodewasstudiedin[21].Anothertypeoferror-
correctionbasedschemesuserankmetriccodestocorrecttheerrorsinthesinknodes:[22{27].
In[28],Jaggi
etal
.developedatwo-partrate-regionfortheircodesbasedonBECchannel
codes.[29{31]studiedthetheoreticalnetworkcapacityunderpollutionattacks.
Theerror-detectionbasedschemesareattractiveinsomenetworkscenarioswherethenet-
worktopologyisunknown.In[32],Zhen
etal
.proposedaprobabilistickeypre-distribution
andmessageauthenticationcodesbasedschemeagainstpollutionattacks.Theirscheme
istintheXORnetworkcodingenvironments.Krohn
etal
.proposedtousehomo-
morphichashfunctions[33]toguaranteethecorrectnessofnetworkw.Themainideais
3
thateachintermediatenodewillcheckthecorrectnessofthepackets.Ifapacketfailsatan
intermediatenodevitwillbediscarded.Thisapproachcanreducethecommu-
nicationoverheadandcanbeusedinrandomnetworkcoding.However,thecomputational
complexityisstillveryhigh.Whenthenetworkscaleislarge,computingtoomanyhash
valuesalsocreateshighdelay.Similarly,[34]usedthecryptographicideatocaptureand
discardthecorruptedpackets.Othererror-detectionbasedschemeswerestudiedin[35{43].
Toaddressthecomputationallimitations,[44]developedasimpleerror-detectionbased
Null
Key
scheme.Themainideaistopartitionthe
n
-dimensionallinearspaceover
GF
(
q
n
)into
twoorthogonalsubspacesofdimension
k
(symbolsubspace)and
n

k
(nullkeyspace).
Comparingtothehomomorphichashfunction,theNullKeyschemeismuchmoreet
andhasvirtuallynomessagedelay.Foralltheseschemesabove,allcorruptedpacketswill
bediscarded.Inpacketizednetworks,alargemessageisdividedintosmallpackets.Ifa
maliciousnodecancorruptonefragment(packet)inthewholemessage,accordingtothe
approachesdescribedintheerror-detectionbasedschemes,thisfragmentwillbediscarded.
Inthisway,thenettransmissioncanbeclosetozero.
Therearealsootherapproachestoimprovetheerrorresilienceinnetworkcoding[45{
53],includingdesigningsecureprotocols,combiningnetworkcodeswithothercodesand
implementingsecurenetworkcodesinspscenarios.
1.1.4SummaryoftheLimitationsofexistingworkonCombatingPollution
AttacksinNetworkCoding
Error-correctionbasedschemesfornetworks

Highcomplexityforencodinganddecoding.

Highcomputationaloverhead.
Error-detectionbasedschemesforrandomnetworks:

Highcomputationaloverheadandcommunicationdelayforsomeschemes.
4
Figure1.2Diagramofdistributedstorage

Lowtransmissionciencyifamaliciousnodecontinuestocorruptonemessagepacket.
1.2DistributedStorageinHostileNetworks
Distributedstorageisanon-demandnetworkdatastorageandaccessparadigm.Thedis-
tributeddatastoragearchitecturemodel(inFigure1.2)distributesthedatabasetomultiple
serversinmanylocationsacrosstheparticipatingnetworkinthestoragecloud.Underthis
model,protecteddataisdistributedonserversinmanylocationsacrosstheparticipating
network.EachlocationisdirectlypluggedintotheInternet.Thesedistributedserversmay
evenbeuntrustedandunreliable.Ifanattackismadeonthedatainonelocation,ortryto
jamthecommunications,onlyasmallamountofbackedupdataisimpacted.Inaddition,
sincethepotentialstorageisdispersedtomanylocations,accesstodatadoesnotcome
underthesamebandwidthconstraintssinceeachlocationhasitsownpipetotheInternet.
Thedecentralizationalsoshrinksthefootprintfordataattacks,sinceabreachofonedata
centerdoesnotexposeallbackedupdatatotheattacker.
1.2.1BriefReviewofCurrentAlgorithmsforDistributedStorage
Toensureaccessibilityofremotelystoreddataatanytime,atypicalsolutionistostore
thedataacrossmultipleserversorclouds,ofteninareplicatedfashion.Datareplication
notonlylacksyindatarecovery,butalsorequiressecuredatamanagementforthe
storeddata.
5
Itiswellknownthatsecuritydatamanagementisgenerallyverycostlyandveryhardto
defendagainstcompromisingattacks.Distributeddatastorageprovidesanelegant
betweenthecostlysecuredatamanagementtaskandthecheapstoragemedia.Themain
ideaisinsteadofstoringtheentiredatainoneserver,wecansplitthedatainto
n
data
components.Theoriginaldatacanberecoveredonlywhentherequired(threshold)number
ofcomponents,say
k
,arecollected.Theoriginaldataisinformationtheoreticallysecure
foranyonewhocanaccesseitheranindividualcomponentormultiplecomponentswhen
thenumberofcomponentscombinedislessthanthethreshold
k
.Inthiscase,whenthe
individualcomponentsarestoreddistributivelyacrossmultiplecloudstorageservers,each
cloudstorageserveronlyneedstoassuredataintegrityanddataavailability.Thecostlydata
encryptionandsecurekeymanagementmaynolongerbeneededanymore.Thedistributed
cloudstoragecanalsoincreasedataavailabilitywhilereducingnetworkcongestionthat
leadstoincreasedresiliency.Apopularapproachistoemployan(
n;k
)maximumdistance
separable(MDS)codesuchasanReed-Solomon(RS)code[54,55].ForRScode,thedatais
storedin
n
storagenodesinthenetwork.Thedatacollector(DC)canreconstructthedata
byconnectingtoany
k
healthynodes.
WhileRScodeworksperfectinreconstructingthedata,itlacksscalabilityinrepairing
orregeneratingafailednode.Todealwiththisissue,theconceptofregeneratingcode
wasintroducedin[56].Themainideaoftheregeneratingcodeistoallowareplacement
nodetoconnecttosomeindividualnodesdirectlyandregenerateasubstituteofthefailed
node,insteadofrecoveringtheoriginaldatathenregeneratingthefailedcomponent.
Inthisway,therecoveryproblemofthedistributedstoragecanbeviewedasamulticasting
problemwhichcanbesolvedusingnetworkcoding.Networkcodingbecomesthebaseofthe
regeneratingcode.
ComparedtotheRScode,regeneratingcodeachievesanoptimalbetweenband-
widthandstoragewithintheminimumstorageregeneration(MSR)andtheminimumband-
widthregeneration(MBR)points.RScodebasedMSR(RS-MSR)codeandMBR(RS-
6
MBR)code[57]havebeenexplicitlyconstructed.However,theexistingresearcheitherhas
noerrordetectioncapability,orhastheerrorcorrectioncapabilitylimitedbytheRScode.
Moreover,theschemeswitherrorcorrectioncapabilityareunabletodeterminewhetherthe
errorcorrectionissuccessful.
1.2.2ExistingWorkonDistributedStorage
Whenastoragenodeinthedistributedstoragenetworkthatemployingtheconventional
(
n;k
)RScode(suchasOceanStore[54]andTotalRecall[55])fails,thereplacementnode
connectsto
k
nodesanddownloadsthewholetorecoverthesymbolsstoredinthefailed
node.Thisapproachisawasteofbandwidthbecausethewholehastobedownloaded
torecoverafractionofit.Toovercomethisdrawback,Dimakis
etal
.[56]introducedthe
conceptionof
f
n;k;d;;;B
g
regeneratingcodebasedonthenetworkcoding.Inthecon-
textofregeneratingcode,thecontentsstoredinafailednodecanberegeneratedbythe
replacementnodethroughdownloading

helpsymbolsfrom
d
helpernodes.Theband-
widthconsumptionforthefailednoderegenerationcouldbefarlessthanthewholeA
datacollector(DC)canreconstructtheoriginalestoredinthenetworkbydownloading

symbolsfromeachofthe
k
storagenodes.In[56],theauthorsprovedthatthereisa
betweenbandwidth

andpernodestorage

.Theytwooptimalpoints:minimum
storageregeneration(MSR)andminimumbandwidthregeneration(MBR)points.Cur-
rentlytherearemanyliteraturesfocusingontheoptimalregeneratingcodesdesign:[58{69].
In[70,71]theimplementationoftheregeneratingcodewerestudied.
Theregeneratingcodecanbedividedintofunctionalregenerationandexactregeneration.
Inthefunctionalregeneration,thereplacementnoderegeneratesanewcomponentthatcan
functionallyreplacethefailedcomponentinsteadofbeingthesameastheoriginalstored
component.[72]formulatedthedataregenerationasamulticastnetworkcodingproblemand
constructedfunctionalregeneratingcodes.[73]implementedarandomlinearregenerating
codesfordistributedstoragesystems.[74]provedthatbyallowingdataexchangeamongthe
7
replacementnodes,abetterbetweenrepairbandwidth

andpernodestorage

can
beachieved.Intheexactregeneration,thereplacementnoderegeneratestheexactsymbols
ofafailednode.[75]proposedtoreducetheregenerationbandwidththroughalgebraic
alignment.[76]providedacodestructureforexactregenerationusinginterferencealignment
technique.[57]presentedoptimalexactconstructionsofMBRcodesandMSRcodesunder
product-matrixframework.Thisistheworkthatallowsindependentselectionofthe
nodesnumber
n
inthenetwork.
1.2.3ExistingWorkonDistributedStorageinHostileNetworks
NoneoftheseworksinSection1.2.2consideredcoderegenerationundernodecorruption
oradversarialmanipulationattacksinhostilenetworks.Infact,alltheseschemeswillfail
inbothregenerationandreconstructioniftherearenodesinthestoragecloudsendingout
incorrectresponsestotheregenerationandreconstructionrequests.
In[77],theByzantinefaulttoleranceofregeneratingcodeswerestudied.In[78],the
authorsdiscussedtheamountofinformationthatcanbesafelystoredagainstpassiveeaves-
droppingandactiveadversarialattacksbasedontheregenerationstructure.In[79],the
authorsproposedtoaddCRCcodesintheregeneratingcodetochecktheintegrityofthe
datainhostilenetworks.Unfortunately,theCRCcheckscanalsobemanipulatedbythe
maliciousnodes,resultinginthefailureoftheregenerationandreconstruction.In[80],the
authorsproposedtoadddataintegrityprotectionindistributedstorage.In[81],theauthors
proposedanerasure-codeddistributedstoragebasedonthresholdcryptography.In[82],the
authorsanalyzedthevcostforboththeclientreadandwriteoperationinwork-
loadswithidleperiods.In[83],theauthorsanalyzedtheerrorresilienceoftheRScode
basedregeneratingcodeinthenetworkwitherrorsanderasures.Theyprovidedthethe-
oreticalerrorcorrectioncapability.TheirresultisanextensionoftheMDScodetothe
regeneratingcodeandtheirschemeisunabletodeterminewhethertheerrorsinthenetwork
aresuccessfullycorrected.
8
1.2.4LimitationsofExistingWorkonDistributedStorageinHostileNetworks
ExistingWorkonDistributedStorageinHostileNetworkshasthefollowinglimitations:

Theerrordetection/correctioncapabilityisIfthereareafewerrors,therewill
beawasteofbandwidth.Iftherearetoomanyerrors,theerrorcorrectingprocesswill
failwithoutbeingdetected.

TheerrorcorrectioncapabilityislimitedbytheerrorcorrectioncapabilityoftheMDS
codes.
1.2.5withExistingWorkonSecureNetworkCommunication
Itisworthwhiletopointoutthatalthoughtherearestrongconnectionsbetweenregenerating
codeindistributedstorageandgeneralnetworkcommunicationofwhichsecurityproblems
havebeenwellstudied,ourproposedH-MSR/H-MBRcodesaretfromthesesecurity
studiesofnetworkcommunicatione.g.[84{86]inbothprinciplesandscopes.First,unlike
allthestudiesabove,theniceerrorcorrectioncapabilityoftheproposedH-MSR/H-MBR
codesisduetotheunderlyingHermitiancode[87].Second,theregeneratingcodesstudiedin
thisworkandthegeneralnetworkcommunicationarefundamentallytinthatbesides
theoveralldatareconstructiontheregeneratingcodesalsoemphasizetherepairofcorrupted
codecomponents(regeneration),whilegeneralnetworkcommunicationonlyfocusesondata
reconstruction.Noneoftheresearchesofgeneralsecurenetworkcommunicationstudiesthe
regenerationproblem.Thescopeofthisworkistfromthatofthoseresearches.
1.3ProposedResearchDirections
1.3.1DirectionsforCombatingPollutionAttacksinNetworkCoding
Fornetworks,insteadofdesigningthenetworkcodeswitherrorcorrectioncapability,
wefocusonthecharacterizingofthenetworkcodingsothatnetworkcodingcanbeviewed
9
fromtheperspectiveoferrorcontrolcoding.Inparticular,wewillanalyzetherelationship
betweenthenetworkcodingandtheerrorcontrolcodinginbothunicastandmulticastcases.
Wethealgebraicaspectsforthesetwocasesareessentiallyidentical.
Afterwehaveproventhateachnetworkcodingcanbetransferredintoanerrorcontrol
codeinabipartitegraphbyignoringthestructureoftheunderlyingerrorcontrolcoding,
wethentransferanetworkcodingschemeintoaseriesofcascadederrorcontrolcodes
byexploringtheinnerstructureofnetworkcoding.Thismappingenablesustoidentify
theminimumnumberofindependenterrorpatterninthecorrespondingnetworkleveland
identifythemaliciousnetworknodes.
Forrandomnetworks,weproposeanewschemethatcombineserror-detectionanderror-
correcting(EDEC)tocombatnetworkpollutionattacks.Originalmessagesymbolsare
encodedusingan(
n;k
)codethensentoutinpackets.Whenanintermediatenodedetects
anerror,insteadofdiscardingthepacket,theintermediatenodewillcontinuetoforwardit.
Aslongastheerrorsarewithinthedecodingcapability,thesinknodeswillbeabletorecover
thecorruptedpacket.InourLDPCbasedEDEC(LEDEC)scheme,wetreatthepacketsas
LDPCcodesatsinknodesandusethebeliefpropagationalgorithm(BPA)todecodethe
LDPCcode.Inthiscase,theLEDECschemecanmaintainthethroughputunchangedfor
moderatenetworkpollution.Itcanalsoguaranteeahighnetworkthroughputevenfora
heavilypollutednetworkenvironment,whilethethroughputbecomesverylowforallerror-
detectionbasedschemesinthiscase.Intheanalyses,wemainlyfocusonthethroughput
impactbroughtbytstrategies(discardvs.keep)towardscorruptedpackets.
1.3.2DirectionsforDistributedStorageinHostileNetworks
Fordistributedstorage,weproposeHermitiancodebasedregeneratingcodes:H-MSRcode
andH-MBRcode.WeconstructtheH-MSRcodebycombiningtheHermitiancodeand
regeneratingcodeattheMSRpoint,thenweconstructtheH-MBRcodebycombiningthe
HermitiancodeandregeneratingcodeattheMBRpoint.Thenweprovethatthesecodes
10
canachievethetheoreticalMSRboundandMBRboundrespectively.Wealsoproposedata
regenerationandreconstructionalgorithmsfortheH-MSRcodeandtheH-MBRcodein
botherror-freenetworksandhostilenetworks.
Moreover,inspiredbytheniceperformanceofHermitiancodebasedregeneratingcodes,
westepforwardtofurtherconstructoptimalregeneratingcodeswhichhavesimilarlayered
structurelikeHermitiancodeindistributedstorage.Weproposeasimpleoptimal
constructionof2-layerrate-matchedMSRcode.Weconductboththeoreticalanalysisand
performanceevaluationtoshowthatthiscodecanachievetheoptimalstorage.
Thenweproposeanoptimalconstructionof
m
-layerrate-matchedMSRcode.The
m
-layer
codecanachievetheoptimalerrorcorrection.
1.4Overviewofthedissertation
1.4.1MajorContributions
Themajorcontributionsofthisdissertationareasfollows:

Combatingpollutionattacksinnetworkcoding.
1.
Weprovideacomprehensiveanalysisandtheoreticalresultsontherelationships
betweenthenetworkcodingandtheerrorcontrolcoding.
2.
Weproposeamethodologytodesigntnetworkcodingschemesthatcan
combatnetworkerrorsandnetworkpollution.
3.
Wedevelopamethodologytomapeachnetworkcodingintoaseriesofcascaded
errorcontrolcodes.
4.
Weprovideanovelapproachtodesigntnetworkcodingschemesthatcan
combatpollutionattacksandlocatethemaliciousnodesbyutilizingtheinner
structureofthenetworkcode.
11

Combatingpollutionattacksinrandomnetworkcoding.
1.
OurproposedEDECschemeandtheLEDECschemecanmaintainthethroughput
unimpactedfornetworkenvironmentwithmoderatemaliciousattackswithonly
aslightincreaseincomputationaloverhead.
2.
Theproposedschemecanguaranteeahighthroughputevenforaheavilypolluted
networkenvironment.
3.
WeprovidecomprehensivethroughputanalysisoftheproposedEDECscheme
andtheLEDECscheme.
4.
Weconductextensivesimulationsusingns-2toevaluatetheperformanceof
theproposedschemesandcompareourschemeswiththeerror-detectionbased
schemes.

Distributedstorageinhostilenetworks|Hermitiancodebasedregeneratingcodes
1.
Theoreticalevaluationshowsthatourproposedschemescandetecttheerroneous
decodingswhileotherexistingworkcannot.
2.
Ourproposedschemescancorrectmoreerrorsinthehostilenetworkthanthe
RS-MSR/RS-MBRcodeswiththesamecoderates.
3.
OuranalysisalsodemonstratesthattheproposedH-MSR/H-MBRcodeshave
lowercomplexitythantheRS-MSR/RS-MBRcodesinbothcodesregeneration
andcodesreconstruction.

Distributedstorageinhostilenetworks|Optimalconstructionofregeneratingcodes
throughrate-matching
1.
Ourproposedoptimalconstructionof2-layerrate-matchedMSRcodecanachieve
theoptimalstorage,whichishigherthantheRS-MSRcodeproposed
in[83].
12
2.
Ourproposedoptimalconstructionof
m
-layerrate-matchedMSRcodecanachieve
theoptimalerrorcorrection,whichishigherthanthecodeproposed
in[83]andtheH-MSRcodeproposedin[88].Furthermore,the
m
-layeredcode
iseasiertounderstandandhasmoreythantheH-MSRcode.
1.4.2Structure
Thedissertationisstructuredasfollows.
Chapter2
introducesthepreliminaryforthisdissertation.Somebasicconceptsand
propertiesofnetworkcoding,errorcontrolcoding,regeneratingcodeandhermitiancodeare
presented.
Chapter3
ismainlyaboutcombatingpollutionattacksfornetworks.The
sectionstudiestherelationshipbetweennetworkcodinganderrorcontrolcoding.Thesecond
sectioncharacterizesnetworkcodingusingcascadederrorcontrolcoding.
Chapter4
ismainlyaboutcombatingpollutionattacksforrandomnetworks.The
system/adversarialmodelsandassumptionsarepresentedinthesection.Theproposed
EDECschemeandperformanceanalysisaredescribedinthesecondsection.Thethird
sectionpresentstheLDPCdecodingandanalysisoftheLEDECscheme.
Chapter5
ismainlyabouttheHermitiancodebasedregeneratingcodesindistributed
storageinhostilenetworks.Afterthesystem/adversarialmodelsandassumptionsarepre-
sented,ourproposedH-MSRcodeisdescribedandanalyzedinthesecondsection.The
proposedH-MBRcodeisdescribedandanalyzedinthethirdsection.Performanceanalysis
isconductedinthefourthsection.
Chapter6
ismainlyabouttheoptimalconstructionofregeneratingcodethroughrate-
matching.Thesectionpresentsthesystem/adversarialmodelsandassumptions.The
secondsectionproposestwocomponentcodesfortherate-matchedMSRcodes.Thethird
sectionproposesandanalyzesthe2-layerrate-matchedMSRcode.Thenthefourthsection
proposesandanalyzesthem-layerrate-matchedMSRcode.
13
Chapter7
summarizesthedissertation.
14
CHAPTER2
PRELIMINARY
Inthischapter,wewillpresentsomebasicconceptsandpropertiesofnetworkcoding,error
controlcoding,regeneratingcodeandhermitiancode.
2.1NetworkCoding
Inthisdissertation,weadoptthenotationsof[3].Anetworkisequivalenttoadirected
graph
G
=(
V;E
),where
V
representsthesetofverticescorrespondingtothenetworknodes
(sourcenodes,relaynodesandsinknodes)and
E
representsallthedirectededgesbetween
verticescorrespondingtothecommunicationlink.Thestartvertex
v
ofanedge
e
iscalled
thetailof
e
andwrittenas
v
=
tail
(
e
),whiletheendvertex
u
ofanedge
e
iscalledthe
headofof
e
andwrittenas
u
=
head
(
e
).Wethecapacityofanedgeasthenumber
ofbitsthatcanbetransmittedthroughtheedgeinonetimeunit.Sothecapacityshould
benon-negativeintegers.Inthisdissertation,wenormalizethecapacityofoneedgeto1.
Ifachannelbetweentwonodeshascapacity
C
largerthan1,wemodelthischannelas
C
multipleedgeseachwithcapacity1.Weassumethenetworkisdelay-free[3],thatisallthe
edgesinthegraphhavezerodelay.Andthenetworkisacyclic,thatisalltheverticesinthe
graphcanbeorganizedinanancestralordering.
Forasourcenode
u
,thereisasetofdiscreterandomprocessestobesent.Eachofthe
randomprocesscanberepresentedbyabinaryvectoroflength
m
,thatiseverysymbol
sequenceoftherandomprocessisfromthe
F
2
m
.Wewritethesetofthepro-
cessesas
X
(
u
)=
f
X
(
u;
1)
;X
(
u;
2)
;

;X
(
u;
(
u
))
g
,inwhich

(
u
)isthenumberofrandom
processesinnode
u
.Sincewenormalizethecapacityofeachedgeto1,itisreasonableto
normalizetherateoftherandomprocess
X
(
u;i
)(1

i


(
u
))to1.
15
Wecanwritealink
e
between
r
1
and
r
2
as
e
=(
r
1
;r
2
).Therandomprocess
Y
(
e
)onthe
link
e
isthefunctionofallthe
Y
(
e
0
)fromlinks
e
0
(suchthat
head
(
e
0
)=
r
1
)andtherandom
processes
X
(
r
1
)fromnode
r
1
.In
F
2
m
linearnetworkcoding[2],
Y
(
e
)canbewrittenas:
Y
(
e
)=

(
r
1
)
X
l
=1

l;e
X
(
v;l
)+
X
e
0
:
head
(
e
0
)=
r
1

e
0
;e
Y
(
e
0
)
;
(2.1)
inwhichtheencodingcots

l;e
;
e
0
;e
2F
2
m
.
Forasinknode
v
,thereisalsoasetofdiscreterandomprocessestobeobserved.We
writethesetoftheprocessesas
Z
(
v
)=
f
Z
(
v;
1)
;Z
(
v;
2)
;

;Z
(
v;
(
v
))
g
,inwhich

(
v
)is
thenumberofrandomprocessesobservedinnode
v
.Inlinearnetworkcoding,
Z
(
v;j
)can
bewrittenas:
Z
(
v;j
)=
X
e
0
:
head
(
e
0
)=
v

e
0
;j
Y
(
e
0
)
;
(2.2)
inwhichtheencodingcots

e
0
;j
2F
2
m
.
Aconnectionbetweenasourcenode
u
andasinknode
v
canbewrittenas
C
=
(
u;v;
X
(
u
)).Fromtheassumptionsanddeductionsabove,therateofthisconnection
R
(
C
)
isequalto
jX
(
u
)
j
,where
j
x
j
isthecardinalityoftheset
x
.Aslongaswecanretrieve
X
(
u
)
from
Z
(
v
),wesaythatthisconnectionispossible.Becauseweapplythelinearencoding
inthenetwork,wecanthesystemtransfermatrix
M
betweeninput
x
andoutput
z
.
Ifwewrite
x
=(
X
(
u;
1)
;X
(
u;
2)
;

;X
(
u;
(
u
)))and
z
=(
Z
(
v;
1)
;Z
(
v;
2)
;

;Z
(
v;
(
v
))),
wehave
z
=
x
M
.
Inthisdissertation,sinceweareonlyconcernedaboutthisrelationshipineachsingle
encodingperiod,wesimplywritetherandomprocesses
X
(
u;i
)
;Y
(
e
)
;Z
(
v;j
)asrandom
numbers
x
u;i
;y
e
;z
v;j
.
2.2ErrorControlCoding
Inthissection,wepresentthepreliminaryforerrordetection,whichisthebaseoftheNull
KeyschemeandtheproposedEDECscheme.Errorcorrection,whichisalsothebaseofthe
16
proposedEDECscheme,isalsopresented.
2.2.1ErrorDetection
Supposetheoriginalmessagesymbolsareinthe
k
-dimensionallinearspaceover
GF
(2
k
).
Afterweencodethesymbolsusingageneratingmatrix
G
k

n
froman(
n;k
)blockcode,the
encodedcodewordswillformalinearsubspaceover
GF
(2
n
)ofdimension
k
.Sotherewill
beanother
n

k
dimensionalsubspaceoverthe
n
dimensionalspace,whichisorthogonal
tothecodewordssubspace.Ifwedenoteavalidcodewordby
c
andthebasesforthe
n

k
dimensionalsubspaceby
h
1
;:::;
h
n

k
,wehave
<
c
;
h
i
>
=0
;
1

i

n

k
,where
<

;

>
representstheinnerproduct.
Let
H
(
n

k
)

n
=[
h
1
;:::;
h
n

k
]
T
,then
H
formstheparity-checkmatrixofthecodewords
andwehave
c

H
T
=
0
:
(2.3)
Suppose
r
=
c
+
e
isareceivedcodeword,where
e
isan
n
-tupleerrorgeneratedbya
maliciousnode.Forthereceivedword
r
,accordingtoequation(2.3),
c
isorthogonalto
H
,
therefore,wehave:
r

H
T
=(
c
+
e
)

H
T
=
c

H
T
+
e

H
T
=
e

H
T
:
(2.4)
Forareceivedword
r
,therearetwopossibilities:(i)
e
isacodewordgeneratedby
G
but
tfromtheoriginalcodeword.Inthiscase,though
r
containserror,however,because
r

H
T
=0,theerrorisundetectableusingconventionalerrorcontrolcodingtechniques;(ii)
e
containsanonzeroprojectiontotheorthogonalparitychecksubspace,then
r

H
T
6
=0.
Inthiscase,wecandetectthatthereceivedwordcontainserror.
Innetworkcoding,suppose
c
1
;:::;
c
i
arevalidcodewords,
c
=
P
j
c
j

b
j
isalinear
combinationofthecodewords
c
1
;:::;
c
i
,where
b
j
isthenetworkencodingcotsand
17
hasthevalue0or1.Itcanbeeasilyvthat
c

H
T
=
X
j
c
j

H
T

b
j
=
0
:
(2.5)
Equation(2.5)isthetheoreticalfoundationforerrorcontrolcodingtobeusedinnetwork
coding.Therowsof
H
arecalled
NullKeys
in[44].Bycheckingpacketsymbolsatevery
node,thereisahighprobabilitythattheNullKeyschemecandetectthepollutedpackets
afterafewhopsoftransmission.However,the`check-and-dump'strategymayresultina
verylowcommunicationundercontinuousnetworkpollutionandpacketcorruption
attacks.
2.2.2ErrorCorrection
Equation(2.4)iscalledthe
syndrome
oferrorpattern
e
,denotedas
s
.Itisclearthat
r
is
acodewordifandonlyif
s
=0.Thetaskofmaximumlikelihooddecodingistothe
minimumweighterrorpattern
e
suchthat
r

H
T
=
e

H
T
.Inthiscase,thereceived
r
is
correctedto
r
+
e
=
c
.
Inlinearnetworkcoding,althoughthepacketsymbolsarenottheoriginalonessentfrom
sourcenodes,wecanstillperformerrorcorrectionusingequation(2.4).
Suppose
r
1
;:::;
r
i
arethereceivedcodewordsfrom
i
incomingedges,
e
istheerrorvector
addedtothenetworkcoding
r
=
P
j
r
j

b
j
.Iftheerroriswithinthecorrectioncapabilityof
the(
n;k
)code,thesyndromewillstillbe
r

H
T
=
e

H
T
+
P
j
r
j

H
T

b
j
=
e

H
T
+
0
=
e

H
T
.
Thenwecancorrecttheerrorusingsyndromedecoding.
IntheproposedEDECscheme,thecorruptedpacketsdetectedattheintermediatenodes
willnotbedumped.Boththeintactandcorruptedpacketswillbegatheredbythesink
nodes.Thesinknodescancorrectthecorruptedpacketsandhaveahighercommunication
thantheerror-detectionbasedschemes.
18
2.2.3SomePropertiesofErrorControlCodes
Theerrordetectionandcorrectioncapabilitiesaredecidedbythe(
n;k
)codestructure.We
canadoptappropriateerrorcontrolcodesaccordingtothepollutionlevelsofthenetwork.
Inthissection,somepropertiesoferrorcontrolcodeswillbepresented.
Theorem2.1.
(Singletonbound[89])
Foran
(
n;k
)
blockcodewiththeminimumdistance
d
,thefollowingrelationshipholds:
k
+
d

n
+1
.
Theminimumdistance
d
isastheminimumhammingdistanceforanytwodistinct
codewords
x
and
y
of
C
:
d
min
=
min
f
d
(
x
;
y
)
j8
x
;
y
2
C
g
,where
d
(
x
;
y
)isthenumberof
positionsatwhichthecorrespondingbitsaretbetween
x
and
y
.Foran(
n;k
)block
codewithminimumdistance
d
,ifwedeletethe
d

1bitsofeverycodewords,allthe
codewordsarestilldistinct.Sothereareatmost2
n

(
d

1)
codewords.Thetotalnumberof
originalmessagesymbolsisatmost2
k
anditcannotbebiggerthanthenumberofpossible
codewords:2
k

2
n

(
d

1)
.
Theorem2.2.
([89])
Foran
(
n;k
)
blockcodewiththeminimumdistance
d
,itcandetect
allthe
d

1
orlesserrors,oritcancorrectallthe
j
d

1
2
k
orlesserrors.
Accordingtotheofminimumhammingdistance,allcodewordswithinthe
distance
d

1orshorterofavalidcodewordareinvalid.Soallthe
d

1orlesserrors
canbedetected.Suppose
x
and
y
aretwovalidcodewordswiththeminimumhamming
distanceand
z
isacorruptedversionofcodeword
x
with
t
errors.Thatis
d
(
x
;
z
)=
t
.Ifwe
wanttocorrect
z
to
x
,wemusthave
d
(
y
;
z
)
>t
.Byusingthetriangleinequity,wehave
d
(
y
;
z
)

d
(
x
;
y
)

d
(
x
;
z
)=
d

t
.Wecanensure
d
(
y
;
z
)
>t
bymaking
d

t>t
.Thatis
2
t<d
,
t
max
=
j
d

1
2
k
.
Theorem2.3.
([89])
Alinearcodeiscapableofcorrecting

orfewererrorsandsimulta-
neouslydetecting
˝
(
˝>
)
orfewererrorsifitsminimumdistance
d
min


+
˝
+1
.
19
Figure2.1Illustrationofregeneratingcode
Proof.
Since
˝>
,wehave
d
min


+
˝
+1
>
2

+1,
<
j
d

1
2
k
.Sowecancorrect

or
fewererrors.Suppose
x
and
y
aretwovalidcodewordswiththeminimumhammingdistance
d
min
and
z
isacorruptedversionofcodeword
x
with
˝
errors.Inordertoavoidthewrong
correction,wemustmakesurethat
d
(
y
;
z
)
>
.Accordingtothetriangleinequity,wehave
d
(
y
;
z
)

d
(
x
;
y
)

d
(
x
;
z
)=
d
min

˝
.Wecanensure
d
(
y
;
z
)
>
bymaking
d
min

˝>
.
Thatis
d
min

˝
+

+1.
2.3RegeneratingCode
Regeneratingcodeintroducedin[56]isalinearcodeover
GF
q
withasetofparameters
f
n;k;d;;;B
g
.Aofsize
B
isstoredin
n
storagenodes,eachofwhichstores

symbols.Areplacementnodecanregeneratethecontentsofafailednodebydownloading

symbolsfromeachof
d
randomlyselectedstoragenodes.Sothetotalbandwidthneededto
regenerateafailednodeis

=

.Thedatacollector(DC)canreconstructthewholeby
downloading

symbolsfromeachof
k

d
randomlyselectedstoragenodes.Anillustration
ofregeneratingcodeisshowninFigure2.1.
20
In[56],thefollowingtheoreticalboundwasderived:
B

k

1
X
i
=0
min
f
;
(
d

i
)

g
:
(2.6)
Fromequation(2.6),abetweentheregenerationbandwidth

andthestoragere-
quirement

wasderived.

and

cannotbedecreasedatthesametime.Therearetwo
specialcases:minimumstorageregeneration(MSR)pointinwhichthestorageparameter

isminimized;
(

MSR
;
MSR
)=

B
k
;
Bd
k
(
d

k
+1)

;
(2.7)
andminimumbandwidthregeneration(MBR)pointinwhichthebandwidth

isminimized:
(

MBR
;
MBR
)=

2
Bd
2
kd

k
2
+
k
;
2
Bd
2
kd

k
2
+
k

:
(2.8)
2.4HermitianCode
AHermitiancurve
H
(
q
)over
GF
(
q
2
)incoordinatesisby:
H
(
q
):
y
q
+
y
=
x
q
+1
:
(2.9)
Thegenusof
H
(
q
)is
%
=(
q
2

q
)
=
2andthereare
q
3
pointsthatsatisfyequation(2.9),denoted
as
P
0
;
0
;

;P
0
;q

1
;

;P
q
2

1
;
0
;

;P
q
2

1
;q

1
(SeeTable2.1),where

0
;
1
;

;
q

1
are
the
q
solutionsto
y
q
+
y
=0and
˚
isaprimitiveelementin
GF
(
q
2
).
L
(
mQ
)isas:
L
(
mQ
)=
f
f
0
(
x
)+
yf
1
(
x
)+

+
y
q

1
f
q

1
(
x
)
j
deg
f
j
(
x
)
<
(
j
)
;j
=0
;
1
;

;q

1
g
;
(2.10)
where

(
j
)=max
f
t
j
tq
+
j
(
q
+1)

m
g
+1
;
(2.11)
for
m

q
2

1.AcodewordoftheHermitiancode[87]
H
m
isas(
%
(
P
0
;
0
)
;

;%
(
P
0
;q

1
)
;

;%
(
P
q
2

1
;
0
)
;

;%
(
P
q
2

1
;q

1
)),where
%
2
L
(
mQ
).Thedimensionofthemessagebe-
foreencodingcanbecalculatedasdim(
H
m
)=
P
j
=
q

1
j
=0
(deg
f
j
(
x
)+1).Asoft-decisionlist
21
decodingalgorithmforHermitiancodeswasproposedin[90].In[87],anovelapproach
fordecodingHermitiancodeswithbursterrorswasproposed.Somegoodpropertiesof
Hermitiancodeswerestudiedin[91].
22
Table2.1
q
^3rationalpointsoftheHermitiancurve
P
0
;
0
=(0
;
0
)
P
1
;
0
=(1
;˚
+

0
)

P
q
2

1
;
0
=(
˚
q
2

2
;˚
(
q
2

2)(
q
+1)+1
+

0
)
P
0
;
1
=(0
;
1
)
P
1
;
1
=(1
;˚
+

1
)

P
q
2

1
;
1
=(
˚
q
2

2
;˚
(
q
2

2)(
q
+1)+1
+

1
)
.
.
.
.
.
.
.
.
.
.
.
.
P
0
;q

1
=(0
;
q

1
)
P
1
;q

1
=(1
;˚
+

q

1
)

P
q
2

1
;q

1
=(
˚
q
2

2
;˚
(
q
2

2)(
q
+1)+1
+

q

1
)
23
CHAPTER3
COMBATINGPOLLUTIONATTACKSFORFIXEDNETWORKS
Inthischapter,Wethatnetworkcodinganderrorcontrolcodingareessentiallyidentical
inalgebraicaspects.Wewillprovideanovelmethodologytocharacterizelinearnetwork
codingthrougherrorcontrolcodingfornetworkcoding.Ourmainideaistorepresent
eachlinearnetworkcodingwithanerrorcontrolcoding.Wewillprovidecomprehensive
theoreticalanalysisontherelationshipsbetweenlinearnetworkcodinganderrorcontrol
codinginbothunicastandmulticastscenarios.
Meanwhile,ourresearchprovidesanewapproachtounderstandnetworkcodingschemes
andanovelmethodologytodevelopnetworkcodingschemesthatcancombatnodecompro-
misingattacksandlocatethemaliciousnodes.Wewillcharacterizealinearnetworkcoding
throughaseriesofcascadedlinearerrorcontrolcodes.Thisrepresentationenablesusto
determinetheindependentsourceoferrorsinthecascadednetworklevel.Itcouldleadto
asuccessfuldecodingoftheoriginalmessageandcouldhelplocatingthemaliciousnetwork
nodes.Wewillprovidecomprehensivetheoreticalanalysisonnetworkcodinginbothunicast
andmulticastscenarios.
3.1CharacterizationofLinearNetworkCodingforPollutionAt-
tacks
3.1.1ModelsandAssumptions
Inthissection,ourmainideaistocharacterizeandclassifynetworkcodingaccordingtothe
underlyingerrorcontrolcoding.Weonlyneedtolimitourconsiderationtolinearnetwork
codesin
F
2
,whichmakesthecorrespondingerrorcontrolcodessimplebinaryblockcodes.
Inthiscasetheencodingcotscanonlybe0or1andtheadditionoperationequals
24
exclusiveor.
3.1.2AnIllustrativeExample
Inthissection,wewillillustrateourmainideausingtheclassicexample[1]showninFig-
ure3.1.Inthisexample,sourcenode1multicaststwosymbols
x
1
;
1
;x
1
;
2
tosinknodes6and
7.Byencodingatnode4,bothnodes6and7canretrievethetwosymbolssuccessfully.To
explainourmainidea,wewillonlyfocusonthecommunicationbetweennode1andnode
6(theshadedareainFigure3.1).Theanalysisissimilartothecommunicationbetween
node1andnode7.Inthiscommunication,symbol
x
1
;
1
ispasseddirectlythroughthepath
e
1

e
5
.Sowecanmergetheedges
e
1
and
e
5
togetherinFigure3.2(a):node1send
x
1
;
1
directlytonode6intheequivalentbipartitegraph.Meanwhile,inFigure3.2(b),
x
1
;
1
and
x
1
;
2
arepassedseparatelytonode4through
e
1

e
3
and
e
2

e
4
,then
x
1
;
1
+
x
1
;
2
ispassed
through
e
6

e
8
afterbeingencodedatnode4.Sowecanmergetheedges
e
1
;e
3
together,
e
2
;e
4
togetherand
e
6
;e
8
togetherinthestepofFigure3.2(b):node1sends
x
1
;
1
;x
1
;
2
directlytonode4andnode4sends
x
1
;
1
+
x
1
;
2
directlytonode6.Inthesecondstep,wecan
ignorenode4andputtheoperation
x
1
;
1
+
x
1
;
2
innode6:node1sends
x
1
;
1
;x
1
;
2
directly
tonode6andnode6addsthetwosymbolstogetherintheequivalentbipartitegraph.
UsingtheprocessesshowninFigure3.2,wetransferthisnetworkcodingproblemintoa
bipartitegraphshowninFigure3.3.Inthiswaywecangettheexplicitrelationshipbetween
symbolsofnode1andsymbolsofnode6.Ifweview
x
1
;
1
;x
1
;
2
asoriginalmessageand
z
1
;
1
;z
1
;
2
asthecodewordinanerrorcontrolcode,wecanviewthisnetworkcodeasa(2
;
2)
errorcontrolcodewiththegeneratormatrix
2
6
4
11
01
3
7
5
:
Althoughinthisexample,thereisnoredundancyinthe(2
;
2)errorcontrolcodeand
thiscodecannotdetectorcorrecterrors,itisttoshowthatnetworkcodecanbe
characterizedusingerrorcontrolcode.
Intheexamplesbelow,wewillshowthatnetworkcodeswithredundanciescanbetrans-
25
Figure3.1Exampleforillustratingthemainidea
Figure3.2Theprocessesoftransferringnetworkcodeintobipartitegraph
Figure3.3ThecorrespondingbipartitegraphofFigure3.1
26
Figure3.4Equivalenceofthreekindsofnodesinnetworkcoding
ferredintoerrorcontrolcodes.Meanwhile,theredundanciesofnetworkcodescanbeadded
accordingtotheerrorcontrolcodes.Andwecancharacterizenetworkcodingusingerror
controlcoding.
3.1.3RelationshipbetweenNetworkCodingandErrorControlCodinginPoint-
to-PointCommunication
Inthispart,wewillformallystatetherelationshipbetweennetworkcodinganderrorcon-
trolcodinginthepoint-to-pointcommunication.Theciencyisstudiedthenthe
necessity.
3.1.3.1The
Theorem3.1.
Everynetworkcodeschemecanberepresentedbyanerrorcontrolcode.
Proof.
Allthenodesinnetworkcodingcanbecategorizedintothreetypes:simpleforward,
multicastandcodethenforward(showninFigure3.4).Sowecantransferthegraph
representingthenetworkcodingusingthreeoperationsaccordingly:
1.
Simpleforward
:Thesenodesdonotencodetheincomingsymbols.Theysimply
27
forwardwhatevermessagetheyhavereceived.Inthissituation,wecanreplacethe
nodeswithdirectlinks.
2.
Multicast
:Likesimpleforward,thesenodesdonotencodetheincomingsymbols
either.Theysimplymulticastthemessagethattheyhavereceived.Inthissituation,
wecanreplacethenodeswithmultipledirectlinks.
3.
Codethenforward
:Thesenodesproducethelinearcombinationoftheincoming
symbols
x
1
;x
2
;

;x
k
.Accordingtotheencodingcots,
m
outof
k
received
symbolswillbeaddedtogethertoformthenewsymbol
x
l
1
+
x
l
2
+

+
x
l
m
tobe
forwarded.Wecanviewthiskindofnodeas
m
parallelnodes
v
l
1
;v
l
2
;

;v
l
m
,eachof
whichhasonlyoneinput.Thesymbols
x
l
1
;x
l
2
;

;x
l
m
willbedirectlyforwardedto
thesinknode
u
.Andthesinknodewillcompletetheadditionoperation.Thereforewe
cantransfercodethenforwardnodesbysplittingmultipleinputsintomultiplesimple
forwardnodes.Thenwecanfurthersimplifythemultiplesimpleforwardnodesasin
'1)'.
Becausethenetworkcodingislinear,thethreeoperationsarecommutativeandhavethe
superpositionproperty.Wecanalwaysperformtheoperationstoalloftheintermediate
nodesinthenetworkandreplacethenodeswithsimplelinks.Atlastwecangetabipartite
graphconsistingofonlysymbolsinthesourcenodeandencodedsymbolsinthesinknode,
whichcanberepresentedusinganerrorcontrolcodecorrespondingtothebipartitegraph.
TakethenetworkcodeinFigure3.5asanexample.Thesourcenode1transmitsthree
symbols
x
1
;
1
;x
1
;
2
;x
1
;
3
tosinknode4inthisnetworkcode.Andsinknode4canreceive
6encodedsymbols,whichindicatesthatthereareredundanciesinthisnetworkcoding.
FollowingtheoperationsmentionedintheproofofTheorem3.1,wecangetthecorresponding
bipartitegraphshowninFigure3.6,whichindicatesthisisa(6
;
3)errorcontrolcode.The
28
Figure3.5Anexampleofpoint-to-pointnetworkcoding
Figure3.6ThecorrespondingbipartitegraphofFigure3.5
generatormatrixis:
G
=
2
6
6
6
6
4
100110
010101
001011
3
7
7
7
7
5
:
(3.1)
Thiscodehasaminimumhammingdistance3.Soitcandetectandcorrect1biterror.
Thecanalsobevalidatedbythesystemtransfermatrix
M
.Toshowthis,
wewillstillusethenetworktopologyshowninFigure3.5,butwithtencoding
29
cots.Thesymbolsoneachedgecanbewrittenas:
y
e
1
=

1
;e
1
x
1
;
1
+

2
;e
1
x
1
;
2
+

3
;e
1
x
1
;
3
y
e
2
=

1
;e
2
x
1
;
1
+

2
;e
2
x
1
;
2
+

3
;e
2
x
1
;
3
y
e
3
=

1
;e
3
x
1
;
1
+

2
;e
3
x
1
;
2
+

3
;e
3
x
1
;
3
y
e
4
=

1
;e
4
x
1
;
1
+

2
;e
4
x
1
;
2
+

3
;e
4
x
1
;
3
y
e
5
=

e
3
;e
5
y
e
3
+

e
4
;e
5
y
e
4
y
e
6
=

e
1
;e
6
y
e
1
+

e
2
;e
6
y
e
2
+

e
5
;e
6
y
e
5
(3.2)
y
e
7
=

e
1
;e
7
y
e
1
+

e
2
;e
7
y
e
2
+

e
5
;e
7
y
e
5
y
e
8
=

e
1
;e
8
y
e
1
+

e
2
;e
8
y
e
2
+

e
5
;e
8
y
e
5
y
e
9
=

e
1
;e
9
y
e
1
+

e
2
;e
9
y
e
2
+

e
5
;e
9
y
e
5
y
e
10
=

e
3
;e
10
y
e
3
+

e
4
;e
10
y
e
4
y
e
11
=

e
3
;e
11
y
e
3
+

e
4
;e
11
y
e
4
:
Thesymbolsatthesinknodecanbewrittenas:
z
4
;j
=
i
=11
X
i
=6

e
i
;j
y
e
i
;
(1

j

6)
:
(3.3)
matrices
A;B
asin[3]:
A
=
2
6
6
6
6
4

1
;e
1

1
;e
2

1
;e
3

1
;e
4

2
;e
1

2
;e
2

2
;e
3

2
;e
4

3
;e
1

3
;e
2

3
;e
3

3
;e
4
3
7
7
7
7
5
;
(3.4)
B
=
2
6
6
6
6
4

1
;e
1


1
;e
6
.
.
.
.
.
.
.
.
.

6
;e
1


6
;e
6
3
7
7
7
7
5
:
(3.5)
30
Wealsoamatrix
=
2
6
6
6
6
6
6
6
4

1
;
6

1
;
7

1
;
8

1
;
9
00

2
;
6

2
;
7

2
;
8

2
;
9
00

5
;
6

3
;
5

5
;
7

3
;
5

5
;
8

3
;
5

5
;
9

3
;
5

3
;
10

3
;
11

5
;
6

4
;
5

5
;
7

4
;
5

5
;
8

4
;
5

5
;
9

4
;
5

4
;
10

4
;
11
3
7
7
7
7
7
7
7
5
;
(3.6)
here

e
i
;e
j
iswrittenas

i;j
forshort.Thenthesystemmatrixis:
M
=
A


B
T
:
(3.7)
Inthisexample,thesizesofmatrices
A;B
are3

4and6

6.Thusthesizeoftransfer
matrixis3

6.Theoriginalsymbols
x
1
;
1
;x
1
;
1
;x
1
;
3
canbeseenasanoriginalmessageof
length3.Andthereceivedsymbolsatsinknodecanbeseenasacodewordoflength6.It
isappropriatethatweidentifythe3

6transfermatrix
M
withthegeneratormatrix
G
of
a(6
;
3)errorcontrolcode.HenceTheorem3.1isvfromtheperspectiveoftransfer
matrix.
3.1.3.2TheNecessity
Wehaveprovedthatanynetworkcodecanbeviewedasanerrorcontrolcode,nowwewill
considerthereverseproblem.Forapoint-to-pointcommunication,anetworkcodeisfeasible
onlyifitcansuccessfullydeliverallthedesiredsymbolsfromthesourcenodetothesink
node.Theorem3.2showsthecriterionofafeasiblenetworkcode.
Theorem3.2.
Foralinearnetworkwithsourcenode
u
,sinknode
v
andadesiredconnection
C
=(
u;v;
X
(
u
))
,thepoint-to-pointconnection
C
ispossibleifandonlyifthedeterminant
ofthe
R
(
C
)

R
(
C
)
transfermatrix
M
isnonzero.
Theproofofthistheoremcanbefoundin[3].
SincetherearenoredundanciesinthenetworkcodeinTheorem3.2,thedimensionof
symbolsinsourcenodeis
R
(
C
)=
jX
(
u
))
j
,andthedimensionofreceivedsymbolsinsink
nodeisalsoexactly
R
(
C
).Therefore,thesizeofthetransfermatrix
M
is
R
(
C
)

R
(
C
).
31
Theorem3.3.
Foralinearnetworkwithsourcenode
u
,sinknode
v
andadesiredconnection
C
=(
u;v;
X
(
u
))
,A(n,k)errorcontrolcodewiththe
k

n
generatormatrix
G
canbeseen
asafeasiblenetworkcodeinthepoint-to-pointconnection
C
ifwehavetherelationship:
k

R
(
C
)
.
Proof.
Ifthetheoremistruefor
k
=
R
(
C
),itwillalsoworkfor
k>R
(
C
).Itisstraightfor-
wardthatanetworkcodecansuccessfullycompletethepoint-to-pointconnectionofwhich
therateislowerthanthecode'smaximumcapacity.Soweonlyneedtoprovethecasewhen
k
=
R
(
C
).
Fromthestatementsinpreliminarysection,wehave
k

n
fora(
n;k
)errorcontrolcode.
Foramessagesequences(
x
1
;x
2
;

;x
k
),wehaveencodeitasfollows:
(
z
1
;z
2
;

;z
n
)=(
x
1
;x
2
;

;x
k
)
G:
(3.8)
Because
k

n
,wecanchoose
k
independentcolumns(
l
1
;l
2
;

;l
k
)from
G
toformanew
matrix
G
0
,whichhastherelationship
(
z
l
1
;z
l
2
;

;z
l
k
)=(
x
1
;x
2
;

;x
k
)
G
0
:
(3.9)
Inthiscase,
G
0
isa
k

k
fullrankmatrixwithnonzerodeterminant.Ifweview(
x
1
;x
2
;

;x
k
)
assymbolsatsourcenode
u
withtherate
R
(
C
)=
k
and(
z
l
1
;z
l
2
;

;z
l
k
)assymbolsre-
ceivedatsinknode
v
,then
G
0
isthetransfermatrixofthenetworkcode.Accordingto
Theorem3.2,thispoint-to-pointconnection
C
withrate
R
(
C
)=
k
ispossible.Sointhis
case,the(
n;k
)errorcontrolcodecanbeseenasafeasiblenetworkcode.
Inthecasethatthesizeoftransfermatrixislargerthan
R
(
C
)

R
(
C
),therepresented
errorcontrolcodewillhaveredundancieswhichcanbeusedtocontrolerrors.However,
basedonouranalysis,wecanaddredundanciesappropriatelysothatthenetworkcodeis
capableofdetectingandcorrectingerrors.Thiscanbedoneintwosteps:
32
1.
Accordingtothecommunicationchannelandthedesignrequirements(numberoferrors
todetectorcorrect,biterrorrate,etc.),determineanappropriatecoderate
k=n
and
thetypeoftheerrorcontrolcode(Hammingcode,Cycliccode,etc).
2.
Accordingtothesourcerate
R
(
C
),chooseproper
k
suchthat
k

R
(
C
)and
n
,and
derivethecorrespondinggeneratormatrix
G
.Thenapplythegeneratormatrix
G
as
thesystemtransfermatrixtothenetworkcoding.
3.1.3.3ApplicationinCombatingpollutionattacks
Forexample,inalinearnetworkshowninFigure3.7,thesourcenodeisgoingtosend4
symbols
x
1
;x
2
;

;x
4
tosinknode.AccordingtoTheorem3.3,wecanapplythe(7
;
4)
Hammingcodewithgeneratormatrix
G
=
2
6
6
6
6
6
6
6
4
1000110
0100101
0010011
0101010
3
7
7
7
7
7
7
7
5
:
(3.10)
ThecorrespondingnetworkcodeisalsoshowninFigure3.7.Becausetheminimumdistance
ofthecodeis3,thisnetworkcodecancorrect1biterror.Supposethesourcenodesends4
symbols(1
;
0
;
1
;
0),theexpectedreceivedsymbolswillbe(1
;
0
;
1
;
0
;
0
;
0
;
0).However,because
themaliciousnode
M
changesthesymbol,thereceivedsymbolswillbe
s
=(1
;
0
;
1
;
0
;
1
;
0
;
0).
Sinknodecandecodethereceivedsymbolusingthesyndrome-decodingmethod.Theparity-
checkmatrixinsinknodeis
H
=
2
6
6
6
6
4
1101100
1011010
0111001
3
7
7
7
7
5
:
(3.11)
Withthesyndromeofthereceivedsymbolscalculatedas
s

H
T
=(0
;
0
;
1),thesinknodecan
theerrorpattern(0
;
0
;
0
;
0
;
1
;
0
;
0)andcorrecttheerroneoussymbol.Fromthelocation
33
Figure3.7Implementthe(7
;
4)Hammingcodeinnetworkcoding
oftheerroneoussymbol,thesinknodecanalsothemaliciousnodefromwhich
x
1
+
x
2
istransmitted.
Fromthisexample,wecanseethatwithproperdesignoferrorcontrolcode,wecanmake
thecorrespondingnetworkcodecapableofdetectingandcorrectingerrorsthenthe
maliciousnodes.
3.1.4MulticastCase
Inprevioussection,westudytherelationshipbetweennetworkcodinganderrorcontrol
codinginpoint-to-pointcommunicationcase.Wecanderivesimilarresultsforthemulticast
case,wherethenetworkconsistsofonesourcenode
u
andseveralsinknodes
v
1
;v
2
;

;v
N
.
Thenetworkcodeformulticastingisfeasibleifandonlyifallthesinknodescanreceiveall
symbols
X
(
u
)sentfromsourcenode
u
.
3.1.4.1The
Theorem3.1stillholdsinmulticastcasebecausewedonotspecifywhetherthecommuni-
cationisunicastormulticastintheproofofthetheorem,whichindicatestheproofofthe
theoremisindependentofthetypeofthecommunications.
34
3.1.4.2TheNecessity
Themulticastproblemcanbedividedinto
N
unicastproblems:
C
=(
C
1
;C
2
;

;C
N
)=((
u;v
1
;
X
(
u
))
;
(
u;v
2
;
X
(
u
))
;

;
(
u;v
N
;
X
(
u
)))
:
(3.12)
Ifwewriteallthereceivedsymbolstogether:
z
=(
z
1
;z
2
;

;z
N
)=(
Z
(
v
1
;
1)
;

;Z
(
v
1
;
(
v
1
))
;

;Z
(
v
N
;
(
v
N
)))
;
(3.13)
wecanobtainthesystemtransferequationforthewholenetwork:
z
=
x
M
,inwhich
M
is
amatrixdas
M
=
jX
(
u
)
j
i
=
N
X
i
=1

(
v
i
)
:
(3.14)
Itisobviousthat
M
=

M
1
j
M
2
jj
M
N

;
(3.15)
inwhich
M
1
;M
2
;

;M
N
arethesystemtransfermatrixesforeachunicast
C
1
;C
2
;

;C
N
.
Theorem3.4.
Foralinearnetworkwithsourcenode
u
,sinknode
v
1
;v
2
;

;v
N
anddesired
connections
C
i
=(
u;v
i
;
X
(
u
))(1

i

N
)
,Aconcatenationof
N
errorcontrolcodeswith
the
k

n
i
generatormatrix
G
i
canbeseenasafeasiblenetworkcodeinthemulticastproblem
C
=(
C
1
;C
2
;

;C
N
)
ifwehavetherelationship:
k

R
(
C
i
)
.
Proof.
ThistheoremisanaturalextensionofTheorem3.3.If
k

R
(
C
i
),allthegenerator
matrix
G
i
canbeseenasfeasiblenetworkcodesinunicastproblem
C
i
.Sotheconcatenation
of
G
i
canbeseenasafeasiblenetworkcodeinmulticastproblem
C
.
3.2ACascadedErrorControlCodingApproach
3.2.1ModelsandAssumptions
Ifarelaynode
r
iscompromised,thesymbolstransmittedoneachedge
e
suchthat
head
(
e
)=
r
willbemoThenodesafternode
r
willbepollutedbecauseofthenetworkencoding.
35
Eventuallythesinknodewillreceivemoreerroneoussymbolsthanthoseoriginallybrought
bythemaliciousnode.Inthissection,wetrytoexploretheinnerstructureofthenetwork
codetocorrecttheerrorsandlocatethemaliciousnode.Wewillpartitionthenetworkinto
severalcascadedlevelsandexploretheinnerstructureofthenetworkcode,thuswemustbe
abletocorrectlyaccesstheoutputsofalltherelaynodes.Torealizethis,weaddaspecial
monitornodeinthenetwork.Thisnodecancollecttheoutputencodedmessagesfromall
therelaynodesandcanneverbecompromised.
3.2.2AnIllustrativeExample
Letusexaminetheclassicexample[1]againshowninFigure3.1.Byencodingatnode
4,bothnodes6and7canretrievethetwosymbols
x
1
;
1
;x
1
;
2
successfully.InSection3.1,
wemergedtheintermediatenodesandpathsandtransferredthethenetworkcodeintoa
bipartitegraph.Whileinthissection,wetrytoexplorethenetworkcodetoexhibitthe
innerstructureofthenetworkcode.Toexplainourmainidea,wewillonlyfocusonthe
communicationbetweennode1andnode6(theshadedareainFigure3.1).Theanalysisis
similartothecommunicationbetweennode1andnode7.Inthiscommunication,symbol
x
1
;
1
ispassedtonode2,node4,node5andnode6throughonehop,twohops,threehops
andtwohopsrespectively,andsymbol
x
1
;
2
ispassedtonode3,node4,node5andnode6
throughonehop,twohops,threehopsandfourhopsrespectively.AsshowninFigure3.8,if
weaddtwovirtualnodes
v
1
and
v
2
onedge
e
5
,wecanmake
x
1
;
1
passedtonode6through
fourhops,thusturnalloftheintermediatenodesinto3cascadedlevels.Eachofthelevel
canbeseenasasinglenetworkcode,sowecanrepresenteachlevelusingthebipartite
graphshowninFigure3.9accordingto[92].Inthisway,weexploretheinnerstructure
oftheoriginalnetworkcode,whichisdeterminedbythenetworktopology.Theoriginal
networkcodecanbeviewedas3cascadederrorcontrolcodeswiththegeneratormatrices
36
Figure3.8TransferthenetworkcodingschemeinFigure3.1intoa3-levelcascadedcoding
byadding2virtualnodes.
Figure3.9Thecorrespondingbipartitegraphsof3cascadedlevelsinFigure3.8
2
6
4
110
001
3
7
5
;
2
6
6
6
6
4
10
01
01
3
7
7
7
7
5
;
2
6
4
10
01
3
7
5
:
Althoughinthisexample,thereisnoredundancyinthethreeerrorcontrolcodes,the
correspondingnetworkcodecannotdetectorcorrecterrors,itisttoshowthat
networkcodecanbeexpandedtocascadederrorcontrolcodes.
IntherestofSection3.2,wewillshowthatnetworkcodescanbetransferredintocascaded
errorcontrolcodes.Inthisway,wecancharacterizeanddesignnetworkcodesbasedonthe
underlyingcascadederrorcontrolcodesforerrordetection/correctionandmaliciousnodes
locating.
37
Figure3.10Transferincomingedgesofnodeshavingmultipleincomingedgesbyadding
virtualnodes
3.2.3CharacterizationofNetworkCodingusingCascadedErrorControlCod-
inginPoint-to-PointCommunication
Herewewillformallystatetherelationshipbetweennetworkcodingandcascadederror
controlcodinginthepoint-to-pointcommunication.Theciencyisstudiedthenthe
necessity.
3.2.3.1The
Theorem3.5.
Everynetworkcodeschemecanbeexpandedtoaseriesofcascadederror
controlcodes.
Proof.
Toprovethis,wewillshowthatthenetworkcodecanbepartitionedintoseveral
cascadedlevelsofonehopnetworkcodes.Foreachofthenodesthathavemultipleincoming
edgesinthenetwork,weaddsomevirtualnodesontheseedgesasshowninFigure3.10.For
eachoftheincomingedges,theremaybeseveralpathsthroughwhichmessagesarepassed
fromthesourcenodetonode
u
includingtheedge.Amongallthepaths,wethelongest
oneandcalculateitsnumberofhops.Aftercalculatingthehopvalues
h
1
;:::;h
m
forallthe
incomingedges,wechoosethemaximumvalue
h
max
.Foreachoftheincomingedge
i
,we
add
h
max

h
i
virtualnodesonit,makingallthepathsfromsourcetonode
u
havethesame
countofhops.Thevirtualnodessimplyforwardthemessagespassedonthecorresponding
edges.
38
Figure3.11Partitionanetworkcodeintoseverallevels
AftertheoperationinFigure3.10isperformedinallthenodeshavingmultipleincoming
edges,sinceallthepathsfromsourcenodetothesametherelaynodehavethesamehop
countsandthesinknodeitselfmusthavemultipleincomingedges,everypathfromthesource
nodetothesinknodehasthesamenumberofhops,thusthesamenumberofintermediate
nodes,includingtherelaynodesandthevirtualnodes.Wecanputthenodeshavingthe
samehopcountstogetherasalevelasshowninFigure3.11.Everysinglelevelcanbeviewed
asonehopnetworkcodedeterminedbytheconnectionsfromnodesofthepreviouslevel.So
everynetworkcodecanbepartitionedintoseveralcascadedlevelsofonehopnetworkcode.
AccordingtoTheorem3.1,theseonehopnetworkcodescanberepresentedbyerror
controlcodes.Sothecascadednetworkcodescanberepresentedbyconcatenatingthe
correspondingerrorcontrolcodestogether.Wecanexpandanynetworkcodetoaseriesof
cascadederrorcontrolcodes.
TakingthenetworkcodeinFigure3.5asanexample.Thesourcenode1transmitsthree
symbols
x
1
;
1
;x
1
;
2
;x
1
;
3
tosinknode4inthisnetworkcode.Andsinknode4canreceive
6encodedsymbols,whichindicatesthatthereareredundanciesinthisnetworkcoding.In
Section3.1,weanalyzethesamecodeandtransferitintoa(6
;
3)errorcontrolcodewhich
cancorrect1error.Herewewillshowthiscodecanbetransferredintoaseriesofcascaded
errorcontrolcodes.FollowingtheoperationsmentionedintheproofofTheorem3.5,we
cangetthecorrespondingcascadednetworkcodesandcascadederrorcontrolcodesshown
39
Figure3.12ThecorrespondingcascadedbipartitegraphofFigure3.5
inFigure3.12.Nodes
v
1
and
v
2
areaddedasvirtualnodestopartitiontheoriginalnetwork
code.Thelevelerrorcontrolcodeisa(5
;
3)codeandthesecondlevelcodeisa(6
;
5)
code.
Ifanerroroccursonedge
e
1
,node2willreceivewrong
x
1
;
1
.Theerrorwillpropagateto
thesucceedingnodes,thustherewillbetwoerroneous
x
1
;
1
and
x
1
;
1
+
x
1
;
3
inthesinknode,
whichisbeyondtheerrorcorrectioncapabilityofthe(6
;
3)errorcontrolcode.Theerrors
cannotbedealtusingthetransformingmethodsinSection3.1.However,ifthemonitor
nodecancollecttheoutputsymbolsofthelevel(5
;
3)code,itcancorrecttheerroneous
symbol
x
1
;
1
innode2.Sotheerrorpropagationiseliminatedfromthebeginning.By
exploringtheinnerstructureofthenetworkcode,wecanmakebetteruseoftheredundancy
inthenetwork.
Ifnode3isanmaliciousnodeandsendoutcorruptedmessages,therewillbe3errors
intheoutputofboththelevelerrorcontrolcodeandthesecondlevel.Theerroris
beyondthecapabilityofthecascadederrorcontrolcodes,sowecannotcorrecterrorsor
40
locatethemaliciousnode.Wewillshowthatwecandesignnetworkcodescorresponding
topropercascadederrorcontrolcodestocorrecterrorsandlocatemaliciousnodesinthe
sectionsbelow.
3.2.3.2TheNecessity
Wehaveprovedthatanynetworkcodecanbeviewedasaseriesofcascadederrorcontrol
codes,nowwewillconsiderthereverseproblem.Forapoint-to-pointcommunication,a
networkcodeisfeasibleonlyifitcansuccessfullydeliverallthedesiredsymbolsfromthe
sourcenodetothesinknode.
Theorem3.6.
Foralinearnetworkandadesiredconnection
C
=(
u;v;
X
(
u
))
,Aseriesof
cascadederrorcontrolcodeswithparameters
(
n
1
;n
0
)
;
(
n
2
;n
1
)
;:::;
(
n
m
;n
m

1
)
,canbeseen
asafeasiblenetworkcodeintheconnection
C
ifwehavetherelationship:
n
0

R
(
C
)
.
Proof.
Supposetheoriginalmessageis
x
=(
x
1
;:::;x
k
),theoutputencodedmessagefor
eachlevelofthecascadederrorcontrolcodesis
y
i
=(
y
i;
1
;:::;y
i;n
i
)(1

i

m
)andthe
generatormatrixforeachofthecascadederrorcontrolcodesis
G
i
(1

i

m
)ofthesize
n
i

1

n
i
.
y
i
foreachlevelcanbewrittenas:
y
1
=
x

G
1
;
y
2
=
y
1

G
2
;:::;
y
m
=
y
m

1

G
m
:
(3.16)
Sotheentireencodingequationforthecascadederrorcontrolcodescanbewrittenas
y
m
=
x

G
1

G
2

G
m
=
x

G:
(3.17)
Ifweviewthecascadederrorcontrolcodesasanerrorcontrolcodewiththegenera-
tormatrix
G
ofthesize
n
0

n
m
,theparameterforthecodeis(
n
m
;n
0
).Accordingto
Theorem3.3,if
n
0

R
(
C
),thenetworkcodeisfeasible.
Basedontheanalysis,byimplementingtheerrorcontrolcodeforeachlevelofthe
cascadederrorcontrolcodes,wecanaddappropriateredundanciesintothenetworkcodeto
controlerrorsandlocatemaliciousnodes.Thiscanbedoneintwosteps:
41
1.
Accordingtothenetworktopology,determinethenumberoflevelsofthecascaded
codes.Accordingtothedesignrequirements(numberoferrorstodetectorcorrect,
numberofmaliciousnodestolocate),determineanappropriatecoderateandthetype
oftheerrorcontrolcodeforeachlevel.
2.
Accordingtothesourcerate
R
(
C
),chooseaproper
n
0
suchthat
n
0

R
(
C
),andderive
therestofthe
n
i
(1

i

m
)basedonthecoderateforeachleveloftheerrorcontrol
codes.Generatethegeneratormatrices
G
1
;:::;G
m
accordingtothecodetypesand
applythemasthesystemtransfermatricestoeachlevelofthenetworkcodes.
3.2.3.3ApplicationinCombatingpollutionattacks
Theorem3.7.
Suppose
d
i
;d
i
+1
>
2
aretheminimumdistancesof2adjacentlevels(
L
i
;L
i
+1
)
ofthecascadednetworkcode.If
2
d
i
+1
>d
i
+1
,thenerrorsin
L
i
+1
spreadbyasingleerror
in
L
i
isuncorrectablebythe
L
i
+1
'serrorcontrolcode.However,theycanbecorrectedby
the
L
i
'serrorcontrolcode.
Proof.
Accordingto[89],onesymbolinthesourcemessageisrelatedtoatleast
d
min
symbols
intheencodedcodeword.Sooneerrorin
L
i
canbecomeatleast
d
i
errorsin
L
i
+1
.If
2
d
i
+1
>d
i
+1
,theseerrorsarebeyondthecapabilityoftheerrorcontrolcode.However,
thesingleerrorcanbecorrectedat
L
i
because
d
i
>
2.Thentheerrorsin
L
i
+1
canbe
correctedaccordingly.
LetusanalyzethelinearnetworkshowninFigure3.13,thesourcenode1isgoingto
send3symbols
x
1
;x
2
;x
3
tosinknode12.Thisnetworkcanbepartitionedinto2levels.
Nodes2
;
3
;
4
;
5formthelevelandnodes6
;
7
;
8
;
9
;
10
;
11formthesecondlevel.Inorder
togetthebesterrorcontrolcapability,weimplementtwosystematicRScodesinthetwo
levels.Theyare(7
;
3
;
5)codeforlevel1and(11
;
7
;
5)codeforlevel2.Theminimum
distancesofthetwocodesareboth5,thusbothofthemcancorrect2errors.Becausethe
errorsoccurringnexttothesourcenodearemoresensitive.Theymaypropagatetothe
42
subsequentnodescausingmuchmoreerrors.Weputthelowerratecodethathasstronger
errorcontrolcapabilityatthelevel.
Whenthereisnoerrorinthenetwork,wehave(
y
i;
1
;y
i;
2
;y
i;
3
)=(
x
1
;x
2
;x
3
)
;i
=1
;
2
:
Itiseasyforthesinknodetodecodethemessages.Ifnode6isamaliciousnodeandit
sendsouterroneous
y
2
;
1
;y
2
;
2
,themonitornodecancorrectthese2errorsusingthesec-
ondlevelRScodeandoutthismaliciousnodeaccordingtothenetworktopology.If
node2isamaliciousnodeanditsendsouterroneous
y
1
;
1
;y
1
;
2
,theerrorswillpropagate
to
y
2
;
1
;y
2
;
2
;y
2
;
8
;y
2
;
9
;y
2
;
10
;y
2
;
11
,whichpreventsthesecondlevelcodefromcorrectingthe
errors.InthecorrespondingcascadedbipartitegraphFigure3.14,theerrorsaremarked
withgreycolor.Itisclearthat2errorsfromlevel1spreadto6errorsinlevel2.Evenifwe
transferthenetworkcodeintoone(11
;
3
;
9)RScodewhichiscapableofcorrecting4errors
accordingtoSection3.1,theerrorsarestilltoomanytocorrect.However,basedonthefact
thattheerrorsareburstandcorrelated,afterthemonitornodecollectstheoutputsofthe
level,itcancorrectthe2errorsoccurringinnode2usingthelevelRScode,
outthemaliciousnodebasedonthenetworktopologyandcorrectthe6errorsinthesecond
level.OurcascadedRScodecancorrectatmost6errorsbyexploringtheinnerstructure
ofthecodeandismorepowerfulthanregularRScodes.
Withproperdesignofeachlevelofthecascadederrorcontrolcodes,wecanmakethe
correspondingnetworkcodecapableofdetectingandcorrectingerrorsthenlocatingthe
maliciousnodes.
3.2.4MulticastCase
Becauseinpoint-to-pointcommunicationcase,ourproofsfortherelationship(writtenas
R
nc;cec
)betweennetworkcodeandcascadederrorcontrolcodesaresolelydependedon
theproofsfortherelationship(writtenas
R
nc;ec
)betweennetworkcodeanderrorcontrol
codeinSection3.1(Theorem3.1andTheorem3.3)andthiskindofdependencehasno
relationshipwiththespcommunicationcase,wecanprovethat
R
nc;cec
inthemulticast
43
Figure3.13Implementa2levelcascadederrorcontrolcodeinnetworkcoding
Figure3.14ThecorrespondingcascadedbipartitegraphofFigure3.13
caseissimilartothatinthepoint-to-pointcommunicationcase,basedonthefactthatin
Section3.1
R
nc;ec
staysthesameinbothpoint-to-pointandmulticastcases.
44
CHAPTER4
COMBATINGPOLLUTIONATTACKSFORRANDOMNETWORKS
Inthischapter,wewillproposeanewerror-detectionanderror-correction(EDEC)scheme
todetectandremovethemaliciousattacksforrandomnetworkcoding.TheproposedEDEC
schemeissimilarinstructuretotheexistingerrorcontrolbasedschemes.However,itcan
maintainthroughputunchangedwhenmoderatenetworkpollutionexistswithonlyaslight
increaseincomputationaloverhead.ThenweproposeanimprovedLEDECschemeby
integratingthelow-densityparitycheck(LDPC)decoding.Ourtheoreticalanalysisdemon-
stratesthattheLEDECschemecanguaranteeahighthroughputevenforheavilypolluted
networkenvironment.Wealsoprovideextensiveperformanceevaluationandsimulation
resultstovalidateourtheoreticalresultsusingns-2networksimulator.
4.1System/AdversarialModelsandAssumptions
Inthischapter,wewillstudycombatingpollutionattacksfortheencodingforrandom
networks,whereitistodesignterrorcorrectionnetworkcodeswithoutthe
knowledgeofthenetworktopology.Inthiscase,alltheencodingcots

l;e
;
e
0
;e
and

e
0
;j
willbechosenrandomly.
Inthischapter,wewillusesomenotationsofSection2.1.Forasourcenode
u
,thereisasetofsymbols
X
(
u
)=(
x
1
;:::;x
l
)tobesent.Foralink
e
betweenrelaynodes
r
1
and
r
2
,writtenas
e
=(
r
1
;r
2
),thesymbol
y
e
transmittedonitisthefunctionofallthe
y
e
0
suchthat
head
(
e
0
)=
r
1
.And
y
e
canbewrittenas:
y
e
=
X
e
0
:
head
(
e
0
)=
r
1

e
0
;e

y
e
0
=
l
X
i
=1

e;i
x
i
=

e
x
;
(4.1)
where

e
0
;e
isthelocalnetworkencodingcot,

e;i
istheglobalnetworkencoding
45
Figure4.1Applyerrorcontrolcodesinlinearnetworkcoding
cotforsymbol
y
e
and

e
=


e;
1
;
e;
2
;:::;
e;l

isthenetworkencodingvector.For
asinknode
v
,thereisasetofincomingsymbols
y
e
0
(
e
0
:
tail
(
e
0
)=
v
)tobedecoded.
Aswementionedabove,ifadversariescanmodifythecontentsofthepacketsandsend
themtothesucceedingrelaynodes,thecommunicationwillfailandthecapacitywillbe
reduced.Inaddition,foralargescalenetwork,asmallerroroccurringatanintermediate
relaynodemaytomanypacketsatthesinknode.Thiscancauseatwaste
ofnetworkresourcesandsometimescanevenruinthewholenetworkcommunication.
Inthisdissertation,themaliciousnodecanaddrandomerrorstothesymbolsinthe
receivedpacketsthensendthecorruptedpacketsouttopollutethenetwork.Weadoptthis
simpleadversarialmodelbecausewemainlyfocusonthethroughputimpactbroughtby
tstrategies(discardvs.keep)towardscorruptedpacketsinthisresearch.
4.2ProposedEDECScheme
ThebasicideaoftheproposedEDECschemeinFigure4.1isthatthesourcenodesencode
theoriginalmessagesusinganerrorcontrolcodebeforesendingthemout.Theproperties
oftheerrorcontrolcodekeepunchangedduringthelinearnetworkcoding.
AswementionedinSection1.1.3,theerror-detectionbasedschemesmainlyfocuson
detectingthecorruptpackets.Whenacorruptpacketisidenthroughsyndromes,it
willbediscarded.Soifanadversarycontinuestocorruptcertainpackets,thesepacketswill
becontinuouslydroppedandthecommunicationmayneversucceed.Therefore,weneedto
46
Figure4.2Limitationsoferrorcontrolcodes
developtechniquesthatcanimprovethethroughputforthesesituations.
4.2.1EDECScheme
SimilartotheNullKeyscheme,ourapproachalsoutilizestheerrorcontrolcode,butwe
useboththeerrordetectionanderrorcorrectionproperties.Whenacorruptedpacketis
detected,wedonotdropit.Insteadwecollectthecorruptedpackettothesinknodeto
correcttheerrors.However,thecorruptedpacketwillnotparticipateinnetworkcodingin
thesubsequentrelaynodesonceitisidentobecorrupted.
4.2.1.1LimitationsofErrorControlCode
Alinearerror-correctingcodeencodestheoriginal
k
bitsmessagesymbol
m
toan
n
bits
codeword
c
usingageneratingmatrix
G
k

n
.Sothecoderateis
r
=
k=n
.Supposethe
minimumdistanceis
d
,accordingtotheresultsinthePreliminary,themaximumnumber
oferrorswecancorrectis
j
d

1
2
k
.Ifthenumberoferrorsismorethanthisamount,wemay
correctthecorruptedcodewordintoafalseone,asillustratedinFigure4.2.
4.2.1.2MoErrorControlCode
Theconventionalerrorcontrolcodemayhaveundetecteddecodingerrors.Thisisaninherent
nature.Nomatterhowlowwesetthecoderate,theseundetectederrorsmayexist.The
47
Figure4.3TheencodingprocessofmoerrorcontrolcodeinEDECscheme
decodingerrorscanonlybedetectedusingmechanismsotherthanastand-aloneerror-
correctingcode.
Therefore,weproposetoapplymoerror-controlcodetobothmessagesymbolsand
networkcodingcoetsinequation(4.1).Inthissection,wewillusethemessagesymbol
asanexample.Theoriginalmessagesymbol
m
ismappedtoa
t
bitvalue
h
usinga
homomorphicMACalgorithmlike[33].The
t
bitswillbeappendedto
m
toformanew
k
+
t
bitsmessagesymbolandtogetthecodewordbyencodingthisnewmessagesymbol.So
thecodebecomesan(
n;k
+
t
)code.Byaddingtheextrabits,wecanmitigatelimitations
oftheconventionalerror-controlcode.Figure.4.3illustratesthemoencodingscheme.
Uponasuccessfuldecoding,thedecodedmessagesymbolissplitintotwoparts
m
0
and
h
0
.Thenwecalculatethemappingof
m
0
:
h
00
.If
h
00
doesnotequalto
h
0
,we
candetectadecodingerror.Ourmoisequivalenttochoose2
k
messagesymbols
from2
k
+
t
symbols.Othermessagesymbolsinthe2
k
+
t
symbolspaceareconsideredtobe
illegal.However,thedecodingalgorithmonlyguaranteesthatthedecodedcodewordisin
the
k
+
t
dimensionalsubspace.Soifthecorrectedcodewordbelongstothe2
k
+
t

2
k
illegal
symbolspace,weknowthedecodingcontainserror.Figure4.4illustratesthecorresponding
modecodingscheme.
Theorem4.1.
Supposeadecodingerroroccurs,thewrongcodewordwillbeanycodeword
inthe
2
k
+
t
symbolspace.Sotheprobabilityofdetectinganerroneousdecodingis:
p
=
2
k
+
t

2
k
2
k
+
t
=
2
t

1
2
t
=1

1
2
t
:
(4.2)
48
Figure4.4ThedecodingprocessofmoerrorcontrolcodeinEDECscheme
Table4.1Fourcasesofdecodedcodewordsinmoerrorcontrolcode
Case
kbitsoriginalsymbol
t
bitsmappingvalue
Results
1
Decodedright
Obeymappingrule
Successfully
2
Decodedright
Violatemappingrule
Falsealarm
3
Decodedwrong
Obeymappingrule
Missdetect
4
Decodedwrong
Violatemappingrule
Successfully
Asanexample,when
t
=4
;p
=
15
16
,theprobabilityfor3consecutiveerroneousdecodings
tobedetectedis1


1
16

3
ˇ
0
:
9998.Therefore,weonlyneedtoaddaverysmalloverhead
todetecterroneousdecodings.
4.2.1.3PerformanceofMoErrorControlCode
Inthissection,wewillselectacycliccodewith
n
=15,
t
=4todemonstrateourproposed
scheme.Weaddsomeerrorstotheencodedsymbols,thendecodethesesymbolsas
describedinFigure4.4.Weevaluatetheperformancebycheckingthenumbersofdecoded
codewordsinfourtcases.TheresultsaresummarizedinTable4.1.
Codewith
k
=6Inthissimulation,weusea(15
;
10)codefortheevaluation.Fromthe
results(seeFigure4.5)wecansee:(i)Thiscode(withminimumhammingdistance4)can
detectandcorrectallthe1biterrorandpartofthetwobitserrors.(ii)Wecansuccessfully
detectmostofthedecodingerrorswhenthenumberoferrorsismorethan2.Infact,
49
Figure4.5PerformanceofmoerrorcontrolcodeinEDECschemewhen
k
=6
thedetectionprobabilityislargerthan0.8mostofthetimeexceptfor14errors,inwhich
theprobabilityisabout0.6.(iii)Falsealarmcannotbedistinguishedfromthesuccessful
detection.Infact,thefalsealarmisalsocausedbytheerrorsbeyondthecorrectingability.
Theonlyerenceisthatthe
t
bitsappendixpartofthesymbolisdecodedwrong.However,
thefalsealarmisneglectableaccordingtotheresults.
Codewith
k
=4Inthissimulation(seeFigure4.6),weusea(15
;
8)codetodothe
evaluation.Theonlyisthatthiscodeisabletocorrectmoreerrorsbecausethe
coderateisrelativelylower.Fromtheresultsofthetwotcoderates,wecanseethat
adding4extracheckbitsisenoughtodetecterroneouscorrection.
4.2.1.4AlgorithmsforEDECScheme
TheproposedEDECschemeisdividedintotwophases:initializationphaseandtransmission
phase.Theinitializationphaseisfornullkeyandsecurityparameterdistributionwhiledata
symbolsaretransmittedthroughnetworkcodinginthetransmissionphase.
50
Figure4.6PerformanceofmoerrorcontrolcodeinEDECschemewhen
k
=4
InitializationPhase
Ininitializationphase,thesourcenodewilldistributetherow
vectorsoftheparitycheckmatrixcorrespondingto
G
inAlgorithm4.1(nullkeys)toall
therelaynodessimilarto[44]usinghomomorphichashes.Unlikenormallinearnetwork
codinginwhichthenetworkencodingvectorswillbeattachedtothestartortheendof
thepackets,weproposetoinserttheencodingvectorstoapredeterminedsecretlocationin
thepackets.Thesourcenodewillsendthelocationinformationtoallthesinknodesduring
initializationphasethroughasecuretransmissionprotocolsuchasTLS[93].Thiswill
preventthemaliciousnodesfromcorruptingtheencodingvector,whichisessentialforthe
datadecoding.Moreover,thesourcenodewillalsosendtheencodingmatrix
G
c
fornetwork
encodingvectorsand
G
fordatasymbolstoallthesinknodes.Oncetheinitializationphase
isdone,thesourcenodescanmulticastanynumberofpacketstosinknodes.Theoverhead
oftheinitializationphaseisnegligible.
51
TransmissionPhase
Inthetransmissionphase,thesourcenodes,relaynodesandsink
nodeswillperformtheproposedEDECschemeaccordingtoAlgorithm4.1,4.2and4.3.
Algorithm4.1
EDECAlgorithmforSourceNodes
for
packeti
do
==
Encodenetworkencodingvector

i
inequation(4.1)usingthemoderror-control
code(Figure.4.3)
h
c
 
map(

i
)
u
c
 
(

i
j
h
c
)
Encodednetworkencodingvector
 
u
c

G
c
for
everysymbol
m
ofthepacket
do
==
Encode
m
usingthemoderror-controlcode(Figure.4.3)
h
 
map(
m
)
u
 
(
m
j
h
)
Encodedsymbol
 
u

G
endfor
Sendouttheencodedencodingvectorandsymbols
endfor
Inalgorithm4.1,thesourcenodewillencodethenetworkencodingvector

i
usingthe
moerror-controlcodewithamuchlongerappendixandlowercoderate,comparedto
theencodingofdatasymbols.Thiscanimprovetheerrorresistanceanddetectionproba-
bilityforerroneousdecodingstoguaranteethecorrectnessofencodingvectorsusedfordata
decoding.Sincethereisonlyoneencodingvectorineachpacket,theoverheadbroughtby
thishighersecuritylevelisnegligible.
Algorithm4.2presentstheEDECalgorithmforrelaynodes.Sincethenullkeys(rowvec-
torsoftheparitycheckmatrixcorrespondingto
G
inalgorithm4.1)arealreadydistributed
ininitializationphase,therelaynodescancheckwhetherapacketisintact.
Algorithm4.3presentstheEDECalgorithmforsinknodes.Sincethesinknodehas
alreadyreceivedtheencodingmatrix
G
c
and
G
inAlgorithm4.1ininitializationphase,it
canperformtheerror-controlcodedecodinganddetectionforerroneousdecoding.Thenit
canderivetheoriginaldatasymbolsthroughdecodingofnetworkcoding.
52
Algorithm4.2
EDECAlgorithmforRelayNodes
if
everysymbolinthereceivedpacketisintact
then
if
thepacketisindependent
then
Savethepacket
if
x
(apredeterminednumber)packetsarecollected
then
repeat
Generate
x
randomly,linearlycombinedpacketsusingthesavedpackets(network
coding)
until
the
x
newpacketsareindependent
Sendoutthe
x
packets
endif
endif
else
if
thepacketisindependentfromallthepreviouspackets
then
Markthepacketascorruptedandsenditout
endif
endif
Algorithm4.3
EDECAlgorithmforSinkNodes
Apacketisreceived
Decodethenetworkencodingvectorandeverysymbolinthepacketusingthedecoding
algorithmforthemoerror-controlcode(Figure.4.4)
if
thenetworkencodingvectorandallsymbolsaredecodedcorrectly
then
if
thepacketisindependent
then
Savethepacket
if
l
(inequation(1.1))independentpacketsaresaved
then
Solvethenetworkcodingequations
endif
endif
endif
4.2.2Simulationinns-2
Inthissection,thesimulationplatformforEDECschemeinns-2[94]ispresented.Then
wewillcomparetheEDECschemeandtheerror-detectionbasedschemes.Inthesimulation,
weimplementtheNullKeyschemetorepresenttheerror-detectionbasedschemes.
53
4.2.2.1SimulationPlatform
ns-2isadiscreteeventsimulatorthatprovidescomprehensivesupportforsimulationof
networkprotocols.ItisidealforthesimulationofEDECscheme.Thescenarioissetasa
gridnetworkwithonesourcenode,anumberofrelaynodesandsinknodes.Allthenodesare
setaswirelessnodesusingwirelessphysicallayer,802.11MACprotocolandAODVrouting
protocol.ThewirelesschannelissettoTwoRayGround.Thenodestransmitpacketsusing
broadcasting.Onceanodereceivesapacket,itwillstartthecorrespondingoperations
dependingonitstype(source,relay,malicious,sink)andthepacketcontent.
Figure4.7showsthetopologyofthesimulatednetwork.Thesourcenodeislocated
atthelowerleftcornerand19sinknodeslieattheupperright.Therestnodesareall
intermediatenodesthatcanrelaypackets.Inthesimulation,werandomlypickanumber
ofintermediatenodesasmaliciousnodestoperformpollutionattacks.Thesenodescanadd
certainerrorstoreceivedpacketsbeforesendingthepacketsouttopollutethenetwork.We
canchangethenumberofmaliciousnodestoevaluateperformanceofthealgorithmsunder
tnetworkconditions.Asanexample,inFigure4.7,werandomlypick50nodesoutof
209intermediatenodestobemaliciousnodes.Themaliciousratioisabout50
=
209=24%.
Therestoftheintermediatenodesactasrelaynodes.Afterreceivingapacket,theywill
conductthepollutiondetection.Intheerror-detectionbasedschemes,ifthepacketis
corrupted,itwillbedropped.WhileintheEDECscheme,wewillforwardthesepackets.
However,thesepacketswillnotparticipateinnetworkcoding.Thenodesbehaviorswillbe
detailedinthenextsection.
Becausethepacketsaretransmittedthroughbroadcasting,althoughtheMACprotocolis
IEEE802.11,wewillstillhavepacketscollisionsthatwilleventuallytthesimulation
results.Inthisdissertation,weonlyfocusonnetworklayerprotocols.Thusafterconsidering
thetransmissionrangeofthesinglenode,adjacentnodesareassignedttimeslots
(seeFigure4.8)toavoidpacketscollisions.Thereare9timeslotsintotalandtheduration
54
Figure4.7Simulationscenario
Figure4.89timeslotstoavoidpacketscollisions
ofeachtimeslotis100ms.Thenodesareallowedtosendpacketsonlyiftheyareintheir
ownslots.Ifnot,theywillhavetowaituntiltheirnextslots.InFigure4.8,wegivean
examplefornodesthatbelongtotimeslot2tosimultaneouslytransmitwithoutpackets
collisions.
55
4.2.2.2NodesDesign
Aftersettingupthesimulationplatform,wecanfairlyevaluatethealgorithmswithoutcon-
sideringotherfacts.Fourtypesofnodesaredesignedaccordingtothealgorithmsdescribed
above.
SourceNode
Inthesimulation,thesourcenodewillmulticasta352-symbolmessage,
whichisfragmentedinto32packetsof11symbols.Eachsymbolhasthesize
k
=512
bits.Inthewholenetworktherewillbe32linearlyindependentpackets.Thatis
l
=32in
equation(1.1).Afterinitializingthenetwork,thesourcenodewillencodeeachdatasymbol
usingthemoerror-controlcodepresentedinFigure.4.3with
t
=16.Theencoding
vector

i
willalsobeencodedusingthemoerror-controlcodewith
t
=32.According
toTheorem4.1,theprobabilityofdetectinganerroneousdecodingfortheencodingvector
isabout1

2

10
,whichmeansoncethedecodedencodingvectorpassthevin
Figure.4.4,wecanviewtheencodingvectorasintact.Thenthesourcenodewillinsert
encodednetworkencodingvectorintothepredeterminedlocationineachpacketandsend
outthepacket.
RelayNodes
RelaynodeswillperformEDECschemeaccordingtoalgorithm4.2.Because
thenetworkiscollisionfreeandallthetransmittedpacketscanbereceived,eachpacketonly
needstobetransmittedonce.Soifanewlyreceivedvalidpacketislinearlydependentof
previoustransmittedpackets,itwillbediscarded.Sincethereare32linearlyindependent
packetsintotal,ifarelaynodedoesnottransmitpacketsuntilallthe32packetshavebeen
received,thetimedelaywillbehuge.Moreover,arelaynodemayneverbeabletocollect
allthe32validpacketsduetomaliciousattacks.Tousenetworkcodingtlywhile
minimizingthetimedelay,relaynodeswillperformnetworkencodingoncetheycollect4
independentvalidpackets.
56
MaliciousNodes
Similartotherelaynodes,maliciousnodesonlysendoutindependent
packets.However,weassumethatthemaliciousnodeswillnotperformnetworkencoding
inthiscase.Theyonlypollutepacketsandsendoutcorruptedpackets.
SinkNodes
Sinknodeswilldecodingboththenetworkcodingandtheunderlingmo
error-controlcodeaccordingtoalgorithm4.3.Aftertheoriginalsymbolsaresuccessfully
retrieved,allthepacketsreceivedafterwardswillbeignored.
4.2.2.3SimulationResults
Weconductedsimulationsundertpercentagesofthemaliciousrelaynodes.Tomake
theresultsmoreclear,wethenumberofbitsthatthemaliciousnodescancorrupt
foreachsymbol.Thenwemakethisnumberrandomaccordingtoouradversarymodel.
SmallNumberofErrors
Whenthenumberofbitsthatmaliciousnodescancorrupt
foreachsymboliswithinthecapabilityoferrorcontrolcodes,thethroughputcomparison
betweentheEDECschemeandtheerror-detectionbasedschemesisshowninFigure4.9.
Inthewecanseethat:(i)Whenthepercentageofmaliciousnodesislessthan
10%,theperformanceofthetwoschemesarealmostthesame.(ii)Withtheincreasingof
themaliciousnodes,theperformanceoferror-detectionbasedschemesdegradetly.
WhilethethroughoutoftheEDECschemeremainsunchanged.(iii)Whenthepercentage
ofthemaliciousnodesislargerthan65%,theerror-detectionbasedschemesdonotworkat
allbecausetoomanycorruptedpacketshavebeendumped.However,thethroughputforthe
EDECschemestillremainsunchangedbecausetheEDECschemecansuccessfullyrecover
allofthemessagesymbolsfromthecorruptedpackets.Thisscenariowillremaintrueas
longasthecorruptedpacketsymbolsarewithinthecapabilityoftheerrorcontrolcodes.In
thiscase,theEDECschemesurpassestheerror-detectionbasedschemesinthroughput.
57
Figure4.9ThroughputcomparisonbetweenEDECschemeandtheerror-detectionschemes
basedonthenumberofbitcorruptedineachsymbol|forsmallnumberoferrors
LargeNumberofErrors
Whenthenumberofbitscorruptedineachsymbolofthere-
ceivedpacketsisbeyondthecapabilityoftheerrorcontrolcodes,thethroughputcomparison
betweentheEDECschemeandtheerror-detectionbasedschemesisshowninFigure4.10.
Fromtheresultswecanseethattheperformanceofthetwoschemesarealmostthesame.
ThisisbecausethecorruptedpacketsthatcannotberecoveredbytheEDECschemehave
alreadybeendumpedbytheerror-detectionbasedschemes.
RandomNumberofErrors
Whenthemaliciousnodesaddsrandomerrorstothesym-
bolsinthereceivedpackets,theperformanceoftheEDECschemeiscomparablewith
theerror-detectionbasedschemes.Thisisbecausethatwhilesomeofthesymbolsinthe
corruptedpacketscanbecorrected,butsomearebeyondthedecodingcapabilityofthe
errorcontrolcode,whichmakesthepacketunusablewithresultsimilartothepacketbeing
dumpedintheerror-detectionbasedschemes.
58
Figure4.10ThroughputcomparisonbetweenEDECschemeandtheerror-detection
schemesbasedonthenumberofbitcorruptedineachsymbol|forlargenumberoferrors
4.3LDPCDecodingandLEDECScheme
IntheEDECscheme,onlylinearlyindependentpacketsparticipateinthenetworkdecoding
atthesinknodes.Corruptedorlinearlydependentpacketswillnotbeused.Inthissec-
tion,wewillexploreutilizingthesepacketstorecovermoremessagesymbolsusingLDPC
decoding.
4.3.1LDPCCode
Lowdensityparitycheck(LDPC)linearblockcodewasintroducedbyGallagerin
1962[95].OneoftheimportantcharacteristicofLDPCcodeisitssparseparitycheck
matrix.Byusingiterativedecoding,LDPCcodecanachieveerror-correctionperformance
closetoShannonbounds[96].TheadvantagesofLDPCcodewerediscussedin[97,98].
SomenewclassesofasymptoticallygoodLDPCcodeswerestudiedin[99{101].Andsome
decodingalgorithmsofLDPCcodeswerepresentedin[102{105].
LDPCcodescanbecategorizedastheregularLDPCcode,ofwhichtheparitycheck
59
Figure4.11AnillustrativeexampleofparitycheckmatrixandTannergraph
matrix
H
hasednumberof1'spercolumnandperrow,andtheirregularLDPCcode[106],
ofwhichtheparitycheckmatrixmayhavetnumberof1'sineachcolumnandeach
row.Inthisdissertation,wewillformulatethenetworkcodingtotheirregularLDPCcode.
4.3.2DecodingofLDPCCode
Theiterativedecodingalgorithm,knownasbeliefpropagationalgorithm(BPA),isgenerally
usedtodecodetheLDPCcode.TheBPAisasoft-decisionalgorithmstudiedin[107{109].
Forabinaryerasurechannel(BEC),thebitsinthecodewordsarereceivedas0's,1'sor
x
's(erasures).TheBPAcanbedescribedovertheTannergraph[110],whichisabipartite
graph.InaTannergraph,therearetwotypesofnodes:thesymbolnodes(corresponding
tothereceivedbits),andthechecknodes(correspondingtotherowsoftheparitycheck
matrix).AnillustrativeexampleoftheparitycheckmatrixanditsTannergraphisshown
inFigure4.11.Intheparitycheckmatrix,everyrowrepresentsaparitycheckequation.
Thesymbolnodes,whichcorrespondtothebitsequalto1'sinarowoftheparitycheck
matrix,areconnectedtothechecknodewhichcorrespondstothesamerow.Thesenodes
andedgesintheTannergraphexpresstheparitycheckequationofthatrow.InFigure4.11,
node
h
1representstherowoftheparitycheckmatrix.Andthethesecondand
thirdelementsoftherowinparitycheckmatrixare1's,sosymbolnodes
d
1,
d
2and
d
3
areconnectedto
h
1intheTannergraph.
Thedecodingalgorithmcanbedescribedthroughthefollowingalgorithm:
60
Algorithm4.4
BPADecodingAlgorithmforBEC
while
Therearechecknodesconnectedtoonlyoneunknownsymbolnode
do
for
Eachofthesechecknodes
do
Theunknownsymbolnode
 
xor(Alloftheothersymbolnodesconnectedtothe
checknode)
endfor
endwhile
if
Alltheunknownsymbolnodesarerecovered
then
Decodesuccessfully
endif
4.3.3RelationshipBetweenLinearNetworkCodeandLDPCCode
Inlinearnetworkcoding,packetsarelinearlycombinedattheintermediatenodes.The
packetsthatarereceivedatthesinknodessatisfyequation(1.1).Inthenetworkcode
decodingpartoftheEDECalgorithm,onlyindependentvalidpacketsareused.However,
thereisalsohelpfulinformationinthelinearlydependentpacketsorcorruptedpackets.If
wecanexploitandusethesepackets,wecanimprovethesystemperformance.Denotethe
receivedencodingvectoras
a
i
=(
a
1
;i
;

;a
l;i
)
T
,where1

i

m
and
m
isthenumberof
receivedpackets.Thenthegenerationmatrixoftheblockcodecanbeas:
G
=[
a
1

a
l
;
a
l
+1

a
m
]=[
P
1
;P
2
]
;
(4.3)
wherethematrix
P
1
canbemadeasa
l

l
fullrankmatrixthroughcolumnexchangeafter
l
independentpacketsarereceived,and
P
2
isa
l

(
m

l
)matrix.
Asanexample,supposethereisonlyonebit
x
i
ineveryoriginalpacketinthesource
node(1

i

l
).
x
=(
x
1
;:::;x
l
).Inthiscase,thereisalsoonlyonebit
y
j
inevery
receivedpacketinthesinknodes(1

j

m
).Denoteallthe
m
receivedpacketsasavector
y
.Wehavethefollowingencodingequation:
y
=
x

G
:
(4.4)
Thecorrespondingparitycheckmatrix
H
canbederivedasfollows.
61
thegeneratingmatrixas
G
=
P

1
1

[
I
l
;P

1
1
P
2
]
;
(4.5)
andtheparity-checkmatrixas
H
=[(
P

1
1
P
2
)
T
;I
m

l
]
:
(4.6)
Wecanverifythecorrectnessof
H
byverifyingthefollowequation:
G

H
T
=
P

1
1

[
I
l
;P

1
1
P
2
]

[(
P

1
1
P
2
)
T
;I
m

l
]
T
=
0
:
(4.7)
Afterderivingthecorrespondingparitycheckmatrix
H
,wecandecodethelinearnetwork
codeusingtheBPAalgorithm.ThelinearnetworkcodecanbeviewedasaratelessLDPC
code,andhasthepropertyoferrorcontrolcodes.
AlthoughlinearnetworkcodescanbeseenasratelessLDPCcodes,theBPAalgorithm
cannotbeusedtodecodeanetworkcodeifthenetworkcodeisderivedafteranormal
errorcontrolencode,becausewecannottheincorrectdecodingswhichcanbeviewed
aserasures.However,forthemoerrorcontrolcodesintheEDECscheme,wecan
determinetheerroneousdecodingsandmarkthecorrespondingbitsaserasures.Therefore,
wecandecodethelinearnetworkcodeusingtheBPA.
4.3.4LEDECSchemeUsingBPA
IntheLEDECscheme,weusethelinearlydependentpacketsandthecorruptedpacketsand
decodethelinearnetworkcodeusingBPAalgorithm.Figure4.12illustratesthismainidea
ofthedecodingalgorithm.
4.3.5TheoreticalAnalysis
Whenthenumberoferrorsispartiallybeyondthedecodingcapabilityoftheerrorcontrol
code,theLEDECschemecangetadditionalbfromdecodingoftheLDPCcodes.
62
Figure4.12MainideaoftheLEDECscheme
Hereweusea(31
;
15)cycliccodewithgenerationpolynomial
x
16
+
x
14
+
x
10
+
x
9
+
x
8
+
x
7
+
x
5
+
x
4
+
x
3
+
x
2
+
x
+1asanillustrativeexample.Itiseasytocalculatethat
thereareonly17515entriesforthe4-biterrorsinthesyndrometable,whilethenumber
forallthe4-biterrorsis

31
4

=31465.Itmeansonly17515distinct4-biterrorscanbe
successfullycorrected.Inthissituation,4-biterrorsareconsideredtobepartiallybeyond
thedecodingcapabilityofthe(31
;
15)code.Thesuccessfulerrorcorrectionprobabilityis
about17515
=
31465=0
:
5567.Becauseweusethemoerror-controlcode,whichcan
detecttheerroneousdecodings,thefailederrorcorrectionscanbeseenaserasureswith
erasureprobability
P
e
=1

0
:
5567=0
:
4433.
Considertheworstcaseinwhichalmostallthepacketsarecorruptedbythemalicious
nodes.Theerror-detectionbasedschemesdonotworkatallbecauseallofthepackets
aredumped.TheEDECschemedoesnotworkeitherbecausewitherasureprobability
P
e
=0
:
4433therewillnotbeenoughcorrectablepacketstosolvethenetworkcodingequa-
tion(1.1).
FortheLEDECscheme,let

d
denotetheprobabilitythatanedgefromachecknode
isconnectedtoasymbolnodeofdegree
d
,and
ˆ
d
denotetheprobabilitythatanedge
fromasymbolnodeisconnectedtoachecknodeofdegree
d
intheTannergraphofthe
correspondingLDPCcode.ThegeneratingfunctionsforanLDPCcodeisas:

(
x
)=
63
P
d

d
x
d

1
,
ˆ
(
x
)=
P
d
ˆ
d
x
d

1
.Accordingto[111],themaximalfractionoferasuresthat
arandomLDPCcodewithgivengeneratingfunctionscancorrectisboundedby
P
max
=
min
f
x

(1

ˆ
(1

x
))
g
(0
<x<
1)withprobabilityatleast1
O
(
l

3
=
4
),where
l
isthelength
ofthecode.ForthethroughputoftheLEDECscheme,wehavethefollowingtheorem:
Theorem4.2.
ThethroughputoftheLEDECschemeis
F
=
b
N

P
max
c
X
i
=0

N
i

P
i
e
(1

P
e
)
N

i
;
where
P
max
=min
f
x

(1

ˆ
(1

x
))
g
(0
<x<
1)
,
P
e
istheerasureprobability,
N
isthenumber
ofpacketsasinknodereceivedand

istheorfunction.
Proof.
Supposeasinknodereceives
N
packetsandtheerasuresinthepacketsareindepen-
dent,thedistributionofthenumberoferasures
i
inevery
N
receivedpacketsymbolsisa
binomialdistributionwithPr(
i
)=

N
i

P
i
e
(1

P
e
)
N

i
;
0

i

N
astheprobabilitymass
function(PMF).
Theproposedschemecancombatallerasuresupto
N

P
max
withprobabilityatleast1

O
(
N

3
=
4
),whichiscloseto1.Thusthethroughputcanbewrittenas
F
=
P
b
N

P
max
c
i
=0
Pr(
i
).
4.3.6PerformanceAnalysisandSimulation
Inthissection,weprovidesimulationresultsoftheLEDECschemeonthesimulationplat-
formpresentedinSection4.2.2.AllthesettingsandparameterarethesameasSection4.2.2.
4.3.6.1NodesDesign
FortheLEDECscheme,thesourcenode,relaynodesandmaliciousnodesarethesameas
thoseinSection4.2.2.However,thedecodingprocessinthesinknodesist.Allpack-
etsreceivedwillbeused,buttheBPAdecodingwillnotstartuntilthesinknodescollectall
64
the
l
=32independentpackets.Afterreceiving
l
=32independentpackets,theoretically,
wecanusetheBPAalgorithmtodecodewheneveranewpacketarrives.However,thereis
aindeterminingwhentostarttheBPAalgorithm.Whenitisusedtoofrequently,
itmayresultinahighcomputationaloverhead.Ontheotherside,ifwedonotstartthe
BPAdecodinguntilwehavecollectedalargenumberofpackets,thecommunicationdelay
maybetoohigh.Tobalancethesetwoissues,thesinknodeswilltriggertheBPA
decodinguponreceivingofevery10newpackets.Thisprocesswillcontinueuntilallthe
messagesymbolshavebeensuccessfullydecoded.WhiletheBPAdecodingismorepowerful
thanthelinear-equation-solvinghard-decisiondecodingmethoddescribedinSection4.2.2,
thecomputationaloverheadoftheBPAschemeisrelativelyhigher.Tooptimizetheadvan-
tagesofthetwoalgorithms,inourscheme,whenasinknodereceives32independentand
intactpackets,wewilldirectlysolvetheequationsanddecodethepacketsusingthescheme
describedinSection4.2.2.ThewchartoftheLEDECalgorithmthatisimplementedin
thesinknodesisshowninFigure4.13.
4.3.6.2SimulationResults
SameasinSection4.2.2.3,thesimulationsinthissectionarecarriedoutunderdt
percentageofmaliciousrelaynodes.Andthenumberofbitsthatthemaliciousnodescan
corruptineachsymbolisthensettoberandom.Oneexampleoftheparitycheck
matrixgeneratedinthelinearnetworkcodingisshowninFigure4.14.Inthisexample,the
sinknodereceives90packetsanddecodesthelinearnetworkcodeusingtheBPAalgorithm.
Thesizeofthematrixis58

90.Inthewhitesquaresrepresent0andblacksquares
represent1.Wecanseethatthismatrixisasparsematrix.
1.
SmallNumberandLargeNumberofErrors
Remark1.
Whenthenumberofbitsthatthemaliciousnodescancorruptineach
symboliseitherwithinorentirelybeyondthecapabilityoftheerrorcontrolcodeand
65
Figure4.13FlowchartoftheLEDECalgorithmimplementedinthesinknodes
Figure4.14Anexampleoftheparitycheckmatrixinnetworkcoding
66
Figure4.15Performancecomparisonforsmallnumberofmaliciousnodes
thenetworkcode,theperformanceoftheLEDECschemeisthesameasthatofthe
EDECschemeshowninFigure4.9andFigure4.10.
2.
MediumNumberofErrors
Whenthenumberoferrorsispartiallybeyondthecapabilityoferrorcontrolcode
andnetworkcode,theperformanceoftheLEDECschemeisshowninFigure4.15,
Figure4.16andFigure4.17.
Remark2.
Whenthepercentageofthemaliciousnodesislessthan20%,theperfor-
manceoftheLEDECschemeisslightlybetterthantheEDECandtheerror-detection
basedschemes.Thisisbecausethesinknodecansuccessfullydecodethecorrupted
packetsusingonlytheintactpackets.
Remark3.
Whenthepercentageofmaliciousnodesisbetween20%and60%,the
performanceoftheLEDECschemeisabout15%betterthantheEDECandtheerror-
detectionbasedschemes.Thisisbecausethesinknodescanrecoverextrainformation
fromthecorruptedpackets.
67
Figure4.16Performancecomparisonformediumnumberofmaliciousnodes
Whenthepercentageofmaliciousnodesbecomesmorethan65%,theaveragegenerat-
ingfunctionsfortherandomparity-checkmatrixare:

(
x
)=0
:
001
x
18
+0
:
0013
x
17
+
0
:
0003
x
16
+0
:
0008
x
15
+0
:
0048
x
14
+0
:
0098
x
13
+0
:
0116
x
12
+0
:
028
x
11
+0
:
0404
x
10
+
0
:
0369
x
9
+0
:
0644
x
8
+0
:
129
x
7
+0
:
0902
x
6
+0
:
076
x
5
+0
:
1318
x
4
+0
:
2119
x
3
+0
:
1364
x
2
+
0
:
0255
x
,
ˆ
(
x
)=0
:
8397
x
9
+0
:
1575
x
7
+0
:
0027
x
5
;P
max
=min
f
x

(1

ˆ
(1

x
))
gˇ
0
:
3812.
Inthisworstcase,everysinknodewillreceiveabout
N
=90packetsaccordingto
thetopology,andatmostabout
N

P
max
=0
:
3812

90
ˇ
34erasuresinevery90
packetsymbolscanberecovered.Theparametersforthebinomialdistributionis
N
=
90
;P
e
=0
:
44.Sothethroughputcanbecalculatedas
F
(34)=
P
K
=34
K
=0
Pr
(
K
)
ˇ
0
:
1.
Thisresultisveryclosetooursimulation,whichissummarizedintheRemark4.
Remark4.
Whenpercentageofthemaliciousnodesbecomesmorethan65%,the
error-detectionbasedschemesandtheEDECschemedonotworkbecausethenumber
ofthecorruptedpacketshasexceededthedecodingcapacityofthenetworkcodes.How-
ever,theLEDECschemecanstillmaintainathroughputaround8%duetothepartial
informationavailablefromthecorruptedpackets.
68
Figure4.17Performancecomparisonforlargenumberofmaliciousnodes
3.
RandomNumberofErrorsHerewestudythecasewhenthemaliciousnodesadds
randomnumberoferrorstothesymbolsinthereceivedpackets.Thenumberoferrors
mayvaryfromsmallnumberoferrorstolargenumberoferrors.
Remark5.
Forrandomerrors,althoughsomesymbolsinthecorruptedmessagecannot
becorrected,theLEDECschemecanstillrecoversymbolsusingtheLDPCdecoding
fromthecorrectablesymbolsincorruptedpackets.FromFigure.4.18wecanseethat
whilethepercentageofmaliciousnodesisbetween30%and60%,theperformanceof
theLEDECschemeisabout4%betterthantheEDECandtheerror-detectionbased
schemesonaverage.
69
Figure4.18Performancecomparisonbasedonmediumnumberofmaliciousnodes(random
numberoferrors)
70
CHAPTER5
DISTRIBUTEDSTORAGEINHOSTILENETWORKS|HERMITIAN
CODEBASEDREGENERATINGCODESAPPROACH
Inthischapter,wewillproposeHermitiancodebasedregeneratingcodes:H-MSRcode
andH-MBRcode.Theoreticalevaluationshowsthatourproposedschemescandetectthe
erroneousdecodingsandcorrectmoreerrorsinthehostilenetworkthantheRS-MSRcode
andtheRS-MBRcodewiththesamecoderaterespectively.WewillconstructtheH-MSR
codebycombiningtheHermitiancodeandregeneratingcodeattheMSRpoint,thenwe
willconstructtheH-MBRcodebycombiningtheHermitiancodeandregeneratingcodeat
theMBRpoint.WewillprovethatthesecodescanachievethetheoreticalMSRbound
andMBRboundrespectively.Wewillalsoproposedataregenerationandreconstruction
algorithmsfortheH-MSRcodeandtheH-MBRcodeinbotherror-freenetworksandhostile
networks.ThenwewillcomparetheirperformancewithRScodebasedregeneratingcodes.
5.1System/AdversarialModelsandAssumptions
Inthischapter,weassumethereisasecureserverthatisresponsibleforencodingand
distributingthedatatostoragenodes.Replacementnodeswillalsobeinitializedbythe
secureserver.DCandthesecureservercanbeimplementedinthesamecomputerandcan
neverbecompromised.WealsoassumethatDCkeepstheencodingmatrixasasecretand
eachstoragenodeonlyknowsitsownencodingvector.
Weassumesomestoragenodescanbecorruptedduetohardwarefailureorcommuni-
cationerrors,and/orbecompromisedbymalicioususers.Asaresult,uponrequest,these
nodesmaysendoutincorrectresponsetodisruptthedataregenerationandreconstruction.
Formalicioususers,theycantakefullcontrolof
˝
(
˝

n
)storagenodesandcolludeto
performattacks.
71
Wewillreferthesesymbolsas
bogus
symbolswithoutmakingdistinctionbetweenthe
corruptedsymbolsandcompromisedsymbols.Wewillalsousecorruptednodes,malicious
nodesandcompromisednodesinterchangeablywithoutmakinganydistinction.
5.2AnIllustrativeExample
Inthissection,wewillshowanexampleindistributedstorageusingpureRScodeand
Hermitiancodetoshowthestartingpointofthisresearch:theHermitiancodecancorrect
moreerrorsthantheRScodeunderthesamecoderate.
5.2.1RSCodeinDistributedStorage
Supposewehavedata
m
=(1
;
1
;
1
;
0
;
0
;
1
;
1
;
1
;
1
;
1
;
0
;
1
;
1
;
0
;
0
;
0
;
1
;
1
;
0
;
0
;
1
;
0
;
0
;
0
;
0
;
0
;
1
;
0
;
0
;
1
;
0
;
1
;
0
;
0
;
0
;
0
;
1
;
1
;
0
;
1
;
0
;
1
;
0
;
1
;
0
;
0
;
0
;
0
;
1
;
1
;
0
;
1
;
0
;
0
;
0
;
0
;
1
;
1
;
1
;
1
;
1
;
1
;
1
;
0
;
1
;
1
;
0
;
1
;
0
;
1
;
1
;
0
;
0
;
0
;
1
;
1
;
0
;
0
;
0
;
1
;
0
;
0
;
0
;
0
;
1
;
0
;
0
;
1
;
1
;
1
;
0
;
1
;
1
;
1
;
1
;
0
;
0
;
1
;
0
;
1
;
1
;
0
;
1
;
1
;
0
;
1
;
0
;
1
;
1
;
0
;
0
;
1
;
0
;
1
;
1
;
0
;
0
;
0
;
1
;
1
;
1
;
1
;
1
;
0
;
1
;
0
;
0
;
1
;
0
;
0
;
1
;
0
;
1
;
1
;
0
;
0
;
0
;
0
;
1
;
1
;
0
;
0
;
0
;
0
;
0
;
1
;
1
;
0
;
0
;
0
;
1
;
0
;
0
;
1
;
0
;
0
;
0
;
0
;
0
;
1
;
0
;
0
;
0
;
0
;
0
;
0
;
1
;
1
;
1
;
0
;
0
;
0
;
0
;
1
;
1
;
1
;
0
;
0
;
0
;
1
;
1
;
1
;
1
;
0
;
1
;
1
;
0
;
0
;
1
;
0
;
0
;
0)
;
(5.1)
tobedistributivelystoredinthedistributestoragenetwork.Ifweview
m
ascomposedof
elementsfrom
F
2
6
,then
m
canberepresentedas32symbols,eachsymbolcanberepresented
using6bits:(1
;
1
;
1
;
0
;
0
;
1),(1
;
1
;
1
;
1
;
0
;
1),
:::
,(0
;
0
;
1
;
0
;
0
;
0).Let
F
2
6
begeneratedthrough
72
expansionwiththeprimitivepolynomial
f
(
x
)=
x
6
+
x
+1over
F
2
byaddingtheroot

of
f
(
x
)to
F
2
6
.Inthisway,wehave
GF
(2
6
)=
f
0
;
1
;;
2
;
3
;
4
;
5
;
6
=

+1
;
7
=

2
+
;
8
=

3
+

2
;
9
=

4
+

3
;
10
=

5
+

4
;
11
=

5
+

+1
;
12
=

2
+1
;
13
=

3
+
;
14
=

4
+

2
;
15
=

5
+

3
;
16
=

4
+

+1
;
17
=

5
+

2
+
;
18
=

3
+

2
+

1
+1
;
19
=

4
+

3
+

2
+
;
20
=

5
+

4
+

3
+

2
;
21
=

5
+

4
+

3
+

+1
;
22
=

5
+

4
+

2
+1
;
23
=

5
+

3
+1
;
24
=

4
+1
;
25
=

5
+
;
26
=

2
+

+1
;
27
=

3
+

2
+
;
28
=

4
+

3
+

2
;
29
=

5
+

4
+

3
;
30
=

5
+

4
+

+1
;
31
=

5
+

2
+
1
;
32
=

3
+1
;
33
=

4
+
;
34
=

5
+

2
;
35
=

3
+

+1
;
36
=

4
+

2
+
;
37
=

5
+

3
+

2
;
38
=

4
+

3
+

+1
;
39
=

5
+

4
+

2
+
;
40
=

5
+

3
+

2
+

+
1
;
41
=

4
+

3
+

2
+1
;
42
=

5
+

4
+

3
+
;
43
=

5
+

4
+

2
+

+1
;
44
=

5
+

3
+

2
+1
;
45
=

4
+

3
+1
;
46
=

5
+

4
+
;
47
=

5
+

2
+

+1
;
48
=

3
+

2
+1
;
49
=

4
+

3
+
;
50
=

5
+

4
+

2
;
51
=

5
+

3
+

+1
;
52
=

4
+

2
+1
;
53
=

5
+

3
+
;
54
=

4
+

2
+

+1
;
55
=

5
+

3
+

2
+1
;
56
=

4
+

3
+

2
+

+1
;
57
=

5
+

4
+

3
+

2
+
;
58
=

5
+

4
+

3
+

2
+

+1
;
59
=

5
+

4
+

3
+

2
+1
;
60
=

5
+

4
+

3
+1
;
61
=

5
+

4
+1
;
62
=

5
+1
g
,and
m
canbe
representedas
m
=(

5
+

4
+

3
+1
;
5
+

4
+

3
+

2
+1
;
5
+

+1
;
3
;
3
+1
;
4
;
5
+

4
+

2
+1
;
4
;
5
+

4
+

2
;
3
+

2
+

+1
;
5
+

4
+

3
+

+1
;
4
+

2
+
;
3
+

2
;
4
;
5
+

2
+

+1
;
4
+

3
+

2
+
;
4
+

2
+
;
5
+

4
+

2
+1
;
5
+

2
+1
;
5
+

+1
;
5
+

4
+

3
+
;
4
+
;
5
+

4
;
5
+

4
;
4
+

3
;
5
+

2
;
2
;
+1
;
5
+1
;
5
+

4
+1
;
5
+

4
+

3
+

+1
;
3
).

g
(
x
)=

5
+

4
+

3
+1+(

5
+

4
+

3
+

2
+1)
x
+(

5
+

+1)
x
2
+

3
x
3
+(

3
+1)
x
4
+

4
x
5
+(

5
+

4
+

2
+1)
x
6
+

4
x
7
+(

5
+

4
+

2
)
x
8
+(

3
+

2
+

+1)
x
9
+(

5
+

4
+

3
+

+1)
x
10
+(

4
+

2
+

)
x
11
+(

3
+

2
)
x
12
+

4
x
13
+(

5
+

2
+

+1)
x
14
+(

4
+

3
+

2
+

)
x
15
+(

4
+

2
+

)
x
16
+(

5
+

4
+

2
+1)
x
17
+(

5
+

2
+1)
x
18
+(

5
+

+1)
x
19
+(

5
+

4
+

3
+

)
x
20
+(

4
+

)
x
21
+(

5
+

4
)
x
22
+(

5
+

4
)
x
23
+(

4
+

3
)
x
24
+(

5
+

2
)
x
25
+

2
x
26
+(

+1)
x
27
+(

5
+1)
x
28
+(

5
+

4
+1)
x
29
+(

5
+

4
+

3
+

+1)
x
30
+

3
x
31
.
73
ThenusingReed-Solomoncode,wecanencode
m
to
c
asfollows:
c
=(
g
(0)
;g
(1)
;g
(

)
;:::;g
(

62
))
=(

5
+

4
+

3
+1
;
5
+

4
+

2
+1
;
3
+

2
+1
;
2
+
;
5
+

4
+

2
+

+1
;

5
+

2
+
;
2
;
3
+

+1
;
5
+

2
;
5
+

3
+

2
+1
;
3
+

2
+
;

5
+

4
+

2
+1
;
4
+

2
+

+1
;
4
+

2
+

+1
;
4
+

3
+

+1
;
0
;
2
+
;

4
+

2
+
;
3
+1
;
5
+1
;
5
+

+1
;
4
+

3
+

2
+

+1
;
3
;
2
+

+1
;

5
+

4
+

3
;
4
+

+1
;
5
+

3
+

2
;
4
+

3
+

2
+

+1
;
3
+

2
+1
;

2
+

+1
;
5
+

2
+
;
4
+

3
+1
;
4
+

3
+

2
+

+1
;
3
+

2
+
;

5
+

3
+

+1
;
1
;
4
+

+1
;
5
+

3
+

+1
;
5
+

4
+

3
+

2
+

+1
;

3
+

2
+
;
5
+1
;
2
+
;
4
+
;
5
+

4
+

+1
;
4
+

3
+

2
;
5
+

+1
;

5
+

4
+

2
+1
;
5
+

4
+

+1
;
5
+

4
+

3
+

2
+

+1
;
4
+

3
+

2
+
;

5
+

3
+

2
+

+1
;
5
+

4
+

+1
;
5
+

3
+

2
;
5
+

4
+

3
+

2
;
2
;

5
+

4
+

2
;
5
+
;
1
;
5
+

4
+

3
+

2
;
3
+1
;
2
;
3
+

2
+
;
5
+

2
+
;

5
+

4
+

3
+

2
)
:
(5.2)
Thiscodehasparameter(64
;
32
;
33),whichmeansthecodecancorrect32erasures,or16
errors.Thecoderatiois32
=
64=1
=
2.Ifwesplitthecode
c
into8groupssothateachgroup
contains8symbols,say
f
g
(0)
;g
(1)
;g
(

)
;g
(

2
)
;g
(

3
)
;g
(

4
)
;g
(

5
)
;g
(

6
)
g
;
f
g
(

7
)
;g
(

8
)
;g
(

9
)
;
g
(

10
)
;g
(

11
)
;g
(

12
)
;g
(

13
)
;g
(

14
)
g
;
f
g
(

15
)
;g
(

16
)
;g
(

17
)
;g
(

18
)
;g
(

19
)
;g
(

20
)
;g
(

21
)
;
g
(

22
)
g
;
f
g
(

23
)
;g
(

24
)
;g
(

25
)
;g
(

26
)
;g
(

27
)
;g
(

28
)
;g
(

29
)
;g
(

30
)
g
;
f
g
(

31
)
;g
(

32
)
;g
(

33
)
;
74
g
(

34
)
;g
(

35
)
;g
(

36
)
;g
(

37
)
;g
(

38
)
g
;
f
g
(

39
)
;g
(

40
)
;g
(

41
)
;g
(

42
)
;g
(

43
)
;g
(

44
)
;
g
(

45
)
;g
(

46
)
g
;
f
g
(

47
)
;g
(

48
)
;g
(

49
)
;g
(

50
)
;g
(

51
)
;g
(

52
)
;g
(

53
)
;g
(

54
)
g
;
f
g
(

55
)
;
g
(

56
)
;g
(

57
)
;g
(

58
)
;g
(

59
)
;g
(

60
)
;g
(

61
)
;g
(

62
)
g
,thenusingLagrangeinterpolation,we
canrecovertheentiredatafromany4groupsifalltheindividualpieceareavailablewithout
corrupted.However,whenmorethan2groupsarecorrupted,themessageisnolongerrecov-
erableevenifReed-Solomonerror-decodingalgorithmisused.Inotherwords,thecorruption
levelcannotbehigherthan2
=
8=1
=
4.
5.2.2HermitianCodeinDistributedStorage
Inourpreliminaryresearch,wehavedevelopedadecodingalgorithmforHermitiancode,
whichisdesignedonthecurve
y
4
+
y
=
x
5
overthed
GF
(2
4
).Ourdecoding
algorithmcancorrecterasuresaswellaserrors,however,itcancorrectmoreerrorsthan
theReed-Solomoncodeintheaforementionedscenario,whilemaintainingtheexistingcode
ratio.WewillexplainHermitiancodeusingthenotationintroducedintheprevious
example.
Let
G
j
=

(
y
j
f
j
)(
R
0
)
;
(
y
j
f
j
)(
R
1
)
;

;
(
y
j
f
j
)(
R
q
3

1
)

;
(5.3)
where
k
(
j
)=max
f
t
j
4
t
+5
j

32
g
+1.
R
i
runsthrough(
;
5
y
0
+

j
),for

2
GF
(
q
2
)and

j
=0
;

;q

1arethesolutionstotheequation
x
q
+
x
=0.

H
m
=
G
0
G
1
G
2
G
q

1
;
(5.4)
thenwecanprovethattheparameteroftheabovecodeis(64
;
32).However,wecan
correctmorethan(
n

k
)
=
2=(64

32)
=
2=16errorsduetothespecialstructureofthe
code.
Inthefollowing,wewillpresentaschemethatcancorrect24errors.Sincethesolutions
to
y
4
+
y
=0are

0
=0
;
1
=1
;
2
=

5
;
3
=

10
,and(1
;
)isinthecurve,the
R
i
'scan
75
berepresentedthroughthefollowing
P
i;j
's:
P
0
;j
=(0
;
j
)
;P
i;j
=(

i

1
;
(
i

1)(
q
+1)+1
+

j
)
;i
=1
;
2
;

;
15
;j
=0
;
1
;
2
;
3
:
(5.5)
Considerareceivedvector
u
=(0
;
0
;
0
;
0
;;
2
;
4
;
5
;
7
;
9
;
8
;
6
;
0
;

;
0
;
5
;
10
;
4
;;

11
;
13
;;
8
;
6
;
5
;
10
;
7
;
14
;
2
;
3
;
1
;
0
;
0
;
0
;
0),where

4
=

+1.Thedecodingwill
break
u
intofourReed-Solomoncode:
r
3
=(0
;
4
;
3
;
0
;

;
0
;
0
;
2
;
5
;
2
;
0)
;
r
2
=(0
;
14
;
12
;
0
;

;
0
;
4
;
12
;
12
;
4
;
0)
;
r
1
=(0
;
5
;
13
;
0
;

;
0
;;
7
;
2
;
5
;
0)
;
r
0
=(0
;
10
;
11
;
0
;

;
0
;
12
;
12
;
3
;
4
;
0)
:
(5.6)
Decode
r
3
inusingthedecodingalgorithmofReed-Solomoncodes[89],wetheerror
vector
e
3
=(0
;
4
;
3
;
0
;

;
0
;
0
;
2
;
5
;
2
;
0)(5.7)
for
r
3
,Therefore,theerrorlocationsare
E
2
;E
3
;E
13
;E
14
;E
15
.
Replacethelocations
E
2
;E
3
;E
13
;E
14
;E
15
in
r
2
witherasures,markedas\

",weget
thevector:(0
;

;

;
0
;

;
0
;;

;

;

;
0).
Decodethisvectorandwetheerrorvector
e
2
=(0
;
14
;
12
;
0
;

;
0
;
4
;
12
;
12
;
4
;
0)(5.8)
withanewerrorlocation
E
12
.
Decode
r
1
and
r
0
,Wecantheerrorvectors
e
1
=(0
;
5
;
13
;
0
;

;
0
;;
7
;
2
;
5
;
0)
;
(5.9)
e
0
=(0
;
10
;
11
;
0
;

;
0
;
12
;
12
;
3
;
4
;
0)
:
(5.10)
76
Nowwecanreconstructtheentireerrorvectoras:
e
=(0
;
0
;
0
;
0
;;
2
;
4
;
5
;
7
;
9
;
8
;
6
;
0
;

;
0
;
5
;
10
;
4
;;

11
;
13
;;
8
;
6
;
5
;
10
;
7
;
14
;
2
;
3
;
1
;
0
;
0
;
0
;
0)
:
(5.11)
Thereforethetransmittedcodewordis
u
=(0
;
0
;
0
;

;
0
;
0)
:
(5.12)
Forthiscodescheme,ifwerepresentthebitsusingsymbolsover
GF
(2
4
),theentire
messagecanberepresentedusing64symbols.Ifwesplitthe64symbolsintogroupssothat
eachgroupcontains8symbols,thenwhennomorethan3groupsarecorrupted,wecan
thecorruptedgroupswhiletheentiremessage.Therefore,wehavethefollowing
claim.
Claim1.
TheerrorcorrectionratioforHermitiancodeis
24
=
64=3
=
8
,whichishigher
thantheReed-Solomoncodeerrorcorrectionratio
1
=
4
forthecodingratio
1
=
2
.
5.2.3Inspirationfromthisexample
Fromthisexample,wethattheHermitiancodecancorrectmoreerrorsthantheRS
codeunderthesamecoderate.However,directlyapplyingHermitiancodeintodistributed
storageisanaiveapproachlikedirectlyapplyingtheRScode.Thusweproposetocombine
theadvantagesoftheHermitiancodeandtheregeneratingcodefordistritbutedstoragein
thefollowingsections.
5.3HermitianCodeBasedMSRRegeneratingCode(H-MSRCode)
5.3.1EncodingH-MSRCode
Inthissection,wewillanalyzetheH-MSRcodebasedontheMSRpointwith
d
=2
k

2=
2

.ThecodebasedontheMSRpointwith
d>
2
k

2canbederivedthesamewaythrough
77
Figure5.1Anexampleillustrationofmatrix
S
truncatingoperations.
Let

0
;

;
q

1
beastrictlydecreasingintegersequencesatisfying0
<
i


(
i
)
;
0

i

q

1,where

i
istheparameter

fortheunderlyingregeneratingcode.Theleast
commonmultipleof

0
;

;
q

1
is
A
.Supposethedatacontains
B
=
A
P
q

1
i
=0
(

i
+1)
messagesymbolsfromthe
GF
(
q
2
).Inpractice,ifthesizeoftheactualdata
islargerthan
B
symbols,wecanfragmentitintoblocksofsize
B
andprocesseachblock
individually.
Wearrangethe
B
symbolsintotwomatrices
S;T
asbelow:
S
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
S
0
S
1
.
.
.
S
q

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
;T
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
T
0
T
1
.
.
.
T
q

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
;
(5.13)
where
S
i
=[
S
i;
1
;S
i;
2
;

;S

i
]
;
T
i
=[
T
i;
1
;T
i;
2
;

;T

i
]
:
(5.14)
78
S
i;j
;
0

i

q

1
;
1

j


i
,isasymmetricmatrixofsize

i


i
withtheupper-
triangularentriesdbydatasymbols.Thus
S
i;j
contains

i
(

i
+1)
=
2symbols.The
numberofcolumnsofeachsubmatrix
S
i
;
0

i

q

1,isthesame:

i


i
=
A
.Thesize
ofmatrix
S
is(
P
q

1
i
=0

i
)

A
.Soitcontains
P
q

1
i
=0
(

i
(

i
+1)
=
2)

i
=(
A
P
q

1
i
=0
(

i
+1))
=
2
datasymbols.Figure5.1showsanexampleofmatrix
S
for
q
=4
;
0
=6
;
1
=5
;
2
=
4
;
3
=3.In5.1,thesubmatrix
S
i;j
isrepresentedbythesquareinthecorresponding
positionwiththesizerepresentingthesizeofthesubmatrix.
T
i;j
(0

i

q

1
;
1

j


i
)isconstructedthesameas
S
i;j
.So
T
hasthesame
structureas
S
andcontainstheother(
A

P
q

1
i
=0
(

i
+1))
=
2datasymbols.
1.
ForaHermitiancode
H
m
over
GF
(
q
2
)
,weencodematrix
M
dim(
H
m
)

A
=
[
M
1
;M
2

;M
A
]
byencodingeachcolumn
M
i
;i
=1
;
2
;

;A
,individuallyusing
H
m
.The
codewordmatrixisdas
H
m
(
M
)=[
H
m
(
M
1
)
;
H
m
(
M
2
)
;

;
H
m
(
M
A
)]
;
(5.15)
where
H
m
(
M
i
)
hasthefollowingform(
%
2
L
(
mQ
)
):
[
%
(
P
0
;
0
)
;

;%
(
P
0
;q

1
)
;

;%
(
P
q
2

1
;
0
)
;

;%
(
P
q
2

1
;q

1
)]
T
;
(5.16)
andtheelementsof
M
i
areviewedasthecoofthepolynomials
f
0
(
x
)
;

;f
q

1
(
x
)
in
%
when
M
i
isencoded.
Let

i
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
100

0
111

1
1
˚˚
2

˚

i

1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
˚
q
2

2
(
˚
q
2

2
)
2

(
˚
q
2

2
)

i

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
(5.17)
79
beaVandermondematrix,where
˚
istheprimitiveelementin
GF
(
q
2
)mentionedinsec-
tion2.4and0

i

q

1.

=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0
0

0
0

1

0
.
.
.
.
.
.
.
.
.
.
.
.
00


q
2

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
(5.18)
tobeadiagonalmatrixcomprisedof
q
2
elements,where

i
;
0

i

q
2

1,ischosenusing
thefollowingtwocriteria:(i)

i
6
=

j
;
8
i
6
=
j;
0

i;j

q
2

1.(ii)Any
d
i
=2

i
rowsof
thematrix
i
;


i
],0

i

q

1,arelinearlyindependent.
Wealso

i
=

i
I
(5.19)
tobea
q

q
diagonalmatrixfor0

i

q
2

1,whereIisthe
q

q
identicalmatrix.And
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0
0

0
0
1

0
.
.
.
.
.
.
.
.
.
.
.
.
00


q
2

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
(5.20)
isa
q
3

q
3
diagonalmatrixformedby
q
2
diagonalsubmatrices
0
;

;

q
2

1
.
Fordistributedstorage,weencodeeachpairofmatrices(
S;T
)usingAlgorithm5.1.We
willnamethisencodingschemeas
Hermitian-MSRcodeencoding
,or
H-MSRcodeencoding
.
ForH-MSRcodingencoding,wehavethefollowingtheorem.
80
Algorithm5.1
EncodingH-MSRCode
Step1:
Encodethedatamatrices
S;T
inequation(5.13)usingaHermitiancode
H
m
over
GF
(
q
2
)withparameters

(
j
)(0

j

q

1)and
m
(
m

q
2

1).Denote
thegenerated
q
3

A
codewordmatricesas
H
m
(
S
)and
H
m
(
T
).
Step2:
Computethe
q
3

A
codewordmatrix
Y
=
H
m
(
S
)+
H
m
(
T
).
Step3:
Divide
Y
into
q
2
submatrices
Y
0
;

;Y
q
2

1
ofsize
q

A
andstoreeachsubmatrix
inastoragenodeasshowninFigure.5.2.
Figure5.2Illustrationofstoringthecodewordmatricesindistributedstoragenodes
Theorem5.1.
TheH-MSRcodeencodingdescribedinAlgorithm5.1canachievetheMSR
pointindistributedstorage.
Proof.
Westudythestructureofthecodewordmatrix
H
m
(
S
).Sinceeverycolumn
ofthematrixisanindependentHermitiancodeword,wecandecodethecolumn
h
=
[
h
0
;
0
;

;h
0
;q

1
;

;h
q
2

1
;
0
;

;h
q
2

1
;q

1
]
T
asanexamplewithoutlossofgenerality.We
arrangethe
q
3
rationalpointsoftheHermitiancurvefollowingtheorderinTable2.1.Inthe
table,wecanthatforeach
i;i
=0
;
1
;

;q
2

1,therationalpoints
P
i;
0
;P
i;
1
;

;P
i;q

1
allhavethesamecoordinate.
Suppose
%
2
L
(
mQ
):
%
(
P
i;l
)=
f
0
(
P
i;l
)+
y
(
P
i;l
)
f
1
(
P
i;l
)+

+(
y
(
P
i;l
))
q

1
f
q

1
(
P
i;l
)
;
0

i

q
2

1,0

l

q

1
;
deg
f
j
(
x
)=

j

1for0

j

q

1.Since
P
i;
0
;P
i;
1
;

;P
i;q

1
allhavethesamecoordinateand
f
j
(
P
i;l
)isonlyappliedtothecoordinateof
P
i;l
,
wehave
f
j
(
P
i;l
)=
f
j
(
˚
s
i
)
;s
0
=

;s
i
=
i

1
;
for
i

1
;˚

=0,whichdoesnotdepend
81
on
l
.Therefore,wecanderive
q
2
setsofequationsfor0

i

q
2

1:
8
>
>
>
>
>
>
>
>
>
>
>
>
>
>
<
>
>
>
>
>
>
>
>
>
>
>
>
>
>
:
f
0
(
˚
s
i
)+
y
(
P
i;
0
)
f
1
(
˚
s
i
)+

+(
y
(
P
i;
0
))
q

1
f
q

1
(
˚
s
i
)=
h
i;
0
f
0
(
˚
s
i
)+
y
(
P
i;
1
)
f
1
(
˚
s
i
)+

+(
y
(
P
i;
1
))
q

1
f
q

1
(
˚
s
i
)=
h
i;
1
...................................................................
f
0
(
˚
s
i
)+
y
(
P
i;q

1
)
f
1
(
˚
s
i
)+

+(
y
(
P
i;q

1
))
q

1
f
q

1
(
˚
s
i
)=
h
i;q

1
:
(5.21)
IfwestorethecodewordmatrixinstoragenodesaccordingtoFigure.5.2,eachsetofthe
equationscorrespondstoastoragenode.Aswementionedabove,thesetofequationsin
equation(5.21)canbederivedinstoragenode
i
.
Sincethecotmatrix
B
i
isaVandermondematrix:
B
i
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
1
y
(
P
i;
0
)

y
(
P
i;
0
)
q

1
1
y
(
P
i;
1
)

y
(
P
i;
1
)
q

1
.
.
.
.
.
.
.
.
.
.
.
.
1
y
(
P
i;q

1
)

y
(
P
i;q

1
)
q

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
:
(5.22)
wecansolve
u
i
=[
f
0
(
˚
s
i
)
;f
1
(
˚
s
i
)
;

;f
q

1
(
˚
s
i
)]
T
from
u
i
=
B

1
i
h
i
;
(5.23)
where
h
i
=[
h
i;
0
;h
i;
1
;

;h
i;q

1
]
T
.
Fromallthe
q
2
storagenodes,wecangetvectors
F
i
=[
f
i
(0)
;f
i
(1)
;

;f
i
(
˚
q
2

2
)]
T
;
i
=0
;

;q

1,whichcanbeviewedasextendedReed-Solomoncodes.
Nowconsiderallthecolumnsof
H
m
(
S
),wecangetthefollowingequation:

i
S
i;j
=
F
i;j
;
(5.24)
82
where
F
i;j
=[
F
(1)
i
;

;
F
(

i
)
i
],0

i

q

1,1

j


i
,and
F
(
l
)
i
correspondstothe
l
th
columnofthesubmatrix
S
i;j
.
Nextwewillconsiderthestructureofthecodewordmatrix
H
m
(
T
).Becausetheencoding
processfor
H
m
(
T
)isthesameasthatof
H
m
(
S
),for
H
m
(
T
),wecanderive

i
T
i;j
=
E
i;j
;
(5.25)
where
E
i
=[
e
i
(0)
;e
i
(1)
;

;e
i
(
˚
q
2

2
)]
T
,
E
i;j
=[
E
(1)
i
;

;
E
(

i
)
i
],0

i

q

1,1

j


i
,and
E
(
l
)
i
correspondstothe
l
th
columnofthesubmatrix
T
i;j
.
Thirdly,wewillstudytheoptimalityofthecodeinthesenseoftheMSRpoint.For

i
S
i;j
+
i
T
i;j
;
0

i

q

1
;
1

j


i
,since
S
i;j
;T
i;j
aresymmetricandsatisfy
therequirementsforMSRpointaccordingto[57]withparameters
d
=2

i
;k
=

i
+1
;
=

i
;
=1
;B
=

i

(

i
+1).Byencoding
S;T
using
H
m
(
S
)+
H
m
(
T
)anddistributing
Y
0
;

;Y
q
2

1
into
q
2
storagenodes,eachrowofthematrix
i
S
i;j
+
i
T
i;j
;
0

i

q

1
;
1

j


i
,canbederivedinacorrespondingstoragenode.Because
i
S
i;j
+
i
T
i;j
achievestheMSRpoint,datarelatedtomatrices
S
i;j
;T
i;j
;
0

i

q

1
;
1

j


i
,can
beregeneratedattheMSRpoint.Therefore,Algorithm5.1canachievetheMSRpoint.
5.3.2RegenerationoftheH-MSRCodeintheError-freeNetwork
Inthissection,wewilldiscusstheregenerationfortheH-MSRcodeintheerror-freenetwork.
Let
v
i
=[
e
0
(
˚
(
s
i
))
;e
1
(
˚
(
s
i
))
;

;e
q

1
(
˚
(
s
i
))]
T
,then
u
i
+
i
v
i
=
B

1
i
y
i
=[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;

;f
q

1
(
˚
s
i
)+

i
e
q

1
(
˚
s
i
)]
T
;
(5.26)
foreverycolumn
y
i
of
Y
i
.
Themainideaoftheregenerationalgorithmsistoregenerate
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
),0

l

q

1,bydownloadinghelpsymbolsfrom
d
l
=2

l
nodes,where
d
l
representstheregeneration
parameter
d
for
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
)intheH-MSRcoderegeneration.
83
Supposenode
z
fails,wedeviseAlgorithm5.2inthenetworktoregeneratetheexact
H-MSRcodesymbolsofnode
z
inareplacementnode
z
0
.Forconvenience,wesuppose
d
q
=2

q
=0and
V
i;j;l
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

i;l

i
+1
;l
.
.
.

j;l
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
;
(5.27)
where

t;l
;i

t

j
,isthe
t
th
rowof
l
;

l
].Eachnode
i
,0

i

q
2

1,onlystoresits
ownencodingvector

i;l
,0

l

q

1.
First,replacementnode
z
0
willsendrequeststohelpernodesforregeneration:
z
0
sends
theinteger
j
to
d
j

d
j
+1
helpernodes,towhich
z
0
hasnotsentrequestsbefore,forevery
j
from
q

1to0indescendingorder.
Uponreceivingtherequestinteger
j
,helpernode
i
willcalculateandsendthehelp
symbolsasfollows:node
i
willcalculate
e
Y
i
=
B

1
i
Y
i
toremovethecotmatrix
B
i
fromthecodewordmatrix.Sincethe(
l
+1)
th
rowof
e
Y
i
correspondstothesymbols
relatedto
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
),for0

l

j
,node
i
willdividethe(
l
+1)
th
rowof
e
Y
i
into

l
rowvectorsofthesize1


l
:[
~y
i;l;
1
;
~y
i;l;
2
;

;
~y
i;l
l
].Thenforevery0

l

j
and1

t


l
,node
i
willcalculatethehelpsymbol~
p
i;l;t
=
~y
i;l;t

T
z;l
,where

z;l
isthe
z
th
rowoftheencodingmatrix
l
inequation(6.2).Atlast,node
i
willsendoutall
thecalculatedsymbols~
p
i;l;t
.Here
j
indicatesthat
z
0
isrequestingsymbols~
p
i;l;t
,0

l

j
and1

t


l
,calculatedby[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;

;f
j
(
˚
s
i
)+

i
e
j
(
˚
s
i
)]
T
Since
d
l
1
>d
l
2
for
l
1
<l
2
,foreconsideration,only
d
q

1
helpernodesneed
tosendoutsymbols~
p
i;l;t
,0

l

q

1and1

t


l
,calculatedby[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;f
1
(
˚
s
i
)+

i
e
1
(
˚
s
i
)
;

;f
q

1
(
˚
s
i
)+

i
e
q

1
(
˚
s
i
)]
T
.Then
d
j

d
j
+1
nodesonly
needtosendoutsymbols~
p
i;l;t
,0

l

j
and1

t


l
,calculatedby[
f
0
(
˚
s
i
)+
84

i
e
0
(
˚
s
i
)
;f
1
(
˚
s
i
)+

i
e
1
(
˚
s
i
)
;

;f
j
(
˚
s
i
)+

i
e
j
(
˚
s
i
)]
T
for0

j

q

2.Inthisway,
thetotalnumberofhelpernodesthatsendoutsymbols~
p
i;l;t
,1

t


l
,calculatedby
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
)is
d
q

1
+
P
q

2
j
=
l
(
d
j

d
j
+1
)=
d
l
.
Whenthereplacementnode
z
0
receivesalltherequestedsymbols,itcanregeneratethe
symbolsstoredinthefailednode
z
usingthefollowingalgorithm:
Algorithm5.2
z
0
RegeneratesSymbolsoftheFailedNode
z
Step1:
Forevery0

l

q

1and1

t


l
,calculatetheregeneratedsymbols
relatedtothehelpsymbols~
p
i;l;t
from
d
l
helpernodes.Withoutlossofgenerality,
weassume0

i

d
l

1:
Step1.1:
Let
p
=[~
p
0
;l;t
;
~
p
1
;l;t
;

;
~
p
d
l

1
;l;t
]
T
,solvetheequation:
V
0
;d
l

1
;l
x
=
p
.
Step1.2:
Since
x
=
2
4
S
l;t
T
l;t
3
5

T
z;l
and
S
l;t
;T
l;t
aresymmetric,wecancalculate
x
T
=[

z;l
S
l;t
;
z;l
T
l;t
].
Step1.3:
Compute
~y
z;l;t
=

z;l
S
l;t
+

z

z;l
T
l;t
=

z;l
2
4
S
l;t
T
l;t
3
5
.
Step2:
Let
e
Y
z
bea
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z
e
Y
z
.
FromAlgorithm5.2,wecanderivetheequivalentstorageparametersforeachsymbol
blockofsize
B
j
=
A
(

j
+1):
d
=2

j
,
k
=

j
+1,

=
A
,

=

j
;
0

j

q

1and
equation(2.7)oftheMSRpointholdsfortheseparameters.Theorem5.1guaranteesthat
Algorithm5.2canachievetheMSRpointfordataregenerationoftheH-MSRcode.
5.3.3RegenerationoftheH-MSRCodeintheHostileNetwork
Inhostilenetwork,Algorithm5.2maynotbeabletoregeneratethefailednodedueto
possiblebogussymbolsreceivedfromtheresponses.Infact,evenifthereplacementnode
85
z
0
canderivethesymbolmatrix
Y
z
0
usingAlgorithm5.2,itcannotverifythecorrectnessof
theresult.
Therearetwomodesforthehelpernodestoregeneratethecontentsofafailedstorage
nodeinhostilenetwork.Onemodeisthedetectionmode,inwhichnoerrorhasbeenfound
inthesymbolsreceivedfromthehelpernodes.Onceerrorsaredetected,therecoverymode
willbeusedtocorrecttheerrorsandlocatethemaliciousnodes.
5.3.3.1DetectionMode
Inthedetectionmode,thereplacementnode
z
0
willsendrequestsinthewaysimilartothat
oftheerror-freenetworkinSection5.3.2.Theonlyisthatwhen
j
=
q

1,
z
0
sendsrequeststo
d
q

1

d
q
+1nodesinsteadof
d
q

1

d
q
nodes.Helpernodeswillstilluse
thewaysimilartothatoftheerror-freenetworkinSection5.3.2tosendthehelpsymbols.
TheregenerationalgorithmisdescribedinAlgorithm5.3withthedetectionprobability
characterizedinTheorem5.2.
Lemma1.
Suppose
e
0
;

;e
d
l
arethe
d
l
+1
errors
e
0
;l;t
;

;e
d
l
;l;t
inAlgorithm5.3,
^
x
1
=
V

1
0
;d
l

1
;l

[
e
0
;

;e
d
l

1
]
T
and
^
x
2
=
V

1
1
;d
l
;l

[
e
1
;

;e
d
l
]
T
.Whenthenumberofmalicious
nodesinthe
d
l
+1
helpernodesofAlgorithm5.3islessthan
d
l
+1
,theprobabilitythat
^
x
1
=
^
x
2
isatmost
1
=q
2
.
Proof.
Since
V
0
;d
l

1
;l
and
V
1
;d
l
;l
arefullrankmatrices,wecangettheircorresponding
inversematrices.
^
x
1
=
^
x
2
isequivalentto
V
0
;d
l

1
;l

^
x
1
=
V
0
;d
l

1
;l

^
x
2
.
First,wehave
V
0
;d
l

1
;l

^
x
1
=[
e
0
;e
1
;

;e
d
l

1
]
T
:
(5.28)
86
Algorithm5.3
[DetectionMode]
z
0
RegeneratesSymbolsoftheFailedNode
z
inHostile
Network
Step1:
Forevery0

l

q

1and1

t


l
,wecancalculatetheregeneratedsymbols
whicharerelatedtothehelpsymbols~
p
0
i;l;t
from
d
l
helpernodes.~
p
0
i;l;t
=~
p
i;l;t
+
e
i;l;t
istheresponsefromthe
i
th
helpernode.If~
p
i;l;t
hasbeenmobythemalicious
node
i
,wehave
e
i;l;t
2
GF
(
q
2
)
nf
0
g
.Otherwisewehave
e
i;l;t
=0.Todetect
whetherthereareerrors,wewillcalculatesymbolsfromtwosetsofhelpernodes
thencomparetheresults.(Withoutlossofgenerality,weassume0

i

d
l
.)
Step1.1:
Let
p
1
0
=[~
p
0
0
;l;t
;
~
p
0
1
;l;t
;

;
~
p
0
d
l

1
;l;t
]
T
,wherethesymbolsarecollected
fromnode0tonode
d
l

1,solvetheequation
V
0
;d
l

1
;l
x
1
=
p
1
0
.
Step1.2:
Let
p
2
0
=[~
p
0
1
;l;t
;
~
p
0
2
;l;t
;

;
~
p
0
d
l
;l;t
]
T
,wherethesymbolsarecollectedfrom
node1tonode
d
l
,solvetheequation
V
1
;d
l
;l
x
2
=
p
2
0
.
Step1.3:
Compare
x
1
with
x
2
.Iftheyarethesame,compute
~y
z;l;t
=

z;l
S
l;t
+

z

z;l
T
l;t
asdescribedinAlgorithm5.2.Otherwise,errorsaredetectedinthehelp
symbols.Exitthealgorithmandswitchtorecoveryregenerationmode.
Step2:
Noerrorhasbeendetectedforthecalculatingoftheregenerationsofar.Let
e
Y
z
be
a
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z
e
Y
z
.
Suppose
V

1
1
;d
l
;l
=[

0
;
1
;

;
d
l

1
],thenwehave:

r;l


s
=
8
>
>
>
>
<
>
>
>
>
:
1
;r
=
s
+1
0
;r
6
=
s
+1
;
1

r

d
l
;
0

s

d
l

1
:
(5.29)
V
0
;d
l

1
;l

^
x
2
=
V
0
;d
l

1
;l

V

1
1
;d
l
;l

[
e
1
;e
2
;

;e
d
l
]
T
=
V
0
;d
l

1
;l

[

0
;
1
;

;
d
l

1
]

[
e
1
;e
2
;

;e
d
l
]
T
(5.30)
=[
x
2
;
0
;e
1
;

;e
d
l

1
]
T
:
Tocalculate
x
2
;
0
,wederivetheexpressionof

0
;l
.Because

1
;l
;
2
;l
;

;
d
l
;l
are
linearlyindependent,theycanbeviewedasasetofbasesofthe
d
l
dimensionallinearspace.
87
Sowehave

0
;l
=
r
=
d
l
X
r
=1

r


r;l
;
(5.31)
where

r
6
=0
;r
=1
;

;d
l
,becauseany
d
l
vectorsoutof

0
;l
;
1
;l
;

;
d
l
;l
arelinearly
independent.Then
x
2
;
0
=
0
@
r
=
d
l
X
r
=1

r


r;l
1
A
[

0
;
1
;

;
d
l

1
][
e
1
;e
2
;

;e
d
l
]
T
=
r
=
d
l
X
r
=1

r

e
r
:
(5.32)
If
e
0
=
r
=
d
l
X
r
=1

r

e
r
;
(5.33)
then
V
0
;d
l

1
;l

^
x
1
=
V
0
;d
l

1
;l

^
x
2
and
^
x
1
=
^
x
2
.
Whenonlyoneelementof
e
0
;e
1
;

;e
d
l
isnonzero,since

1
;

;
d
l
areallnonzero,
equation(5.33)willneverhold.Inthiscase,theprobabilityis0.Whentherearemorethan
onenonzeroelements,itmeanstherearemorethanonemaliciousnodes.Ifthenumberof
maliciousnodesislessthan
d
l
+1,theywillnotbeabletocolludetosolvethecots

r
in(5.31).Theprobabilitythatequation(5.33)holdswillbe1
=q
2
.
Theorem5.2.
Whenthenumberofmaliciousnodesinthe
d
l
+1
helpernodesofAlgo-
rithm5.3islessthan
d
l
+1
,theprobabilityforthebogussymbolssentfromthemalicious
nodestobedetectedisatleast
1

1
=q
2
.
Proof.
Since
V
0
;d
l

1
;l
and
V
1
;d
l
;l
arefullrankmatrices,
x
1
canbecalculatedby(Forcon-
venience,use
e
i
torepresent
e
i;l;t
):
x
1
=
V

1
0
;d
l

1
;l

"
~
p
0
;l;t
+
e
0
;

;
~
p
d
l

1
;l;t
+
e
d
l

1
#
T
=
x
+
V

1
0
;d
l

1
;l

[
e
0
;e
1
;

;e
d
l

1
]
T
=
x
+
^
x
1
:
(5.34)
88
x
2
canbecalculatedthesameway:
x
2
=
x
+
V

1
1
;d
l
;l

[
e
1
;e
2
;

;e
d
l
]
T
=
x
+
^
x
2
:
(5.35)
If
^
x
1
=
^
x
2
,Algorithm5.3willfailtodetecttheerrors.Sowewillfocusontherelationship
between
^
x
1
and
^
x
2
.AccordingtoLemma1,whenthenumberofmaliciousnodesinthe
d
l
+1helpernodesislessthan
d
l
+1,theprobabilitythat
^
x
1
=
^
x
2
isatmost1
=q
2
.Sothe
probabilitythat
x
1
6
=
x
2
,equivalentlythedetectionprobability,isatleast1

1
=q
2
.
5.3.3.2RecoveryMode
Oncethereplacementnode
z
0
detectserrorsusingAlgorithm5.3,itwillsendinteger
j
=
q

1
toalltheother
q
2

1nodesinthenetworkrequestinghelpsymbols.Helpernode
i
willsend
helpsymbolssimilartoSection5.3.2.
z
0
canregeneratesymbolsusingAlgorithm5.4.
5.3.4ReconstructionoftheH-MSRCodeintheError-freeNetwork
HerewewilldiscussthereconstructionoftheH-MSRcodeintheerror-freenetwork.The
mainideaofthereconstructionalgorithmsistoreconstruct
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
),0

l

q

1,
bydownloadinghelpsymbolsfrom
k
l
=

l
+1nodes,where
k
l
isusedtorepresentthe
reconstructionparameter
k
for
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
)intheH-MSRcodereconstruction.We
deviseAlgorithm5.5inthenetworkforthedatacollectorDCtoreconstructtheoriginal
Forconvenience,wesuppose

q
=0.
First,DCwillsendrequeststothestoragenodesforreconstruction:DCsendsinteger
j
to
k
j

k
j
+1
helpernodes,towhichDChasnotsentrequestsbefore,forevery
j
from
q

1
to0indescendingorder.
Uponreceivingtherequestinteger
j
,node
i
willcalculateandsendsymbolsasfollows:
node
i
willcalculate
~
Y
i
=
B

1
i
Y
i
toremovethecotmatrix
B
i
fromthecodeword
matrix.Sincethe(
l
+1)
th
rowof
~
Y
i
correspondstothesymbolsrelatedto
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
),
for0

l

j
,node
i
willsendoutthe(
l
+1)
th
rowof
~
Y
i
:
~y
i;l
.Here
j
indicatesthatDC
89
Algorithm5.4
[RecoveryMode]
z
0
RegeneratesSymbolsoftheFailedNode
z
inHostile
Network
Step1:
Forevery
q

1

l

0indescendingorderand1

t


l
inascendingorder,
wecanregeneratethesymbolswhentheerrorsinthereceivedhelpsymbols~
p
0
i;l;t
from
q
2

1helpernodescanbecorrected.Withoutlossofgenerality,weassume
0

i

q
2

2.
Step1.1:
Let
p
0
=[~
p
0
0
;l;t
;
~
p
0
1
;l;t
;

;
~
p
0
q
2

2
;l;t
]
T
.Since
V
0
;q
2

2
;l

x
=
p
0
,
p
0
canbe
viewedasanMDScodewithparameters(
q
2

1
;d
l
;q
2

d
l
).
Step1.2:
Substitute~
p
0
i;l;t
in
p
0
withthesymbol

representinganerasureifnode
i
hasbeendetectedtobecorruptedinthepreviousloops(previousvaluesof
l;t
).
Step1.3:
Ifthenumberoferasuresin
p
0
islargerthanmin
f
q
2

d
l

1
;
b
(
q
2

d
q

1

1)
=
2
cg
,thenthenumberoferrorshaveexceededtheerrorcorrectioncapability.So
herewewillthedecodingfailureandexitthealgorithm.
Step1.4:
Sincethenumberoferrorsiswithintheerrorcorrectioncapabilityof
theMDScode,decode
p
0
to
p
0
cw
andsolve
x
.
Step1.5:
Ifthe
i
th
positionsymbolsof
p
0
cw
and
p
0
aredit,marknode
i
as
corrupted.
Step1.6:
Compute
~y
z;l;t
=

z;l

S
l;t
+

z


z;l

T
l;t
asdescribedinAlgorithm5.2.
Step2:
Let
e
Y
z
bea
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z
e
Y
z
.
isrequestingsymbolsof
~y
i;l
,0

l

j
,calculatedby[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;

;f
j
(
˚
s
i
)+

i
e
j
(
˚
s
i
)]
T
.
Since
k
l
1
>k
l
2
for
l
1
<l
2
,forconsideration,only
k
q

1
helpernodesneed
tosendoutsymbolsof
~y
i;l
,0

l

q

1,calculatedby[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;f
1
(
˚
s
i
)+

i
e
1
(
˚
s
i
)
;

;f
q

1
(
˚
s
i
)+

i
e
q

1
(
˚
s
i
)]
T
.Then
k
j

k
j
+1
nodesonlyneedtosendout
symbolsof
~y
i;l
,0

l

j
,calculatedby[
f
0
(
˚
s
i
)+

i
e
0
(
˚
s
i
)
;f
1
(
˚
s
i
)+

i
e
1
(
˚
s
i
)
;

;f
j
(
˚
s
i
)+

i
e
j
(
˚
s
i
)]
T
for0

j

q

2.Inthisway,thetotalnumberofhelpernodesthatsendout
symbolsof
~y
i;l
calculatedby
f
l
(
˚
s
i
)+

i
e
l
(
˚
s
i
)is
k
q

1
+
P
q

2
j
=
l
(
k
j

k
j
+1
)=
k
l
.
WhenDCreceivesalltherequestedsymbols,itcanreconstructtheoriginalusingthe
90
followingalgorithm:
Algorithm5.5
DCReconstructstheOriginalFile
Step1:
Forevery0

l

q

1,dividetheresponsesymbolvector
~y
i;l
fromthe
i
th
node
into

l
equalrowvectors:[
~y
i;l;
1
;
~y
i;l;
2
;

;
~y
i;l
l
],0

i

k
l

1.
Step2:
Forevery0

l

q

1and1

t


l
,DCreconstructsthematricesrelatedto
theoriginalle:
Step2.1:
Let
R
=[
~y
T
0
;l;t
;
~y
T
1
;l;t
;

;
~y
T
k
l

1
;l;t
]
T
,wehavetheequation:
V
0
;k
l

1
;l

2
4
S
l;t
T
l;t
3
5
=
R
accordingtotheencodingalgorithm.
Step2.2:
DCreconstructs
S
l;t
;T
l;t
usingtechniquessimilarto[57].
Step3:
DCreconstructstheoriginalefromallthematrices
S
l;t
;T
l;t
,0

l

q

1and
1

t


l
.
5.3.5ReconstructionoftheH-MSRCodeintheHostileNetwork
Similartotheregenerationalgorithms,thereconstructionalgorithmsinerror-freenetwork
donotworkinhostilenetwork.Evenifthedatacollectercancalculatethesymbolmatrices
S;T
usingAlgorithm5.5,itcannotverifywhethertheresultiscorrectornot.Therearetwo
modesfortheoriginaltobereconstructedinhostilenetwork.Onemodeisthedetection
mode,inwhichnoerrorhasbeenfoundinthesymbolsreceivedfromthestoragenodes.
Onceerrorsaredetectedinthedetectionmode,therecoverymodewillbeusedtocorrect
theerrorsandlocatethemaliciousnodes.
5.3.5.1DetectionMode
Inthedetectionmode,DCwillsendrequestsinthewaysimilartothatfortheerror-free
networkinSection5.3.4.Theonlyceisthatwhen
j
=
q

1,DCwillsendrequeststo
k
q

1

k
q
+1nodesinsteadof
k
q

1

k
q
nodes.Storagenodeswillstillusethewaysimilarto
91
thatfortheerror-freenetworkinSection5.3.4tosendsymbols.Thereconstructionalgorithm
isdescribedinAlgorithm5.6withthedetectionprobabilitydescribedinTheorem5.3.
Algorithm5.6
[Detectionmode]DCReconstructstheOriginalFileinHostileNetwork
Step1:
Forevery0

l

q

1,wecandividethesymbolvector
~y
0
i;l
into

l
equal
rowvectors:[
~y
0
i;l;
1
;
~y
0
i;l;
2
;

;
~y
0
i;l
l
].
~y
0
i;l
=
~y
i;l
+
e
i;l
istheresponsefrom
the
i
th
storagenode.If
~y
i;l
hasbeenmobythemaliciousnode
i
,wehave
e
i;l
2
(
GF
(
q
2
))
A
nf
0
g
.Todetectwhetherthereareerrors,wewillreconstructthe
originalfromtwosetsofstoragenodesthencomparetheresults.(Withoutloss
ofgenerality,weassume0

i

k
l
.)
Step2:
Forevery0

l

q

1and1

t


l
,DCcanreconstructthematricesrelated
totheoriginal
Step2.1:
Let
R
0
=[
~y
0
T
0
;l;t
;
~y
0
T
1
;l;t
;

;
~y
0
T
k
l
;l;t
]
T
.
Step2.2:
Let
R
1
0
=[
~y
0
T
0
;l;t
;
~y
0
T
1
;l;t
;

;
~y
0
T

l
;l;t
]
T
,whicharethesymbolscollected
fromnode0tonode
k
l

1=

l
,thenwehave
V
0

l
;l

2
4
S
1
T
1
3
5
=
R
1
0
.Solve
S
1
;T
1
usingthemethodsametoalgorithm5.5.
Step2.3:
Let
R
2
0
=[
~y
0
T
0
;l;t
;

;
~y
0
T

l

1
;l;t
;
~y
0
T

l
+1
;l;t
]
T
,whicharethesymbolscol-
lectedfromnode0tonode
k
l
=

l
+1exceptnode

l
,and
DC
2
=
2
6
6
6
6
6
6
4

0
;l
.
.
.


l

1
;l


l
+1
;l
3
7
7
7
7
7
7
5
,then
wehave
DC
2

2
4
S
2
T
2
3
5
=
R
2
0
.Solve
S
2
;T
2
usingthemethodsametoalgorithm5.5.
Step2.4:
Compare[
S
1
;T
1
]with[
S
2
;T
2
].Iftheyarethesame,let[
S
l;t
;T
l;t
]=
[
S
1
;T
1
].Otherwise,errorsaredetectedinthereceivedsymbols.Exitthealgorithm
andswitchtorecoveryreconstructionmode.
Step3:
Noerrorhasbeendetectedforthecalculatingofthereconstructionsofar.SoDC
canreconstructtheoriginalfromallthematrices
S
l;t
;T
l;t
,0

l

q

1and
1

t


l
.
92
Theorem5.3.
Whenthenumberofmaliciousnodesinthe
k
l
+1
nodesofAlgorithm5.6
islessthan
k
l
+1
,theprobabilityforthebogussymbolssentfromthemaliciousnodestobe
detectedisatleast
1

(1
=q
2
)
2(

l

2)
.
Proof.
Wearrangethisproofasfollows.Wewillstudytherequirementsfor
S
1
=
S
2
;T
1
=
T
2
inAlgorithm5.6whichwillleadtothefailureoftheAlgorithmwhenthere
arebogussymbols.Thenwewillstudythecorrespondingfailureprobabilitiesdependingon
tvaluesof

i
ofthematrixinsection5.3.1.
Forconveniencewewrite
e
i;l;t
as
e
i
intheproof.
e
i
2
[
GF
(
q
2
)]

l
for0

i


l
+1.We
alsowrite
DC
=
DC
;

DC


DC
],where
DC
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0

1
.
.
.

k
l

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
and

i
represents

i;l
which
isthe
i
th
rowoftheencodingmatrix
l
insection5.3.1.
Step1.Derivetherequirements
For
R
1
0
=
R
1
+
W
1
inAlgorithm5.6,wehave:

DC
1
S
1

T
DC
1
+
DC
1

DC
1
T
1

T
DC
1
=
R
1

T
DC
1
+
W
1

T
DC
1
;
(5.36)
where
DC
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0

1
.
.
.


l
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
,
W
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
e
0
e
1
.
.
.
e

l
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
.Suppose
C
1
=
DC
1
S
1

T
DC
1
;D
1
=
DC
1
T
1

T
DC
1
,
wecanwriteequation(5.36)as:
C
1
+
DC
1
D
1
=
R
1

T
DC
1
+
W
1

T
DC
1
=
^
R
1
+
^
W
1
:
(5.37)
93
Itiseasytoseethat
C
1
and
D
1
aresymmetric,sowehave
8
>
>
>
>
<
>
>
>
>
:
C
1
;i;j
+

i

D
1
;i;j
=
^
R
1
;i;j
+
^
W
1
;i;j
C
1
;i;j
+

j

D
1
;i;j
=
^
R
1
;j;i
+
^
W
1
;j;i
;
(5.38)
where
C
1
;i;j
;D
1
;i;j
;
^
R
1
;i;j
;
^
W
1
;i;j
aretheelementsinthe
i
th
row,
j
th
columnof
C
1
;D
1
;
^
R
1
;
^
W
1
respectively.Solveequation(5.38)forallthe
i;j
(
i
6
=
j;
0

i


l
;
0

j


l

1),wecan
getthecorresponding
C
1
;i;j
;D
1
;i;j
.Becausethestructureof
C
1
and
D
1
arethesame,wewill
onlyfocuson
C
1
(correspondingto
S
1
)intheproof.Thecalculationfor
D
1
(corresponding
to
T
1
)isthesame.

DC
1
S
1

T
DC
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0

1
.
.
.


l
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5

S
1

[

T
0
;
T
1
;

;
T

l
]=
C
1
:
(5.39)
Sotheelementsofthe
i
th
rowof
C
1
(excepttheelementinthediagonalposition)canbe
writtenas:

i

S
1

[

T
0
;

;
T
i

1
;
T
i
+1

;
T

l
]=[
C
1
;i;
0
;

;C
1
;i;i

1
;C
1
;i;i
+1
;

;C
1

l
]
:
(5.40)
94
Let=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0

1
.
.
.


l

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
,thenisan

l


l
fullrankmatrix,andwecanderive
S
1
from


S
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
[
C
1
;
0
;
1
;C
1
;
0
;
2
;

;C
1
;
0

l
][

T
1
;
T
2
;

;
T

l
]

1
[
C
1
;
1
;
0
;C
1
;
1
;
2
;

;C
1
;
1

l
][

T
0
;
T
2
;

;
T

l
]

1

[
C
1

l

1
;
0
;C
1

l

1
;
1
;

;C
1

l

1

l
][

T
0
;
T
1
;

;
T

l
]

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
:
(5.41)
For
R
2
0
=
R
2
+
W
2
inAlgorithm5.6,wecanget

S
2
thesameway.If

S
1
=

S
2
,
Algorithm5.6willfailtodetecttheerrors.Thiswillhappenifalltherowsof

S
1
and


S
2
arethesame.Sowewillfocusonthe
i
th
rowof

S
1
and

S
2
.
Step2.Calculatethefailureprobabilities
Dependingonthevaluesof

i
,wediscuss
twocases:
(a)Ifnoneofthe

i
(0

i


l
)equalsto0,wecansolve
C
1
;i;j
inequation(5.38):
C
1
;i;j
=

j

^
R
1
;i;j


i

^
R
1
;j;i

i


j
+
e
i


T
j

i

e
j


T
i

j
=
N
1
;i;j
+
Q
1
;i;j
:
(5.42)
Inequation(5.42),
N
1
;i;j
representstheoriginalsolutionwithouterrors,while
Q
1
;i;j
repre-
95
sentstheimpactoftheerrors.Sothe
i
th
rowof

S
1
canbewrittenas:
[
C
1
;i;
0
;

;C
1
;i;i

1
;C
1
;i;i
+1
;

;C
1

l
]


1
1
;i
=[
N
1
;i;
0
;

;N
1
;i;i

1
;N
1
;i;i
+1
;

;N
1

l
]


1
1
;i
+[
Q
1
;i;
0
;

;Q
1
;i;i

1
;Q
1
;i;i
+1
;

;Q
1

l
]


1
1
;i
(5.43)
=
˘
i
+

1
;i
;
where
1
;i
=[

T
0
;

;
T
i

1
;
T
i
+1
;

;
T

l
].
˘
i
correspondstothepartindependentofthe
errors.

1
;i
istheerrorpartandcanbefurtherexpandedas:

1
;i
=
"
e
i


T
0

i
;

;
e
i


T
i

1

i
;
e
i


T
i
+1

i
;

;
e
i


T

l

i
#


1
1
;i

"
e
0


T
i

0
;

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l


T
i


l
#


1
1
;i
:
(5.44)
Thepartofequation(5.44)canbereducedasfollows:
"
e
i


T
0

i
;

;
e
i


T
i

1

i
;
e
i


T
i
+1

i
;

;
e
i


T

l

i
#


1
1
;i
=
e
i

i

h

T
0
;

;
T
i

1
;
T
i
+1
;

;
T

l
i


1
1
;i
(5.45)
=
e
i

i
:
Sowehave:

1
;i
=
e
i

i

"
e
0


T
i

0
;

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l


T
i


l
#


1
1
;i
=
e
i

i

ˆ
1
;i
:
(5.46)
For
R
2
0
=
R
2
+
W
2
inAlgorithm5.6where
W
2
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
e
0
.
.
.
e

l

1
e

l
+1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
,wecanderive
C
2
;i;j
,then
96


S
2
thesameway.The
i
th
rowof

S
2
canbewrittenas:
˘
i
+

2
;i
=
˘
i
+
e
i

i

ˆ
2
;i
;
(5.47)
where
ˆ
2
;i
=
"
e
0


T
i

0
;

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l

1


T
i


l

1
;
e

l
+1


T
i


l
+1
#


1
2
;i
,
2
;i
=[

T
0
;

;
T
i

1
;
T
i
+1
,

;
T

l

1
;
T

l
+1
].
Because
1
;i
isafullrankmatrix,
ˆ
1
;i
=
ˆ
2
;i
isequivalentto
ˆ
1
;i


1
;i
=
ˆ
2
;i


1
;i
.
SimilartotheproofofLemma1,suppose

1
2
;i
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0
.
.
.


l

1


l
+1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
,wehave

s


T
r
=
8
>
>
>
>
<
>
>
>
>
:
1
r
=
s
0
r
6
=
s
.So
ˆ
1
;i


1
;i
=
"

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l

1


T
i


l

1
;
e

l


T
i


l
#
;
(5.48)
ˆ
2
;i


1
;i
=
"

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l

1


T
i


l

1
;x
2

l
#
:
(5.49)
Because

T
0
;

;
T
i

1
;
T
i
+1
;

;
T

l

1
;
T

l
+1
arelinearlyindependent,theycanbeviewed
asasetofbasesofthe

l
dimensionallinearspace.Sowehave

T

l
=
r
=

l
+1
X
r
=0
;r
6
=

l

r


T
r
:
(5.50)
Thus
x
2

l
=
"

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l

1


T
i


l

1
;
e

l
+1


T
i


l
+1
#


1
2
;i

0
@
r
=

l
+1
X
r
=0
;r
6
=

l

r


T
r
1
A
=
0
@
r
=

l
+1
X
r
=0
;r
6
=

l

r

e
r


T
i

r
1
A
:
(5.51)
If
e

l


T
i


l
=
r
=

l
+1
X
r
=0
;r
6
=

l

r

e
r


T
i

r
(0

i


l

1)
;
(5.52)
97
ˆ
1
;i
and
ˆ
2
;i
willbeequal,soare

S
1
and

S
2
.Therefore,Algorithm5.6willfail.
Fortheerror
e
i
(0

i


l
+1),thefollowingequationholds:
e
i

[

T
0
;
T
1
;

;
T

l

1
]=[^
e
i;
0
;
^
e
i;
1
;

;
^
e

l

1
]=
^
e
i
:
(5.53)
Because[

T
0
;
T
1
;

;
T

l

1
]isafullrankmatrix,thereisaone-to-onemappingbetween
e
i
and
^
e
i
.Equation(5.52)canbewrittenas:
^
e

l
;i


l
=
r
=

l
+1
X
r
=0
;r
6
=

l

r

^
e
r;i

r
(0

i


l

1)
:
(5.54)
Whenthenumberofmaliciousnodesinthe
k
l
+1nodesislessthan
k
l
+1,themalicious
nodescancolludetosatisfyequation(5.54)foratmostoneparticular
i
.Sotheprobability
thatequation(5.54)holdsis1
=q
2
foratleast

l

1outof

l
i
0
s
between0and

l

1.
Ifweconsiderequation(5.54)forallthe
i
0
s
simultaneously,theprobabilitywillbeatmost
(1
=q
2
)

l

1
.Aswehavementionedabove,theprobabilityfor
T
1
=
T
2
willbeatmost
(1
=q
2
)

l

1
.Inthiscase,thedetectionprobabilityisatleast1

(1
=q
2
)
2(

l

1)
.
(b)Ifoneofthe

i
(0

i


l
)equalsto0,wecanassume

0
=0withoutlossof
generality.When
i
=0,thesolutionforequation(5.38)is:
C
1
;
0
;j
=
^
R
1
;
0
;j
+
e
0


T
j
=
N
1
;
0
;j
+
Q
1
;
0
;j
:
(5.55)
Similartoequations(5.43),(5.44)and(5.45),wehave

1
;
0
=
e
0
.For
R
2
0
=
R
2
+
W
2
,itis
easytoseethat

2
;
0
=
e
0
.Sotherowsof

S
1
and

S
2
arethesamenomatterwhat
theerrorvector
e
0
is.
When
i>
0
;j
=0,thesolutionforequation(5.38)is:
C
1
;i;
0
=
^
R
1
;i;
0
+
0


T
0
+
e
0


T
i
=
N
1
;i;
0
+
Q
1
;i;
0
;
(5.56)
where
0
isazerorowvector.When
i>
0
;j>
0,thesolutionhasthesameexpressionas
equation(5.42).Inthiscase,forthe
i
th
(
i>
0)rowof

S
1
,equation(5.44)canbewritten
98
as:

1
;i
=
"
0
;

;
e
i


T
i

1

i
;
e
i


T
i
+1

i
;

;
e
i


T

l

i
#


1
1
;i

"

e
0


T
i
;

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l


T
i


l
#


1
1
;i
:
(5.57)
Thepartofequation(5.57)canbedividedintotwoparts:
"
e
i


T
0

i
;

;
e
i


T
i

1

i
;
e
i


T
i
+1

i
;

;
e
i


T

l

i
#


1
1
;i

"
e
i


T
0

i
;
0
;

;
0
#


1
1
;i
=
e
i

i

e
i

i

[

T
0
;
0
;

;
0
]


1
1
;i
:
(5.58)
Soequation(5.57)canbefurtherwrittenas:

1
;i
=
e
i

i

"
e
i


T
0

i

e
0


T
i
;

;
e
i

1


T
i

i

1
;
e
i
+1


T
i

i
+1
;

;
e

l


T
i


l
#


1
1
;i
=
e
i

i

ˆ
1
;i
:
(5.59)
Byemployingthesamederivationincase(a),for1

i


l

1,
ˆ
1
;i
and
ˆ
2
;i
willbe
equalif
e

l


T
i


l
=
r
=

l
+1
X
r
=1
;r
6
=

l

r

e
r


T
i

r


0

e
0


T
i
+

0

e
i


T
0

i
;
(5.60)
^
e

l
;i


l
=
r
=

l
+1
X
r
=1
;r
6
=

l

r

^
e
r;i

r


0

^
e
0
;i
+

0

^
e
i;
0

i
:
(5.61)
Whenthenumberofmaliciousnodesinthe
k
l
+1nodesislessthan
k
l
+1,forthesamereason
asincase(a),theprobabilitythatequation(5.61)holdsis1
=q
2
foratleast

l

2outof

l

1
i
0
s
between1and

l

1.Ifweconsiderequation(5.61)forallthe
i
0
s
simultaneously,the
probabilitywillbeatmost(1
=q
2
)

l

2
.Heretheprobabilityfor
T
1
=
T
2
willbe(1
=q
2
)

l

2
.
Inthiscase,thedetectionprobabilityis1

(1
=q
2
)
2(

l

2)
.
Combiningbothcases,thedetectionprobabilityisatleast1

(1
=q
2
)
2(

l

2)
.
99
5.3.5.2RecoveryMode
OnceDCdetectserrorsusingAlgorithm5.6,itwillsendinteger
j
=
q

1toallthe
q
2
nodesinthenetworkrequestingsymbols.Storagenodeswillstillusethewaysimilartothat
oftheerror-freenetworkinSection5.3.4tosendsymbols.Thereconstructproceduresare
describedinAlgorithm5.7.
Algorithm5.7
[RecoveryMode]DCReconstructstheOriginalFileinHostileNetwork
Step1:
Forevery0

l

q

1,wedividethesymbolvector
~y
0
i;l
into

l
equalrowvectors:
[
~y
0
i;l;
1
;
~y
0
i;l;
2
;

;
~y
0
i;l
l
].(Withoutlossofgenerality,weassume0

i

q
2

1.)
Step2:
Forevery
q

1

l

0indescendingorderand1

t


l
inascendingorder,
DCcanreconstructthematricesrelatedtotheoriginalwhentheerrorsinthe
receivedsymbolvectors
~y
0
i;l;t
from
q
2
storagenodescanbecorrected:
Step2.1:
Let
R
0
=[
~y
0
T
0
;l;t
;
~y
0
T
1
;l;t
;

;
~y
0
T
q
2

1
;l;t
]
T
.
Step2.2:
Ifthenumberofcorruptednodesdetectedislargerthanmin
f
q
2

k
l
;
b
(
q
2

k
q

1
)
=
2
cg
,thenthenumberoferrorshaveexceededtheerrorcorrection
capability.Wewillthedecodingfailureandexitthealgorithm.
Step2.3:
Sincethenumberoferrorsiswithintheerrorcorrectioncapabilityof
theH-MSRcode,substitute
~y
0
i;l;t
in
R
0
withthesymbol

representinganerasure
vectorifnode
i
hasbeendetectedtobecorruptedinthepreviousloops(previous
valuesof
l;t
).
Step2.4:
Solve
S
l;t
;T
l;t
usingthemethoddescribedinsection5.3.6.Ifsymbols
fromnode
i
aredetectedtobeerroneousduringthecalculation,marknode
i
as
corrupted.
Step3:
DCreconstructstheoriginalfromallthematrices
S
l;t
;T
l;t
,0

l

q

1and
1

t


l
.
5.3.6RecoverMatrices
S
l;t
;T
l;t
from
q
^2
StorageNodes
Whentherearebogussymbols~
p
0
i;l;t
sentbythecorruptednodesforcertain
l;t
,wecan
recoverthematrices
S
l;t
;T
l;t
asfollows:
100
For
R
0
inAlgorithm5.7,wehave
DC

2
6
6
6
6
4
S
0
T
0
3
7
7
7
7
5
=
R
0
,and

DC
S
0

T
DC
+
DC

DC
T
0

T
DC
=
R
0

T
DC
;
(5.62)
where
DC
=
DC
;

DC


DC
],
DC
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

0

1
.
.
.

q
2

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
and

i
represents

i;l
whichisthe
i
th
rowoftheencodingmatrix
l
intheproofofTheorem5.1.
Let
C
=
DC
S
0

T
DC
,
D
=
DC
T
0

T
DC
,and
^
R
0
=
R
0

T
DC
,then
C
+
DC
D
=
^
R
0
:
(5.63)
Since
C;D
arebothsymmetric,wecansolvethenon-diagonalelementsofthemasfollows:
8
>
>
>
>
<
>
>
>
>
:
C
i;j
+

i

D
i;j
=
^
R
0
i;j
C
i;j
+

j

D
i;j
=
^
R
0
j;i
:
(5.64)
Becausematrices
C
and
D
havethesamestructure,hereweonlyfocuson
C
(corresponding
to
S
0
).Itisstraightforwardtoseethatifnode
i
ismaliciousandthereareerrorsinthe
i
th
rowof
R
0
,therewillbeerrorsinthe
i
th
rowof
^
R
0
.Furthermore,therewillbeerrorsinthe
i
th
rowand
i
th
columnof
C
.
S
0

T
DC
=
^
S
0
,wehave

DC
^
S
0
=
C:
(5.65)
Herewecanvieweachcolumnof
C
asa(
q
2

1
;
l
;q
2


l
)MDScodebecause
DC
isa
Vandermondematrix.Thelengthofthecodeis
q
2

1sincethediagonalelementsof
C
is
101
unknown.Supposenode
j
isuncorrupted.Ifthenumberoferasures
˙
(correspondingtothe
previouslydetectedcorruptednodes)andthenumberofthecorruptednodes
˝
thathave
notbeendetectedsatisfy:
˙
+2
˝
+1

q
2


l
;
(5.66)
thenthe
j
th
columnof
C
canberecoveredandtheerrorlocations(correspondingtothe
corruptednodes)canbepinpointed.Thenon-diagonalelementsof
C
canberecovered.So
DCcanreconstruct
S
l;t
usingthemethodsimilarto[57].For
T
l;t
,therecoveringprocessis
similar.
5.4HermitianCodeBasedMBRRegeneratingCode(H-MBR
Code)
5.4.1EncodingH-MBRCode
Inthissection,wewillanalyzetheH-MBRcodebasedontheMBRpointwith

=1.
Accordingtoequation(2.8),wehave
d
=

.
Let

0
;

;
q

1
beastrictlydecreasingintegersequencesatisfying0
<
i


(
i
)
;
0

i

q

1.Theleastcommonmultipleof

0
;

;
q

1
is
A
.Let
k
0
;

;k
q

1
beainteger
sequencesatisfying0
<k
i


i
;
0

i

q

1.Supposethedatacontains
B
=
A

P
q

1
i
=0
(
k
i
(2

i

k
i
+1)
=
(2

i
))messagesymbolsfromthe
GF
(
q
2
).Inpractice,if
thesizeoftheactualdataislargerthan
B
symbols,wecanfragmentitintoblocksofsize
B
andprocesseachblockindividually.
102
Figure5.3Anexampleillustrationofmatrix
M
Wearrangethe
B
symbolsintomatrix
M
asbelow:
M
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
M
0
M
1
.
.
.
M
q

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
;
(5.67)
where
M
i
=[
M
i;
1
;M
i;
2
;

;M

i
](5.68)
and
M
i;j
=
2
6
6
6
6
4
S
i;j
T
i;j
T
T
i;j
0
:
3
7
7
7
7
5
(5.69)
S
i;j
;
0

i

q

1
;
1

j


i
isasymmetricmatrixofsize
k
i

k
i
withtheupper-
triangularentriesbydatasymbols.
T
i;j
isa
k
i

(

i

k
i
)matrix.Thus
M
i;j
contains
k
i
(2

i

k
i
+1)
=
2symbols,
M
i
contains
A

k
i
(2

i

k
i
+1)
=
(2

i
)symbolsand
M
contains
B
symbols.Figure5.3showsanexampleofmatrix
M
for
q
=4
;
0
=6
;
1
=5
;
2
=4
;
3
=3.
103
In5.3,thesubmatrix
M
i;j
isrepresentedbythesquareinthecorrespondingposition
withthesizerepresentingthesizeofthesubmatrix.
Fordistributedstorage,weencode
M
usingAlgorithm5.8:
Algorithm5.8
EncodingH-MBRCode
Step1:
Firstweencodethedatamatrices
M
aboveusingaHermitiancode
H
m
over
GF
(
q
2
)withparameters

(
j
)(0

j

q

1)and
m
(
m

q
2

1).The
q
3

A
codewordmatrixcanbewrittenas
Y
=
H
m
(
M
).
Step2:
Thenwedividethecodewordmatrix
Y
into
q
2
submatrices
Y
0
;

;Y
q
2

1
ofthe
size
q

A
andstoreonesubmatrixineachofthe
q
2
storagenodesasshownin
Figure.5.2.
Thenwehavethefollowingtheorem:
Theorem5.4.
ByprocessingthedatasymbolsusingAlgorithm5.8,wecanachievetheMBR
pointindistributedstorage.
Proof.
SimilartotheproofofTheorem5.1,wecangetthefollowingequationconsidering
allthecolumnsof
H
m
(
M
):

i

M
i;j
=
G
i;j
;
(5.70)
where
G
i;j
=[
G
(1)
i
;

;
G
(

i
)
i
],0

i

q

1,1

j


i
.
G
(
l
)
i
correspondstothe
l
th
columnofthesubmatrix
M
i;j
andeachelementof
G
i
=[
g
i
(0)
;g
i
(1)
;

;g
i
(
˚
q
2

2
)]
T
canbe
derivedfromadistinctstoragenode.
i
isinequation(6.2).
NextwewillstudytheoptimalityofthecodeinthesenseoftheMBRpoint.For

i

M
i;j
;
0

i

q

1
;
1

j


i
,
M
i;j
issymmetricandtherequirements
forMBRpointaccordingto[57]withparameters
d
=

i
;k
=
k
i
;
=

i
;
=1
;B
=
k
i
(2

i

k
i
+1)
=
2.Byencoding
M
using
H
m
(
M
)anddistributing
Y
0
;

;Y
q
2

1
into
q
2
storagenodes,eachrowofthematrix
i

M
i;j
;
0

i

q

1
;
1

j


i
,canbederived
inacorrespondingstoragenode.Because
i

M
i;j
achievestheMBRpoint,datarelated
104
tomatrices
M
i;j
;
0

i

q

1
;
1

j


i
,canberegeneratedattheMBRpoint.
Therefore,Algorithm5.8canachievetheMBRpoint.
5.4.2RegenerationoftheH-MBRCodeintheError-freeNetwork
Inthissection,wewilldiscusstheregenerationfortheH-MBRcodeintheerror-freenetwork.
Let
w
i
=[
g
0
(
˚
(
s
i
))
;g
1
(
˚
(
s
i
))
;

;g
q

1
(
˚
(
s
i
))]
T
,then
w
i
=
B

1
i

y
i
=[
g
0
(
˚
s
i
)
;

;g
q

1
(
˚
s
i
)]
T
;
foreverycolumn
y
i
of
Y
i
.
ThemainideaoftheregenerationalgorithmsissimilartothatoftheH-MSRcode:
regenerate
g
l
(
˚
(
s
i
))
;
0

l

q

1,bydownloadinghelpsymbolsfrom
d
l
=

l
nodes,where
d
l
istheregenerationparameter
d
for
g
l
(
˚
(
s
i
))intheH-MBRcoderegeneration.
Supposenode
z
fails,weuseAlgorithm5.9toregeneratetheexactH-MBRcodesymbols
ofnode
z
.Forconvenience,wesuppose
d
q
=

q
=0and
W
i;j;l
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4

i;l

i
+1
;l
.
.
.

j;l
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
;
(5.71)
where

t;l
;i

t

j
,isthe
t
th
rowof
l
.
SimilartotheH-MSRcode,replacementnode
z
0
willsendrequeststohelpernodesin
thewaysametothatinSection5.3.2.Uponreceivingtherequestinteger
j
,helpernode
i
willcalculateandsendthehelpsymbolssimilartothatofSection5.3.2.
Whenthereplacementnode
z
0
receivesalltherequestedsymbols,itcanregeneratethe
symbolsstoredinthefailednode
z
usingthefollowingalgorithm:
ForAlgorithm5.9wecanderivetheequivalentstorageparametersforeachsymbolblock
ofsize
B
j
=
Ak
j
(2

j

k
j
+1)
=
(2

j
):
d
=

j
;k
=
k
j
;
=
A;
=

j
;
0

j

q

1and
105
Algorithm5.9
z
0
RegeneratesSymbolsoftheFailedNode
z
Step1:
Forevery0

l

q

1and1

t


l
,wecancalculatetheregeneratedsymbols
whicharerelatedtothehelpsymbols~
p
i;l;t
from
d
l
helpernodes:(Withoutlossof
generality,weassume0

i

d
l

1.)
Step1.1:
Let
p
=[~
p
0
;l;t
;
~
p
1
;l;t
;

;
~
p
d
l

1
;l;t
]
T
,solvetheequation:
W
0
;d
l

1
;l

x
=
p
.
Step1.2:
Since
x
=
M
l;t


T
z;l
and
M
l;t
issymmetric,wecancalculate
~y
z;l;t
=
x
T
=

z;l

M
l;t
.
Step2:
Let
e
Y
z
bea
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z

e
Y
z
.
equation(2.8)oftheMBRpointholdsfortheseparameters.Theorem5.4guaranteesthat
Algorithm5.9canachievetheMBRpointfordataregenerationoftheH-MBRcode.
5.4.3RegenerationoftheH-MBRCodeintheHostileNetwork
Inhostilenetwork,Algorithm5.9maybeunabletoregeneratethefailednodeduetothe
possiblebogussymbolsreceivedfromtheresponses.Infact,evenifthereplacementnode
z
0
canderivethesymbolmatrix
Y
z
0
usingAlgorithm5.9,itcannotverifythecorrectnessof
theresult.
SimilartotheH-MSRcode,therearetwomodesforthehelpernodestoregeneratethe
H-MBRcodeofafailedstoragenodeinhostilenetwork.Onemodeisthedetectionmode,
inwhichnoerrorhasbeenfoundinthesymbolsreceivedfromthehelpernodes.Onceerrors
aredetected,therecoverymodewillbeusedtocorrecttheerrorsandlocatethemalicious
nodes.
106
5.4.3.1DetectionMode
Inthedetectionmode,thereplacementnode
z
0
willsendrequestsinthewaysimilartothat
oftheerror-freenetworkinSection5.4.2.Theonlyisthatwhen
j
=
q

1,
z
0
sendsrequeststo
d
q

1

d
q
+1nodesinsteadof
d
q

1

d
q
nodes.Helpernodeswillstilluse
thewaysimilartothatoftheerror-freenetworkinSection5.4.2tosendthehelpsymbols.
TheregenerationalgorithmisdescribedinAlgorithm5.10withthedetectionprobability
characterizedinTheorem5.5.
Algorithm5.10
[DetectionMode]
z
0
RegeneratesSymbolsoftheFailedNode
z
inHostile
Network
Step1:
Forevery0

l

q

1and1

t


l
,wecancalculatetheregeneratedsymbols
whicharerelatedtothehelpsymbols~
p
0
i;l;t
from
d
l
helpernodes.~
p
0
i;l;t
=~
p
i;l;t
+
e
i;l;t
istheresponsefromthe
i
th
helpernode.If~
p
i;l;t
hasbeenmobythemalicious
node
i
,wehave
e
i;l;t
2
GF
(
q
2
)
nf
0
g
.Todetectwhetherthereareerrors,wewill
calculatesymbolsfromtwosetsofhelpernodesthencomparetheresults.(Without
lossofgenerality,weassume0

i

d
l
.)
Step1.1:
Let
p
1
0
=[~
p
0
0
;l;t
;
~
p
0
1
;l;t
;

;
~
p
0
d
l

1
;l;t
]
T
,wherethesymbolsarecollected
fromnode0tonode
d
l

1,solvetheequation
W
0
;d
l

1
;l

x
1
=
p
1
0
.
Step1.2:
Let
p
2
0
=[~
p
0
1
;l;t
;
~
p
0
2
;l;t
;

;
~
p
0
d
l
;l;t
]
T
,wherethesymbolsarecollectedfrom
node1tonode
d
l
,solvetheequation
W
1
;d
l
;l

x
2
=
p
2
0
.
Step1.3:
If
x
1
=
x
2
,compute
~y
z;l;t
=

z;l

M
l;t
asdescribedinAlgorithm5.9.
Otherwise,errorsaredetectedinthehelpsymbols.Exitthealgorithmandswitch
torecoveryregenerationmode.
Step2:
Noerrorhasbeendetectedforthecalculatingoftheregenerationsofar.Let
e
Y
z
be
a
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z

e
Y
z
.
Theorem5.5.
Whenthenumberofmaliciousnodesinthe
d
l
+1
helpernodesofAlgo-
rithm5.10islessthan
d
l
+1
,theprobabilityforthebogussymbolssentfromthemalicious
nodestobedetectedisatleast
1

1
=q
2
.
107
Proof.
SimilartotheproofofTheorem5.2,wecanwrite
x
1
=
x
+
W

1
0
;d
l

1
;l

[
e
0
;

;e
d
l

1
]
T
=
x
+
^
x
1
;
(5.72)
x
2
=
x
+
W

1
1
;d
l
;l

[
e
1
;

;e
d
l
]
T
=
x
+
^
x
2
:
(5.73)
Since
W
0
;d
l

1
;l
;
W
1
;d
l
;l
arefullrankmatriceslikethematrices
V
0
;d
l

1
;l
;
V
1
;d
l
;l
inthe
proofofLemma1andany
d
l
vectorsoutof

0
;l
;
1
;l
;

;
d
l
;l
arelinearlyindependent,the
restofthisproofissimilartothatofLemma1.Whenthenumberofmaliciousnodesinthe
d
l
+1helpernodesislessthan
d
l
+1,theprobabilityfor
^
x
1
=
^
x
2
isatmost1
=q
2
.Therefore,
thedetectionprobabilityisatleast1

1
=q
2
.
5.4.3.2RecoveryMode
Oncethereplacementnode
z
0
detectserrorsusingAlgorithm5.10,itwillsendinteger
j
=
q

1
toalltheother
q
2

1nodesinthenetworkrequestinghelpsymbols.Helpernodeswillstill
usethewaysimilartothatoftheerror-freenetworkinSection5.4.2tosendthehelpsymbols.
z
0
canregeneratesymbolsusingAlgorithm5.11.
5.4.4ReconstructionoftheH-MBRcodeintheError-freeNetwork
Inthissection,wewilldiscussthereconstructionoftheH-MBRcodeintheerror-free
network.ThemainideaofthereconstructionalgorithmsissimilartothatoftheH-MSR
code:reconstruct
g
l
(
˚
(
s
i
))
;
0

l

q

1,bydownloadinghelpsymbolsfrom
k
l
nodes,
where
k
l
representsthereconstructionparameter
k
for
g
l
(
˚
(
s
i
))intheH-MBRcode.We
useAlgorithm5.12inthenetworkforthedatacollectorDCtoreconstructtheoriginal
Forconvenience,wesuppose
k
q
=0.
SimilartotheH-MSRcodedescribedinSection5.3.4,DCwillsendrequeststostorage
nodes.Uponreceivingtherequestinteger
j
,node
i
willcalculateandsendsymbols.When
DCreceivesalltherequestedsymbols,itcanreconstructtheoriginalusingthefollowing
algorithm:
108
Algorithm5.11
[RecoveryMode]
z
0
RegeneratesSymbolsoftheFailedNode
z
inHostile
Network
Step1:
Forevery
q

1

l

0indescendingorderand1

t


l
inascendingorder,
wecanregeneratethesymbolswhentheerrorsinthereceivedhelpsymbols~
p
0
i;l;t
from
q
2

1helpernodescanbecorrected.Withoutlossofgenerality,weassume
0

i

q
2

2.
Step1.1:
Let
p
0
=[~
p
0
0
;l;t
;
~
p
0
1
;l;t
;

;
~
p
0
q
2

2
;l;t
]
T
.Since
W
0
;q
2

2
;l

x
=
p
0
,
p
0
can
beviewedasanMDScodewithparameters(
q
2

1
;d
l
;q
2

d
l
).
Step1.2:
Substitute~
p
0
i;l;t
in
p
0
withthesymbol

representinganerasureifnode
i
hasbeendetectedtobecorruptedinthepreviousloops(previousvaluesof
l;t
).
Step1.3:
Ifthenumberoferasuresin
p
0
islargerthanmin
f
q
2

d
l

1
;
b
(
q
2

d
q

1

1)
=
2
cg
,thenthenumberoferrorshaveexceededtheerrorcorrectioncapability.We
willthedecodingfailureandexitthealgorithm.
Step1.4:
Sincethenumberoferrorsiswithintheerrorcorrectioncapabilityof
theMDScode,decode
p
0
to
p
0
cw
andsolve
x
.
Step1.5:
Ifthe
i
th
positionsymbolsof
p
0
cw
and
p
0
aredit,marknode
i
as
corrupted.
Step1.6:
Compute
~y
z;l;t
=

z;l

M
l;t
asdescribedinAlgorithm5.9.
Step2:
Let
e
Y
z
bea
q

A
matrixwiththe
l
th
rowas[
~y
z;l;
1
;

;
~y
z;l
l
]
;
0

l

q

1.
Step3:
Calculatetheregeneratedsymbolsofthefailednode
z
:
Y
z
0
=
Y
z
=
B
z

e
Y
z
.
Algorithm5.12
DCReconstructstheOriginalFile
Step1:
Forevery0

l

q

1,dividethesymbolvector
~y
i;l
into

l
equalrowvectors:
[
~y
i;l;
1
;
~y
i;l;
2
,

;
~y
i;l
l
].(
~y
i;l
istheresponsefromthe
i
th
nodeandweassume
0

i

k
l

1withoutlossofgenerality.)
Step2:
Forevery0

l

q

1and1

t


l
,DCreconstructsthematricesrelatedto
theoriginal
Step2.1:
Let
R
=[
~y
T
0
;l;t
;
~y
T
1
;l;t
;

;
~y
T
k
l

1
;l;t
]
T
,wehavetheequation:
W
0
;k
l

1
;l

M
l;t
=
R
accordingtotheencodingalgorithm.
Step2.2:
DCreconstructs
M
l;t
usingtechniquessimilartothatof[57].
Step3:
DCreconstructstheoriginalfromallthematrices
M
l;t
,0

l

q

1and
1

t


l
.
109
5.4.5ReconstructionoftheH-MBRcodeintheHostileNetwork
SimilartotheH-MSRcode,thereconstructionalgorithmsforH-MBRcodeinerror-free
networkdonotworkinhostilenetwork.Evenifthedatacollectercancalculatethesymbol
matrices
M
usingAlgorithm5.12,itcannotverifywhethertheresultiscorrectornot.There
aretwomodesfortheoriginaltobereconstructedinhostilenetwork.Onemodeisthe
detectionmode,inwhichnoerrorhasbeenfoundinthesymbolsreceivedfromthestorage
nodes.Onceerrorsaredetectedinthedetectionmode,therecoverymodewillbeusedto
correcttheerrorsandlocatethemaliciousnodes.
5.4.5.1DetectionMode
Inthedetectionmode,DCwillsendrequestsinthewaysimilartothatoftheerror-free
networkinSection5.4.4.Theonlyisthatwhen
j
=
q

1,DCwillsendrequests
to
k
q

1

k
q
+1nodesinsteadof
k
q

1

k
q
nodes.Storagenodeswillsendsymbolssimilar
tothatoftheerror-freenetworkinSection5.4.4.Thereconstructionalgorithmisdescribed
inAlgorithm5.13withthedetectionprobabilitydescribedinTheorem5.6.
Theorem5.6.
Whenthenumberofmaliciousnodesinthe
k
l
+1
nodesofAlgorithm5.13
islessthan
k
l
+1
,theprobabilityforthebogussymbolssentfromthemaliciousnodestobe
detectedisatleast
1

1
=q
2

l
.
Proof.
Forconvenience,wewrite
e
i;l;t
as
e
i
intheproof.
e
i
2
[
GF
(
q
2
)]

l
for0

i

k
l
.
InAlgorithm5.13,
R
1
0
=
R
1
+
Q
1
where
Q
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
e
0
e
1
.
.
.
e
k
l

1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
.Let
W
0
;k
l

1
;l
=
DC
1
;

DC
1
],
110
Algorithm5.13
[DetectionMode]DCReconstructstheOriginalFileinHostileNetwork
Step1:
Forevery0

l

q

1,wecandividethesymbolvector
~y
0
i;l
into

l
equal
rowvectors:[
~y
0
i;l;
1
;
~y
0
i;l;
2
;

;
~y
0
i;l
l
].
~y
0
i;l
=
~y
i;l
+
e
i;l
istheresponsefrom
the
i
th
storagenode.If
~y
i;l
hasbeenmobythemaliciousnode
i
,wehave
e
i;l
2
(
GF
(
q
2
))
A
nf
0
g
.Todetectwhetherthereareerrors,wewillreconstructthe
originalfromtwosetsofstoragenodesthencomparetheresults.(Withoutloss
ofgenerality,weassume0

i

k
l
.)
Step2:
Forevery0

l

q

1and1

t


l
,DCcanreconstructthematricesrelated
totheoriginal
Step2.1:
Let
R
0
=[
~y
0
T
0
;l;t
;
~y
0
T
1
;l;t
;

;
~y
0
T
k
l
;l;t
]
T
.
Step2.2:
Let
R
1
0
=[
~y
0
T
0
;l;t
;

;
~y
0
T
k
l

1
;l;t
]
T
,whicharethesymbolscollectedfrom
node0tonode
k
l

1,thenwehave
W
0
;k
l

1
;l

M
1
=
R
1
0
.Solve
M
1
usingthe
methodsametoalgorithm5.12.
Step2.3:
Let
R
2
0
=[
~y
0
T
1
;l;t
;

;
~y
0
T
k
l
;l;t
]
T
,whicharethesymbolscollectedfrom
node1tonode
k
l
,thenwehave
W
1
;k
l
;l

M
2
=
R
2
0
.Solve
M
2
usingthemethod
sametoalgorithm5.12.
Step2.4:
Compare
M
1
with
M
2
.Iftheyarethesame,let
M
l;t
=
M
1
.Otherwise,
errorsaredetectedinthereceivedsymbols.Exitthealgorithmandswitchto
recoveryreconstructionmode.
Step3:
Noerrorhasbeendetectedforthecalculatingofthereconstructionsofar.SoDC
canreconstructtheoriginalefromallthematrices
M
l;t
,0

l

q

1and
1

t


l
.
R
1
=[
R
1
;
1
;R
1
;
2
]and
Q
1
=[
Q
1
;
1
;Q
1
;
2
],where
DC
1
,
R
1
;
1
,
Q
1
;
1
are
k
l

k
l
submatricesand

DC
1
,
R
1
;
2
,
Q
1
;
2
are
k
l

(

l

k
l
)submatrices.
Accordingtoequation(5.69),wehave
W
0
;k
l

1
;l

M
1
=
DC
1
S
1
+
DC
1
T
T
1
;

DC
1
T
1
]
=[
R
1
;
1
+
Q
1
;
1
;R
1
;
2
+
Q
1
;
2
]
:
(5.74)
111
Since
DC
1
isasubmatrixofaVandermondematrix,itisafullrankmatrix.Sowehave
T
1
=

1
DC
1
R
1
;
2
+

1
DC
1
Q
1
;
2
=
T
+
^
T
1
;
(5.75)
S
1
=

1
DC
1
(
R
1
;
1
+
Q
1
;
1


DC
1
T
T
1
)
=

1
DC
1
(
R
1
;
1


DC
1
T
T
)+

1
DC
1
(
Q
1
;
1


DC
1
^
T
T
1
)
=
S
+

1
DC
1
(
Q
1
;
1


DC
1
^
T
T
1
)=
S
+
^
S
1
:
(5.76)
For
R
2
0
=
R
2
+
Q
2
inAlgorithm5.13,Let
R
2
=[
R
2
;
1
;R
2
;
2
],
Q
2
=[
Q
2
;
1
;Q
2
;
2
]and
W
1
;k
l
;l
=
DC
2
;

DC
2
],where
R
2
;
1
,
Q
2
;
1
,
DC
2
are
k
l

k
l
submatricesand
R
2
;
2
,
Q
2
;
2
,

DC
2
are
k
l

(

l

k
l
)submatrices.Similarly,wehave
T
2
=

1
DC
2
R
2
;
2
+

1
DC
2
Q
2
;
2
=
T
+
^
T
2
;
(5.77)
S
2
=
S
+

1
DC
2
(
Q
2
;
1


DC
2
^
T
T
2
)=
S
+
^
S
2
:
(5.78)
If
^
T
1
=
^
T
2
and
^
S
1
=
^
S
2
,Algorithm5.13willfailtodetectthebogussymbols.Sowewill
focuson
^
T
1
;
^
T
2
and
^
S
1
;
^
S
2
.
Suppose
1
;j
=[
e
0
;

;e
k
l

1
]
T
;

2
;j
=[
e
1
;

;e
k
l
]
T
arethe
j
th
;
1

j


l

k
l
;
columnsof
Q
1
;
2
and
Q
2
;
2
respectively,where
e
i
2
GF
(
q
2
).Since
DC
1
and
DC
2
are
Vandermondematricesandhavethesamerelationshipasthatbetween
V
0
;d
l

1
;l
and
V
1
;d
l
;l
,
similarastheproofofLemma1,wecanprovethatwhenthenumberofmaliciousnodesin
the
k
l
+1nodesislessthan
k
l
+1,theprobabilityof

1
DC
1

1
;j
=

1
DC
2

2
;j
isatmost
1
=q
2
.Thustheprobabilityfor
^
T
1
=
^
T
2
isatmost1
=q
2(

l

k
l
)
.Throughthesameprocedure,
wecanderivethattheprobabilityof
^
S
1
=
^
S
2
isatmost1
=q
2
k
l
.Theprobabilityforboth
^
S
1
=
^
S
2
and
^
T
1
=
^
T
2
isatmost1
=q
2

l
.Sothedetectionprobabilityisatleast1

1
=q
2

l
.
5.4.5.2RecoveryMode
OnceDCdetectserrorsusingAlgorithm5.13,itwillsendinteger
j
=
q

1toallthe
q
2
nodesinthenetworkrequestingsymbols.Storagenode
i
willusethewaysimilartothat
112
oftheerror-freenetworkinSection5.4.4tosendsymbols.Thereconstructproceduresare
describedinAlgorithm5.14.
Algorithm5.14
[RecoveryMode]DCReconstructstheOriginalFileinHostileNetwork
Step1:
Forevery0

l

q

1,dividethesymbolvector
~y
0
i;l
into

l
equalrowvectors:
[
~y
0
i;l;
1
;
~y
0
i;l;
2
;

;
~y
0
i;l
l
].(Withoutlossofgenerality,weassume0

i

q
2

1.)
Step2:
Forevery
q

1

l

0indescendingorderand1

t


l
inascendingorder,
DCreconstructsthematricesrelatedtotheoriginalwhentheerrorsinthe
receivedsymbolvectors
~y
0
i;l;t
from
q
2
storagenodescanbecorrected:
Step2.1:
Let
R
0
=[
~y
0
T
0
;l;t
;
~y
0
T
1
;l;t
;

;
~y
0
T
q
2

1
;l;t
]
T
.
Step2.2:
Ifthenumberofcorruptednodesdetectedislargerthanmin
f
q
2

k
l
;
b
(
q
2

k
q

1
)
=
2
cg
,thenthenumberoferrorshaveexceededtheerrorcorrection
capability.Soherewewillthedecodingfailureandexitthealgorithm.
Step2.3:
Sincethenumberoferrorsiswithintheerrorcorrectioncapabilityof
theH-MBRcode,substitute
~y
0
i;l;t
in
R
0
withthesymbol

representinganerasure
vectorifnode
i
hasbeendetectedtobecorruptedinthepreviousloops(previous
valuesof
l;t
).
Step2.4:
Solve
M
l;t
usingthemethodinsection5.4.6.Ifsymbolsfromnode
i
are
detectedtobeerroneousduringthecalculation,marknode
i
ascorrupted.
Step3:
DCreconstructstheoriginalfromallthematrices
M
l;t
,0

l

q

1and
1

t


l
.
5.4.6RecoverMatrices
M

l
;t
from
q
^2
StorageNodes
Whentherearebogussymbols~
p
0
i;l;t
sentbythecorruptednodesforcertain
l;t
,wecan
recoverthematrices
M

l
;t
asfollows:
For
R
0
inAlgorithm5.14,wehave
DC

M
0
=
R
0
,where
DC
=
W
0
;q
2

1
;l
=
DC
;

DC
],
R
0
=[
R
0
1
;R
0
2
].
DC
,
R
0
1
are
q
2

k
l
submatricesand
DC
,
R
0
2
are
q
2

(

l

k
l
)submatrices.
Accordingtoequation(5.69),wehave

DC

M
0
=
DC
S
0
+
DC
T
0
T
;

DC
T
0
]=[
R
0
1
;R
0
2
]
:
(5.79)
113
For
R
0
2
=
DC
T
0
,wecanvieweachcolumnof
R
0
2
asa(
q
2
;k
l
;q
2

k
l
+1)MDScode
because
DC
isaVandermondematrix.Ifthenumberoferasures
˙
(correspondingtothe
previouslydetectedcorruptednodes)andthenumberofcorruptednodes
˝
thathavenot
beendetectedsatisfy:
˙
+2
˝

q
2

k
l
;
(5.80)
thenallthecolumnsof
T
0
canberecoveredandtheerrorlocations(correspondingtothe
corruptednodes)canbepinpointed.After
T
0
hasbeenrecovered,wecanrecover
S
0
following
thesameprocessbecause
DC
S
0
=
R
0
1


DC
T
0
T
.SoDCcanreconstruct
M

l
;t
.
5.5PerformanceAnalysis
Inthissection,weanalyzetheperformanceoftheH-MSRcodeandcompareitwiththe
performanceoftheRS-MSRcode.Wewillanalyzetheirerrorcorrectioncapabilitythen
theircomplexity.
ThecomparisonresultsbetweentheH-MBRcodeandtheRS-MBRcodearethesame
sincetheerrorcorrectioncapabilityandthecomplexityoftheH-MSRcodeandtheH-MBR
codearesimilarwhiletheseperformanceparametersoftheRS-MSRcodeandtheRS-MBR
codearesimilar.
5.5.1ScalableErrorCorrection
5.5.1.1Errorcorrectionfordataregeneration
TheRS-MSRcodein[83]cancorrectupto
˝
errorsbydownloadingsymbolsfrom
d
+2
˝
nodes.However,thenumberoferrorsmayvaryinthesymbolssentbyhelpernodes.When
thereisnoerrororthenumberoferrorsisfarlessthan
˝
,downloadingsymbolsfromextra
nodeswillbeawasteofbandwidth.Whenthenumberoferrorsislargerthan
˝
,thedecoding
processwillfailwithoutbeingdetected.Inthiscase,thesymbolsstoredinthereplacement
114
nodewillbeerroneous.Ifthiserroneousnodebecomesahelpernodelater,theerrorswill
propagatetoothernodes.
TheH-MSRcodecandetecttheerroneousdecodingsusingAlgorithm5.3.Ifnoerror
isdetected,regenerationofH-MSRonlyneedstodownloadsymbolsfromonemorenode
thantheregenerationinerror-freenetwork,whiletheextracostfortheRS-MSRcodeis2
˝
.
Iferrorsaredetectedinthesymbolsreceivedfromthehelpernodes,theH-MSRcodecan
correcttheerrorsusingAlgorithm5.4.Moreover,thealgorithmcandeterminewhetherthe
decodingissuccessful,whiletheRS-MSRcodeisunabletoprovidesuchinformation.
5.5.1.2Errorcorrectionfordatareconstruction
Theevaluationresultissimilartothedataregeneration.TheRS-MSRcodecancorrectup
to
˝
errorswithsupportfrom2
˝
additionalhelpernodes.TheH-MSRcodeismore
Forerrordetection,itonlyrequiressymbolsfromoneadditionalnodeusingAlgorithm5.6.
TheerrorscanthenbecorrectedusingAlgorithm5.7.Thealgorithmcanalsodetermine
whetherthedecodingissuccessful.
5.5.2ErrorCorrectionCapability
FordataregenerationdescribedinAlgorithm5.4,H-MSRcodecanbeviewedas
q
MDScodes
withparameters(
q
2

1
;d
l
;q
2

d
l
),
l
=0
;

;q

1.Since

l


(
l
)and

(
l
)isstrictly
decreasing,wecanchoosethesequence

l
tobestrictlydecreasing.So
d
l
isalsostrictly
decreasing.Forthe
q
MDScodes,theminimumdistanceofthe(
q
2

1
;d
q

1
;q
2

d
q

1
)
codeisthelargest.InAlgorithm5.4,thiscodeisdecodedanditcancorrectupto
˝
q

1
=

(
q
2

d
q

1

1)
=
2

errors,where
b
x
c
istheorfunctionof
x
.Thenthecode
(
q
2

1
;d
l
;q
2

d
l
)
;l
=
q

2
;

;
0,willbedecodedsequentially.The(
q
2

1
;d
l
;q
2

d
l
)
codecancorrectatmost
˝
l
=
˝
q

1
errorswhen
q
2

d
0

1

˝
q

1
.Thus,thetotal
numberserrorsthattheH-MSRcodecancorrectis
˝
H

MSR
=
q

˝
q

1
.Whilethe(
q
3

115
q;
P
q

1
l
=0
d
l
;q
3

q

P
q

1
l
=0
d
l
+1)RS-MSRcodewiththesameratecancorrect
˝
RS

MSR
=
b
(
q
3

q

P
q

1
l
=0
d
l
)
=
2
c
errors.Therefore,wehavethefollowingtheorem.
Theorem5.7.
Fordataregeneration,thenumberoferrorsthattheH-MSRcodeandthe
RS-MSRcodecancorrectsatisfy
˝
H

MSR
>˝
RS

MSR
when
q

3
.
Proof.
For
˝
RS

MSR
,wehave
˝
RS

MSR
=
6
6
6
4
0
@
q
3

q

q

1
X
l
=0
d
l
1
A
=
2
7
7
7
5
(5.81)

j
(
q
3

q

q

d
q

1

q
2
(
q

1))
=
2
k
=

q

(
q
2

d
q

1

1)
=
2

q
(
q

1)
4


q

(
q
2

d
q

1

1)
=
2

q
(
q

1)
4
:
For
˝
H

MSR
,wehave
˝
H

MSR
=
q
b
(
q
2

d
q

1

1)
=
2
c
:
(5.82)
When
q
=3,itiseasytoverifythat
˝
H

MSR
>˝
RS

MSR
.
When
q>
3,Wecanrewriteequation(5.82)as
˝
H

MSR

q

(
q
2

d
q

1

1)
=
2

q=
2
:
(5.83)
Thegapbetween
˝
H

MSR
and
˝
RS

MSR
isatleast
q
(
q

1)
4

q
2
=
q
2

3
q
4
>
0
;q>
3
;
(5.84)
sowehave
˝
H

MSR
>˝
RS

MSR
.
Example1.
Suppose
q
=4
and
m
=37
,theHermitiancurveisdby
y
4
+
y
=
x
5
over
GF
(4
2
)
.Fromthepreviousdiscussion,wehave

(0)=10
;
(1)=9
;
(2)=7
;
(3)=6
.
Choose

0
=6
;
1
=5
;
2
=4
;
3
=3
.So
d
0
=12
;d
1
=10
;d
2
=8
;d
3
=6
.Accordingto
theanalysisabove,wehave
˝
H

MSR
=4

˝
3
=4
b
(15

6)
=
2
c
=16
,whichislargerthan
˝
RS

MSR
=
b
(60

36)
=
2
c
=12
.
116
Figure5.4ComparisonoferrorcorrectioncapabilitybetweentheH-MSRcodeandthe
RS-MSRcode
Wealsoshowthemaximumnumberofmaliciousnodesfromwhichtheerrorscanbe
correctedbytheH-MSRcodeinFigure.5.4.Herewesettheparameter
q
oftheHermitian
codefrom4to16withastepof2.InthetheperformanceoftheRS-MSRcode
withthesamecoderatesastheH-MSRcodeisalsoplotted.Thecomparisonresultfur-
therdemonstratesthatfordataregenerationtheH-MSRcodehasbettererrorcorrection
capabilitythantheRS-MSRcode.
FordatareconstructionAlgorithm5.7,H-MSRcodecanbeviewedas
q
MDScodeswith
parameters(
q
2

1
;k
l

1
;q
2

k
l
+1).Thedecodingforthereconstructionisperformedfrom
thecodewiththelargestminimumdistancetothecodewiththesmallestminimumdistance
asinthedataregenerationcase.Similarly,wecanconcludethatfordatareconstructionthe
H-MSRcodehasbettererrorcorrectioncapabilitythantheRS-MSRcodeunderthesame
coderate.
5.5.3ComplexityDiscussion
ForthecomplexityoftheH-MSRcode,weconsidertwoscenarios.
117
5.5.3.1H-MSRregeneration
FortheH-MSRregeneration,comparedwithRS-MSRcode,theH-MSRcodewillslightly
increasethecomplexityofthehelpernodes.Foreachhelpernode,theextraoperationis
amatrixmultiplicationbetween
B

1
i
and
Y
i
.Thecomplexityis
O
(
q
2
)=
O
((
n
1
=
3
)
2
)=
O
(
n
2
=
3
).Similarto[87],forareplacementnode,fromAlgorithm5.2andAlgorithm5.3,
wecanderivethatthecomplexitytoregeneratesymbolsforRS-MSRis
O
(
n
2
),whilethe
complexityforH-MSRisonly
O
(
n
5
=
3
).Likewise,forAlgorithm5.4,thecomplexityto
recovertheH-MSRcodeis
O
(
n
5
=
3
),and
O
(
n
2
)forRS-MSRcode.
5.5.3.2H-MSRreconstruction
Forthereconstruction,comparedwithRS-MSRcode,theadditionalcomplexityoftheH-
MSRcodeforeachstoragenodeis
O
(
q
2
),whichis
O
(
n
2
=
3
).Thecomputationalcomplexity
forDCtoreconstructthedatais
O
(
n
5
=
3
)fortheH-MSRcodeand
O
(
n
2
)fortheRS-MSR
code.
118
CHAPTER6
DISTRIBUTEDSTORAGEINHOSTILENETWORKS|OPTIMAL
CONSTRUCTIONOFREGENERATINGCODESTHROUGH
RATE-MATCHINGAPPROACH
InspiredbytheniceperformanceofHermitiancodebasedregeneratingcodes,wewillstep
forwardinthischaptertofurtherconstructoptimalregeneratingcodeswhichhavesimilar
layeredstructurelikeHermitiancodeindistributedstorage.ComparedtotheHermitian
basedcode,thesecodeshavesimplerstructureandareeasiertounderstandandimplement.
WewillproposetwooptimalconstructionsofMSRcodesthroughrate-matchinginhostile
networks:2-layerrate-matchedMSRcodeand
m
-layerrate-matchedMSRcode.Forthe
2-layercode,wecanachievetheoptimalstorageforgivensystemrequirements.
Ourcomprehensiveanalysisshowsthatourcodecandetectandcorrectmaliciousnodes
withhigherstoragecomparedtotheRS-MSRcode.Thenwewillproposethe
m
-
layercodebyextendingthe2-layercodeandachievetheoptimalerrorcorrectionciency
bymatchingthecoderateofeachlayer'sMSRcode.Wewillalsodemonstratethatthe
optimizedparametercanachievethemaximumstoragecapacityunderthesameconstraint.
ComparedtotheRS-MSRcode,ourcodecanachievemuchhighererrorcorrection.
Theoptimized
m
-layercodealsohasbettererrorcorrectioncapabilitythantheH-MSRcode.
6.1System/AdversarialModelsandAssumptions
Thesystem/adversarialmodelsandassumptionsinthischapterarethesamewithChapter5.
Weusethenotation
CH
torefertoeitherthefullrateMSRcodeoracodewordofthefull
rateMSRcode.Theexactmeaningcanbediscriminatedclearlyaccordingtothecontext.
119
6.2ComponentCodesofRate-matchedMSRCode
Inthissection,wewillintroducetwotcomponentcodesforrate-matchedMSRcode
ontheMSRpointwith
d
=2
k

2.ThecodebasedontheMSRpointwith
d>
2
k

2can
bederivedthesamewaythroughtruncatingoperations.Intherate-matchedMSRcode,
therearetwotypesofMSRcodeswithtcoderates:fullratecodeandfractionalrate
code.
6.2.1FullRateCode
6.2.1.1Encoding
Thefullratecodeisencodedbasedontheproduct-matrixcodeframeworkin[57].According
toequation(2.7),wehave

H
=
d=
2,

H
=1foroneblockofdatawiththesize
B
H
=
(

+1)

.Thedatawillbearrangedintotwo


symmetricmatrices
S
1
;S
2
,eachofwhich
willcontain
B
H
=
2data.Thecodeword
CH
isas
CH
=
2
6
6
6
6
4
S
1
S
2
3
7
7
7
7
5
=
S
H
;
(6.1)
where
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
111
:::
1
1
˚˚
2
:::˚


1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
˚
n

1
(
˚
n

1
)
2
:::
(
˚
n

1
)


1
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
(6.2)
isaVandermondematrixand=diag[

1
;
2
;

;

]suchthat

i
6
=

j
for1

i;j

;i
6
=
j
,where

i
2
GF
q
for1

i


,
˚
isaprimitiveelementin
GF
q
,andany
d
rowsof
120
arelinearindependent.Theneachrow
ch
i
,0

i<n
,ofthecodewordmatrix
CH
will
bestoredinstoragenode
i
,inwhichtheencodingvector

i
isthe
i
th
rowof
6.2.1.2Regeneration
Supposenode
z
fails,thereplacementnode
z
0
willsendregenerationrequeststotherestof
n

1helpernodes.Uponreceivingtheregenerationrequest,helpernode
i
willcalculate
andsendoutthehelpsymbol
p
i
=
ch
i

T
z
,where

z
isthe
z
th
rowof
z
0
willperform
Algorithm6.1toregeneratethecontentsofthefailednode
z
.Forconvenience,we
V
i;j
=
"

T
i
;
T
i
+1

;
T
j
#
T
;
where

t
;i

t

j
,isthe
t
th
rowofand
x
(
j
)
isthevector
containingthe
j
symbolsof
S
H

T
z
.
Algorithm6.1
z
0
RegeneratesSymbolsoftheFailedNode
z
Suppose
p
0
i
=
p
i
+
e
i
istheresponsefromthe
i
th
helpernode.If
p
i
hasbeenmodbythe
maliciousnode
i
,wehave
e
i
2
GF
q
nf
0
g
.Wecansuccessfullyregeneratethesymbolsinnode
z
whentheerrorsinthereceivedhelpsymbols
p
i
0
from
n

1
helpernodescanbecorrected.
Withoutlossofgenerality,weassume
0

i

n

2
.
Step1:
Decode
p
0
to
p
cw
,where
p
0
=[
p
0
0
;p
0
1
;

;p
0
n

2
]
T
canbeviewedasanMDScode
withparameters(
n

1
;d;n

d
)since
V
0
;n

2

x
(
n

1)
=
p
0
.
Step2:
Solve
V
0
;n

2

x
(
n

1)
=
p
cw
andcompute
ch
z
=

z
S
1
+

z

z
S
2
asdescribedin[57].
6.2.1.3Reconstruction
WhenDCneedstoreconstructtheoriginalitwillsendreconstructionrequeststo
n
storagenodes.Uponreceivingtherequest,node
i
willsendoutthesymbolvector
c
i
.
Suppose
c
0
i
=
c
i
+
e
i
istheresponsefromthe
i
th
storagenode.If
c
i
hasbeenmoby
themaliciousnode
i
,wehave
e
i
2
(
GF
q
)

nf
0
g
.
ThenDCwillreconstructtheasfollows:Let
R
0
=[
ch
0
0
T
;
ch
0
1
T
;

;
ch
0
n

1
T
]
T
,we
121
have

DC
2
6
6
6
6
4
S
0
1
S
0
2
3
7
7
7
7
5
=
DC

DC

DC
]
2
6
6
6
6
4
S
0
1
S
0
2
3
7
7
7
7
5
=
V
0
;n

1
2
6
6
6
6
4
S
0
1
S
0
2
3
7
7
7
7
5
=
R
0
;

DC
S
0
1

T
DC
+
DC

DC
S
0
2

T
DC
=
R
0

T
DC
:
(6.3)
Let
C
=
DC
S
0
1

T
DC
,
D
=
DC
S
0
2

T
DC
,and
b
R
0
=
R
0

T
DC
,then
C
+
DC
D
=
b
R
0
:
(6.4)
Since
C;D
arebothsymmetric,wecansolvethenon-diagonalelementsof
C;D
asfollows:
8
>
>
>
>
<
>
>
>
>
:
C
i;j
+

i

D
i;j
=
b
R
0
i;j
C
i;j
+

j

D
i;j
=
b
R
0
j;i
:
(6.5)
Becausematrices
C
and
D
havethesamestructure,hereweonlyfocuson
C
(corresponding
to
S
0
1
).Itisstraightforwardtoseethatifnode
i
ismaliciousandthereareerrorsinthe
i
th
rowof
R
0
,therewillbeerrorsinthe
i
th
rowof
b
R
0
.Furthermore,therewillbeerrorsinthe
i
th
rowand
i
th
columnof
C
.
S
0
1

T
DC
=
b
S
0
1
,wehave
DC
b
S
0
1
=
C:
Herewecanview
eachcolumnof
C
asan(
n

1
;;n


)MDScodebecause
DC
isaVandermondematrix.
Thelengthofthecodeis
n

1sincethediagonalelementsof
C
isunknown.Supposenode
j
isalegitimatenode,wecandecodetheMDScodetorecoverthe
j
th
columnof
C
andlocate
themaliciousnodes.Eventually
C
canberecovered.SoDCcanreconstructs
S
1
usingthe
methodsimilarto[57].For
S
2
,therecoveringprocessissimilar.
122
6.2.2FractionalRateCode
6.2.2.1Encoding
Forthefractionalratecode,wealsohave

L
=
d=
2,

L
=1foroneblockofdatawiththe
size
B
L
=
8
>
>
>
>
<
>
>
>
>
:
xd
(1+
xd
)
=
2
;x
2
(0
;
0
:
5]

(

+1)
=
2+(
x

0
:
5)
d
(1+(
x

0
:
5)
d
)
;x
2
(0
:
5
;
1]
;
(6.6)
where
x
isthematchfactoroftherate-matchedMSRcode.Itiseasytoseethatthefractional
ratecodewillbecomethefullratecodewith
x
=1.Thedata
m
=[
m
1
;m
2
;:::;m
B
L
]
2
GF
B
L
q
willbeprocessedasfollows:
When
x

0
:
5,thedatawillbearrangedintoasymmetricmatrix
S
1
ofthesize


:
S
1
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
m
1
m
2
:::m
xd
0
:::
0
m
2
m
xd
+1
:::m
2
xd

1
0
:::
0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
m
xd
m
2
xd

1
:::m
B
L
=
2
0
:::
0
00
:::
00
:::
0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
00
:::
00
:::
0
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
:
(6.7)
Thecodeword
CL
isas
CL
=
2
6
6
6
6
4
S
1
0
3
7
7
7
7
5
=
S
L
;
(6.8)
123
where
0
isthe


zeromatrixand
;

;
arethesameasthefullratecode.
When
x>
0
:
5,the

(

+1)
=
2datawillbearrangedintoan


symmetric
matrix
S
1
.Therestofthedata
m

(

+1)
=
2+1
;:::;m
B
L
willbearrangedintoanother


symmetricmatrix
S
2
:
S
2
=
2
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
4
m

(

+1)
=
2+1
:::m

(

+1)
=
2+
xd
0
:::
0
m

(

+1)
=
2+2
:::m

(

+1)
=
2+2
xd

1
0
:::
0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
m

(

+1)
=
2+
xd
:::m
B
L
=
2
0
:::
0
0
:::
00
:::
0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0
:::
00
:::
0
3
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
5
:
(6.9)
Thecodeword
CL
isthesameasequation(6.1)withthesameparameters
;
and

Theneachrow
cl
i
,0

i<n
,ofthecodewordmatrix
CL
willbestoredinstoragenode
i
respectively,inwhichtheencodingvector

i
isthe
i
th
rowof
6.2.2.2Regeneration
Theregenerationforthefractionalratecodeisthesameastheregenerationforthefullrate
codedescribedinSection6.2.1.2withonlyaminorIfwe
x
(
j
)
asthevector
containingthe
j
symbolsof
S
L

T
z
,therewillbeonly
xd
nonzeroelementsinthevector.
Accordingto
V
0
;n

2

x
(
n

1)
=
p
0
,thereceivedsymbolvector
p
0
forthefractionalratecode
in
Step1
ofAlgorithm6.1canbeviewedasan(
n

1
;xd;n

xd
)MDScode.Since
x<
1,
124
wecandetectandcorrectmoreerrorsindataregenerationusingthefractionalratecode
thanusingthefullratecode.
6.2.2.3Reconstruction
Thereconstructionforthefractionalratecodeissimilartothatforthefullratecodede-
scribedinSection6.2.1.3.Let
R
0
=[
cl
0
0
T
;
cl
0
1
T
;

;
cl
0
n

1
T
]
T
.
Whenthematchfactor
x>
0
:
5,reconstructionforthefractionalratecodeisthesame
tothatforthefullratecode.
When
x

0
:
5,equation(6.3)canbewrittenas:

DC
S
0
1
=
R
0
:
(6.10)
Sowecanvieweachcolumnof
R
0
asan(
n;xd;n

xd
+1)MDScode.Afterdecoding
R
0
to
R
cw
,wecanrecoverthedatamatrix
S
1
bysolvingtheequation
DC
S
1
=
R
cw
:
Meanwhile,
ifthe
i
th
rowsof
R
0
and
R
cw
aret,wecanmarknode
i
ascorrupted.
6.32-LayerRate-matchedMSRCode
Inthissection,wewillshowouroptimizationoftherate-matchedMSRcode:2-layer
rate-matchedMSRcode.Inthecodedesign,weutilizetwolayersoftheMSRcode:the
fractionalratecodeforonelayerandthefullratecodefortheother.Thepurposeof
thefractionalratecodeistocorrecttheerroneoussymbolssentbymaliciousnodesand
locatethecorrespondingmaliciousnodes.Thenwecantreattheerrorsinthereceived
symbolsaserasureswhenregeneratingwiththefullratecode.However,theratesofthe
twocodesmustmatchtoachieveanoptimalperformance.Herewemainlyfocusonthe
rate-matchingfordataregeneration.Wecanseeinthelateranalysisthattheperformance
ofdatareconstructioncanalsobeimprovedwiththisdesigncriterion.
125
Themainideaofthisoptimizationis:theerrorcorrectioncapabilitiesofthe
fractionalratecodeandthefullratecodebymakingtheirratematched,thenoptimizing
thedatastoragebyadjustingthenumberofdatablocksoftcodes.
6.3.1RateMatching
Fromtheanalysisabove,weknowthatduringregenerationthefractionalratecodecan
correctupto
b
(
n

xd

1)
=
2
c
errors,whicharemorethan
b
(
n

d

1)
=
2
c
errorsthatthe
fullratecodecancorrect.Inthe2-layerrate-matchedMSRcodedesign,ourgoalistomatch
thefractionalratecodewiththefullratecode.Themaintaskforthefractionalratecode
istodetectandcorrecterrors,whilethemaintaskforthefullratecodeistomaintainthe
storage.Soifthefractionalratecodecanlocateallthemaliciousnodes,thefull
ratecodecansimplytreatthesymbolssentfromthesemaliciousnodesaserasures,which
requirestheminimumredundancyforthefullratecode.Thefullratecodecancorrectup
to
n

d

1erasures.Thuswehavethefollowingoptimalrate-matchingequation:
b
(
n

xd

1)
=
2
c
=
n

d

1
;
(6.11)
fromwhichwecanderivethematchfactor
x
.
6.3.2Encoding
Toencodeawithsize
B
F
usingthe2-layerrate-matchedMSRcode,thewill
bedividedinto

H
blocksofdatawiththesize
B
H
and

L
blocksofdatawiththesize
B
L
,
wheretheparametersshouldsatisfy
B
F
=

H
B
H
+

L
B
L
:
(6.12)
Thenthe

H
blocksofdatawillbeencodedintocodematrices
CH
1
;:::;CH

H
usingthe
fullratecodeandthe

L
blocksofdatawillbeencodedintocodematrices
CL
1
;:::;CL

L
usingthefractionalratecode.Topreventthemaliciousnodesfromcorruptingthefractional
126
ratecodeonly,thesecureserverwillrandomlyconcatenateallthematricestogethertoform
the
n


(

H
+

L
)codewordmatrix:
CM
=[Perm(
CH
1
;:::;CH

H
;CL
1
;:::;CL

L
)]
;
(6.13)
wherePerm(

)istherandommatricespermutationoperation.Thesecureseverwillalso
recordtheorderofthepermutationforfuturecoderegenerationandreconstruction.Then
eachrow
c
i
=[Perm(
ch
1
;i
;:::;
ch

H
;i
;
cl
1
;i
;:::;
cl

L
;i
)],0

i

n

1,ofthecodeword
matrix
CM
willbestoredinstoragenode
i
,where
ch
j
;i
isthe
i
th
rowof
CH
j
,1

j


H
,
and
cl
j
;i
isthe
i
th
rowof
CL
j
,1

j


L
.Theencodingvector

i
forstoragenode
i
isthe
i
th
rowofinequation(6.1).
6.3.3Regeneration
Supposenode
z
fails,thesecurityserverwillinitializeareplacementnode
z
0
withtheorder
informationofthefractionalratecodeandthefullratecodeinthe2-layerrate-matched
MSRcode.Thenthereplacementnode
z
0
willsendregenerationrequeststotherestof
n

1
helpernodes.Uponreceivingtheregenerationrequest,helpernode
i
willcalculateandsend
outthehelpsymbol
p
i
=
c
i

T
z
.
z
0
willperformAlgorithm6.2toregeneratethecontentsof
thefailednode
z
.Aftertheregenerationisished,
z
0
willerasetheorderinformation.So
evenif
z
0
wascompromisedlater,theadversarywouldnotgetthepermutationorderofthe
fractionalratecodeandthefullratecode.
Algorithm6.2
z
0
RegeneratesSymbolsoftheFailedNode
z
forthe2-layerRate-matched
MSRCode
Step1:
Accordingtotheorderinformation,regenerateallthesymbolsrelatedtothe

L
datablocksencodedbythefractionalratecode,usingAlgorithm6.1.Iferrorsare
detectedinthesymbolssentbynode
i
,itwillbemarkedasamaliciousnode.
Step2:
Regenerateallthesymbolsrelatedtothe

H
datablocksencodedbythefullrate
code,usingAlgorithm6.1.Duringtheregeneration,allthesymbolssentfromnodes
markedasmaliciousnodeswillbereplacedbyerasures
N
.
127
ItiseasytoseethatAlgorithm6.2cancorrecterrorsandlocatemaliciousnodeusing
thefractionalratecodewhileachievehighstorageusingthefullratecode.
6.3.4ParametersOptimization
Wehavethefollowingdesignrequirementsforagivendistributedstoragesystemapplying
the2-layerrate-matchedMSRcode:

Themaximumnumberofmaliciousnodes
M
thatthesystemcandetectandlocate
usingthefractionalratecode.Wehave
b
(
n

xd

1)
=
2
c
=
M:
(6.14)

Theprobability
P
det
thatthesystemcandetectallthemaliciousnodes.Thedetection
willbesuccessfulifeachmaliciousnodemoatleastonehelpsymbolcorresponding
tothefractionalratecodeandsendsittothereplacementnode.Supposethemalicious
nodesmodifyeachhelpsymboltobesenttothereplacementnodewithprobability
P
,wehave
(1

(1

P
)

L
)
M

P
det
:
(6.15)
Sothereisabetween

L
and

H
:thenumberofdatablocksencodedbythe
fractionalratecodeandthenumberofdatablocksencodedbythefullratecode.Ifweencode
usingtoomuchfullratecode,wemaynotmeetthedetectionprobability
P
det
requirement.
Iftoomuchfractionalratecodeisused,theredundancymaybetoohigh.
Thestorageisastheratiobetweentheactualsizeofdatatobestored
andthetotalstoragespaceneededbytheencodeddata:

S
=

H
B
H
+

L
B
L
(

H
+

L
)

=
B
F
(

H
+

L
)

:
(6.16)
Thuswecancalculatetheoptimizedparameters
x
,
d
,

H
,

L
bymaximizingequation(6.16)
undertheconstraintsbyequations(6.11),(6.12),(6.14),(6.15).
128
d
and
x
canbedeterminedbyequation(6.11)and(6.14):
d
=
n

M

1
;
(6.17)
x
=(
n

2
M

1)
=
(
n

M

1)
:
(6.18)
Since
B
F
isconstant,tomaximize

S
isequaltominimize

H
+

L
.Sowecanrewritethe
optimizationproblemasfollows:
Minimize

H
+

L
;
subjectto(6
:
12)and(6
:
15)
:
(6.19)
Thisisasimplelinearprogrammingproblem.Herewewillshowtheoptimizationresults
directly:

L
=
log
(1

P
)
(1

P
1
=M
det
)
;
(6.20)

H
=(
B
F


L
B
L
)
=B
H
:
(6.21)
Weassumethatwearestoringlargewhichmeans
B
F
>
L
B
L
.Soanoptimalsolution
forthe2-layerrate-matchedMSRcodecanalwaysbefound.Wehavethefollowingtheorem:
Theorem6.1.
Whenthenumberofblocksofthefractionalratecode

L
equalsto
log
(1

P
)
(1

P
1
=M
det
)
andthenumberofblocksofthefullratecode

H
equalsto
(
B
F


L
B
L
)
=B
H
,the
2-layerrate-matchedMSRcodecanachievetheoptimalstorage
6.3.5Reconstruction
WhenDCneedstoreconstructtheoriginalitwillsendreconstructionrequeststo
n
storagenodes.Uponreceivingtherequest,node
i
willsendoutthesymbolvector
c
i
.
Suppose
c
0
i
=
c
i
+
e
i
istheresponsefromthe
i
th
storagenode.If
c
i
hasbeenmo
bythemaliciousnode
i
,wehave
e
i
2
(
GF
q
)

(

L
+

H
)
nf
0
g
.SinceDChasthepermutation
informationofthefractionalratecodeandthefullratecode,similartotheregenerationof
the2-layerrate-matchedMSRcode,DCwillperformthereconstructionusingAlgorithm6.3.
129
Algorithm6.3
DCReconstructstheOriginalFileforthe2-layerRate-matchedMSRCode
Step1:
Accordingtotheorderinformation,reconstructeachofthe

L
datablocksencoded
bythefractionalratecodeandlocatethemaliciousnodes.
Step2:
Reconstructeachofthedatablocksencodedbythefullratecode.Duringthere-
construction,allthesymbolssentfrommaliciousnodeswillbereplacedbyerasures
N
.
6.3.5.1OptimizedParameters
InSection6.3.4,weoptimizedtheparametersforthedataregeneration,consideringthe
betweenthesuccessfulmaliciousnodedetectionprobabilityandthestorage
ciency.Herewewillshowthatthesameparameterscanguaranteethatthesameconstraints
bedforthedatareconstruction.

Themaximumnumberofmaliciousnodescanbedetectedforthedatareconstruc-
tionisnosmallerthat
M
:if
x>
0
:
5,thenumberis
b
(
n


1)
=
2
c
.Wehave
b
(
n


1)
=
2
cb
(
n

xd

1)
=
2
c
=
M
.If
x

0
:
5,thenumberis
b
(
n

xd
)
=
2
c
.We
have
b
(
n

xd
)
=
2
cb
(
n

xd

1)
=
2
c
=
M

Thesuccessfulmaliciousnodedetectionprobabilityforthedatareconstructionislarger
than
P
det
:theprobabilityis(1

(1

P
)

L
)
M
,sowehave(1

(1

P
)

L
)
M
>
(1

(1

P
)

L
)
M

P
det
.
Althoughtherate-matchingequation(6.11)doesnotapplytothedatareconstruction,
thereconstructionstrategyinAlgorithm6.3canstillbfromthetratesofthe
twocodes.When
x

0
:
5,thefractionalratecodecandetectandcorrect
b
(
n

xd
)
=
2
c
maliciousnodes,whicharemorethan
b
(
n

d=
2

1)
=
2
c
maliciousnodesthefullratecode
candetect.When
x>
0
:
5,thefullratecodeandthefractionalratecodecandetectand
correctthesamenumberofmaliciousnodes:
b
(
n


1)
=
2
c
.
130
Figure6.1Thenumberoffractional/fullratecodeblocksfort
P
det
Fromtheanalysisabovewecanseethatthesameoptimizedparameters,whichareob-
tainedforthedataregeneration,canmaintaintheoptimizedbetweenthemalicious
nodedetectionandstorageforthedatareconstruction.
6.3.6PerformanceEvaluation
Fromtheanalysisabove,weknowthatforadistributedstoragesystemwith
n
storage
nodesoutofwhichatmost
M
nodesaremalicious,the2-layerrate-matchedMSRcodecan
guaranteedetectionandcorrectionofthemaliciousnodesduringthedataregenerationand
reconstructionwiththeprobabilityatleast
P
det
.
Foradistributedstoragesystemwith
n
=30,
M
=11and
P
=0
:
2,supposewehave
awiththesize
B
F
=14000
M
symbolstobestoredinthesystem.Thenumberofthe
fractionalratecodeblocks

L
andthenumberofthefullratecodeblocks

H
fort
detectionprobabilities
P
det
areshowninFigure.6.1.Fromthewecanseethatthe
numberoffractionalratecodeblockswillincreasewhenthedetectionprobabilitybecomes
larger.Accordingly,thenumberoffullratecodeblockswilldecrease.
FortheRS-MSRcodeconstructedin[83],theofthecodewiththesamere-
131
Figure6.2ratiosbetweenthe2-layerrate-matchedMSRcodeandtheRS-MSR
codefort
P
det
generationperformanceasthe2-layerrate-matchedMSRcodeisas

0
S
=

0
(

0
+1)

0
n
=

0
+1
n
=
xd=
2+1
n
:
(6.22)
InFigure.6.2wewillshowtheratios

=

S

0
S
betweenthe2-layerrate-matched
MSRcodeandtheRS-MSRcodeundertdetectionprobabilities
P
det
.Fromthe
wecanseethatthe2-layerrate-matchedMSRcodehashigherthanthe
RS-MSRcode.Whenthesuccessfulmaliciousnodesdetectionprobabilityis0
:
999999,the
ofthe2-layerrate-matchedMSRcodeisabout70%higher.
6.4
m
-LayerRate-matchedMSRCode
Inthissection,wewillshowoursecondoptimizationoftherate-matchedMSRcode:
m
-layer
rate-matchedMSRcode.Inthecodedesign,weextendthedesignconceptofthe2-layer
rate-matchedMSRcode.InsteadofencodingthedatausingtwoMSRcodeswitht
matchfactors,weutilize
m
layersofthefullrateMSRcodeswithtparameter
d
,
writtenas
d
i
forlayer
L
i
,1

i

m
,whichsatisfy
d
i

d
j
;
8
1

i

j

m:
(6.23)
132
Thedatawillbedividedinto
m
partsandeachpartwillbeencodedbyadistinctfullrate
MSRcode.Accordingtotheanalysisabove,thecodewithalowercoderatehasbettererror
correctioncapability.
Thecodewordswillbedecodedlayerbylayerintheorderfromlayer
L
1
tolayer
L
m
.
Thatis,thecodewordsencodedbythefullrateMSRcodewithalower
d
willbedecoded
priortothoseencodedbythefullrateMSRcodewithahigher
d
.Iferrorswerefoundbythe
fullrateMSRcodewithalower
d
,thecorrespondingnodeswouldbemarkedasmalicious.
Thesymbolssentfromthesenodeswouldbetreatedaserasuresinthesubsequentdecoding
ofthefullrateMSRcodeswithhigher
d
's.Thepurposeofthisarrangementistotryto
correctasmanyaserroneoussymbolssentbymaliciousnodesandlocatethecorresponding
maliciousnodesusingthefullrateMSRcodewithalowerrate.However,theratesofthe
m
fullrateMSRcodesmustmatchtoachieveanoptimalperformance.Herewemainly
focusontherate-matchingfordataregeneration.Wecanseeinthelateranalysisthatthe
performanceofdatareconstructioncanalsobeimprovedwiththisdesigncriterion.
Themainideaofthisoptimizationistooptimizetheoverallerrorcorrectioncapability
bymatchingthecoderatesoftfullrateMSRcodes.
6.4.1RateMatchingandParametersOptimization
AccordingtoSection6.2.1.2,thefullrateMSRcode
CH
i
forlayer
L
i
canbeviewedasan
(
n

1
;d
i
;n

d
i
)MDScodefor1

i

m
.Duringtheoptimization,wesetthesummation
ofthe
d
'sofallthelayerstoaconstant
d
:
m
X
i
=1
d
i
=
d:
(6.24)
HerewewillshowtheoptimizationthroughanillustrativeexampleThenwewill
presentthegeneralresult.
133
6.4.1.1Optimizationfor
m
=3
TherearethreelayersoffullrateMSRcodesfor
m
=3:
CH
1
,
CH
2
and
CH
3
.
Thelayercode
CH
1
cancorrect
t
1
errors:
t
1
=
b
(
n

d
1

1)
=
2
c
=(
n

d
1

1

"
1
)
=
2
;
(6.25)
where
"
1
=0or1dependingonwhether(
n

d
1

1)
=
2isevenorodd.
Byregardingthesymbolsfromthe
t
1
nodeswhereerrorsarefoundby
CH
1
aserasures,
thesecondlayercode
CH
2
cancorrect
t
2
errors:
t
2
=
b
(
n

d
2

1

t
1
)
=
2
c
+
t
1
=(
n

d
2

1

t
1

"
2
)
=
2+
t
1
=(2(
n

d
2
)+
n

d
1

2
"
2

"
1

3)
=
4
;
(6.26)
where
"
2
=0or1,withtherestrictionthat
n

d
2

1

t
1
,whichcanbewrittenas:

d
1
+2
d
2

n
+
"
1

1
:
(6.27)
Thethirdlayercode
CH
3
alsotreatthesymbolsfromthe
t
2
nodesaserasures.
CH
3
can
correct
t
3
errors:
t
3
=
b
(
n

d
3

1

t
2
)
=
2
c
+
t
2
=(
n

d
3

1

t
2

"
2
)
=
2+
t
2
(6.28)
=(4(
n

d
3
)+2(
n

d
2
)+
n

d
1

4
"
3

2
"
2

"
1

7)
=
8
;
where
"
3
=0or1,withtherestrictionthat
n

d
3

1

t
2
,whichcanbewrittenas:

d
1

2
d
2
+4
d
3

n
+
"
1
+2
"
2

1
:
(6.29)
Accordingtotheanalysisabove,the
d
'softhethreelayerssatisfy:
d
1

d
2

0
;
(6.30)
d
2

d
3

0
:
(6.31)
134
Andwecanrewriteequation(6.24)as:
d
1
+
d
2
+
d
3

d;
(6.32)

d
1

d
2

d
3

d:
(6.33)
Tomaximizetheerrorcorrectioncapabilityofthe
m
-layerrate-matchedMSRcodefor
m
=3,wehavetomaximize
t
3
,thenumberoferrorsthatthethirdlayercode
CH
3
can
correct,since
t
3
hasincludedallthemaliciousnodesfromwhicherrorsarefoundbythe
codesoftwolayers.Withalltheconstraintslistedabove,theoptimizationproblemcan
writtenas:
Maximize
t
3
in(6
:
28)
;
subjectto(6
:
27)
;
(6
:
29)
;
(6
:
30)
;
(6
:
31)
;
(6
:
32)
;
(6
:
33)
:
(6.34)
Nowwehavechangedthisoptimizationproblemintoatypicallinearprogrammingprob-
lem.Afterverifyingthislinearprogrammingproblemhasafeasiblesolution,wesolveit
usingtheSIMPLEXalgorithm[112].When
d
1
=
d
2
=
d
3
=Round(
d=
3)=
e
d
,the
m
-layer
rate-matchedMSRcodecancorrecterrorsfromatmost
e
t
3
=(7
n

7
e
d

4
"
3

2
"
2

"
1

7)
=
8

(7
n

7
e
d

14)
=
8(worstcase)
(6.35)
maliciousnodes,whereRound(

)istheroundingoperation.
6.4.1.2EvaluationoftheOptimizationfor
m
=3
Similartothestorage

S
inSection6.3,herewecantheerror
correction

C
ofthe
m
-layerrate-matchedMSRcodeastheratiobetweenthe
maximumnumberofmaliciousnodesthatcanbefoundandthetotalnumberofstorage
nodesinthenetwork:

C
=(7
n

7
e
d

14)
=
(8
n
)
:
(6.36)
135
Figure6.3Comparisonoftheerrorcorrectioncapabilitybetween
m
-layerrate-matched
MSRcodefor
m
=3andRS-MSRcode
TheRS-MSRcodewiththesamecoderatecanbeviewedasan(
n

1
;
e
d;n

e
d
)MDScode
whichcancorrecterrorsfromatmost(
n

e
d

1)
=
2maliciousnodes(bestcase).Sotheerror
correction

0
C
is

0
C
=(
n

e
d

1)
=
(2
n
)
:
(6.37)
Thecomparisonoftheerrorcorrectioncapabilitybetween
m
-layerrate-matchedMSRcode
for
m
=3andRS-MSRcodeisshowninFigure.6.3.Inthiscomparison,wesetthenumber
ofstoragenodesinthenetwork
n
=30.Fromthewecanseethatthe
m
-layer
rate-matchedMSRcodefor
m
=3improvestheerrorcorrectionmorethan50%.
6.4.1.3GeneralOptimizationResult
Forthegeneral
m
-layerrate-matchedMSRcode,theoptimizationprocessissimilar.
Thelayercode
CH
1
cancorrect
t
1
errorsasinequation(6.25).Byregardingthe
symbolsfromthe
t
i

1
nodeswhereerrorsarefoundby
CH
i

1
aserasures,the
i
th
layercode
136
cancorrect
t
i
errorsfor2

i

m
:
t
i
=
b
(
n

d
i

1

t
i

1
)
=
2
c
+
t
i

1
=(
n

d
i

1

t
i

1

"
i
)
=
2+
t
i

1
=(
P
i
j
=1
2
j

1
(
n

d
j
)

P
i
j
=1
2
j

1
"
j

2
i
+1)
=
2
i
;
(6.38)
where
"
i
=0or1,withtherestrictionthat
n

d
i

1

t
i

1
,whichcanbewrittenas:

i

1
X
j
=1
2
j

1
d
j
+2
i

1
d
i

n
+
i

1
X
j
=1
2
j

1
"
j

1
:
(6.39)
Similarly,theparameter
d
ofthe
i
th
layerfor2

i

m
mustsatisfy
d
i

1

d
i

0
:
(6.40)
Andequation(6.24)canbewrittenas:
m
X
j
=1
d
j

d;
(6.41)

m
X
j
=1
d
j

d:
(6.42)
Wecanmaximizetheerrorcorrectioncapabilityofthe
m
-layerrate-matchedMSRcode
bymaximizing
t
m
.Withalltheconstrainslistedabove,theoptimizationproblemcanbe
writtenas:
Maximize
t
i
for
i
=
m
in(6
:
38)
;
subjectto(6
:
39)and(6
:
40)for2

i

m;
(6
:
41)
;
(6
:
42)
:
(6.43)
Afterverifyingthatthislinearprogrammingproblemhasafeasiblesolution,wecanuse
theSIMPLEXalgorithmtosolveit.Theoptimizationresultcanbesummarizedasfollows:
Theorem6.2.
Forthe
m
-layerrate-matchedMSRcode,when
d
i
=
Round
(
d=m
)=
e
d
for
1

i

m;
(6.44)
137
itcancorrecterrorsfromatmost
e
t
m
=((2
m

1)(
n

e
d
)

m
X
j
=1
2
j

1
"
j

2
m
+1)
=
2
m

((2
m

1)(
n

e
d
)

2
m
+1
+2)
=
2
m
(
worstcase
)
:
(6.45)
maliciousnodes.
Theerrorcorrectionforthe
m
-layerrate-matchedMSRcodeis

C
=((2
m

1)(
n

e
d
)

2
m
+1
+2)
=
(2
m
n
)
:
(6.46)
Thisisamonotonicallyincreasingfunctionfor
m
,sowehave:
Corollary1.
Theerrorcorrectionofthe
m
-layerrate-matchedMSRcodein-
creaseswithm,whichisthenumberoflayers.
Remark6.
Duringtheoptimization,wesetthecoderateoftherate-matchedMSRcodetoa
constantvalueandmaximizetheerrorcorrectioncapability.Tooptimizingtherate-matched
MSRcode,wecanalsosettheerrorcorrectioncapability
t
i
for
i
=
m
in(6.38)toaconstant
value
t
m
=

t
(6.47)
andmaximizethecoderate.Theproblemcanbewrittenas:
Maximize
P
m
j
=1
d
j
subjectto
(6
:
39)
and
(6
:
40)
for
2

i

m;
(6
:
47)
:
(6.48)
Theoptimizationresultisthesameasthatof(6.43):whenallthe
d
0
i
s
for
1

i

m
arethe
same,thecoderateismaximized.
d
i
,
1

i

m
,thefollowingequation:
d
i

n

2
m

t
+2
m
+1

2
2
m

1
(worstcase)
:
(6.49)
138
Figure6.4Comparisonoferrorcorrectioncapabilitybetweenthe
m
-layerratematched
MSRcodeandtheH-MSRcode
6.4.1.4EvaluationoftheOptimization
Althoughatthebeginningofthissectionweproposetodecodethecodewithalowerrate
inthe
m
-layerrate-matchedMSRcode,equation(6.44)showsthatwecangetthe
optimizederrorcorrectioncapabilitywhenalltheratesofthecodesinthe
m
-layercodeare
equal.However,thisresultisnotinwithourassumptioninequation(6.23).
ComparisonwiththeHermitiancodebasedMSRcodein[88]
TheHermitian
codebasedMSRcode(H-MSRcode)in[88]hasbettererrorcorrectioncapabilitythan
theRS-MSRcode.However,becausethestructureoftheunderlyingHermitiancodeis
predetermined,theerrorcorrectioncapabilitymightnotbeoptimal.In6.4,the
maximumnumberofmaliciousnodesfromwhichtheerrorscanbecorrectedbytheH-MSR
codeisshown.Herewesettheparameter
q
oftheHermitiancode[87]from4to16witha
stepof2.Inthewealsoplottheperformanceofthe
m
-layerrate-matchedMSRcode
withthesamecoderatesastheH-MSRcode.Thecomparisonresultdemonstratesthat
therate-matchedMSRcodehasbettererrorcorrectioncapabilitythantheH-MSRcode.
Moreover,therate-matchedcodeiseasiertounderstandandhasmoreythanthe
H-MSRcode.
139
Figure6.5Theoptimalerrorcorrectionofthe
m
-layerrate-matchedMSRcode
undertmfor2

m

16
Relationshipbetweentheno.oflayersanderrorcorrection
Sincewe
haveseentheadvantageoftherate-matchedMSRcodeovertheRS-MSRcodeinSec-
tion6.4.1.2,herewewillmainlydiscusshowthenumberoflayerscantheerror
correction.Theerrorcorrectionofthe
m
-layerrate-matchedMSRcode
isshownisFigure.6.5,whereweset
n
=30and
d
=50.Wealsoplottheerrorcorrection


0
C
oftheRS-MSRcodewithsamecoderatesforcomparison.Fromthe
wecanseethatwhen
n
and
d
aretheoptimalerrorcorrectionwillincrease
withthenumberoflayers
m
asinCorollary1.
Optimizedstoragecapacity
Moreover,theoptimizationconditioninequation(6.44)
alsoleadstomaximumstoragecapacitybesidestheoptimalerrorcorrectioncapability.We
havethefollowingtheorem:
Theorem6.3.
The
m
-layerrate-matchedMSRcodecanachievethemaximumstorageca-
pacityiftheparameter
d
'sofallthelayersarethesame,undertheconstraintinequa-
tion(6.24).
Proof.
Thecodeofthe
i
th
layercanstoreoneblockofdatawiththesize
B
i
=

i
(

i
+1)=
(
d
i
=
2)(
d
i
=
2+1).Sothe
m
-layercodecanstoredatawiththesize
B
=
P
m
i
=1
(
d
i
=
2)(
d
i
=
2+1).
140
Ourgoalhereistomaximize
B
undertheconstraintinequation(6.24).
WecanuseLagrangemultiplierstothepointofmaximum
B
.Let

L
(
d
1
;:::;d
m
;
)=
m
X
i
=1
(
d
i
=
2)(
d
i
=
2+1)+

(
m
X
i
=1
d
i

d
)
:
(6.50)
Wecanthemaximumvalueof
B
bysettingthepartialderivativesofthisequationto
zero:
@

L
@d
i
=
d
i
+1
2


=0
;
8
1

i

m:
(6.51)
Herewecanseethatwhenalltheparameter
d
'sofallthelayersarethesame,wecanget
themaximumstoragecapacity
B
.Thismaximizationconditioncoincideswiththeoptimiza-
tionconditionforachievingthegoalofthissection:optimizingtheoverallerrorcorrection
capabilityoftherate-matchedMSRcode.
6.4.2PracticalConsiderationoftheOptimization
Sofar,weimplicitlypresumethatthereisonlyonedatablockofthesize
B
i
=

i
(

i
+1)
foreachlayer
i
.Inpracticaldistributedstorage,itistheparameter
d
i
thatisinstead
of
d
,thesummationof
d
i
.However,aslongasweuse
m
layersofMSRcodeswiththesame
parameter
d
=
e
d
,wewillstillgettheoptimalsolutionfor
d
=
m
e
d
.Infact,the
m
-layer
rate-matchedMSRcodeherebecomesasinglefullrateMSRcodewithparameter
d
=
e
d
and
m
datablocks.Andbasedonthedependentdecodingideawedescribeatthebeginningof
Section6.4,wecanachievetheoptimalperformance.
Sowhenthesize
B
F
islargerthanonedatablocksize
e
B
ofthesinglefullrateMSR
codewithparameter
d
=
e
d
,wewilldividetheinto
d
B
F
=
e
B
e
datablocksandencode
themseparately.Ifwedecodethesedatablocksdependently,wecangettheoptimalerror
correction.
141
Figure6.6Theoptimalerrorcorrectionfor2

m

16
6.4.2.1EvaluationoftheOptimalErrorCorrection
Inthepracticalcase,
e
d
inequation(6.46)isSoherewewillstudytherelationshipbe-
tweenthenumberofdependentlydecodingdatablocks
m
andtheerrorcorrection

C
,whichisshowninFigure.6.6.Weset
n
=30and
e
d
=5
;
10.Fromthewecan
seethatalthough

C
willbecomehigherwiththeincreasingofdependentlydecodingdata
blocks
m
,theimprovementwillbenegligiblefor
m

8.Actuallywhen
m
=7the
hasalreadybecome99%oftheupperboundof

C
.
Ontheotherhand,thereexistparallelalgorithmsforfastMDScodedecoding[113].We
candecodeblocksofMDScodewordsparallelinapipelinefashiontoacceleratetheoverall
decodingspeed.Themoreblocksofcodewordswedecodeparallel,thefasterwewill
thewholedecodingprocess.Forlargethatcouldbedividedintoalargeamountofdata
blocks(

blocks),wecangetabetweentheoptimalerrorcorrectionyand
thedecodingspeedbysettingthenumberofdependentlydecodingdatablocks
m
andthe
numberofparalleldecodingdatablocks
ˆ
undertheconstraint

=
mˆ
.
6.4.3Encoding
Fromtheanalysisaboveweknowthattoencodeawithsize
B
F
usingtheoptimal
m
-layer
rate-matchedMSRcodeistoencodetheusingafullrateMSRcodewithpredetermined
142
Figure6.7Latticeofreceivedhelpsymbolsforregeneration
parameter
d
=2

=
e
d
.Firstthewillbedividedinto

blocksofdatawithsize
e
B
,where

=
d
B
F
=
e
B
e
.Thenthe

blocksofdatawillbeencodedintocodematrices
CH
1
;:::;CH

andformthe
n


codewordmatrix:
CM
=[
CH
1
;:::;CH

].Eachrow
c
i
=
[
ch
1
;i
;:::;
ch

;i
],0

i

n

1,ofthecodewordmatrix
CM
willbestoredinstoragenode
i
,where
ch
j
;i
isthe
i
th
rowof
CH
j
,1

j


.Theencodingvector

i
forstoragenode
i
is
the
i
th
rowofinequation(6.1).
6.4.4Regeneration
Supposenode
z
fails,thereplacementnode
z
0
willsendregenerationrequeststotherestof
n

1helpernodes.Uponreceivingtheregenerationrequest,helpernode
i
willcalculate
andsendoutthehelpsymbol
p
i
=
c
i

T
z
.
Aswediscussabove,combiningbothdependentdecodingandparalleldecodingcan
achievethebetweenoptimalerrorcorrectionanddecodingspeed.Al-
thoughall

blocksofdataareencodedwiththesameMSRcode,
z
0
willplacethereceived
helpsymbolsintoa2-dimensionlatticewithsize
m

ˆ
asshowninFigure.6.7.Ineach
143
gridofthelatticethereare
n

1helpsymbolscorrespondingtoonedatablock,received
from
n

1helpernodes.Wecanvieweachrowofthelatticeasrelatedtoalayerofan
m
-layerrate-matchedMSRcodewith
ˆ
blocksofdata,whichwillbedecodedparallel.We
alsovieweachcolumnofthelatticeasrelatedto
m
layersofan
m
-layerrate-matchedMSR
codewithoneblockofdataeachlayer,whichwillbedecodeddependently.
z
0
willperform
Algorithm6.4toregeneratethecontentsofthefailednode
z
.
Algorithm6.4
z
0
RegeneratesSymbolsoftheFailedNode
z
forthe
m
-layerRate-matched
MSRCode
ArrangethereceivedhelpsymbolsaccordingtoFigure.6.7.Repeatthefollowingstepsfrom
Layer
1
toLayer
m
:
Step1:
Foracertaingrid,iferrorsaredetectedinthesymbolssentbynode
i
inprevious
layersofthesamecolumn,replacethesymbolsentfromnode
i
byanerasure
N
.
Step2:
Parallelregenerateallthesymbolsrelatedto
ˆ
datablocksusingthealgorithm
similartoAlgorithm6.1withonlyoneparalleldecodeallthe
ˆ
MDS
codesin
Step1
ofAlgorithm6.1.
6.4.5Reconstruction
WhenDCneedstoreconstructtheoriginalitwillsendreconstructionrequeststo
n
storagenodes.Uponreceivingtherequest,node
i
willsendoutthesymbolvector
c
i
.
Suppose
c
0
i
=
c
i
+
e
i
istheresponsefromthe
i
th
storagenode.If
c
i
hasbeenmo
bythemaliciousnode
i
,wehave
e
i
2
(
GF
q
)

nf
0
g
.Thestrategyofcombiningdependent
decodingandparalleldecodingforreconstructionissimilartothatforregeneration.
DC
will
placethereceivedsymbolsintoa2-dimensionlatticewithsize
m

ˆ
.Theonly
isthatinagridofthelatticethereare
n
symbolvectors
ch
j
0
;
0
;:::;
ch
j
0
;n

1
corresponding
todatablock
j
,receivedfrom
n
storagenodes.DCwillperformthereconstructionusing
Algorithm6.5.
144
Algorithm6.5
DCReconstructstheOriginalFileforthe
m
-layerRate-matchedMSRCode
ArrangethereceivedsymbolssimilartoFigure.6.7.Hereweplacereceivedcodewordmatrix
CH
0
j
intogrid
j
insteadofhelpsymbolsreceivedfromn-1helpnodes.Repeatthefollowing
stepsfromLayer
1
toLayer
m
:
Step1:
Foracertaingrid,iferrorsaredetectedinthesymbolssentbynode
i
inprevious
layersofthesamecolumn,replacesymbolssentfromnode
i
byerasures
N
.
Step2:
Parallelreconstructallthesymbolsofthe
ˆ
datablocksusingthealgorithmsimilar
toSection6.2.1.3withonlyonedparalleldecodealltheMDScodesin
Section6.2.1.3.
6.4.5.1OptimizedParameters
FromSection6.4.1weknowthatforregenerationofanoptimal
m
-layerrate-matchedMSR
code,theparameter
d
'sofallthelayersarethesame,whichimpliestheparameter

'sof
alllayersarealsothesame.Sincetheoptimizationofregenerationisderivedbasedonthe
decodingof(
n

1
;d;n

d
)MDScodesandinreconstructionwehavetodecode(
n

1
;;n


)
MDScodes,iftheparameter

'sofallthelayersarethesame,wecanachievethesame
optimizationresultsforreconstruction.
145
CHAPTER7
CONCLUSIONS
Inthisdissertation,westudythesecureproblemsinnetworkcodinganddistributedstorage.
Weproposeandanalysisschemesforcombatingpollutionattacksinnetworkcodingand
combatingmaliciousattacksindistributedstorage.
Forcombatingpollutionattacksinednetworkcoding,weanalyzetherelationshipbe-
tweentheerrorcontrolcodingandthenetworkcodinginunicastcaseandprovethatthetwo
codesareessentiallycorrelated.Furthermore,weextendthiscorrelationtomulticastcase.
Thisresearchprovidesamethodologytodesigntnetworkcodingschemebasedon
thecommunicationchannelanderrorcontrolcodingschemestocombatthecommunication
errorsandnodecompromisingattacks.
Atthesametime,weanalyzetherelationshipbetweenthecascadederrorcontrolcodes
andthenetworkcodeinunicastcaseandprovethatthetwocodesareessentiallycorrelated.
Thenweextendthiscorrelationtomulticastcase.Thisresearchprovidesanewmethodology
thatcancombatthecommunicationerrorsandnodecompromisingattacksbydesigning
tnetworkcodingschemebasedoncascadederrorcontrolcodesandfullyutilizingthe
innerstructureofnetworkcodes.
Forcombatingpollutionattacksinrandomnetworkcoding,ourpurposeistoguarantee
aminimumthroughputevenforheavilypollutednetworkenvironments.Weintro-
ducedanerrordetectionanderrorcorrection(EDEC)scheme.Byutilizingtheinformation
availableinthecorruptedpackets,thenetworkthroughputcanbemaintainedwithonlya
slightincreaseofthecomputationaloverheadwhenmoderatepollutionattackspresent.To
dealwithnetworkenvironmentwithheavypollution,weintroducedLEDECschemethat
enableschannelinformationbeexploitedandbeliefpropagationalgorithm(BPA)beused
forthepacketsymbolrecovery.Thisschemecanguaranteethethroughputundertheheavy
146
pollution.WeformulatedthethroughputoftheLEDECschemethroughboththeoretical
analysisandcomprehensiveevaluation.Ourextensivesimulationresultsderivedinns-2
platformshowthatthetheoreticalresultsareachievableinpracticalenvironments.
Forcombatingmaliciousattacksindistributedstorage,wedevelopaHermitiancode
basedminimumstorageregeneration(H-MSR)codeandaHermitiancodebasedminimum
bandwidthregeneration(H-MBR)codefordistributedstorage.Duetothestructureof
Hermitiancode,ourproposedcodescantlyimprovetheperformanceoftheregen-
eratingcodeundermaliciousattacks.Inparticular,thesecodescandealwitherrorsbeyond
themaximumdistanceseparable(MDS)code.Ourtheoreticalanalysesdemonstratethat
theH-MSR/H-MBRcodeshavelowercomplexitythantheReed-Solomonbasedminimum
storageregeneration(RS-MSR)codeandtheReed-Solomonbasedminimumbandwidthre-
generation(RS-MBR)codeinbothregenerationandreconstruction.
Wealsodeveloptworate-matchedminimumstorageregeneration(MSR)codesformali-
ciousnodesdetectionandcorrectioninhostilenetworks:2-layerrate-matchedMSRcodeand
m
-layerrate-matchedMSRcode.Weproposetheencoding,regenerationandreconstruction
algorithmsforbothcodes.Forthe2-layerrate-matchedcode,weoptimizetheparameters
forthedataregeneration,consideringthebetweenthemaliciousnodesdetection
probabilityandthestorage.Theoreticalanalysisshowsthatthecodecansuccess-
fullydetectandcorrectmaliciousnodesusingtheoptimizedparameters.Ouranalysisalso
showsthatthecodehashigherstoragecomparedtotheRS-MSRcode(70%higher
forthedetectionprobability0
:
999999).Thenweextendthe2-layercodeto
m
-layercode
andoptimizetheoverallerrorcorrectionbymatchingthecoderateofeachlayer's
MSRcode.Theoreticalanalysisshowsthattheoptimizedparametercouldalsoachievethe
maximumstoragecapacityunderthesameconstraint.Furthermore,analysisshowsthat
comparedtotheRS-MSRcode,ourcodecanimprovetheerrorcorrectionmore
than50%.
147
BIBLIOGRAPHY
148
BIBLIOGRAPHY
[1]
R.Ahlswede,N.Cai,S.-Y.Li,andR.Yeung,\Networkinformationw,"
IEEETrans-
actionsonInformationTheory
,vol.46,pp.1205{1216,July2000.
[2]
S.-Y.Li,R.W.Yeung,andN.Cai,\Linearnetworkcoding,"
IEEETransactionson
InformationTheory
,vol.49,no.2,pp.371{381,2003.
[3]
R.KoetterandM.Medard,\Analgebraicapproachtonetworkcoding,"
IEEE/ACM
TransactionsonNetworking
,vol.11,no.5,pp.782{795,2003.
[4]
T.Ho,B.Leong,R.Koetter,M.Medard,M.andD.Karger,\Byzantine
modetectioninmulticastnetworksusingrandomizednetworkcoding,"in
InternationalSymposiumonInformationTheory(ISIT)2004
,July2004,p.144.
[5]
C.GkantsidisandP.Rodriguez,\Cooperativesecurityfornetworkcodingdistri-
bution,"in
IEEEINFOCOM2006
,Apr.2006,pp.1{13.
[6]
N.CaiandR.W.Yeung,\Networkerrorcorrectionvpartii:Lowerbounds,"
Commu-
nicationsinInformationandSystems
,vol.6,pp.37{54,2006.
[7]
R.W.YeungandN.Cai,\Networkerrorcorrection,parti:Basicconceptsandupper
bounds,"
CommunicationsinInformationandSystems
,vol.6,pp.19{36,2006.
[8]
N.Cai,\Networklocalizederrorcorrection:Fornon-coherentcoding,"in
2011IEEE
InternationalSymposiumonInformationTheoryProceedings(ISIT)
,2011,pp.1146{
1150.
[9]
N.CaiandR.W.Yeung,\Networkcodinganderrorcorrection,"in
Proc.ofIEEE
InformationTheoryWorkshop(ITW2002)
,2002,pp.119{122.
[10]
R.Matsumoto,\Constructionalgorithmfornetworkerror-correctingcodesattaining
thesingletonbound,"
IEICETransactionsonFundamentalsofElectronics,Commu-
nicationsandComputerSciences
,vol.VolumeE90-A,pp.1729{1735,2007.
[11]
S.Yang,C.-K.Ngai,andR.Yeung,\Constructionoflinearnetworkcodesthatachieve
asingletonbound,"in
ISIT2007.IEEEInternationalSymposiumonInforma-
tionTheory,2007
,2007,pp.1576{1580.
[12]
Z.Zhang,\Linearnetworkerrorcorrectioncodesinpacketnetworks,"
IEEETransac-
tionsonInformationTheory
,vol.54,no.1,pp.209{218,2008.
[13]
H.Balli,X.Yan,andZ.Zhang,\Onrandomizedlinearnetworkcodesandtheirerror
correctioncapabilities,"
IEEETransactionsonInformationTheory
,vol.55,no.7,pp.
3148{3160,2009.
149
[14]
Z.Zhang,X.Yan,andH.Balli,\Somekeyproblemsinnetworkerrorcorrection
codingtheory,"in
2007IEEEInformationTheoryWorkshoponInformationTheory
forWirelessNetworks
,2007,pp.1{5.
[15]
Z.Zhang,\Somerecentprogressesinnetworkerrorcorrectioncodingtheory,"in
Net-
Cod2008.FourthWorkshoponNetworkCoding,TheoryandApplications,2008
,2008,
pp.1{5.
[16]
||,\Networkerrorcorrectioncodinginpacketizednetworks,"in
InformationTheory
Workshop,2006.ITW'06Chengdu.IEEE
,2006,pp.433{437.
[17]
H.BalliandZ.Zhang,\Onthelimitingbehaviorofrandomlinearnetworkcodes,"in
NetCod'09.WorkshoponNetworkCoding,Theory,andApplications,2009
,2009,pp.
1{5.
[18]
N.CaiandR.W.Yeung,\Thesingletonboundfornetworkerror-correctingcodes,"in
20064thInternationalSymposiumonTurboCodesRelatedTopics;6thInternational
ITG-ConferenceonSourceandChannelCoding(TURBOCODING)
,2006,pp.1{6.
[19]
S.Yang,R.Yeung,andC.-K.Ngai,codingboundsandcodeconstructions
forcoherentnetworkerrorcorrection,"
IEEETransactionsonInformationTheory
,
vol.57,no.3,pp.1409{1424,2011.
[20]
M.Siavoshani,C.Fragouli,andS.Diggavi,\Onlocatingbyzantineattackers,"in
NetCod2008.FourthWorkshoponNetworkCoding,TheoryandApplications,2008
,
2008,pp.1{6.
[21]
H.BahramgiriandF.Lahouti,\Blocknetworkerrorcontrolcodesandsyndrome-
basedmaximumlikelihooddecoding,"in
ISIT2008.IEEEInternationalSymposium
onInformationTheory,2008.
,2008,pp.807{811.
[22]
R.KoetterandF.Kschischang,\Codingforerrorsanderasuresinrandomnetwork
coding,"
IEEETransactionsonInformationTheory
,vol.54,no.8,pp.3579{3591,
2008.
[23]
D.Silva,F.Kschischang,andR.Koetter,\Arank-metricapproachtoerrorcontrolin
randomnetworkcoding,"in
2007IEEEInformationTheoryWorkshoponInformation
TheoryforWirelessNetworks
,2007,pp.1{5.
[24]
D.SilvaandF.Kschischang,\Adversarialerrorcorrectionfornetworkcoding:Models
andmetrics,"in
200846thAnnualAllertonConferenceonCommunication,Control,
andComputing
,2008,pp.1246{1253.
[25]
M.GadouleauandZ.Yan,\Decodererrorprobabilityofboundeddistancedecoders
forconstant-dimensioncodes,"in
ISIT2009.IEEEInternationalSymposiumonIn-
formationTheory,2009
,2009,pp.2226{2230.
[26]
D.SilvaandF.Kschischang,\Universalsecurenetworkcodingviarank-metriccodes,"
IEEETransactionsonInformationTheory
,vol.57,no.2,pp.1124{1135,2011.
150
[27]
N.Chen,Z.Yan,M.Gadouleau,Y.Wang,andB.Suter,\Rankmetricdecoderarchi-
tecturesforrandomlinearnetworkcodingwitherrorcontrol,"
IEEETransactionson
VeryLargeScaleIntegration(VLSI)Systems
,vol.20,no.2,pp.296{309,2012.
[28]
S.Jaggi,M.Langberg,T.Ho,andM.\Correctionofadversarialerrorsin
networks,"in
Proc.ofInternationalSymposiumonInformationTheory(ISIT2005)
,
2005,pp.1455{1459.
[29]
S.Jaggi,M.Langberg,S.Katti,T.Ho,D.Katabi,M.Medard,andM.\Re-
silientnetworkcodinginthepresenceofbyzantineadversaries,"
IEEETransactions
onInformationTheory
,vol.54,no.6,pp.2596{2603,2008.
[30]
S.Kim,T.Ho,M.andA.Avestimehr,\Networkerrorcorrectionwithunequal
linkcapacities,"
IEEETransactionsonInformationTheory
,vol.57,no.2,pp.1144{
1164,2011.
[31]
J.Kurihara,T.Uyematsu,andR.Matsumoto,\Newparametersoflinearcodesex-
pressingsecurityperformanceofuniversalsecurenetworkcoding,"in
201250thAnnual
AllertonConferenceonCommunication,Control,andComputing(Allerton)
,2012,pp.
533{540.
[32]
Z.Yu,Y.Wei,B.Ramkumar,andY.Guan,\Antschemeforsecuringxor
networkcodingagainstpollutionattacks,"in
IEEEINFOCOM2009
,Apr.2009,pp.
406{414.
[33]
M.Krohn,M.Freedman,andD.Mazieres,vationofratelesserasure
codesfortcontentdistribution,"in
IEEESymposiumonSecurityandPrivacy
2004
,May2004,pp.226{240.
[34]
D.Charles,K.Jain,andK.Lauter,\Signaturesfornetworkcoding,"in
200640th
AnnualConferenceonInformationSciencesandSystems
,2006,pp.857{863.
[35]
M.Kim,M.Medard,M.Medard,andJ.Barros,\Analgebraicwatchdogforwire-
lessnetworkcoding,"in
ISIT2009.IEEEInternationalSymposiumonInformation
Theory,2009
,2009,pp.1159{1163.
[36]
F.OggierandH.Fathi,\Anauthenticationcodeagainstpollutionattacksinnetwork
coding,"
IEEE/ACMTransactionsonNetworking
,vol.19,no.6,pp.1587{1596,2011.
[37]
D.Boneh,D.Freeman,J.Katz,andB.Waters,\Signingalinearsubspace:Signature
schemesfornetworkcoding,"in
12thInternationalConferenceonPracticeandTheory
inPublicKeyCryptography
,2009,pp.68{87.
[38]
Z.Yu,Y.Wei,B.Ramkumar,andY.Guan,\Antsignature-basedscheme
forsecuringnetworkcodingagainstpollutionattacks,"in
INFOCOM2008.The27th
ConferenceonComputerCommunications.IEEE
,2008,pp.{.
151
[39]
F.Zhao,T.Kalker,M.Medard,andK.Han,\Signaturesforcontentdistribution
withnetworkcoding,"in
ISIT2007.IEEEInternationalSymposiumonInformation
Theory,2007
,2007,pp.556{560.
[40]
S.Agrawal,D.Boneh,X.Boyen,andD.M.Freeman,\Preventingpollutionattacks
inmulti-sourcenetworkcoding,"in
13thInternationalConferenceonPracticeand
TheoryinPublicKeyCryptography
,2010,pp.161{176.
[41]
A.LeandA.Markopoulou,\Cooperativedefenseagainstpollutionattacksinnetwork
codingusingspacemac,"
IEEEJournalonSelectedAreasinCommunications
,vol.30,
no.2,pp.442{449,2012.
[42]
Y.XuandK.Sakurai,\Cooperativelysecuringnetworkcodingagainstpollutionat-
tackswithincentivemechanism,"in
Proceedingsofthe6thInternationalConference
onUbiquitousInformationManagementandCommunication
,no.52,2012.
[43]
J.Dong,R.Curtmola,C.Nita-Rotaru,andD.Yau,\Pollutionattacksanddefensesin
wirelessinwnetworkcodingsystems,"in
WirelessNetworkCodingConference
(WiNC),2010IEEE
,2010,pp.1{6.
[44]
E.KehdiandB.Li,\Nullkeys:Limitingmaliciousattacksvianullspaceproperties
ofnetworkcoding,"in
IEEEINFOCOM2009
,Apr.2009,pp.1224{1232.
[45]
M.YangandJ.An,\Combinedfountaincodewithnetworkcodingforerror-tolerant
transmissionnetwork,"in
WiCom'09.5thInternationalConferenceonWirelessCom-
munications,NetworkingandMobileComputing,2009
,2009,pp.1{4.
[46]
Z.LiuandS.Jin,\Aninteractionbetweennetworkcodingandend-hostcoding,"in
WirelessCommunicationsandNetworkingConference(WCNC),2011IEEE
,2011,pp.
885{890.
[47]
H.Yao,T.Ho,andC.Nita-Rotaru,\Keyagreementforwirelessnetworksinthe
presenceofactiveadversaries,"in
2011ConferenceRecordoftheFortyFifthAsilomar
ConferenceonSignals,SystemsandComputers(ASILOMAR)
,2011,pp.792{796.
[48]
M.Kim,M.Medard,andJ.Barros,\Counteractingbyzantineadversarieswithnet-
workcoding:Anoverheadanalysis,"in
MilitaryCommunicationsConference,2008.
MILCOM2008.IEEE
,2008,pp.1{7.
[49]
J.Zhang,K.Letaief,andP.Fan,\Adistributedproductcodingapproachforrobust
networkcoding,"in
ICC'08.IEEEInternationalConferenceonCommunications,
2008
,2008,pp.176{180.
[50]
Q.Wang,S.Jaggi,andS.-Y.Li,\Binaryerrorcorrectingnetworkcodes,"in
Informa-
tionTheoryWorkshop(ITW),2011IEEE
,2011,pp.498{502.
[51]
M.NajeemandC.SivaRamMurthy,\Onenhancingtherandomlinearnetworkcod-
ing,"in
17thIEEEInternationalConferenceonNetworks(ICON),2011
,2011,pp.
246{251.
152
[52]
M.GadouleauandA.Goupil,\Binarycodesforpacketerrorandpacketlosscorrection
instoreandforward,"in
2010InternationalITGConferenceonSourceandChannel
Coding(SCC)
,2010,pp.1{6.
[53]
S.Vyetrenko,A.Khosla,andT.Ho,\Oncombininginformation-theoreticandcryp-
tographicapproachestonetworkcodingsecurityagainstthepollutionattack,"in
2009
ConferenceRecordoftheForty-ThirdAsilomarConferenceonSignals,Systemsand
Computers
,2009,pp.788{792.
[54]
S.Rhea,C.Wells,P.Eaton,D.Geels,B.Zhao,H.Weatherspoon,andJ.Kubiatowicz,
\Maintenance-freeglobaldatastorage,"
IEEEInternetComputing
,vol.5,pp.40{49,
2001.
[55]
R.Bhagwan,K.Tati,Y.-C.Cheng,S.Savage,andG.M.Voelker,\Totalrecall:
Systemsupportforautomatedavailabilitymanagement,"in
roc.Symp.Netw.Syst.
DesignImplementation
,2004,pp.337{350.
[56]
A.Dimakis,P.Godfrey,Y.Wu,M.Wainwright,andK.Ramchandran,\Network
codingfordistributedstoragesystems,"
IEEETransactionsonInformationTheory
,
vol.56,pp.4539{4551,2010.
[57]
K.Rashmi,N.Shah,andP.Kumar,\Optimalexact-regeneratingcodesfordistributed
storageatthemsrandmbrpointsviaaproduct-matrixconstruction,"
IEEETrans-
actionsonInformationTheory
,vol.57,pp.5227{5239,2011.
[58]
D.Cullina,A.G.Dimakis,andT.Ho,\Searchingforminimumstorageregenerating
codes,"
Available:arXiv:0910.2245
,2009.
[59]
N.Shah,K.Rashmi,P.Kumar,andK.Ramchandran,\Explicitcodesminimizing
repairbandwidthfordistributedstorage,"in
InformationTheoryWorkshop(ITW),
2010IEEE
,2010,pp.1{5.
[60]
C.SuhandK.Ramchandran,\Exact-repairmdscodesfordistributedstorageusingin-
terferencealignment,"in
2010IEEEInternationalSymposiumonInformationTheory
Proceedings(ISIT)
,2010,pp.161{165.
[61]
Y.Wu,\Aconstructionofsystematicmdscodeswithminimumrepairbandwidth,"
IEEETransactionsonInformationTheory
,vol.57,no.6,pp.3738{3741,2011.
[62]
D.Papailiopoulos,J.Luo,A.Dimakis,C.Huang,andJ.Li,\Simpleregenerating
codes:Networkcodingforcloudstorage,"in
INFOCOM,2012ProceedingsIEEE
,
2012,pp.2801{2805.
[63]
S.ElRouayhebandK.Ramchandran,\Fractionalrepetitioncodesforrepairindis-
tributedstoragesystems,"in
201048thAnnualAllertonConferenceonCommunica-
tion,Control,andComputing(Allerton)
,2010,pp.1510{1517.
153
[64]
I.Tamo,Z.Wang,andJ.Bruck,\Mdsarraycodeswithoptimalrebuilding,"in
2011
IEEEInternationalSymposiumonInformationTheoryProceedings(ISIT)
,2011,pp.
1240{1244.
[65]
V.R.Cadambe,C.Huang,S.A.Jafar,andJ.Li,\Optimalrepairofmdscodesin
distributedstorageviasubspaceinterferencealignment,"
Available:arXiv:1106.1250
,
2011.
[66]
D.Papailiopoulos,A.Dimakis,andV.Cadambe,\Repairoptimalerasurecodes
throughhadamarddesigns,"
IEEETransactionsonInformationTheory
,vol.59,no.5,
pp.3021{3037,2013.
[67]
N.Shah,K.V.Rashmi,andP.Kumar,\Aclassofregeneratingcodesfor
distributedstorage,"in
2010IEEEInternationalSymposiumonInformationTheory
Proceedings(ISIT)
,2010,pp.1943{1947.
[68]
K.ShumandY.Hu,\Existenceofminimum-repair-bandwidthcooperativeregenerat-
ingcodes,"in
2011InternationalSymposiumonNetworkCoding(NetCod)
,2011,pp.
1{6.
[69]
A.WangandZ.Zhang,\Exactcooperativeregeneratingcodeswithminimum-repair-
bandwidthfordistributedstorage,"in
INFOCOM,2013ProceedingsIEEE
,2013,pp.
400{404.
[70]
H.Hou,K.W.Shum,M.Chen,andH.Li,\Basicregeneratingcode:Binaryaddition
andshiftforexactrepair,"in
2013IEEEInternationalSymposiumonInformation
TheoryProceedings(ISIT)
,2013,pp.1621{1625.
[71]
Y.-L.Chen,G.-M.Li,C.-T.Tsai,S.-M.Yuan,andH.-T.Chiao,\Regeneratingcode
basedp2pstorageschemewithcaching,"in
ICCIT'09.FourthInternationalConfer-
enceonComputerSciencesandConvergenceInformationTechnology,2009
,2009,pp.
927{932.
[72]
Y.Wu,A.G.Dimakis,andK.Ramchandran,\Deterministicregeneratingcodesfor
distributedstorage,"in
45thAnnu.AllertonConf.Control,Computing,andCommu-
nication
,2007.
[73]
A.DuminucoandE.Biersack,\Apracticalstudyofregeneratingcodesforpeer-to-peer
backupsystems,"in
ICDCS'09.29thIEEEInternationalConferenceonDistributed
ComputingSystems,2009
,June2009,pp.376{384.
[74]
K.Shum,\Cooperativeregeneratingcodesfordistributedstoragesystems,"in
2011
IEEEInternationalConferenceonCommunications(ICC)
,2011,pp.1{5.
[75]
Y.WuandA.G.Dimakis,\Reducingrepairforerasurecoding-basedstorage
viainterferencealignment,"in
IEEEInternationalSymposiumonInformationTheory,
2009.ISIT2009.
,2009,pp.2276{2280.
154
[76]
N.Shah,K.Rashmi,P.Kumar,andK.Ramchandran,\Interferencealignmentin
regeneratingcodesfordistributedstorage:Necessityandcodeconstructions,"
IEEE
TransactionsonInformationTheory
,vol.58,pp.2134{2158,2012.
[77]
F.OggierandA.Datta,\Byzantinefaulttoleranceofregeneratingcodes,"in
2011
IEEEInternationalConferenceonPeer-to-PeerComputing(P2P)
,2011,pp.112{121.
[78]
S.Pawar,S.ElRouayheb,andK.Ramchandran,\Securingdynamicdistributedstor-
agesystemsagainsteavesdroppingandadversarialattacks,"
IEEETransactionson
InformationTheory
,vol.57,pp.6734{6753,2011.
[79]
Y.Han,R.Zheng,andW.H.Mow,\Exactregeneratingcodesforbyzantinefault
toleranceindistributedstorage,"in
ProceedingsIEEEINFOCOM
,2012,pp.2498{
2506.
[80]
H.ChenandP.Lee,\Enablingdataintegrityprotectioninregenerating-coding-based
cloudstorage,"in
2012IEEE31stSymposiumonReliableDistributedSystems(SRDS)
,
2012,pp.51{60.
[81]
C.CachinandS.Tessaro,\Optimalresilienceforerasure-codedbyzantinedistributed
storage,"in
DSN2006.InternationalConferenceonDependableSystemsandNet-
works,2006
,2006,pp.115{124.
[82]
M.Abd-El-Malek,G.Ganger,G.Goodson,M.Reiter,andJ.Wylie,\Lazyv
infault-tolerantdistributedstoragesystems,"in
SRDS2005.24thIEEESymposium
onReliableDistributedSystems,2005
,2005,pp.179{190.
[83]
K.Rashmi,N.Shah,K.Ramchandran,andP.Kumar,\Regeneratingcodesforer-
rorsanderasuresindistributedstorage,"in
InternationalSymposiumonInformation
Theory(ISIT)2012
,2012,pp.1202{1206.
[84]
L.NutmanandM.Langberg,\Adversarialmodelsandresilientschemesfornetwork
coding,"in
ISIT2008.IEEEInternationalSymposiumonInformationTheory,2008.
,
July2008,pp.171{175.
[85]
O.KosutandL.-W.Kao,\Ongeneralizedactiveattacksbycausaladversariesin
networks,"in
InformationTheoryWorkshop(ITW),2014IEEE
,Nov2014,pp.247{
251.
[86]
P.WangandR.Safavi-Naini,\Antcodeforadversarialwiretapchannel,"in
InformationTheoryWorkshop(ITW),2014IEEE
,Nov2014,pp.40{44.
[87]
J.Ren,\Onthestructureofhermitiancodesanddecodingforbursterrors,"
IEEE
TransactionsonInformationTheory
,vol.50,pp.2850{2854,2004.
[88]
J.Li,T.Li,andJ.Ren,\Beyondthemdsboundindistributedcloudstorage,"in
INFOCOM,2014ProceedingsIEEE
,April2014,pp.307{315.
[89]
S.LinandD.J.Costello,
ErrorControlCoding
,2nded.PrenticeHall,June2004.
155
[90]
L.Chen,R.Carrasco,andM.Johnston,\Soft-decisionlistdecodingofhermitian
codes,"
IEEETransactionsonCommunications
,vol.57,no.8,pp.2169{2176,2009.
[91]
R.Blahut,\Oncodescontaininghermitiancodes,"in
1995IEEEInternationalSym-
posiumonInformationTheory
,1995,pp.101{.
[92]
J.Li,C.Yang,D.Tang,T.Li,andJ.Ren,\Characterizationoflinearnetworkcoding
forpollutiondetection,"in
GlobalCommunicationsConference(GLOBECOM),2012
IEEE
,2012,pp.1066{1071.
[93]
T.DierksandC.Allen,\TheTLSprotocol,version1.0,"RFC-2246,January1999.
[94]
W.NS2,http://nsnam.isi.edu/nsnam/index.php/Main
P
age,2010.
[95]
R.G.Gallager,\Low-densityparity-checkcodes,"
IRETrans.Inf.Theory
,vol.IT-8,
no.1,pp.21{28,January1962.
[96]
C.E.Shannon,\Amathematicaltheoryofcommunication,"
BellSyst.Tech.J.
,vol.27,
pp.533{547,623{656,JulyandOctober1948.
[97]
C.Jones,A.Matache,T.Tian,J.Villasenor,andR.Wesel,\Theuniversalityofldpc
codesonwirelesschannels,"in
MilitaryCommunicationsConference,2003.MILCOM
'03.2003IEEE
,vol.1,2003,pp.440{445Vol.1.
[98]
M.Franceschini,G.Ferrari,andR.Raheli,\Doestheperformanceofldpccodesdepend
onthechannel?"
IEEETransactionsonCommunications
,vol.54,no.12,pp.2129{
2132,2006.
[99]
D.Spielman,\Linear-timeencodableanddecodableerror-correctingcodes,"
IEEE
TransactionsonInformationTheory
,vol.42,no.6,pp.1723{1731,1996.
[100]
M.Luby,M.Mitzenmacher,A.Shokrollahi,D.Spielman,andV.Stemann,\Practi-
calloss-resilientcodes,"in
STOC'97Proceedingsofthetwenty-ninthannualACM
symposiumonTheoryofcomputing
,1997,pp.150{159.
[101]
M.Luby,M.Mitzenmacher,M.Shokrollahi,andD.Spielman,\Improvedlow-density
parity-checkcodesusingirregulargraphs,"
IEEETransactionsonInformationTheory
,
vol.47,no.2,pp.585{598,2001.
[102]
X.-Y.Hu,E.Eleftheriou,D.-M.Arnold,andA.Dholakia,timplementations
ofthesum-productalgorithmfordecodingldpccodes,"in
GlobalTelecommunications
Conference,2001.GLOBECOM'01.IEEE
,vol.2,2001,pp.1036{1036Evol.2.
[103]
H.Pishro-NikandF.Fekri,\Ondecodingoflow-densityparity-checkcodesoverthe
binaryerasurechannel,"
IEEETransactionsonInformationTheory
,vol.50,no.3,pp.
439{454,2004.
[104]
L.Bazzi,T.Richardson,andR.Urbanke,\Exactthresholdsandoptimalcodesforthe
binary-symmetricchannelandgallager'sdecodingalgorithma,"
IEEETransactions
onInformationTheory
,vol.50,no.9,pp.2010{2021,2004.
156
[105]
G.Liva,E.Paolini,B.Matuz,andM.Chiani,\Adecodingalgorithmforldpccodes
overerasurechannelswithsporadicerrors,"in
201048thAnnualAllertonConference
onCommunication,Control,andComputing(Allerton)
,2010,pp.458{465.
[106]
T.Richardson,M.Shokrollahi,andR.Urbanke,\Designofcapacity-approachingir-
regularlow-densityparity-checkcodes,"
IEEETransactionsonInformationTheoryn
,
vol.47,no.2,pp.619{637,2001.
[107]
M.G.Luby,M.Mitzenmacher,andM.A.Shokrollahi,\Analysisofrandomprocesses
viaand-ortreeevaluation,"in
InProceedingsofthe9thAnnualACM-SIAMSympo-
siumonDiscreteAlgorithms
,1998,pp.364{373.
[108]
M.Luby,M.Mitzenmacher,A.Shokrollah,andD.Spielman,\Analysisoflowdensity
codesandimproveddesignsusingirregulargraphs,"in
STOC'98Proceedingsofthe
thirtiethannualACMsymposiumonTheoryofcomputing
,1998,pp.249{258.
[109]
T.RichardsonandR.Urbanke,\Thecapacityoflow-densityparity-checkcodesunder
message-passingdecoding,"
IEEETransactionsonInformationTheory
,vol.47,no.2,
pp.599{618,2001.
[110]
L.M.Tanner,\Arecursiveapproachtolowcomplexitycodes,"
IEEETrans.Inf.
Theory
,vol.27,no.5,pp.533{547,1981.
[111]
M.Luby,M.Mitzenmacher,A.Shokrollahi,andD.Spielman,terasurecor-
rectingcodes,"
IEEETrans.Inf.Theory
,vol.47,pp.569{584,2001.
[112]
T.H.Cormen,C.E.Leiserson,R.L.Rivest,andC.Stein,
IntroductiontoAlgorithms
,
3rded.TheMITPress,2009.
[113]
D.DabiriandI.Blake,\Fastparallelalgorithmsfordecodingreed-solomoncodesbased
onremainderpolynomials,"
IEEETransactionsonInformationTheory
,vol.41,no.4,
pp.873{885,Jul1995.
157