Distributed file systems dfs

1,417 views 12 slides Apr 08, 2021
Slide 1
Slide 1 of 12
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12

About This Presentation

Distributed file systems dfs


Slide Content

Distributed File Systems

Adistributedfilesystemisaresourcemanagement
componentofadistributedoperatingsystem.Itimplementsa
commonfilesystemthatcanbesharedbyalltheautonomous
computersinthesystem.
DISTRIBUTED FILE SYSTEM
Twoimportantgoals:
1.Networktransparency–toaccessfilesdistributedovera
network.Ideally,usersdonothavetobeawareofthe
locationoffilestoaccessthem.
2.HighAvailability-toprovidehighavailability.Usersshould
havethesameeasyaccesstofiles,irrespectiveoftheir
physicallocation.

ARCHITECTURE
Inadistributedfilesystem,filescanbestoredatany
machineandthecomputationcanbeperformedatany
machine.
Thetwomostimportantservicespresentinadistributedfile
systemarenameserverandcachemanager.
Anameserverisaprocessthatmapsnamesspecifiedby
clientstostoredobjectssuchasfilesanddirectories.
Themapping(alsoreferredtoasnameresolution)
Acachemanagerisaprocessthatimplementsfilecaching.
Infilecaching,acopyofdatastoredataremotefileserveris
broughttotheclient’smachinewhenreferencedbythe
client.
DISTRIBUTED FILE SYSTEM

Cachemanagerscanbepresentatbothclientsandfileservers.
Cachemanagersattheserverscachefilesinthemainmemory
toreducedelaysduetodisklatency.
Ifmultipleclientsareallowedtocacheafileandmodifyit,
thecopiescanbecomeinconsistent.
Toavoidthisinconsistencyproblem,cachemanagersatboth
serversandclientscoordinatetoperformdatastorageand
retrievaloperations.

Architecture of a Distributed File System

DISTRIBUTED FILE SYSTEM
Arequestbyaprocesstoaccessadatablockispresentedto
thelocalcache(clientcache)ofthemachine(client)on
whichtheprocessisrunning.
Iftheblockisnotinthecache,thenthelocaldisk,if
present,ischeckedforthepresenceofthedatablock.
Iftheblockispresent,thentherequestissatisfiedandthe
blockisloadedintotheclientcache.
Iftheblockisnotstoredlocally,thentherequestispassed
ontotheappropriatefileserver
Theserverchecksitsowncacheforthepresenceofthedata
blockbeforeissuingadiskI/Orequest.
Thedatablockistransferredtotheclientcacheinanycase
andloadedtotheservercacheifitwasmissingintheserver
cache.

MECHANISMS FOR BUILDING DISTRIBUTED
FILE SYSTEM
Mounting
Amountmechanismallowsbindingtogetherofdifferent
filenamespacestoformasinglehierarchicallystructuredname
space.
Two approaches to maintain the mount information:
Mount information can be maintained at clients, in which
case each client has to individually mount every required file
system. This approach is employed in the Sun network file
system. Since each client can mount a file system at any node
in the name space tree, every client need not necessarily see
an identical filename space.

Mount information can be maintained at servers, in which
case it is possible that every client sees an identical filename
space. If files are moved to a different server, then mount
information need only be updated at the servers. In the first
approach, every client needs to update its mount table.

Cachingiscommonlyemployedindistributedfilessystemsto
reducedelaysintheaccessingofdata.
Infilecaching,acopyofdatastoredataremotefileserveris
broughttotheclientwhenreferencedbytheclient.
Thetemporallocalityofreferencereferstothefactthatafile
recentlyaccessedislikelytobeaccessedagaininthenear
future.
Datacaneitherbecachedinthemainmemory(servercache)at
theserverstoreducediskaccesslatency.
Cachingimprovesfilessystemperformancebyreducingthe
delayinaccessingdata.
CACHING

Analternativeapproachistotreatcacheddataashints.
Inthiscase,cacheddataarenotexpectedtobecompletely
accurate.
However,validcacheentriesimproveperformance
substantiallywithoutincurringthecostofmaintaining
costconsistency.
Theclassofapplicationsthatcanutilizehintsarethose
whichcanrecoverafterdiscoveringthatthecacheddata
areinvalid.
HINTS

Transferringdatainbulkreducestheprotocolprocessingoverhead
atbothserversandclients.
Inbulkdatatransfer,multipleconsecutivedatablocksare
transferredfromserverstoclientsinsteadofjusttheblock
referencedbyclients.
Whilefilecachingamortizesthehighcostofaccessingremote
serversoverseverallocalreferencestothesameinformation
Bulktransferamortizestheprotocolprocessingoverheadanddisk
seektimeovermanyconsecutiveblocksofafile.
Bulktransfersreducefileaccessoverheadthroughobtaininga
multiplenumberofblockswithasingleseek;byformattingand
transmittingamultiplenumberoflargepacketsinasinglecontext
switch;andbyreducingthenumberofacknowledgementsthat
needtobesent.
BULK DATA TRANSFER

Encryptionisusedforenforcingsecurityindistributed
systems.
TheworkofNeedhamandSchroederisthebasisformostof
thecurrentsecuritymechanismsindistributedsystems.
Intheirscheme,twoentitieswishingtocommunicatewith
eachotherestablishakeyforconversationwiththehelpof
anauthenticationserver.
Itisimportanttonotethattheconversationkeyis
determinedbytheauthenticationserver,butisneverspent
inplain(unencrypted)texttoeitheroftheentities.
ENCRYPTION