Japan Advanced Institute of Science and Technology
JAIST Repository
https://dspace.jaist.ac.jp/
Title
分散環境におけるフォールトトレラントソフトウエアの構成法に関する研究
Author(s)
伊関, 浩Citation
Issue Date
1997‑03Type
Thesis or DissertationText version
authorURL
http://hdl.handle.net/10119/1014Rights
Description
Supervisor:片山 卓也, 情報科学研究科, 修士in Distributed System
Ko Iseki
Scho ol of Information Science,
Japan Advanced Institute of Science and Technology
February 14,1997
Keywords: fault tolerance,software faulttolerance,FTAG,CML.
The increased useof computersand ourincreased relianceonthem haveledtoaneed
for highly reliable computer systems. Once failure o ccured in computer system, weneed
the system that to guard the failure inuence the whole of system. Even if system has
faluire init ,system isgiving the service. Then this prop ertyis calledfault tolerant. The
goal of fault tolerance isto avoid system failure,evenif faultsare present. There are two
fault tolerance approach: hardware fault tolerance and software fault tolerance. In this
paper, we focus on software fault tolerance. Software fault tolerance is a structure that
constructs to continue providing service despite the existence of software faults and/or
operational faults.
Building fault tolerance software by functional approach, there is many advantage
compared with imperative approach. FTAG(Fault Tolerant AttributeGrammers) model
is a model that functional model for writing fault tolerant software that is based at-
tribute grammers. As described there,this mo del can facilitate the writing of programs
that use various mechanismsfor dealingwith software faultsand/or faults inthe under-
lying computation platform. These include recovery blocks,N{version programing and
process replication. With this approach , a program is written as a series of module do-
compositions,withprovisionsfor redoingandreplicatingmodulesused toimplementfault
tolerance requirements.
This FTAG model ,however, is not yet implemented completely computer systems.
Therfore,itis necessarytoimplementFTAGmodeltocomputer system, and makesome
experience onit.
The goalof this paperistoshowavarageoffaulttolerance basedonfunctionalmo del
andhowtoconstructafaulttolerantsoftwarebyimplementationonlooselycoupledmulti
processor system.
Copyright c
1997byKoIseki
tions called mo dules. Computing order of modules is determined by attributes' dep en-
dence. And, mo dules that no dependency of attributes is possible that running in con-
currently. Accordingly, it is to be desired that the language to implement FTAG have
architecture of concurrency. In this paper, we will use CML(concurrentML).
CML is a language for concurrent programing. It is an extension of Standard ML
(SML), and is implemented on top of Standerd ML of New Jersey(SML/NJ). A CML
program consists of a number of threads, which use message passing on typ ed channels
tocommunicate. Threadsand channelscan create dynamically,this propretyis useful to
simulate attributes owbetween the threads.
The conversion method forFTAG programinglist isconvert to CMLare as follows.
(1). FTAGprogramisconvertedCMLprogramthathaveamanagementoffaulttolerant
software. This program is actual apply of application software.And this program detect
failure and raise redoing.
(2). Workspacemanager and nodemanager is created thatmanage attributesstored ob-
ject base isrequired by faulttolerant structure redoingand replication.
(3). Composing these twomanagers and CMLprogram whichconvertedbefore, generate
the programwhichrun onCMLinterpreter. Then,thisprogram isfaulttolerantsoftware
in CML.
We call three component that describe above, no de manager, work space manager,
and application applier. These components are important component in fault tolerant
softwareinCML.Eachcomp onentsare communicatewithother componentbychannels.
Channel ismessagepassingwaybetween thread. Inthis system,channelisusedbynodes
of comptational tree to carryattribute values.
No demanagercontrolsthe treethatcreatedbyapplicationappliertoapply functions.
Nodemanagerstoredshapeofcomptationtree. Thenredoingoccuredbydetectingfailure
,make tree to no contradictionin system, redoing operation is execute. Workspaceman-
agerisstore andmanageattribute valuesrequiredwhenredoingand replicationhapp ens.
Application applier is that converted program FTAG to CML. Actually,this component
execute the program of CML.
Messages from the node manager to the workspace manager are required that read
attritute value and require to delete workspace. Messages from work spaces to the node
managerarereportedthatrequiredattributevaluestotheworkspaces. Messagesbetween
other comp oses are required and reported asavob e.
Indecompositionmodules tothreadswithchannels,eachnodecanworkconcurrently.
By this property,this model is possible to implement to distributed system. Based the
architecture we describe above,we show this implementation FTAG model is applicable
to destributed system. And through description of name sarver system, in conrm the
eciency of FTAG model, and itsimplementation of ondistributed systems.