TY - JOUR
T1 - A fault-tolerant protocol for election in chordal-ring networks with fail-stop processor failures
AU - Pan, Yi
AU - Singh, Gurdip
N1 - Funding Information:
Yi Pan’s research was supported in part by the US NSF Research Opportunity Award CCR-9211621. Gurdip Singh’s research was supported in part by NSF Research Initiation Award CCR-9211621 and NSF CAREER Award CCR-9502506. We thank Dr. R.A. Evans for his suggestions to improve the presentation of this paper.
PY - 1997
Y1 - 1997
N2 - Summary & Conclusions -In a distributed computer system, a group of processors is connected by communication links into a network. Each processor (node) of the network has an identity (a unique integer value) that is not related to its position in the network (its address). A processor's identity is known only to the processor. In the problem of leader election, exactly one processor among a network of processors has to be distinguished as the leader. Previously, many efficient election protocols have been proposed for networks with a sense of direction. In particular, the sequential search is used for election in a reliable complete network, and a multi-token search method is used in a faulty complete network. However, election protocols on a faulty ChRgN (chordal ring network) have not been investigated by other researchers. This paper addresses this issue by: studying the problem of leader election in asynchronous ChRgN with a sense of direction and with the presence of undetectable fail-stop processor failures; proposing a new election protocol which a) combines the concept of sequential search and multi-token search techniques, and b) uses an efficient routing algorithm to reduce the total number of messages used. presenting a protocol, for a ChRgN of n processors with / chords/processor and at most/fail-stop faulty processors, with message complexity O(n + (n/l)-log(n) + k-J), where k is the number of processors starting the election process spontaneously and at most f < I processors are faulty; showing that the message complexity of the protocol is optimal within a constant factor when l > log(n). This paper considers only processor failures with fail-stop failures only. Key Words -Distributed algorithm, fault-tolerance, leader election, message complexity, reliable protocol
AB - Summary & Conclusions -In a distributed computer system, a group of processors is connected by communication links into a network. Each processor (node) of the network has an identity (a unique integer value) that is not related to its position in the network (its address). A processor's identity is known only to the processor. In the problem of leader election, exactly one processor among a network of processors has to be distinguished as the leader. Previously, many efficient election protocols have been proposed for networks with a sense of direction. In particular, the sequential search is used for election in a reliable complete network, and a multi-token search method is used in a faulty complete network. However, election protocols on a faulty ChRgN (chordal ring network) have not been investigated by other researchers. This paper addresses this issue by: studying the problem of leader election in asynchronous ChRgN with a sense of direction and with the presence of undetectable fail-stop processor failures; proposing a new election protocol which a) combines the concept of sequential search and multi-token search techniques, and b) uses an efficient routing algorithm to reduce the total number of messages used. presenting a protocol, for a ChRgN of n processors with / chords/processor and at most/fail-stop faulty processors, with message complexity O(n + (n/l)-log(n) + k-J), where k is the number of processors starting the election process spontaneously and at most f < I processors are faulty; showing that the message complexity of the protocol is optimal within a constant factor when l > log(n). This paper considers only processor failures with fail-stop failures only. Key Words -Distributed algorithm, fault-tolerance, leader election, message complexity, reliable protocol
UR - http://www.scopus.com/inward/record.url?scp=0031100702&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0031100702&partnerID=8YFLogxK
U2 - 10.1109/24.589920
DO - 10.1109/24.589920
M3 - Article
AN - SCOPUS:0031100702
SN - 0018-9529
VL - 46
SP - 11
EP - 17
JO - IEEE Transactions on Reliability
JF - IEEE Transactions on Reliability
IS - 1
ER -