A Distributed Algorithm for System-Level Diagnosis (NOT PUBLISHED)
08 April 1988
In this paper, a distributed fault detection algorithm is given for a system of multiple computing elements which communicate via 1) a broadcast bus, or 2) point-to-point links. The algorithm is based upon earlier work [1] which showed that the faulty elements in such systems can be detected using this approach with a remarkably high probability and can be implemented using minimal operational overhead.