University of Wollongong
Browse

Reliability of fault-tolerant systems with parallel task processing

Download (162.08 kB)
journal contribution
posted on 2024-11-15, 06:32 authored by Gregory Levitin, Min Xie, Tieling ZhangTieling Zhang
The paper considers performance and reliability of fault-tolerant software running on a hardware system that consists of multiple processing units. The software consists of functionally equivalent but independently developed versions that start execution simultaneously. The computational complexity and reliability of different versions are different. The system completes the task execution when the outputs of a pre-specified number of versions coincide. The processing units are characterized by different availability and processing speed. It is assumed that they are able to share the computational burden perfectly and that execution of each version can be fully parallelized. The algorithm based on the universal generating function technique is used for determining the distribution of system task execution time. This algorithm allows analysts to evaluate complex hardware-software reliability and performance indices such as expected task execution time and probability that the task is completed within a given time. Illustrative examples are also presented.

History

Citation

Levitin, G., Xie, M. & Zhang, T. (2007). Reliability of fault-tolerant systems with parallel task processing. European Journal of Operational Research, 177 (1), 420-430.

Journal title

European Journal of Operational Research

Volume

177

Issue

1

Pagination

420-430

Language

English

RIS ID

78900

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC