Parallel Redundancy

Parallel redundancy, a technique for enhancing system reliability and fault tolerance, focuses on creating multiple copies of a system or component, all operating concurrently. Current research explores optimizing redundancy allocation strategies, particularly in resource-constrained environments like cloud computing and embedded systems, often employing techniques like pruning and selective duplication to minimize computational overhead. This work is significant for improving the robustness and efficiency of various applications, ranging from safety-critical systems (e.g., autonomous vehicles) to data processing and machine learning models, where resilience to failures is paramount.

Papers