A centralized self-adaptive fault tolerance approach based on feedback control for multiagent systems

Our research introduces a self-adaptive fault tolerance approach for multiagent systems that enables the system to avoid crash failures. It is a replication-based approach that exploits a feedback control loop and a proportional (P) controller within a replication infrastructure. Thus, we are able to both observe the agents' behaviors to estimate criticalities and determine the number of replicas in replica groups with respect to the agents' criticalities and the number of available resources. Thus, agents that are to be replicated and the numbers of replicas in replica groups are automatically and adaptively identified in dynamic environments. We implement this approach to demonstrate performance gained in a set of experiments undertaken in different operating conditions.