Management System
The management system is the central component of CLSE and manages the overall execution of the framework. It is a distributed system that consist of N >= 1 physical servers connected through an IP network. One of the servers is designated to be the “leader” and the other servers are “workers”. Workers can perform local management actions but not actions that affect the overall system state. These actions are routed to the leader, which applies them sequentially to ensure consistent updates to the system state.
The leader is elected using a leader election protocol that uses the metastore for coordination (see Fig. 3). A new leader is elected by a quorum whenever the current leader fails or becomes unresponsive. This means that the management system tolerates up to N/2-1 failing servers.
Each server of the management system executes the following services: (1) an instance of the emulation system; (2) a replica of the metastore; (3) an instance of the simulation system; (4) an HTTP server; and (5), a set of monitoring services, namely Prometheus, Grafana, cAdvisor, and Node exporter (see Fig. 4).
- Previous
- Next