In network-cloud ecosystems, large-scale failures affecting network carrier and datacenter (DC) infrastructures can severely disrupt cloud services. Post-disaster cloud service restoration requires cooperation among carriers and DC providers (DCPs) to minimize downtime. Such cooperation is challenging due to proprietary and regulatory policies, which limit access to confidential information (detailed topology, resource availability, etc.). Accordingly, we introduce a third-party entity, a provider-neutral exchange, which enables cooperation by sharing abstracted information. We formulate an optimization problem for DCP–carrier cooperation to maximize service restoration while minimizing restoration time and cost. We propose a scalable heuristic, demonstrating significant improvement in restoration efficiency with different topologies and failure scenarios.
more »
« less
This content will become publicly available on March 1, 2026
Service Restoration in Multi-Entity Network-Cloud Ecosystems: How to Cooperate?
In network-cloud ecosystems, cooperation among different entities, for example, network carriers and datacenter providers (DCPs), is crucial to enhance resiliency, especially during large-scale failures or congestion. However, such cooperation is constrained by limited visibility of confidential information, for example, network topology, resource availability, and so on, of different entities owing to proprietary and regulatory policies. To facilitate cooperation, we present and discuss the role of a third-party entity, called provider neutral exchange (PNE), which acts as a broker/mediator and enables cooperation among multiple entities by sharing abstracted (instead of detailed) information of individual entities. We design novel cooperation strategies for post-disaster service restoration and categorize them as: multi-carrier cooperation and DCP-carrier cooperation. Results under different failure scenarios show benefits of cooperation in terms of service-restoration efficiency, restoration time, and restoration cost.
more »
« less
- Award ID(s):
- 2210384
- PAR ID:
- 10646192
- Publisher / Repository:
- IEEE
- Date Published:
- Journal Name:
- IEEE Communications Magazine
- Volume:
- 63
- Issue:
- 3
- ISSN:
- 0163-6804
- Page Range / eLocation ID:
- 129 to 135
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Large-scale network-cloud ecosystems are fundamental infrastructures to support future 5G/6G services, and their resilience is a primary societal concern for the years to come. Differently from a single-entity ecosystem (in which one entity owns the whole infrastructure), in multi-entity ecosystems (in which the networks and datacenters are owned by different entities) cooperation among such different entities is crucial to achieve resilience against large-scale failures. Such cooperation is challenging since diffident entities may not disclose confidential information, e.g., detailed resource availability. To enhance the resilience of multi-entity ecosystems, carriers are important as all the entities rely on carriers’ communication services. Thus, in this study we investigate how to perform carrier cooperative recovery in case of large-scale failures/disasters. We propose a two-stage cooperative recovery planning by incorporating a coordinated scheduling for swift recovery. Through preliminary numerical evaluation, we confirm the potential benefit of carrier cooperation in terms of both recovery time and recovery cost/burden reduction.more » « less
-
To accommodate the growing demand for cloud services, telecom carriers’ networks and datacenter (DC) facilities form large network–cloud ecosystems (ecosystems for short) physically supporting these services. These large-scale ecosystems are continuously evolving and must be highly resilient to support critical services. Open and disaggregated optical-networking technologies promise to enhance the interoperability across telecom carriers and DC operators, thanks to their open interfaces in both the data plane and control/management plane. In the first part of this paper, we focus on a single entity (e.g., a telecom carrier or an emerging telecom/DC partnership company) that owns both the network and DC infrastructures in the ecosystem. We introduce a solution by leveraging open and disaggregated technologies to enhance the resilience of the optical networks within a multi-vendor and multi-domain ecosystem. In the second part of this paper, we consider the case when the networks and DCs are owned by different entities. Also, in this case, cooperation among datacenter providers (DCPs) and carriers is crucial to provide failure/disaster resilience to today’s cloud services. However, such cooperation is more challenging since DCPs and carriers, being different entities, may not disclose confidential information, e.g., detailed resource availability. Hence, we introduce a solution to enhance the resilience of such multi-entity ecosystems through cooperation between DCPs and carriers without violating confidentiality.more » « less
-
Large-scale carrier networks are fundamental ICT infrastructures that support future 5G/6G services, and their resilience is a primary societal concern. Differently from single-carrier networks (in which one carrier owns multiple networks), in multi-carrier network ecosystems (in which the networks in the fields are operated by different carriers), cooperation among such different carriers is crucial to achieve resilience against large-scale failures. However, such cooperation is challenging since carriers may not disclose confidential information, e.g., detailed resource availability. In this study, we investigate how to perform carrier cooperative recovery in the case of large-scale failures/disasters. We propose two-stage carrier-carrier cooperative recovery planning by incorporating a coordinated scheduling for faster recovery. Through numerical evaluation, we confirm the potential benefit of carrier cooperation in terms of both recovery time and recovery cost reduction.more » « less
An official website of the United States government
