resilient optical networks: principles and...

5
Abstract—This paper presents the principles of designing resilient optical transport networks including failure scenarios, survivability architectures, routing and wavelength assignment problem and implementation approaches. Furthermore, the paper addresses current and future research challenges including covering multiple simultaneous failures, QoS-based RWA, robustness and future demand uncertainty accommodation, and quality of service issues in deployment of resilient backbones for next generation telecommunication networks. Index Terms—Optical networks, resilient systems, failure scenarios, principles and state-of-the art. I. INTRODUCTION HE optical networks are prone to failures and unexpected failures of network components such as link and node failures at optical layer can potentially lead to a catastrophic loss of data and revenues producing an unacceptable deterioration in the delivered quality of service (QoS). Therefore, one of the most important optical network design issues is survivability [1], which is the ability of a network to provide continuous service at an acceptable level in the presence of different failure scenarios [2, 3]. Some reasons of designing resilient optical transport networks could be summarized as; 1) The increasing demands of end-users of client networks to have reliable communications and services with fair quality [4]. 2) The unexpected failures of network components such as link and node failures at the optical layer may result in multiple failures at client layers. 3) The aggregation bandwidths of the order of several Tb/s per fiber, using dense-wavelength-division-multiplexing (DWDM) technique, causes enormous data and revenue loss in the event of network’s failure. 4) Fault tolerance and traffic restoration at optical transport layer have several advantages, such as shorter restoration time, efficient resource utilization, and protocol transparency, over that at the client layer. Yousef S. Kavian is with Electrical Engineering Department, Faculty of Engineering, Shahid Chamran University, Ahvaz, Iran (e-mail: [email protected] ). II. PRINCIPLES The survivable networks employ redundant resources like spare ligthpaths and switches to cover the failures which increase the network cost planning. A. Failure Models and Redundancy The network failures are classified in three types; transient, temporary and permanent failures which affect the network redundancy. The researches on survivability in DWDM networks focus on permanent failures employing permanent redundancies like permanent redundant bandwidth and spare switches for failure covering. Three main considering failure models are; The Single Link Failure Model This is the more realistic and probable failure model for DWDM networks [5], where the fiber may cut by workers, fire, earthquake, changing environmental condition like temperature, humidity. In this model the fiber cut in a link of the network is repaired before another fiber cut assumed to occur in the network. The link failures are assumed to be independent of each other and the probability of a single link failure is the same for all links and; it has uniform distribution function. The Node Failure Model The node (OXC, OADM, transmitter, receiver, and amplifier) failure model is also a real issue that needs to be considered in designing fault tolerant backbone networks [6]. While the probability of a node failure is generally much smaller than a link failure due to the built-in redundancy of many network equipments but node failure is still possible and will cause server service disruption. B. Survivability Architectures The hierarchy of optical mesh transport network protection and restoration options is illustrated in Fig. 1. Protection and Restoration Architectures Two main survivability architectures called protection and restoration provide the basis of the different schemes to the design of fault tolerant optical transport networks. Protection architectures [7] establish working and spare lightpaths for arrival requests during network configuration before network Resilient Optical Networks: Principles and State-of-the-Art Yousef S. Kavian T Page 147 /183

Upload: others

Post on 29-Sep-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Resilient Optical Networks: Principles and State-of-the-Artseminars.usb.ac.ir/Files/iccome/en-us/Document/... · failures at optical layer can potentially lead to a catastrophic loss

Abstract—This paper presents the principles of designing

resilient optical transport networks including failure scenarios, survivability architectures, routing and wavelength assignment problem and implementation approaches. Furthermore, the paper addresses current and future research challenges including covering multiple simultaneous failures, QoS-based RWA, robustness and future demand uncertainty accommodation, and quality of service issues in deployment of resilient backbones for next generation telecommunication networks.

Index Terms—Optical networks, resilient systems, failure scenarios, principles and state-of-the art.

I. INTRODUCTION HE optical networks are prone to failures and unexpected failures of network components such as link and node

failures at optical layer can potentially lead to a catastrophic loss of data and revenues producing an unacceptable deterioration in the delivered quality of service (QoS). Therefore, one of the most important optical network design issues is survivability [1], which is the ability of a network to provide continuous service at an acceptable level in the presence of different failure scenarios [2, 3]. Some reasons of designing resilient optical transport networks could be summarized as; 1) The increasing demands of end-users of client networks to have reliable communications and services with fair quality [4]. 2) The unexpected failures of network components such as

link and node failures at the optical layer may result in multiple failures at client layers.

3) The aggregation bandwidths of the order of several Tb/s per fiber, using dense-wavelength-division-multiplexing (DWDM) technique, causes enormous data and revenue loss in the event of network’s failure.

4) Fault tolerance and traffic restoration at optical transport layer have several advantages, such as shorter restoration time, efficient resource utilization, and protocol transparency, over that at the client layer.

Yousef S. Kavian is with Electrical Engineering Department, Faculty of Engineering, Shahid Chamran University, Ahvaz, Iran (e-mail: [email protected] ).

II. PRINCIPLES The survivable networks employ redundant resources like spare ligthpaths and switches to cover the failures which increase the network cost planning.

A. Failure Models and Redundancy The network failures are classified in three types; transient,

temporary and permanent failures which affect the network redundancy. The researches on survivability in DWDM networks focus on permanent failures employing permanent redundancies like permanent redundant bandwidth and spare switches for failure covering. Three main considering failure models are; The Single Link Failure Model

This is the more realistic and probable failure model for DWDM networks [5], where the fiber may cut by workers, fire, earthquake, changing environmental condition like temperature, humidity. In this model the fiber cut in a link of the network is repaired before another fiber cut assumed to occur in the network. The link failures are assumed to be independent of each other and the probability of a single link failure is the same for all links and; it has uniform distribution function.

The Node Failure Model

The node (OXC, OADM, transmitter, receiver, and amplifier) failure model is also a real issue that needs to be considered in designing fault tolerant backbone networks [6]. While the probability of a node failure is generally much smaller than a link failure due to the built-in redundancy of many network equipments but node failure is still possible and will cause server service disruption.

B. Survivability Architectures The hierarchy of optical mesh transport network protection

and restoration options is illustrated in Fig. 1.

Protection and Restoration Architectures Two main survivability architectures called protection and

restoration provide the basis of the different schemes to the design of fault tolerant optical transport networks. Protection architectures [7] establish working and spare lightpaths for arrival requests during network configuration before network

Resilient Optical Networks: Principles and State-of-the-Art

Yousef S. Kavian

T

Page 147 /183

Page 2: Resilient Optical Networks: Principles and State-of-the-Artseminars.usb.ac.ir/Files/iccome/en-us/Document/... · failures at optical layer can potentially lead to a catastrophic loss

operation, whereas restoration architectures [8] use network state variables to assign spare lightpaths in the event of failure. Whilst the former is static and offers a faster restoration time, the latter is dynamic with greater bandwidth efficiency. Furthermore, the restoration architecture is more flexible against network configuration changes and may cover multiple link and node failures. Link, path and sub-path survivability

Both protection and restoration architectures can be further divided into link, path and sub-path survivability schemes. In link survivability scheme, the traffic is rerouted only around the failed link. In path survivability scheme [9], the working and the spare lightpaths must be link disjoint so in the event of failure on working lightpath the traffic is rerouted through its spare lightpath. Whilst the former provides efficient utilization of redundant resources and lower propagation delay, the latter provides lower switching delay. In the sub-path scheme [10] the whole network is divided into different domains where a ligthpath segment in one domain must be protected by the resources in the same domain. Compared to path protection, sub-path protection can achieve high scalability and faster recovery time for a modest sacrifice in resource utilization [11, 12]. Dedicated and shared protection

Furthermore the protection architecture is classified into two major categories, dedicated protection and shared

protection. In dedicated protection backup resources along spare lightpath are especially dedicated for that lightpath and can not be utilized by other spare ligthpaths, but in shared segment [13, 14] and shared path [15] protection schemes redundant resources can be shared between several spare ligthpaths. Dedicated protection is faster than shared protection, but shared protection is more efficient for resource utilization. The problem of survivable light path provisioning using shared path protection in optical mesh networks employing WDM was investigated in [16].

Shared mesh WDM optical networks with partial wavelength conversion capability has been considered in [17]. In addition in [18] the state of the art progress in developing both shared path protection (SPP) and segment shared protection (SSP) were reviewed.

C. Routing and Wavelength Assignment Problem The routing and wavelength assignment (RWA) problem

lies at the heart of designing the wavelength-routed DWDM optical transport networks [19] where a variety of services including voice, data and video are multiplexed together and routed from origin to destination over a common infrastructure called lightpaths or an all- optical communication channels without any optical-electronic-optical conversion and buffering at the intermediate nodes, known as wavelength routing. Establishment of a lightpath

Protection

Link

Restoration

Link

Sub-path

Path

Dedicated

Shared

Path

Dedicated

Shared

Sub-path

Dedicated

SharedFailure Recovery Schemes

Fig. 1. The survivability hierarchy in optical transport network

Page 148 /183

Page 3: Resilient Optical Networks: Principles and State-of-the-Artseminars.usb.ac.ir/Files/iccome/en-us/Document/... · failures at optical layer can potentially lead to a catastrophic loss

consists of choosing a route and assigning wavelengths known as an NP-hard problem. Static and Dynamic RWA

The RWA problem is called static where the demand matrix is known and static and a set of lightpaths should be established between demand node pairs while the objective is network resource optimization like minimizing the number of allocated wavelength channels. The static RWA problem is employed in protection architectures. The RWA problem is dynamic when the traffic pattern is dynamic and changes duration network operation, and the objective is network resource optimization like minimizing the number of allocated wavelength channels and minimizing blocking ratio [20, 21].

D. Implementing Approaches

Heuristics and ILP approaches The integer linear programming (ILP) approach has been

extensively employed for modeling different survivability schemes. The survey papers [22], [23] and [24] may be referred to in order to present an indication of the body of work in the literature. The ILP approach works well for small problems with linear objective functions and constraints, and for small networks where the number of decision variables increases dramatically by increasing the size of networks. However, the limitations of ILP models have led to the introduction of a diverse range of heuristic algorithms. The Intelligent Techniques

The other successful approaches are based on intelligent techniques especially genetic algorithms. A genetic algorithm (GA) is a stochastic global search method based on natural biological evolution and genetics [25] that may be employed to solve optimization problems that are not well suited for standard optimization algorithms, including those where the objective function is discontinuous or non differentiable. The complexity of problem, the abilities of genetic algorithm to search solution for NP-hard problems in large-scale search space [26], and the successful background of genetic algorithms in designing communication networks [27] lead to provide genetic algorithm based models for solving RWA problems in designing the survivable DWDM optical transport networks. Encoding the working and spare lightpaths into chromosomes, creating next generation using crossover, mutation and selection operators, population initializing, developing termination rules and construction the fitness function are the main problems for implementation a GA approach for designing survivable DWDM optical transport networks [28].

III. CURRENT AND FUTURE RESEARCH CHALLENGES

A. The Multiple-link Failure

Another failure scenario that should be considered in next generation high speed telecommunication networks is multiple near simultaneous link failures where , for example, a new failure would happen before repairing former occurred failure in the network or more than one link be affected in the network during a natural earthquake disaster [29]-[31]. Here for the case of M-failure the network topology should be (M+1)-connected. For example in case of 2-link failure, the network should be 3-connected that means it needs to establish three link-disjoint paths between each node pair. As shown in Fig. 2, during design of a survivable network, in addition to setting up a working lightpath, (4-6), between each original and destination node pair, two link-disjoint spare lightpath,(4-5-6),(4-3-2-1-6), needs to be established between the node pair, too. The diverse spare ligthpaths are employed as the means to restore the traffic demand when two links affected in the network.

B. QoS based RWA The resource reservation routing algorithms provide

lightpaths which minimize the bandwidth (wavelength) utilization and they may not be suitable to accommodate the QoS requirements [32]-[33]. The effect of QoS requirements, propagation delay optimization, on solutions of RWA problem is compared with resource (bandwidth) optimization. The topology of the network has been shown in Fig. 3 that

Fig. 2. A survivable network: double link failures covering

Fig. 3. Resource reservation and QoS-based RWA

Page 149 /183

Page 4: Resilient Optical Networks: Principles and State-of-the-Artseminars.usb.ac.ir/Files/iccome/en-us/Document/... · failures at optical layer can potentially lead to a catastrophic loss

includes 6 nodes and 8 links. Table I describes the weight of connecting links in fiber kilometers. The demand is a connecting request between node pair (2, 6) for 3 wavelengths bandwidth. The results are tabulated in Table II. The adjusted delay on each link was assumed to be 1 ms/km. The resource reservation based RWA tried to establish bandwidth efficient lightpath where the QoS based RWA found ligthpath with low latency time. In reservation based RWA the number of allocated wavelengths are 6 wavelengths where the propagation delay of the request is 24ms. In QoS based RWA the number of allocated wavelengths are 9 wavelengths where the propagation delay of the request is 17ms. Consequently, the QoS based RWA tries to establish lightpath with lower latency time by extending network resource utilization that is assured through bandwidth trading.

C. Robustness and Accommodation of demand uncertainty Rising Internet usage is producing increased traffic

uncertainty that must be accommodated in optical core design optimization. The delivery of the required quality of service (QoS) by client networks such as Internet network providers faces uncertain environment capacity problems where demand volume changes extensively over different periods of network operation. Therefore, existing network capacity is unable to accommodate the demands at all times due to the natural variability of unpredictable loads in periods of high demand. Planning for robust DWDM transport networks capable of meeting future uncertain demands is, therefore, a necessity [34]-[37].

D. The Quality of Service (QoS) Issues The quality of service (QoS) refers to the ability of a

network to enforce preferential treatment through classification.

TABLE I THE WEIGHT OF CONNECTING LINKS IN KILOMETERS

Link Length (km) (1,2) 12 (1,6) 12 (2,3) 4 (3,4) 3 (3,5) 8 (4,5) 7 (4,6) 10 (5,6) 14

TABLE II RESOURCE RESERVATION AND QOS-BASED RWA

Objective Lightpath

Path/Wavelength Lightpath Properties (Bandwidth, Delay)

Bandwidth 2-1-6/( 321 ,, λλλ ) (6,24)

Propagation Delay 2-3-4-6/( 321 ,, λλλ ) (9,17)

The QoS of a network can be studied in the context of any combination of properties of a service delivered to the end users including availability, delay, jitter, loss ratio and throughput [38, 39]. Although survivability of DWDM networks improves the client network’s QoS by tolerance against component failures and avoiding losing data and revenue but also the survivability parameters including restoration time, restoration bandwidth, restoration granularity, and call acceptance ratio will affect the delivering QoS. The QoS at optical layer is differentiated to differentiated reliability, reliable connection, quality of protection, and quality of recovery [40]. The important role of quality of service (QoS) in deployment of a resilient DWDM backbone for global networks requires critical design-phase planning optimization. In general, QoS requirement extends network resource utilization that is assured through bandwidth trading [41].

IV. CONCLUSION The survivability is an important and critical issue that

must be taken in deployment of DWDM optical transport networks as backbones for reliable high speed next generation telecommunication networks due to their ability to carry large traffic volumes. This paper reviewed some major issues of designing fault tolerant DWDM optical transport networks for service provision at acceptable level of performance in the presence of failures. Furthermore, the paper addressed some current and future research challenges in deployment of resilient DWDM backbones for next generation telecommunication networks.

REFERENCES [1] D. Zhou, and S. Subramaniam, “Survivability in Optical Networks”,

IEEE Network, vol. 14, pp. 16-23, 2000. [2] O. Gerstel, R. Ramasawami, “Optical Layer Survivability- An

implementation perspective”, IEEE Journal on Selected Areas in Communications, vol. 18, pp. 1885-1899, 2000.

[3] J. Zhang, B. Mukherjee, “Review of Fault Management in WDM Mesh Networks: Basic Concepts and Research Challenges”, IEEE Network, vol. 18, pp. 41-48, 2004.

[4] M. M´edard, and S. S. Lumetta, “Network Reliability and Fault Tolerance”, MIT, Laboratory for Information and Decision Systems, Techniqual Report, 2002.

[5] R. Shenai, and K. Sivalingam, “Hybrid survivability approaches for optical WDM mesh networks”, Journal of Lightwave Technology, Vol. 23, No. 10, pp. 3046-3055,2005.

[6] Y. Wang, T. H. Cheng, and B. Mukherjee, “Dynamic Routing and Wavelength Assignment Scheme for Protection against Node Failure”, Proceeding of IEEE GLOBECOM’03, pp. 2585-2589, 2003.

[7] S. Ramamuthy, and B. Mukherjee, “Survivable WDM mesh networks: Part I-Protection,” Proceeding of IEEE Infocom Conference, New York, vol. 2, pp. 744-751, 1999.

[8] S. Ramamuthy, and B. Mukherjee, “Survivable WDM mesh networks: Part II-Restoration,” Proceeding of IEEE ICC Conference, pp.2023-2030, 1999.

[9] Y. Xin, G. N. Rouskas, “A Study of Path Protection in Large-Scale Optical Networks”, Kluwer Academic Publishers, Photonic Network Communications, vol. 7, no. 3, pp. 267-278, 2004.

[10] C. Ou, H. Zang, N. K. Singhal, K. Zhu, L. H. Sahasrabuddhe, R. A. MacDonald, B. Mukherjee,“ Subpath Protection for Scalability and

Page 150 /183

Page 5: Resilient Optical Networks: Principles and State-of-the-Artseminars.usb.ac.ir/Files/iccome/en-us/Document/... · failures at optical layer can potentially lead to a catastrophic loss

Fast Recovery in Optical WDM Mesh Networks”, IEEE Journal on Selected Areas in Communications, vol. 22, no. 9, pp. 1859–1875, 2004.

[11] V. Anand, S. Chauhan, and C. Qiao, “Sub-path protection: A new framework for optical layer survivability and its quantitative evaluation,” Dept. CSE, SUNY Buffalo, Tech. Rep. 2002-01, 2002.

[12] R. He, H. Wen, L. Li et al. “Shared sub-path protection algorithm in traffic-grooming WDM mesh networks,” Photon. Network Communications, vol. 8, no. 3, pp. 239–249, Nov. 2004.

[13] D. Xu, Y. Xiong, and C. Qiao, “Novel algorithms for shared segment protection,” IEEE Journal on Selected Areas in Communications, vol. 21, no. 8, pp. 1320–1331, 2003.

[14] P.-H Ho, J. Tapolcai, and T. Cinkler, “Segment shared protection in mesh communications networks with bandwidth guaranteed tunnels,” IEEE/ACM Transaction on Network, vol. 12, no. 6, pp. 1105–1118, 2004.

[15] P.-H. Ho and H. T. Mouftah, “On optimal diverse routing for shared protection in mesh WDM networks,” IEEE transactions on Reliability, vol. 53, no. 6, pp. 216–225, 2004.

[16] C. Ou, J. Zhang, H. Zang, L. H. Sahasrabuddhe, and B. Mukherjee, “New and Improved Approaches for Shared-Path Protection in WDM Mesh Networks”, Journal of Lightwave Technology, vol. 22, no. 5, pp. 1223-1232, 2004.

[17] P.-H. Ho, H. T. Mouftah, “A novel survivable routing algorithm for shared segment protection in mesh WDM networks with partial wavelength conversion”, IEEE Journal on Selected Areas in Communications, vol. 22, no. 8, pp. 1548-1560, 2004.

[18] P.-H. Ho “State-of-the-art Progress in Developing Survivable Routing Schemes in Mesh WDM Networks”, On optimal diverse routing for shared protection in mesh WDM networks,” IEEE Communications, pp. 2-16, vol. 6, no. 4, 2004.

[19] R. Ramaswami, and K. N. Sivarajan, “Routing and Wavelength Assignment in All-optical Networks”, IEEE/ACM Transaction on Networking, vol. 3, pp. 489-500, 1995.

[20] H. Zang, J. Jue, and B. Mukherjee, “A review of routing and wavelength assignment approaches for wavelength-routed optical WDM networks,” SPIE Optical Networks Magazine, vol. 1, no. 1, pp. 47–60, Jan. 2000.

[21] J. Kennington, E. Olinick, A. Ortynski, and G. Spiride, “Wavelength routing and assignment in a survivable WDM mesh network”, Operation Research, vol. 51, no. 1, pp. 67-79, 2003.

[22] S. Ramamurthy, L. Sahasrabuddhe, and B. Mukherjee, “Survivable WDM Mesh Networks”, Journal of Lightwave Technology, vol. 21, no. 4, pp. 870-883, 2003.

[23] J. L. Kenningtona, E. V. Olinick, and G. Spiride, “Basic mathematical programming models for capacity allocation in mesh-based survivable networks”, International journals of management sciences, pp. 1-16, 2006.

[24] J. Kennington, and E. Olinick, “A Survey of Mathematical Programming Models for Mesh-based Survivable Networks”, Southern Methodist University, Technical Report 04-EMIS-13, pp. 1-33, 2004.

[25] D. E. Goldberg, Genetic algorithms in search, optimization, and machine learning, Addison-Wesley, 1989.

[26] K. A. De Jong, and W. M. Spears, “Using Genetic Algorithm to Solve NP-Complete Problems”, Proceeding of Third International Conference on Genetic Algorithms (ICGA), 1989.

[27] J. Arabas, S. Kozdrowski, “Applying an Evolutionary Algorithm to Telecommunication Network Design,” IEEE Transactions on Evolutionary Computation, vol. 5, no. 4, pp. 309-323, 2001.

[28] Y. S. Kavian, H. F. Rashvand, W. Ren, M. S. Leeson, E. L. Hines, M. Naderi, “RWA problem for designing DWDM networks delay against capacity optimization,” Electronics Letters, vol. 43, no. 16, pp. 892-893, 2007.

[29] L. Guo, L. Li, J. Cao, H. Yu, and X. Wei, “On Finding Feasible Solutions With Shared Backup Resources for Surviving Double-Link Failures in Path-Protected WDM Mesh Networks”, Journal of Lightwave Technology, vol. 25, no. 25, pp. 287-296, 2007.

[30] J. Zhang, K. Zhu, and B. Mukherjee, “A comprehensive study on backup reprovisioning to remedy the effect of double-link failures in WDM mesh networks,” Proceeding of IEEE ICC, vol. 27, pp. 1654–1658, 2004.

[31] L. Guo, H. Yu, and L. Li, “Protection design for double-link failures in meshed WDM networks,” Acta Eletronica Sinica, vol. 33, no. 5, pp. 883–888, 2005.

[32] A. Autenrieth, A. Kirstadter, “Engineering End-to-End IP Resilience Using Resilience-Differentiated QoS”, IEEE Communications Magazine, vol. 40, no. 1, pp. 50-57, 2002.

[33] Y. Bejerano, Y. Breitbart, A. Orda, R. Rastogi, A. Sprintson, “Algorithms for computing QoS paths with restoration’, IEEE/ACM Transactions on Networking, vol. 13, no. 3, pp. 648-661, 2005.

[34] D. Leung, and W. D. Grover, “Capacity Planning of Survivable Mesh-based Transport Networks under Demand Uncertainty”, Photonic Network Communications, vol. 10, no. 2, pp. 123-140, 2005.

[35] J. L. Kennington, K. R. Lewis, E. V. Olinick, “Robust Solutions for the WDM Routing and Provisioning Problem: Models and Algorithms”, Optical Networks Magazine, vol. 4, no. 2, pp. 74 – 84, 2003.

[36] S. V. Ukkusuri, T. V. Mathew, S. T. Waller, “Robust Transportation Network Design Under Demand Uncertainty”, Computer-Aided Civil and Infrastructure Engineering, vol. 22, pp. 6-18, 2007.

[37] A. Atamturk, M. Zhang, “Two-stage robust network flow and design under demand uncertainty”, Operations Research, 2006.

[38] X. Xipeng, L. M. Ni, “Internet QoS: A Big Picture”, IEEE Network, vol. 13, no. 2, pp. 8-18, 1999.

[39] A. Kaheel, T. Khattab, A. Mohamed, H. Alnuweiri, “Quality-of-service mechanisms in IP-over-WDM networks”, IEEE Communications Magazine, vol. 40, no. 12, pp. 38-43, 2002.

[40] C. V. Saradhi, M. Gurusamy, and L. Zhou, “Differentiated QoS for Survivable WDM Optical Networks”, IEEE Communications Magazine, vol. 42, no. 5, pp. s8-14, 2004.

[41] Y. S. Kavian, H. F. Rashvand, W. Ren, M. Naderi, E. L. Hines, M. S. Leeson, “Genetic algorithm quality of service design in resilient dense wavelength division multiplexing optical networks”, IET Communication, Vol. 2, No. 4, pp. 505–513, 2008.

Page 151 /183