Читать книгу Communication Networks and Service Management in the Era of Artificial Intelligence and Machine Learning - Группа авторов - Страница 38
2.3 Learning for Network and Service Management
ОглавлениеAI/ML techniques have a vital list of applications in many network and service management tasks, including (but are not limited to) traffic/service classification and prediction for performance management; intrusion, malware identification, and attribution for security management; root cause analysis and fault identification/prediction for fault management; and resource/job allocation/assignment for configuration management. As discussed in Chapter, the growth in connected devices as well as new communication technologies from 5G+ to SDN to NFV persuade network and service management research to explore new methodologies from the AI/ML field [17].
Given the current advances in networks/services AI/ML has found its place in performance management tasks for its ability to learn from big data to predict different conditions, to aggregate patterns, to identify triggers for operations and management actions. For example, traffic prediction has seen multiple ML‐based applications from time series forecasting [18] to neural networks [19, 20] to hidden Markov models [21] to genetic algorithms [22]. Moreover, many other tasks in performance management have employed AI/ML techniques for traffic management in the cloud and mobile edge computing, network resource management and allocation, Quality of Service assurance, and congestion control. These leverage the capabilities of AI/ML techniques to learn from temporal and dynamic data [23–26]. Current examples of such developments include Deep Neural Networks [27], transfer learning [28], Deep Reinforcement Learning [15, 29], and Stream online learning [30].
Security management is another network/service management field that includes extensive and early endorsement of AI/ML techniques. Network anomaly detection is a prime example, in which ML techniques are applied for their ability to automatically learn from the data and extract patterns that can be used for identifying network anomalies in a timely manner [31]. To this end, temporal correlation [32], wavelet analysis [33], and traditional change point detection [34] approaches are applied to produce normal/malicious traffic models, where the sequence of actions in a time window are used to create profiles using clustering techniques such as Self Organizing Maps [35], K‐means [36], and Gaussian Mixture Models [37]. Moreover, AI/ML techniques have been applied to network intrusion detection including, but not limited to, Decision Trees, Evolutionary Computing, Bayesian Networks, Support Vector Machines, and recently Deep and Reinforcement Learning [38–43]. Unsupervised learning and Stream online learning have been employed for security tasks as well [44, 45]. Other examples of AI/ML applications in security are moving target defence, insider threat detection, and network content filtering [46–48].
In fault management, prediction and diagnosis of faults attracted widespread use of AI/ML techniques from online learning for change point detection to Neural Networks to Hidden Markov Models to Decision Trees, and several unsupervised learning algorithms [49–53]. Additionally, other AI/ML have been introduced specifically for fault prediction, automated fault mitigation, and root cause analysis [54–57].
The application of AI/ML techniques have been slower in configuration management tasks. However, as discussed earlier, with the introduction of NFV and SDN technologies, this is changing [58–60]. Initiatives such as Intent Based Networking [61] and Zero Touch Networking [62] widespread usage of AI/ML has been seen in wireless networks. Other example tasks in configuration management employing ML are service configuration management network load balancing and routing [63–68].
In summary, AI/ML techniques have been applied to several tasks of network and service management in greater numbers over the last decade [69]. However, there are still challenges that need to be resolved for the successful usage of such techniques in production environments. One of the challenges is obtaining high quality data for training and evaluating ML techniques for network and service management functions. Even though network/service data is plenty and diverse in real world, most of the time it is difficult to obtain such data with ground truth. In return, this not only poses challenges for evaluating AI/ML techniques but also faces privacy and trust issues. Another challenge is that in today's networks/services data are generated nonstop in high volume and velocity. They include stationary as well as non‐stationary behaviors superimposed. They evolve continuously as new protocols and technologies are introduced over time. All of these reflect in the data in one shape or form, as gradual drifts in user/system behaviors, or as sudden shifts maybe because of a malfunctioning device or a denial of service attack on a particular network or service. This means that AI/ML techniques require to take these dynamics and changes into account, learn under the aforementioned conditions in order to ensure successful deployment. Yet, another challenge is the need of human experts (from network engineers to security analysts to network/service managers) to understand and trust to AI/ML based system and tools. This requires transparent AI/ML techniques for expert involvement and trust. This is of utmost importance for the widespread and successful deployment of AI/ML techniques in network and service management.
Finally, these challenges also create opportunities in the form of a need for transparent, robust, and dependable AI/ML based techniques for network and service management. To this end, we have already started to see the applications of stream learning, adversarial learning, and transfer learning to the network and service management solutions. Furthermore, research in transparent, secure, and robust AI/ML techniques have gained a big momentum in the ML community. Given the scale and dynamics of today's networks/services, we envision that the application of AI/ML techniques will become more and more ubiquitous and central for operations and management of the future services and networks. In the following Chapters –, we will introduce the current state and the new trends of the AI/ML applications in network and service management.