0% found this document useful (0 votes)

14 views22 pages

Toit

The document discusses a deep learning-based method for predicting network traffic in secure backbone networks of the Internet of Vehicles (IoV). It highlights the importance of accurate traffic prediction for efficient network management and security, proposing a system that utilizes CNN and LSTM to capture spatio-temporal features. The method is evaluated using real network traffic data, demonstrating its effectiveness in maintaining network performance in IoV environments.

Uploaded by

ilaydakantaroglu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views22 pages

Toit

Uploaded by

ilaydakantaroglu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/347506848

Deep Learning-Based Network Trafﬁc Prediction for Secure Backbone Networks

in Internet of Vehicles

Article in ACM Transactions on Internet Technology · January 2020

DOI: 10.1145/3433548

CITATIONS READS
17 634

7 authors, including:

Xiaojie Wang Zhaolong Ning

Chongqing University of Posts and Telecommunications Chongqing University of Posts and Telecommunications
107 PUBLICATIONS 7,205 CITATIONS 32 PUBLICATIONS 3,286 CITATIONS

SEE PROFILE SEE PROFILE

Liang Guo Xinbo Gao

806 PUBLICATIONS 22,341 CITATIONS

Chongqing University of Posts and Telecommunications
1,319 PUBLICATIONS 37,792 CITATIONS
SEE PROFILE
SEE PROFILE

All content following this page was uploaded by Zhaolong Ning on 20 December 2020.

The user has requested enhancement of the downloaded file.

Deep Learning-based Network Traffic Prediction for Secure Backbone
Networks in Internet of Vehicles

XIAOJIE WANG, School of Communication and Information Engineering, Chongqing University of Posts
and Telecommunications, Chongqing 400065, China.
LAISEN NIE* , School of Electronics and Information, Northwestern Polytechnical University, Xi’an, 710072,
China.
ZHAOLONG NING* , School of Communication and Information Engineering, Chongqing University of
Posts and Telecommunications, Chongqing 400065, China.
LEI GUO, School of Communication and Information Engineering, Chongqing University of Posts and
Telecommunications, Chongqing 400065, China.
GUOYIN WANG, Chongqing Key Laboratory of Computational Intelligence, Chongqing University of
Posts and Telecommunications, Chongqing 400065, China.
XINBO GAO, Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecom-
munications, Chongqing 400065, China.
NEERAJ KUMAR, Department of Computer Science and Engineering Thapar Institute of Engineering
and Technology Patiala, India.

Internet of Vehicles (IoV), as a special application of Internet of Things (IoT), has been widely used for Intelligent
Transportation System (ITS), which leads to complex and heterogeneous IoV backbone networks. Network traffic
prediction techniques are crucial for efficient and secure network management, such as routing algorithm, network
planning, anomaly and intrusion detection. This paper studies the problem of end-to-end network traffic prediction
in IoV backbone networks, and proposes a deep learning-based method. The constructed system considers the
spatio-temporal feature of network traffic, and can capture the long-range dependence of network traffic. Furthermore,
a threshold-based update mechanism is put forward to improve the real-time performance of the designed method by
using Q-learning. The effectiveness of the proposed method is evaluated by a real network traffic data set.

Authors’ addresses: Xiaojie Wang, School of Communication and Information Engineering, Chongqing University of Posts and
Telecommunications, Chongqing 400065, China.; Laisen Nie, nielaisen@nwpu.edu.cn, School of Electronics and Information,
Northwestern Polytechnical University, Xi’an, 710072, China.; Zhaolong Ning, zhaolongning@gmail.com, School of Communi-
cation and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.;
Lei Guo, School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications,
Chongqing 400065, China.; Guoyin Wang, Chongqing Key Laboratory of Computational Intelligence, Chongqing University
of Posts and Telecommunications, Chongqing 400065, China.; Xinbo Gao, Chongqing Key Laboratory of Image Cognition,
Chongqing University of Posts and Telecommunications, Chongqing 400065, China.; Neeraj Kumar, Department of Computer
Science and Engineering Thapar Institute of Engineering and Technology Patiala, India.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee
provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and
the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored.
Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires
prior specific permission and/or a fee. Request permissions from permissions@acm.org.
○
c 2020 Association for Computing Machinery.
Manuscript submitted to ACM

Manuscript submitted to ACM 1

2 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

Additional Key Words and Phrases: Internet of vehicles, traffic prediction, network security, deep learning

ACM Reference Format:

Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar. 2020. Deep
Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles. ACM Trans. Internet
Technol. 1, 1, Article 1 (January 2020), 21 pages. https://doi.org/10.1145/3433548

1 INTRODUCTION
Cloud-based Internet of Vehicles (IoV), integrated by Internet of Things (IoT) and cloud computing, has
been rapidly developed for Intelligent Transportation System (ITS) [6, 11, 16, 19, 23, 26, 29]. IoVs can
provide reliable communications among vehicles for safety-related applications, and the network access
to infrastructures and pedestrians. IoV backbone networks transmit the traffic aggregated by end-to-end
network traffic generated from a large number of sensors and safety monitoring systems of On-Board Units
(OBUs) [9], as shown in Fig. 1. Under this case, the IoV backbone network has become much more complex
and heterogeneous. As a crucial component of an IoV, the network security and management system is
necessary to provide reliable data transmissions for secure IoVs. Network management operations usually
collect network states at first, and then make decisions for network management according to the reported
data [5, 12, 28, 33]. End-to-end network traffic is a crucial network state, which is the basis for network
planning and predictive routing [2, 31].
With the development of IoVs, more data have been generated for implementing ITS. It arises the
challenges in network security. For instance, an attacker may intrude an IoV, and it can steal the information
related to users’ privacy leading to a tremendous economic loss for users. In this case, many network security
applications, such as Intrusion Detection Systems (IDS), have been deployed in the IoV backbone network
to prevent it from kinds of threats. Besides, anomaly detection techniques are leveraged to implement
anomaly-based intrusion detection, and detect impossible damages in networks [8, 13, 30]. In practice, both
intrusion and anomaly detection systems need end-to-end network traffic data as an input parameter to
carry out the corresponding functions. For instance, an IDS can identify the Distributed Denial of Service
attack (DDoS) by detecting the anomalous behaviors of end-to-end network traffic.
Thereby, precise network traffic prediction is useful for maintaining IoVs. A Traffic Matrix (TM), which
shows the size of network traffic among Origin-Destination (OD) nodes, is the mathematical representation
of network traffic [20]. In a TM, each row vector reveals the time-varying features of the corresponding
OD flow. A TM obeys manifold statistical features, e.g., spatial, temporal and spatio-temporal features.
These intricate statistical features arise the main challenges in network traffic prediction. Initially, the
statistics-based methods, such as Gaussian and Poisson models, are proposed to predict network traffic. The
statistical features of TM are much more complex, while supporting more services in IoVs. Then, Machine
Learning (ML)-based methods emerge for network traffic prediction, such as Principal Component Analysis
(PCA) [24], Convolutional Neural Network (CNN) [32] and Long Short-Term Memory (LSTM) methods [22].
These methods capture several statistical features jointly to decease the network traffic prediction error.
Nevertheless, they are not appropriate for predicting network traffic in IoVs. The corresponding main
challenges can be summarized as follows [7, 10, 25, 27]:
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 3

Network
Internet Management

Firewall

IoV Backbone Network

Router Router
Router

5G Base
RSU Station

V2I
V2I

Fig. 1. An illustration of IoV backbone networks.

∙ The statistical features of network traffic in secure IoVs obey a superposition of various distributions.
There are kinds of data in a secure IoV leading to complex network traffic statistical features. These
complex statistical features result in unfaithful predictors via previous methods.
∙ The ML-based methods cannot meet the commands in real-time performance. For a secure IoV, the
topology changes frequently due to quick movement of vehicles. Meanwhile, a secure IoV needs fast
responses for any unlawful attacks. Hence, the real-time performance of network traffic prediction is
crucial for secure IoVs. Nevertheless, the ML-based methods usually extract the features of network
traffic by collecting many data samples. Meanwhile, to guarantee prediction accuracy, deep architectures
are often updated persistently. Training these deep architectures by massive data is time-consuming
for network resources (e.g., computation and communication recourses).
∙ It is significantly difficult for existing ML-based methods to exact the features of network traffic
using few traffic samples. With limited communication resources, sampling and transmitting massive
training data are uneconomical for secure IoVs.

Motivated by the above challenges, we investigate the network traffic prediction problem, and design a
network traffic prediction system for secure IoVs. The proposed system uses CNN and LSTM to optimize the
accuracy and real-time performance of network traffic prediction by extracting multiple statistical features
of TM jointly and implementing intermittent model updates. To extract the multiple statistical features
of TM jointly, a deep architecture integrating CNN and LSTM is built, in which CNN and LSTM are
employed to track the spatio-temporal and time-varying features of TM. LSTM is a particular paradigm of
Manuscript submitted to ACM
4 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

Recurrent Neural Network (RNN), which is able to capture the relationship between several input elements
by the feedbacks within the same layer. Based on this advantage, we explore the time-varying features of
network traffic through LSTM. Moreover, the TM consisting of all the OD flows, expresses spatial and
spatio-temporal features caused by routing algorithms and network configurations. Hence, we take advantage
of CNN to portray the spatio-temporal features of TM. To obtain the best predictors, each predictor is
calculated by implementing a training in practice. However, each training of the designed deep architecture
is expensive for real-time performance. Thereby, we train the deep architecture discontinuously. We propose
a threshold-based mechanism to calculate the time intervals between two training processes [17].
The following summarizes the main contributions of our work:

∙ To predict network traffic of a secure IoV backbone network accurately, we design a deep architecture
utilizing CNN and LSTM to trace the temporal and spatio-temporal features of TM jointly.
∙ To improve the real-time performance of the proposed method, a threshold-based mechanism is
proposed to update the deep architecture discretely. A Reinforcement Learning (RL) algorithm is
proposed to determine the threshold.
∙ We evaluate the designed scheme by real network traffic data set sampled from the Abilene network
and our constructed testbed which is leveraged to imitate the scene of a secure IoV. According to the
evaluation, our method can capture the long-term network traffic in secure IoVs.

The following sections are organized as follows. The related work is reviewed in Section 2. Section 3
provides the system model of network traffic prediction. Section 4 presents the proposed method, and Section
5 gives the evaluation of the designed deep architecture. At last, we conclude our work and illustrate the
future work in Section 6.

2 RELATED WORK
2.1 Shallow Learning-based Methods
Initially, researchers model OD traffic by simple statistical models, e.g., Gaussian and Poisson models,
to extract network traffic features. Then, the problem of network traffic prediction is converted into a
parameter estimation problem. However, simple models cannot model nonlinear distributions of network
traffic [1, 4, 21, 24].
ML techniques have a widespread usage in network traffic prediction, which relies on its remarkable
ability in modeling complex statistical distributions. Initially, kinds of shallow learning-based approaches
have emerged to solve network traffic prediction problem, such as shallow Neural Network (NN) and PCA.
These methods try to fit the statistical features of network traffic consisting of linear and nonlinear features.
According to the analysis of network traffic in depth, current network traffic yields multiple distributions,
such as multi-fractal and heavy-tailed distributions. Capturing these statistical features can enhance the
accuracy of network traffic prediction remarkably. However, traditional shallow learning approaches cannot
track these features effectively. For instance, the PCA method deals with the problem of network traffic
prediction by singular value decomposition, and neglects some non-principal components with small singular
values. Namely, it relies on the spatial correlation of TM. The NN method is mainly designed to adapt to
the changes of TM with respect to the time interval.
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 5

To enhance the accuracy of network traffic prediction, hybrid model-based methods have been emerged, in
which researchers model network traffic by more than two models. In [1], the authors proposed two hybrid
methods based on Fanout Estimation (FE) and Iterative Proportional Fitting (IPF), respectively. The IPF
method is adapted to track the time-varying features of OD network traffic. By contrast, the FE method
can extract the spatial features of TM. They are combined with other state-of-the-art modeling techniques
for network traffic feature extraction, i.e., tomogravity, entropy maximization and shallow NN. Besides,
the authors in [18] proposed a network traffic feature extraction method using artificial NN and genetic
algorithm. They first model OD flows by the autoregressive model with exogenous inputs, and then calculate
the inputs (i.e., weights and biases) by genetic algorithm.

2.2 Deep Learning-base Network Traffic Prediction

Deep Learning (DL) techniques are viewed as an excellent solution to fit a complex data set. Hence, it
has been referred to extract network traffic features for both estimation and prediction. For instance, a
hybrid deep architecture was proposed to leverage the diversified features for network traffic prediction
in cellular networks in [27]. In this deep architecture, autoencoder and LSTM are used to extract spatial
and temporal features, respectively. The authors in [32] handled this kind of prediction problem in future
cellular networks. A CNN-based mechanism is designed to profile the nonlinear features of traffic in wireless
networks. Different from previous methods denoting OD flows by sequences, the authors defined an OD
flow as an image (i.e., a matrix). From this representation of an OD flow, they design a CNN-based deep
architecture with highly dense connections for network traffic prediction. In [22], the authors proposed a
LSTM-based deep architecture with one hidden layer consisting of four LSTM blocks to predict the network
traffic of the optical data center.
Many network traffic prediction approaches have been proposed in traditional wired and wireless networks.
However, the approaches for IoV backbone network traffic prediction have not been discussed, to the best of
our knowledge. Though many approaches have been put forward for traffic prediction, they are not adopted
to predict the network traffic of IoV backbone networks. Consequently, we design a deep architecture by
integrating CNN and LSTM to extract the spatio-temporal and temporal features of TM for decreasing
prediction errors.

3 SYSTEM MODEL
As mentioned before, TM reveals the volume of traffic that flows among all OD pairs. In this paper, we
denote TM by 𝑋 for a secure IoV backbone network. Its element is 𝑋𝑛,𝑡 , where 𝑛 is the index of an OD
flow and 𝑡 denotes time slot. Generally, 𝑡 is a time interval, and then 𝑋𝑛,𝑡 shows the average traffic within
time interval 𝑡. If the network is made up of 𝑁 nodes and 𝐾 links, then 𝑛 = 1, 2, 3, ..., 𝑁 2 . Moreover, if TM
contains 𝑇 time slots of network traffic in a secure IoV backbone network, then 𝑡 = 1, 2, 3, ..., 𝑇 . Link load
expresses the aggregation of OD network traffic, and it conforms a linear relationship with respect to TM.
This relationship can be denoted by:
𝑌 = 𝑅𝑋, (1)
where 𝑅 is a so-called routing matrix. The routing matrix is constructed by 0 and 1, if we consider that an
OD flow is transmitted through the same path of an IoV backbone network. The element in 𝑅 is denoted
Manuscript submitted to ACM
6 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

by 𝑅𝑘,𝑛 . When 𝑅𝑘,𝑛 = 1, the 𝑛th OD flow passes link 𝑘 (𝑘 = 1, 2, ..., 𝐾), otherwise 𝑅𝑘,𝑛 = 0. Obviously,
the sizes of link load and routing matrices are 𝐾 × 𝑇 and 𝐾 × 𝑁 2 , respectively. TM estimation techniques
calculate TM 𝑋 via 𝑌 and 𝑅. Symbols 𝑌 and 𝑅 are available, because they can be obtained from the
simple network management protocol and routing configurations, respectively. In this paper, we utilize link
load and routing matrix to build a threshold for deep architecture update.
The problem of network traffic prediction in an IoV backbone network is to calculate 𝑋𝑡 according to
previous network traffic (𝑋𝑡−1 , 𝑋𝑡−2 , ..., 𝑋𝑡−𝐸 ), which can be denoted by:

𝑋𝑛,𝑡 =𝑓 (𝑋𝑛,𝑡−1 , 𝑋𝑛,𝑡−2 , ..., 𝑋𝑛,𝑡−𝐸 ) . (2)

Then this problem is defined as fitting function 𝑓 (·) with respect to 𝑋𝑡−1 , 𝑋𝑡−2 , ..., 𝑋𝑡−𝐸 . Eq. (2) mainly
shows the problem of network traffic prediction according to a sequence. Namely, fitting this function mainly
considers the temporal features of OD flows. By expanding to other features, the network traffic prediction
problem can be defined as:
𝑋𝑛,𝑡 =𝑓 (𝑋𝑡−1 , 𝑋𝑡−2 , ..., 𝑋𝑡−𝐸 ) , (3)
where vector 𝑋𝑡−𝑒 (𝑒 = 1, 2, ..., 𝐸) is a snapshot of network traffic over time slot 𝑡 − 𝑒. Symbol 𝐸 is the
length of prior network traffic. Different from the problem of network traffic prediction shown by Eq. (2), we
consider the spatial and spatio-temporal features of network traffic. In this case, the problem of network
traffic prediction is modeled by calculating a traffic element according to previous snapshots of network
traffic.

4 OUR METHODOLOGY
Traditional prediction approaches usually calculate network traffic by extracting the time-varying features of
OD flows to fit the function in Eq. (2). Different from previous approaches, we combine CNN and LSTM,
and design a deep architecture to predict the network traffic of secure IoVs by means of extracting two
features, i.e., the spatio-temporal and temporal features. The designed deep architecture for network traffic
prediction is shown in Fig. 2. It contains 6 hidden layers,i.e., a convolutional layer, a subsampling layer, an
LSTM layer, two fully connected layers and a dropout layer. In our method, we first preprocess the network
traffic data set to normalize and centralize it, that is:
𝑋𝑛,𝑡 − 𝜇𝑛
𝑋𝑛,𝑡 = , (4)
|𝑋𝑛,𝑡 |
where 𝜇𝑛 is the average value of OD flow 𝑛, and can be computed by the prior of network traffic.

4.1 Traffic Prediction Based on CNN and LSTM

CNN is a prevalent tool for 2-dimensional data feature extraction, such as pattern recognition and image
processing [14, 32]. It can extract the spatio-temporal
{︁ }︁ features {︁
of 2-dimensional
}︁ data by using convolution
kernels. For defining spatial location (𝑖, 𝑗), 𝑖 ∈ 𝐼 (𝑙) and 𝑗 ∈ 𝐽 (𝑙) are the sets of input and output on
layer 𝑙, respectively, and the output map of the 𝑙th convolutional layer can be denoted by:
(︃ )︃
(𝑙)
∑︁ (︁ (𝑙−1) (𝑙)
)︁
(𝑙)
𝑎𝑗 = 𝜎 𝑎𝑖 * 𝑘 𝑗 + 𝑏𝑗 , (5)
𝑖

Manuscript submitted to ACM

Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 7

Fully Connected Layer

Convolutional Layer

Subsampling Layer

Dropout Layer

Output Layer
LSTM Layer
Input Data

6@2 ´ 2
6@5 ´ 5

Fig. 2. Deep architecture with CNN and LSTM.

(𝑙) (𝑙)
where 𝑏𝑗 and 𝑘𝑗 are the 𝑗th convolution kernel and its bias of the 𝑙th convolutional layer, respectively.
(𝑙−1)
Variable 𝑎𝑖 is the output map of the (𝑙 − 1)th layer, and function 𝜎 (·) is the activation function.
As mentioned before, CNN is suitable for handling 2-dimensional data. Thereby, we utilize it to carry
out the TM-oriented spatio-temporal feature extraction. In our deep architecture shown in Fig. 2, the
convolutional layer, following the input layer, consists of 6 convolution kernels which learn 5 × 5 spatial
dimensions. To predict network traffic, the tanh function is used as the activation function on the convolutional
layer. Then, the output maps are:
(︁(︁ )︁ )︁
(1) (1) (1)
𝑎𝑗 = 𝑡𝑎𝑛ℎ 𝑋 (𝑚) * 𝑘𝑗 + 𝑏𝑗 , 𝑗 ∈ {6} , (6)

where 𝑋 (𝑚) is the training data set constructed by the prior of TM, and set {6} denotes the set {1, 2, ..., 6}.
Following the convolutional layer, a subsampling layer is built to carry out the average pooling with a factor
of 2. Then, the output maps of this average pooling are:
(︁ )︁
(2) (1)
𝑎𝑗 = 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 𝑎𝑗 , 𝑗 ∈ {6} , (7)

where 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 (·) denotes the average pooling process.

As a special paradigm of RNN, LSTM can learn the time-varying features of a time sequence [30, 34].
Hence, many LSTM-based algorithms have been put forward to predict a time sequence. The mathematic
model of RNN can be defined by:
⎧ (︀ 𝑓
)︀
⎨ 𝑓𝑡 = 𝜎 𝑈 𝑋𝑡 + 𝑊 𝑓𝑡−1 + 𝑏 ,
⎪
𝑜𝑡 = 𝑉 𝑓 𝑡 + 𝑐 𝑓 , (8)
⎪
𝑎𝑡 = 𝜎 (𝑜𝑡 ) ,
⎩

where 𝑋𝑡 is input data, and 𝑈 , 𝑉 and 𝑊 are weights of RNN. Variables 𝑏𝑓 and 𝑐𝑓 are biases, and 𝑎𝑡 is the
final output map of RNN. LSTM takes advantage of three gates to control the contents of unit state. They
are the forget gate, the input gate and the output gate, whose weights are denoted by 𝑊 𝑓 , 𝑊 𝑖 and 𝑊 𝑜 ,
respectively , and they can be denoted by:
⎧ (︀ 𝑓 𝑓
)︀
⎨ 𝑔𝑡 = 𝜎 (︀ 𝑊 [𝑓𝑡−1 ; 𝑋𝑡 ] + 𝑏 )︀ ,
⎪
𝑖𝑡 = 𝜎 𝑊 𝑖 [𝑓𝑡−1 ; 𝑋𝑡 ] + 𝑏𝑖 , (9)
⎪
𝑜𝑡 = 𝜎 (𝑊 𝑜 [𝑓𝑡−1 ; 𝑋𝑡 ] + 𝑏𝑜 ) ,
⎩

Manuscript submitted to ACM

8 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

where [·; ·] denotes the combination of two matrices. Variables 𝑏𝑓 , 𝑏𝑖 and 𝑏𝑜 are the biases for three gates,
respectively. Finally, the current cell state and the final output map of LSTM cell are:
{︃
𝑐𝑡 = 𝑐𝑡−1 𝑔𝑡 + 𝑖𝑡 tanh (𝑊 [𝑓𝑡−1 ; 𝑋𝑡 ] + 𝑏𝑐 ) ,
(10)
𝑎𝑡 = 𝑜𝑡 tanh (𝑐𝑡 ) ,
where 𝑐𝑡−1 is the previous cell state.
In our method, we extract the spatio-temporal and time-varying features of TM by CNN and LSTM,
respectively. As shown in Fig. 2, the output maps of subsampling layer are 6 matrices. To combine the
LSTM layer, we unfold these matrices as a vector, and make it as the input of LSTM layer. After that, the
output maps of LSTM can be shown as follows:
⎧ (︁ [︁ ]︁ )︁
(2)
⎪
⎪ 𝑔𝑡 = 𝜎 𝑊 𝑓 𝑓𝑡−1 ; 𝑎𝑗 + 𝑏𝑓 ,
⎪
⎪ (︁ [︁ ]︁ )︁
⎪ 𝑖𝑡 = 𝜎 𝑊 𝑖 𝑓𝑡−1 ; 𝑎(2) + 𝑏𝑖 ,
⎪
⎪
⎪
⎨ 𝑗
(︁ [︁ ]︁ )︁
(2)
𝑜𝑡 = 𝜎 𝑊 𝑜 𝑓𝑡−1 ; 𝑎𝑗 + 𝑏𝑜 , (11)
⎪
⎪ (︁ [︁ ]︁ )︁
(2)
𝑐𝑡 = 𝑐𝑡−1 𝑔𝑡 + 𝑖𝑡 tanh 𝑊 𝑓𝑡−1 ; 𝑎𝑗 + 𝑏𝑐 ,
⎪
⎪
⎪
⎪
⎪
⎪
⎩ (3)
𝑎 = 𝑜𝑡 tanh (𝑐𝑡 ) .

On the LSTM layer, there are 3 𝑁 2 − 4 2 LSTM blocks for temporal feature extraction.
(︀ )︀⧸︀

The rest of the proposed deep architecture contains two fully connected layers and one dropout layer
employed between two fully connected layers. The first fully connected layer is made up of 3 𝑁 2 − 4 2
(︀ )︀⧸︀

neurons, and we also use the tanh function as the activation function for prediction. It can be denoted by:
(︁ )︁
(4) (4) (4)
𝑎𝑖 = tanh 𝑊𝑖 𝑎(3) + 𝑏𝑖 . (12)

The following is the dropout layer used to prevent the proposed deep architecture from overfitting. The
number of neurons on the dropout is the same as the first fully connected layer. It carries out a probabilistic
dormancy mechanism for each neuron of the first fully connected layer. In other words, the output maps of
the dropout layer can be denoted by:
(︁ )︁
(5) (5) (4) (4)
𝑎𝑖 = 𝑑𝑖 tanh 𝑊𝑖 𝑎(3) + 𝑏𝑖 , (13)
(5) (5)
where 𝑑𝑖 obeys a Bernoulli distribution 𝑑𝑖 ∼ 𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖 (𝑝). The fully connected layer connecting with
the output layer determines the length of predicted sequence. The objective of the deep architecture is
to implement regression. Hence, to implement prediction, the tanh function is also utilized as activation
function on this layer. The length of predicted sequence by way of a forward propagation is equal to the
number of neurons on this fully connected layer.
To train the deep architecture, the backpropagation algorithm is applied to our approach. The loss
function used in the backpropagation algorithm is the Mean-Squared-Error (MSE) of prediction, which is
defined as:
(𝐿)
𝐼∑︁
1
𝑙𝑜𝑠𝑠𝑀 𝑆𝐸 = (ℎ𝑖 − 𝑋𝑛,𝑖 )2 , (14)
𝐼 (𝐿) 𝑖=1

where 𝐼 (𝐿) is the number of neurons on the output layer, and ℎ𝑖 is the output maps of the proposed deep
architecture.

Manuscript submitted to ACM

Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 9

4.2 Deep Architecture Update Algorithm Based on RL

The prevalent end-to-end network traffic prediction approaches solve the problem shown by Eq. (2). In
other words, OD flows are viewed as time sequences, and then they are predicted through their own prior
information. In this paper, the deep architecture is designed to deal with the problem shown by Eq. (3).
We use the previous network traffic of all OD flows as the prior information to predict the elements of
an OD flow. This method can capture the weak relationship among nonadjacent elements that even come
from different OD flows. Though it can ensure the accuracy of network traffic predictor, it actually arises a
challenge in real-time performance. In detail, we employ 𝐸 snapshots of TM to predict a time sequence, and
then the prior data is an 𝑁 2 × 𝐸 matrix. By contrast, the previous methods merely use an 𝐸-dimensional
vector to predict a time sequence. Without other optimization methods, it is significantly difficult to employ
the proposed deep architecture for online network traffic prediction.
For a network traffic prediction approach in the IoV backbone network, its real-time performance is crucial
in practice. Obviously, training and updating the deep architecture after predicting a network traffic element
is really expensive. On the contrary, predicting a large number of network traffic by one training may cause a
poor predictor. Aiming at dealing with the tradeoff problem, we leverage RL to build a threshold to control
the prediction error and real-time performance. The threshold is based on Normalized Mean Absolute Error
(NMAE) between TM and link load, which is defined as:
∑︀ ⃒⃒ ⃒
⃒𝑅𝑋 ^ − 𝑌 ⃒⃒
𝑘,𝑡
𝑁 𝑀 𝐴𝐸 = ∑︀ ⃒⃒ ⃒ , (15)
⃒𝑅𝑋 ^ ⃒⃒
𝑘,𝑡

^ is the predictor of TM, and 𝑌 is the corresponding aggregation traffic (i.e., link load). Comparing
where 𝑋
with other metrics, NMAE can measure the spatial and temporal errors of network traffic prediction.
Therefore, we use NMAE as the metric of the threshold-based mechanism. In our method, when the NMAE
between a snapshot of link load and its predictor is larger than the threshold, and then we train the deep
architecture. Otherwise, we predict traffic elements over the next time interval using previous trained deep
architecture.
The threshold is computed by RL to obtain the optimal tradeoff between real-time performance and
accuracy. RL has an excellent ability of dealing with the decision problem, in which it can compute an
optimal policy with the maximum reward by implementing iterative actions. It leverages a sample of the
environment to guide the following action from the current state to the next, and the relative reward can be
gained in the meantime. Furthermore, the environment can be updated in the light of the obtained reward.
The RL can be regarded as an MDP represented by ⟨𝑆, 𝐴, 𝑃 , 𝑅, 𝛾⟩. In detail, 𝑆 and 𝐴 are the state and
action spaces, respectively. Symbol 𝑃 is the transition probability matrix, and 𝛾 is the discount factor. 𝑅 is
the immediate reward from the current state to the next according to the given policy with environment.
To explore the optimal policy, many algorithms have been proposed. Recently, the Q-learning, known as
an off-policy learning algorithm, has been brought in RL and widely developed. In Q-learning algorithm,
when an agent with state 𝑠 (𝑠 ∈ 𝑆) moves to state 𝑠˜ (˜
𝑠 ∈ 𝑆) after carrying out action 𝑎 (𝑎 ∈ 𝐴) according to
given policy 𝜋 (𝑠, 𝑎), immediate reward 𝑅 (𝑠, 𝑎) can be obtained. The reward with respect to current state 𝑠
by taking action 𝑎, denoted by 𝑄 (𝑠, 𝑎), is a weight sum of immediate reward 𝑅 (𝑠, 𝑎) of moving to the next

Manuscript submitted to ACM

10 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

state 𝑠˜, which can be denoted by:

(︂ )︂
𝑄 (𝑠, 𝑎) = 𝑅 (𝑠, 𝑎) + 𝛾 max 𝑄 (˜
𝑠, 𝑎
˜) , (16)
˜
˜ ∈𝐴
𝑎

˜ ∈ 𝐴˜ is the action set under the next state 𝑠˜. The Q-learning algorithm can be regarded as an
where 𝑎
optimization problem that finds an optimal policy to maximize the reward 𝑄 (𝑠, 𝑎). For each iterative process
of researching the maximum reward, the reward can be updated according to a weighted sum denoted by:
(︂ (︂ )︂)︂
𝑄 (𝑠, 𝑎) ← (1 − 𝛼) 𝑄 (𝑠, 𝑎) + 𝛼 𝑅 (𝑠, 𝑎) + 𝛾 max 𝑄 (˜
𝑠, 𝑎
˜) , (17)
˜
˜ ∈𝐴
𝑎

where 𝛼 ∈ [0, 1] is a constant step-size parameter. During each iteration, the sampling of state can be chosen
by way of exploration-only and exploitation-only approaches. The former prefers to carry out sampling with
large range in value, and determine a state sampling by the uniform distribution. By contrast, the latter
selects the state sampling with the maximum cumulative reward currently.
The detail process of computing the threshold based on Q-learning is shown in Algorithm 1. The state
𝑠 represents the error threshold. The state space is set according to previous prediction errors, i.e., it is
equal to the maximum NMAE among previous predictors. We define the number of updates with respect to
state 𝑠 as 𝑁 (𝑠). Similarly, the NMAE with respect to state 𝑠 can be denoted by 𝑁 𝑀 𝐴𝐸 (𝑠). They can be
obtained by counting the previous data set. Obviously, 𝑁 (𝑠) and 𝑁 𝑀 𝐴𝐸 (𝑠) are monotone decreasing and
increasing functions with respect to 𝑠, respectively. In practice, we hope that 𝑁 𝑀 𝐴𝐸 (𝑠) and 𝑁 (𝑠) are as
small as possible. Therefore, we define the initialized policy as follows:

1
𝜋 (𝑠, 𝑎) = . (18)
𝑁 𝑀 𝐴𝐸 (𝑠) 𝑁 (𝑠)
By Algorithm 1, we can gain optimal policy 𝜋 * (𝑠, 𝑎), and then set the threshold according to 𝜋 * (𝑠, 𝑎).

Algorithm 1 Residual-based Dictionary Learning

Require: action space 𝐴
initial state 𝑠0
discount factor 𝛾
step-size parameter 𝛼
Ensure: 𝜋 * (𝑠, 𝑎)
1: 𝑄 (𝑠, 𝑎) ← 0
1
2: 𝜋 (𝑠, 𝑎) ← 𝑁 𝑀 𝐴𝐸(𝑠)𝑁 (𝑠)
3: for 𝑧 = 1, 2, ..., 𝑍 do
4: Choose an action from 𝑆 using 𝜀-greedy algorithm
5: Obverse immediate reward 𝑅(𝑠, 𝑎) and 𝑠˜
6: Update 𝑄 (𝑠, 𝑎) according to Eq. (17)
7: 𝑠 ← 𝑠˜
8: end for

Up to now, we have presented the proposed deep architecture to solve the traffic prediction problem and
the corresponding optimization method to optimize the real-time performance. The details of prediction
method are shown by Algorithm 2. We first construct the training data set from previous TM 𝑋 ′ which
is known for us. 𝑋 (𝑚) is the 𝑚th sampling, and it is an 𝑁 2 × 𝐸 matrix. Symbol 𝑋*′ , (𝑚:(𝑚+𝐸−1)) denotes
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 11

intercepting partial columns of 𝑋 ′ , i.e., from columns 𝑚 to 𝑚 + 𝐸 − 1. Similarly, 𝑋𝑛′ , (𝑚:(𝑚+𝐿 means
𝑝 −1))

intercepting partial columns (columns 𝑚 to 𝑚 + 𝐿𝑝 − 1) from row 𝑛 of 𝑋 ′ . Obviously, the output maps of
(𝑁 2 −4)
convolutional and subsampling layers are 𝑁 2 − 4 × (𝐸 − 4) and × (𝐸−4)
(︀ )︀
2 2(︂
matrices. The unfolding
)︂
3(𝑁 2 −4)(𝐸−4)
layer reshapes 6 output maps of the subsampling layer as a sequence which is a 2
× 1 vector.
(2) (𝑁 2 −4) (2)
In each output map 𝑎𝑗 , it is unfolded as a row vector consisting of 2
rows of 𝑎𝑗 in turn. Aiming at
each OD flow, we build the corresponding training data set for prediction. Then, we train the proposed deep
architecture by the backpropagation algorithm independently for each OD flow prediction. The predictor
can be achieved by forward propagation over trained deep architecture. After predicting all OD flows, we
update the prior TM 𝑋 ′ for the next prediction. Meanwhile, we can calculate the NMAE according to
^ Besides, we compute a threshold by means of Algorithm 1.
link load, routing information and predictor 𝑋.
If the NAME is greater than or equal to the threshold, we train the deep architecture by updated prior
TM for the network prediction. Otherwise, we implement the next prediction by the forward propagation
without re-training the proposed deep architecture. This method is able to decease the number of training
and optimize the real-time performance of prediction.

5 NUMERICAL RESULTS
5.1 Network Traffic Data Set
To evaluate the designed approach, we implement it by the real data set sampled via the Abilene network
and our testbed [21]. The Abilene network includes 12 nodes, and these nodes are connected by 54 internal
and external links. Hence, the number of OD flows is 144 in Abilene. We use one week of TM to evaluate
Manuscript submitted to ACM
12 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

7
10
9
Real

CNN+LSTM
Predictor
8

6
0 5 10 15
7
10
9
Real
Predictor
8
PCA
7

6
0 5 10 15
7
10
9
Real
8 Predictor
SRMF

5
0 5 10 15

Fig. 3. Real network traffic and its predictor for large OD flow in Abilene.

the proposed approach. The network traffic is sampled by 10-minute interval. Namely, the TM contains
2016 time slots. Besides, we construct a testbed, which consists of 1 RSU and 12 OBUs, to imitate the scene
of a secure IoV [15]. The topology of our testbed is built according to the open shortest path first algorithm,
in which weights are defined as the general urban path loss model [3]. Meanwhile, the distance between
two OBUs is random. Furthermore, the services provided by the testbed include video and radio. We also
collect end-to-end network traffic with 2016 time slots by using Wireshark. Moreover, two state-of-the-art
network traffic prediction approaches are leveraged for comparison, i.e., the PCA method and the Sparsity
Regularized Matrix Factorization (SRMF) method. The PCA method is a ML-based approach, and leverages
singular value decomposition to capture the spatial features of TM [24]. SRMF is a matrix interpolation
algorithm in fact. It is put forward to reconstruct the missing data during network traffic direct measurement
process. Besides, by setting special parameters, it is also an available network traffic prediction approach by
extracting the spatio-temporal features of TM [21].
In our evaluations, the first 2000 time slots of TM are used as the previous TM to build the training
data set, and the rest is for test. Furthermore, we set 𝑀 = 150, 𝐸 = 50, 𝐿𝑝 = 4, 𝛼 = 0.5, 𝛾 = 0.01 in
our evaluations. To train the proposed deep architecture, we set the batch equal to 2 and 𝑝 = 0.5 for
the Bernoulli distribution on the dropout layer. We set these parameters empirically to obtain the lowest
prediction error. We use MATLAB R2019a to carry out all the evaluations. The evaluations are conducted
on a 64-bit Windows 7 machine running on an Intel Xeon W-2102 (2.9 GHz) and a 32 GB RAM.
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 13

6
10
2.5
Real

CNN+LSTM
Predictor

1.5
0 5 10 15
6
10
2.5
Real
Predictor
PCA

1.5
0 5 10 15
6
10
6
Real
5 Predictor
SRMF

2
0 5 10 15

Fig. 4. Real network traffic and its predictor for small OD flow in Abilene.

600
Real
CNN+LSTM

Predictor
400

200

0
0 5 10 15

600
Real
Predictor
400
PCA

200

0
0 5 10 15

600
Real
Predictor
400
SRMF

200

0
0 5 10 15

Fig. 5. Real network traffic and its predictor for large OD flow in testbed.

Manuscript submitted to ACM

14 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

6
Real

CNN+LSTM
Predictor
4

0
0 5 10 15

4
Real
Predictor

PCA
2

0
0 5 10 15

0
SRMF

-10
Real
Predictor
-20
0 5 10 15

Fig. 6. Real network traffic and its predictor for small OD flow in testbed.

5.2 Prediction Error Evaluation

We first evaluate our approach in tracking the traces of OD flows. Hence, we demonstrate the real network
traffic and its predictor as shown in Figs. 3 and 4, respectively. The x-axis and y-axis are time slot order and
the volume of traffic element (i.e., the number of packets during a time slot), respectively. For a prediction
algorithm, the elephant (large) flows are easier to be predicted. On the contrary, it is difficult to predict the
mice (small) flows. Thereby, to guarantee the comprehensiveness of evaluation, we assess two OD flows with
large and small averages of network traffic from 144 OD flows. Fig. 3 shows the real network traffic of large
OD flow and its predictor in Abilene. Obviously, three approaches can faithfully capture the profile of this
OD flow. PCA and our method (CNN+LSTM) appear over-estimate and under-estimate more or less. By
contrast, SRMF exhibits a consistent under-estimate. For small OD flow shown in Fig. 4, we find that the
prediction errors of three approaches are higher than that of large OD flow. For an approach extracting
spatial or spatio-temporal features to predict network traffic, the prediction errors of small OD flows are
usually influenced by large OD flows. The spatial feature of TM means that two adjacent elements that
belong to different OD flows in TM are similar in size, and the definition of spatio-temporal feature is similar.
Hence, SRMF shows obvious over-estimate in Fig. 4, though it still can track the profile of this OD flow.
PCA and CNN+LSTM have lower errors. Nevertheless, PCA cannot capture the profile of this OD flow at
all. For a network management function, it has a fault-tolerant ability for network traffic prediction error.
Sometimes, predicting the profile of an OD flow are much more important for many network management
functions. Thereby, SRMF is not a failure prediction. Figs. 5 and 6 show the predictors of our testbed. The
traffic flows in our testbed have many irregular fluctuations, which raises the difficulty level of network traffic
prediction. For instance, in Fig. 5, the traffic flow appears a sharp damping. Our approach can capture this
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 15

Spatial Relative Error

CNN+LSTM
2000 800 PCA
600 SRMF
1500
400
1000 200
0
500
0 10 20
0
0 20 40 60 80 100 120 140
Temporal Relative Error (a) Flow ID,From Smallest to Largest in Mean

0.8 CNN+LSTM
PCA
0.6 SRMF

0.4

0.2

0
2 4 6 8 10 12 14 16
(b) Time Slot Order

Fig. 7. Evaluation for SREs and TREs in Abilene.

sharp damping faithfully. By contrast, SRMF has a low error, and PCA cannot track this trace at all. For
small traffic flows, all approaches occur large prediction error. Specially, SRMF has negative predictors, and
PCA shows a consistent predictor. Our approach can pursue the profile of this traffic flow from time slots 6
to 13.
To provide a quantitative analysis of three approaches, we leverage the Spatial Relative Error (SRE) and
Temporal Relative Error (TRE) defined as:
‖𝑋^ 𝑛,𝑡 −𝑋𝑛,𝑡 ‖2
⎧
⎨ 𝑆𝑅𝐸 (𝑛) =
⎪
‖𝑋𝑛,𝑡 ‖2
‖ ^ 𝑛,𝑡 −𝑋𝑛,𝑡 ‖
𝑋
, (19)
⎩ 𝑇 𝑅𝐸 (𝑡) =
⎪ 2
‖𝑋𝑛,𝑡 ‖2
ˆ 𝑛,𝑡 are real network traffic and its predictor, respectively. Fig. 7 shows the SREs and TREs
where 𝑋𝑛,𝑡 and 𝑋
of three approaches. The x-axis in Fig. 7(a) is the index of each OD flow, and it is arranged according to the
means of OD flows from the smallest to the largest ones. We can obtain similar conclusion for Figs. 3 and 4.
SRMF has the largest SREs, specially for small OD flows. The means of SREs of CNN+LSTM, PCA, and
SRMF are 0.47, 1.03 and 22.01, respectively. Fig. 7(b) exhibits the TREs of three approaches. Our method
obtains the lowest TRE consistently. SRMF shows the largest TRE in three approaches. Fig. 8 displays
the Cumulative Distribution Functions (CDF) of SREs and TREs. From Fig. 8(a), for CNN+LSTM, PCA
and SRMF, the SREs are less than 1.63, 1.80 and 15.38 respectively, with respect to 90% of all the OD
flows. Meanwhile, in Fig. 8(b), the TREs are less than 0.13, 0.25 and 0.39 respectively for 80% of time slots.
Similarly, the means of SREs of three methods are 2.34, 6.63, and 4.92, as shown in Fig. 9. The TREs of our
method occur some fluctuations, though it is the lowest one of three approaches. These fluctuations of TRE
can be exhibited clearly in Fig. 10(b).
Manuscript submitted to ACM
16 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

CNN+LSTM
0.8 PCA

CDF of SRE
1 SRMF
0.6 0.9
0.4 0.8
0.7
0.2
0 200
0
0 500 1000 1500 2000
(a) Spatial Relative Error
1
CNN+LSTM
0.8 PCA
CDF of TRE

SRMF
0.6

0.4

0.2

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
(b) Temporal Relative Error

Fig. 8. Evaluation for CDF of SREs and TREs in Abilene.

500
CNN+LSTM
Spatial Relative Error

400 PCA
SRMF
300

200

100

0
0 20 40 60 80 100 120 140
(a) Flow ID,From Smallest to Largest in Mean

1.5
Temporal Relative Error

CNN+LSTM
PCA
SRMF
1

0.5

0
0 5 10 15
(b) Time Slot Order

Fig. 9. Evaluation for SREs and TREs in testbed.

Manuscript submitted to ACM

Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 17

0.8

CDF of SRE
0.6

0.4
CNN+LSTM
0.2 PCA
SRMF
0
0 100 200 300 400 500
(a) Spatial Relative Error

0.8
CDF of TRE

0.6

0.4
CNN+LSTM
0.2 PCA
SRMF
0
0 0.5 1 1.5
(b) Temporal Relative Error

Fig. 10. Evaluation for CDF of SREs and TREs in testbed.

Moreover, we leverage the prediction bias and the relative sample Standard Deviation (SD) to evaluate
the availability, which are denoted by:
⎧
𝑇 (︁ )︁
⎪ 𝑏𝑖𝑎𝑠(𝑛) =
⎪ 1
∑︀ ˆ 𝑛,𝑡 − 𝑋𝑛,𝑡
𝑋
⎪
⎨ 𝑇
√︃ 𝑡=1 , (20)
𝑇
1
(𝑒𝑟𝑟𝑜𝑟(𝑛) − 𝑏𝑖𝑎𝑠(𝑛))2
⎪ ∑︀
⎩ 𝑆𝐷(𝑛) =
⎪
𝑇 −1
⎪
𝑡=1

ˆ 𝑛,𝑡 − 𝑋𝑛,𝑡 . Fig. 11 shows the biases and SD of three approaches. In Fig. 11(a), we sort
where 𝑒𝑟𝑟𝑜𝑟 (𝑛) = 𝑋
all OD flows by the descending order according to their averages. From Fig. 11(a), we find that all the
approaches show an under-estimate or over-estimate. Specially, the CNN+LSTM has smaller biases than
the others. Meanwhile, the biases are decreased with the means of OD flows decrease. The biases of SRMF
are not consistently related to flow size, as in CNN+LSTM and PCA. It has prominent under-estimates and
over-estimates. For small OD flows, SRMF appears a remarkable under-estimate. The biased estimators
sometimes may have lower SD, and then predictors are closer to the real value than those of unbiased
predictor. Thereby, we refer to the SD (or variance) of bias for further evaluation. In Fig. 11(b), we find
that CNN+LSTM and PCA have lower bias, but higher SD. An approach with low bias and high SD tends
to predict the long-term traffic [24]. Otherwise, it prefers to predict the short-term traffic, when it has high
bias and low SD. Hence, CNN+LSTM and PCA are suitable for long-term traffic prediction, and SRMF
for short-term traffic prediction. As mentioned before, the SRMF is a matrix interpolation algorithm used
to reconstruct the missing data in TM. Generally, the missing probability of each element is independent
identically distributed. To extending this matrix interpolation algorithm to predict network traffic, it assumes
Manuscript submitted to ACM
18 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

107 (a) Prediction bias

Bias
-1
CNN+LSTM
-2 PCA
SRMF
-3
0 50 100 150
Flow ID
107 (b) Bias versus SD
2

0
Bias

-1
CNN+LSTM
-2 PCA
SRMF
-3
0 2 4 6 8 10 12
SD in Error 1013

Fig. 11. Prediction bias and its SD in Abilene.

that the missing elements are a series of columns in TM. Under this case, SRMF predicts an element from a
snapshot of TM. Hence, it is good at predicting short-term traffic. On the contrary, PCA and CNN+LSTM
predict a network traffic element by using a great number of snapshots of TM. For instance, the singular
value decomposition is a matrix-oriented operation. We use CNN as the first hidden layer, and employ a
matrix as the input of deep architecture. As a result, PCA and our method are suitable for long-term traffic
prediction. We can obtain the similar conclusion according to the evaluations in Fig. 12.
Finally, we evaluate the performance improvement ratio for our approach versus PCA and SRMF. The
performance improvement ratio is expressed by:
∑︀2
𝑁 𝑇 ⃒
∑︀ ⃒ ˆ𝐴
⃒ 𝑁 2
𝑇 ⃒
⃒ ∑︀ ∑︀ ⃒ ˆ𝐵
⃒
⃒𝑋𝑛,𝑡 − 𝑋𝑛,𝑡 ⃒ − ⃒𝑋𝑛,𝑡 − 𝑋𝑛,𝑡 ⃒
⃒
𝑛=1 𝑡=1 𝑛=1 𝑡=1
𝑃 𝐼𝑅 = , (21)
∑︀2
𝑁 𝑇 ⃒
∑︀ ⃒ ˆ𝐴
⃒
⃒𝑋𝑛,𝑡 − 𝑋𝑛,𝑡 ⃒
⃒
𝑛=1 𝑡=1

ˆ 𝑛,𝑡
where 𝑋 𝐴 ˆ 𝑛,𝑡
and 𝑋 𝐵
are the predictors obtained via algorithms A and B, respectively. According to Eq. (8),
the performance ratios of CNN+LSTM versus PCA and SRMF are 43.34% and 71.47% in Abilene. Moreover,
they are 60.93% and 64.54% in our testbed, respectively.

6 CONCLUSIONS AND FUTURE WORK

This paper investigates the problem of end-to-end network traffic prediction in the IoV backbone network.
Aiming at minimizing the prediction error and improving real-time performance of network traffic prediction,
we design a deep architecture based on CNN and LSTM. This hybrid deep architecture is built by considering
Manuscript submitted to ACM
Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 19

(a) Prediction bias

200
CNN+LSTM
PCA
100 SRMF

Bias
0

-100

-200
0 50 100 150
Flow ID
(b) Bias versus SD
200
CNN+LSTM
PCA
100 SRMF
Bias

-100

-200
0 1 2 3 4 5 6
SD in Error 104

Fig. 12. Prediction bias and its SD in testbed.

two features of TM, i.e., the temporal and spatio-temporal features. Convolutional and subsampling layers
are used for extracting the spatio-temporal features of TM. Meanwhile, an LSTM layer is leveraged to
capture the temporal features. To improve the real-time performance of the proposed deep architecture
for prediction, we propose a threshold-based deep architecture update policy. Furthermore, we propose
a Q-learning algorithm to gain this threshold, in which link loads and routing information are employed
to generate NMAE. In this case, the threshold can be calibrated to obtain an optimal tradeoff between
real-time performance and prediction error. The proposed approach is evaluated by real network traffic data
set from the Abilene backbone network and our constructed testbed. From these evaluations, the proposed
approach can track the trace of end-to-end network traffic precisely.
The 5G-enabled communication network, in which infrastructures are employed densely, has been involved
in IoVs to provide adequate communication resources. Hence, network traffic prediction approaches for
large-scale and heterogeneous IoV backbone networks are necessary. Under this case, the computational
complexity of a prediction approach is significantly important for online network traffic prediction. Thereby,
a lightweight network traffic prediction approach is the main work in the future.

7 ACKNOWLEDGMENTS
This work was supported in part by the National Key R&D Program of China under Grant 2018YFE0206800,
in part by the National Natural Science Foundation of China under Grants 61936001, 61971084, 62001073
and 61701406, in part by the National Natural Science Foundation of Chongqing under Grants cstc2019jcyj-
cxttX0002 and cstc2019jcyj-msxmX0208.
Manuscript submitted to ACM
20 Xiaojie Wang, Laisen Nie, Zhaolong Ning, Lei Guo, Guoyin Wang, Xinbo Gao, and Neeraj Kumar

REFERENCES
[1] T. O. Adelani and A. S. Alfa. 2010. Hybrid Techniques for Large-Scale IP Traffic Matrix Estimation. In 2010 IEEE
International Conference on Communications. IEEE, 1–6.
[2] Nikolaos Athanasios Anagnostopoulos, Saad Ahmad, Tolga Arul, Daniel Steinmetzer, Matthias Hollick, and Stefan
Katzenbeisser. 2020. Low-Cost Security for Next-Generation IoT Networks. ACM Trans. Internet Technol. 20, 3,
Article 30 (Sept. 2020), 31 pages. https://doi.org/10.1145/3406280
[3] J. Andrusenko, R. L. Miller, J. A. Abrahamson, N. M. M. Emanuelli, R. S. Pattay, and R. M. Shuford. 2008. VHF
general urban path loss model for short range ground-to-ground communications. IEEE Transactions on Antennas and
Propagation 56, 10 (2008), 3302–3310.
[4] J. Contreras-Castillo, S. Zeadally, and J. A. Guerrero-Ibaez. 2018. Internet of Vehicles: Architecture, Protocols, and
Security. IEEE Internet of Things Journal 5, 5 (2018), 3701–3709.
[5] Giuseppe Faraci, Christian Grasso, and Giovanni Schembra. 2020. Fog in the Clouds: UAVs to Provide Edge Computing
to IoT Devices. ACM Trans. Internet Technol. 20, 3, Article 26 (Aug. 2020), 26 pages. https://doi.org/10.1145/3382756
[6] Qiang He, Xingwei Wang, Zhencheng Lei, Min Huang, Yuliang Cai, and Lianbo Ma. 2019. TIFIM: A Two-stage
Iterative Framework for Influence Maximization in Social Networks. Appl. Math. Comput. 354 (2019), 338 – 352.
https://doi.org/10.1016/j.amc.2019.02.056
[7] Z. Hu, Y. Qiao, and J. Luo. 2018. ATME: Accurate Traffic Matrix Estimation in Both Public and Private Datacenter
Networks. IEEE Transactions on Cloud Computing 6, 1 (2018), 60–73.
[8] B. Hussain, Q. Du, A. Imran, and M. A. Imran. 2020. Artificial Intelligence-Powered Mobile Edge Computing-Based
Anomaly Detection in Cellular Networks. IEEE Transactions on Industrial Informatics 16, 8 (2020), 4986–4996.
[9] H. Khelifi, S. Luo, B. Nour, H. Moungla, Y. Faheem, R. Hussain, and A. Ksentini. 2020. Named Data Networking in
Vehicular Ad Hoc Networks: State-of-the-Art and Challenges. IEEE Communications Surveys & Tutorials 22, 1 (2020),
320–351.
[10] T. Li, J. Yuan, and M. Torlak. 2018. Network Throughput Optimization for Random Access Narrowband Cognitive
Radio Internet of Things (NB-CR-IoT). IEEE Internet of Things Journal 5, 3 (June 2018), 1436–1448. https:
//doi.org/10.1109/JIOT.2017.2789217
[11] M. Liu, L. Liu, H. Song, Y. Hu, Y. Yi, and F. Gong. 2020. Signal Estimation in Underlay Cognitive Networks for
Industrial Internet of Things. IEEE Transactions on Industrial Informatics 16, 8 (2020), 5478–5488.
[12] X. Liu, L. Che, K. Gao, and Z. Li. 2020. Power System Intra-Interval Operational Security Under False Data Injection
Attacks. IEEE Transactions on Industrial Informatics 16, 8 (2020), 4997–5008.
[13] Marcin Luckner, Maciej Grzenda, Robert Kunicki, and Jaroslaw Legierski. 2020. IoT Architecture for Urban Data-
Centric Services and Applications. ACM Trans. Internet Technol. 20, 3, Article 29 (July 2020), 30 pages. https:
//doi.org/10.1145/3396850
[14] E. Maggiori, Y. Tarabalka, G. Charpiat, and P. Alliez. 2017. Convolutional Neural Networks for Large-Scale Remote-
Sensing Image Classification. IEEE Transactions on Geoscience and Remote Sensing 55, 2 (2017), 645–657. https:
//doi.org/10.1109/TGRS.2016.2612821
[15] L. Nie, Z. Ning, X. Wang, X. Hu, Y. Li, and J. Cheng. 2020. Data-Driven Intrusion Detection for Intelligent Internet
of Vehicles: A Deep Convolutional Neural Network-based Method. IEEE Transactions on Network Science and
Engineering (2020), 1–1.
[16] Z. Ning, K. Zhang, X. Wang, L. Guo, X. Hu, J. Huang, B. Hu, and R. Y. K. Kwok. 2020. Intelligent Edge Computing
in Internet of Vehicles: A Joint Computation Offloading and Caching Solution. IEEE Transactions on Intelligent
Transportation Systems (2020), 1–14.
[17] Z. Ning, K. Zhang, X. Wang, M. S. Obaidat, L. Guo, X. Hu, B. Hu, Y. Guo, B. Sadoun, and R. Y. K. Kwok. 2020.
Joint Computing and Caching in 5G-Envisioned Internet of Vehicles: A Deep Reinforcement Learning-Based Traffic
Control System. IEEE Transactions on Intelligent Transportation Systems (2020), 1–12.
[18] A. Omidvar and H. S. Shahhoseini. 2011. Intelligent IP traffic matrix estimation by neural network and genetic algorithm.
In 2011 IEEE 7th International Symposium on Intelligent Signal Processing. IEEE, 1–6.
[19] Carlo Puliafito, Enzo Mingozzi, Francesco Longo, Antonio Puliafito, and Omer Rana. 2019. Fog Computing for the
Internet of Things: A Survey. ACM Trans. Internet Technol. 19, 2, Article 18 (April 2019), 41 pages. https:
//doi.org/10.1145/3301443
[20] C. Qiu, Y. Zhang, Z. Feng, P. Zhang, and S. Cui. 2018. Spatio-Temporal Wireless Traffic Prediction With Recurrent
Neural Network. IEEE Wireless Communications Letters 7, 4 (Aug 2018), 554–557. https://doi.org/10.1109/LWC.
2018.2795605
[21] M. Roughan, Y. Zhang, W. Willinger, and L. Qiu. 2012. Spatio-temporal compressive sensing and Internet traffic
matrices (extended version). IEEE Transactions on Networking 20, 3 (2012), 662–676.

Manuscript submitted to ACM

Deep Learning-based Network Traffic Prediction for Secure Backbone Networks in Internet of Vehicles 21

[22] S. K. Singh and A. Jukan. 2018. Machine-learning-based prediction for resource (Re)allocation in optical data
center networks. IEEE/OSA Journal of Optical Communications and Networking 10, 10 (Oct 2018), D12–D28.
https://doi.org/10.1364/JOCN.10.000D12
[23] S. Sinha and C. S. R. Murthy. 2005. Information theoretic approach to traffic adaptive WDM networks. IEEE/ACM
Transactions on Networking 13, 4 (2005), 881–894.
[24] A. Soule, A. Lakhina, N. Taft, K. Papagiannaki, K. Salamatian, A. Nucci, M. Crovella, and C. Diot. 2005. Traffic
matrices: balancing measurements, inference and modeling. In Proceedings of SIGMETRICS 2005. IEEE, 362–373.
[25] D. A. Tedjopurnomo, Z. Bao, B. Zheng, F. Choudhury, and A. K. Qin. 2020. A Survey on Modern Deep Neural Network
for Traffic Prediction: Trends, Methods and Challenges. IEEE Transactions on Knowledge and Data Engineering
(2020), 1–1.
[26] D. Wang, J. Fan, Z. Xiao, H. Jiang, H. Chen, F. Zeng, and K. Li. 2019. Stop-and-Wait: Discover Aggregation Effect
Based on Private Car Trajectory Data. IEEE Transactions on Intelligent Transportation Systems 20, 10 (Oct 2019),
3623–3633. https://doi.org/10.1109/TITS.2018.2878253
[27] J. Wang, J. Tang, Z. Xu, Y. Wang, G. Xue, X. Zhang, and D. Yang. 2017. Spatiotemporal modeling and prediction
in cellular networks: A big data enabled deep learning approach. In IEEE INFOCOM 2017 - IEEE Conference on
Computer Communications. IEEE, 1–9.
[28] X. Wang, Z. Ning, S. Guo, and L. Wang. 2020. Imitation Learning Enabled Task Scheduling for Online Vehicular Edge
Computing. IEEE Transactions on Mobile Computing (2020), 1–1.
[29] J. Weng, J. Weng, Y. Zhang, W. Luo, and W. Lan. 2019. BENBI: Scalable and Dynamic Access Control on the
Northbound Interface of SDN-Based VANET. IEEE Transactions on Vehicular Technology 68, 1 (Jan 2019), 822–831.
https://doi.org/10.1109/TVT.2018.2880238
[30] D. Wu, Z. Jiang, X. Xie, X. Wei, W. Yu, and R. Li. 2020. LSTM Learning With Bayesian and Gaussian Processing for
Anomaly Detection in Industrial IoT. IEEE Transactions on Industrial Informatics 16, 8 (2020), 5244–5253.
[31] S. Yang, Y. Su, Y. Chang, and H. Hung. 2019. Short-Term Traffic Prediction for Edge Computing-Enhanced Autonomous
and Connected Cars. IEEE Transactions on Vehicular Technology 68, 4 (April 2019), 3140–3153. https://doi.org/10.
1109/TVT.2019.2899125
[32] C. Zhang, H. Zhang, D. Yuan, and M. Zhang. 2018. Citywide Cellular Traffic Prediction Based on Densely Connected
Convolutional Neural Networks. IEEE Communications Letters 22, 8 (Aug 2018), 1656–1659. https://doi.org/10.
1109/LCOMM.2018.2841832
[33] L. Zhang and Y. Liang. 2019. Joint Spectrum Sensing and Packet Error Rate Optimization in Cognitive IoT. IEEE
Internet of Things Journal 6, 5 (Oct 2019), 7816–7827. https://doi.org/10.1109/JIOT.2019.2907993
[34] Z. Zhao, W. Chen, X. Wu, P. C. Y. Chen, and J. Liu. 2017. LSTM network: a deep learning approach for short-term
traffic forecast. IET Intelligent Transport Systems 11, 2 (2017), 68–75.

Manuscript submitted to ACM

View publication stats

Deep Learning Approach Combainingsparseautoencoderwith SVMFor Networkintrusiondetection
No ratings yet
Deep Learning Approach Combainingsparseautoencoderwith SVMFor Networkintrusiondetection
15 pages
ASurveyon Text Classification
No ratings yet
ASurveyon Text Classification
42 pages
Bridging Machine Learning and Computer Network Res
No ratings yet
Bridging Machine Learning and Computer Network Res
16 pages
Edge Computing With Artificial Intelligence: A Machine Learning Perspective
No ratings yet
Edge Computing With Artificial Intelligence: A Machine Learning Perspective
34 pages
Sensors 25 02725
No ratings yet
Sensors 25 02725
20 pages
Travel Demand Prediction
No ratings yet
Travel Demand Prediction
7 pages
Sequential Graph Neural Network For Urban Road Tra
No ratings yet
Sequential Graph Neural Network For Urban Road Tra
12 pages
IJCIA Acceptedmanuscript
No ratings yet
IJCIA Acceptedmanuscript
19 pages
ASurvey On Modern DeepNeural Network
No ratings yet
ASurvey On Modern DeepNeural Network
18 pages
SketchDLC A Sketch On Distributed Deep Learning Co
No ratings yet
SketchDLC A Sketch On Distributed Deep Learning Co
27 pages
Sensors
No ratings yet
Sensors
31 pages
1806-Article Text-3384-1-10-20210408
No ratings yet
1806-Article Text-3384-1-10-20210408
13 pages
Neural Networks: Image & NLP Applications
No ratings yet
Neural Networks: Image & NLP Applications
16 pages
DB Report Final25report1 Compressed Compressed
No ratings yet
DB Report Final25report1 Compressed Compressed
61 pages
Electronics
No ratings yet
Electronics
15 pages
جديد
No ratings yet
جديد
54 pages
Software Engineering Ai Networking Computing
No ratings yet
Software Engineering Ai Networking Computing
223 pages
Privacy-Preserving Traffic Flow Prediction A Federated Learning Approach
No ratings yet
Privacy-Preserving Traffic Flow Prediction A Federated Learning Approach
13 pages
SecureBoost A Lossless Federated Learning Framewor
No ratings yet
SecureBoost A Lossless Federated Learning Framewor
13 pages
A Deep Learning Approach To Network Intrusion Detection FINAL
No ratings yet
A Deep Learning Approach To Network Intrusion Detection FINAL
11 pages
Deep Learning for Small Object Detection
No ratings yet
Deep Learning for Small Object Detection
14 pages
NetworkingML DL SDN Survey IEEE MN2019 Submitted
No ratings yet
NetworkingML DL SDN Survey IEEE MN2019 Submitted
7 pages
Yolo Versions Architecture Review
No ratings yet
Yolo Versions Architecture Review
21 pages
Deep Learning On Network Traffic Prediction: Recent Advances, Analysis, and Future Directions
No ratings yet
Deep Learning On Network Traffic Prediction: Recent Advances, Analysis, and Future Directions
37 pages
2410 04005v1
No ratings yet
2410 04005v1
8 pages
Information 14 00041
No ratings yet
Information 14 00041
21 pages
Electronics 11 00898
No ratings yet
Electronics 11 00898
13 pages
Predictive Maintenance in Industry 4.0
100% (1)
Predictive Maintenance in Industry 4.0
16 pages
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
No ratings yet
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
10 pages
Nhom4 Report
No ratings yet
Nhom4 Report
16 pages
55.bat Deep Learning Methods On Network Compressed
No ratings yet
55.bat Deep Learning Methods On Network Compressed
14 pages
Machine Learning/Ai For Iot, M2M, and Computer Communication
No ratings yet
Machine Learning/Ai For Iot, M2M, and Computer Communication
3 pages
Integration of Data Science and Iot With Blockchain For Industry 4.0
No ratings yet
Integration of Data Science and Iot With Blockchain For Industry 4.0
40 pages
Ijetae 0423 05
No ratings yet
Ijetae 0423 05
17 pages
1294 DocumentUpload 6064 1 10 20201103
No ratings yet
1294 DocumentUpload 6064 1 10 20201103
11 pages
Faculty Publications 2020-21 and 2021 - 22
No ratings yet
Faculty Publications 2020-21 and 2021 - 22
14 pages
Artificial Intelligence of Things AIoT Technologie
No ratings yet
Artificial Intelligence of Things AIoT Technologie
2 pages
DB Report RM
No ratings yet
DB Report RM
9 pages
Routing Algorithm Based On Vehicle Position Analysis For Internet of Vehicles
No ratings yet
Routing Algorithm Based On Vehicle Position Analysis For Internet of Vehicles
13 pages
Feature Reduction
No ratings yet
Feature Reduction
41 pages
IJCINI 104 CognitiveIntelligence BrainInspiredSystems
No ratings yet
IJCINI 104 CognitiveIntelligence BrainInspiredSystems
21 pages
Big Data Analytics For Manufacturing Internet of Things: Opportunities, Challenges and Enabling Technologies
No ratings yet
Big Data Analytics For Manufacturing Internet of Things: Opportunities, Challenges and Enabling Technologies
27 pages
Tensorflow
No ratings yet
Tensorflow
32 pages
Years Ref. Publications Datasets Methods (Algorithm) Techniques
No ratings yet
Years Ref. Publications Datasets Methods (Algorithm) Techniques
1 page
BDCC 08 00116 v2
No ratings yet
BDCC 08 00116 v2
23 pages
A Novel Method For Effective Intrusion Det - 2024 - Journal of King Saud Univers
No ratings yet
A Novel Method For Effective Intrusion Det - 2024 - Journal of King Saud Univers
14 pages
Spatial Temporal Graph Convolutional Networks For Trafc Fow Prediction Considering Multiple Trafc Parameters
No ratings yet
Spatial Temporal Graph Convolutional Networks For Trafc Fow Prediction Considering Multiple Trafc Parameters
20 pages
Ieee Standards and Deep Learning Techniques For Securing Internet of Things (Iot) Devices Against Cyber Attacks
No ratings yet
Ieee Standards and Deep Learning Techniques For Securing Internet of Things (Iot) Devices Against Cyber Attacks
21 pages
Survey On SDN Based Network Intrusion Detection System Using Machine
No ratings yet
Survey On SDN Based Network Intrusion Detection System Using Machine
10 pages
A Deep Learning Approach For IoT Traffic Multi-Classification in A Smart-City Scenario
No ratings yet
A Deep Learning Approach For IoT Traffic Multi-Classification in A Smart-City Scenario
18 pages
Out
No ratings yet
Out
109 pages
Highway Networks
No ratings yet
Highway Networks
7 pages
Research Article Final Year Project
No ratings yet
Research Article Final Year Project
10 pages
UrbanDigitalTwinasaSocio TechnicalConstruct
No ratings yet
UrbanDigitalTwinasaSocio TechnicalConstruct
24 pages
Software Project Abstracts 2022-23
No ratings yet
Software Project Abstracts 2022-23
52 pages
Three Stage Data Generation Algorithm For Multicla 2023 International Journa
No ratings yet
Three Stage Data Generation Algorithm For Multicla 2023 International Journa
9 pages
Differential Privacy Preserving Using TensorFlow DP-SGD and 2D-CNN For Large-Scale Image Data
No ratings yet
Differential Privacy Preserving Using TensorFlow DP-SGD and 2D-CNN For Large-Scale Image Data
9 pages
List ReadingPaper
No ratings yet
List ReadingPaper
20 pages
Deep Learning For Enhancing Urban Planning and Smart Cities
No ratings yet
Deep Learning For Enhancing Urban Planning and Smart Cities
4 pages
The Influence of Emerging Risks From The Implementation of Artificial Intelligence On Project Management
No ratings yet
The Influence of Emerging Risks From The Implementation of Artificial Intelligence On Project Management
104 pages
Previewpdf
No ratings yet
Previewpdf
26 pages
1 s2.0 S0378874104003678 Main
No ratings yet
1 s2.0 S0378874104003678 Main
10 pages
A Multi-Attribute Approach To Ranking Departments Based On Performance: A Balanced Scorecard Pilot Study
No ratings yet
A Multi-Attribute Approach To Ranking Departments Based On Performance: A Balanced Scorecard Pilot Study
9 pages
Tis Paper Preprint
No ratings yet
Tis Paper Preprint
21 pages
Application of Artificial Intelligence in Small and Medium-Sized Enterprises
No ratings yet
Application of Artificial Intelligence in Small and Medium-Sized Enterprises
9 pages
An Integrated Framework For Prioritizing Sustainability Indicators For The Mining Sector With A Multicriteria Decision Making Technique
No ratings yet
An Integrated Framework For Prioritizing Sustainability Indicators For The Mining Sector With A Multicriteria Decision Making Technique
44 pages
Corporate Strategy Evaluation (CORE) : A New Method To Measure Strategies in Organizations
No ratings yet
Corporate Strategy Evaluation (CORE) : A New Method To Measure Strategies in Organizations
18 pages
Toward A Sustainable Transportation Industry Oil Company Benchmarking Based On The Extension of Linear Diophantine Fuzzy Rough Sets and Multicriteria Decision-Making Methods
No ratings yet
Toward A Sustainable Transportation Industry Oil Company Benchmarking Based On The Extension of Linear Diophantine Fuzzy Rough Sets and Multicriteria Decision-Making Methods
11 pages
The Composition of Shares and Corporate Performance Evidence From Shenzhen Stock Market
No ratings yet
The Composition of Shares and Corporate Performance Evidence From Shenzhen Stock Market
5 pages
Integrated Performance Evaluation Method Study and Performance Based Department Ranking: A Case Study
No ratings yet
Integrated Performance Evaluation Method Study and Performance Based Department Ranking: A Case Study
9 pages
Optimizing Time and Cost Activity Based On Discrete Event Simulation With Multi-Objective Optimization Method by Ratio Analysis MOORA
No ratings yet
Optimizing Time and Cost Activity Based On Discrete Event Simulation With Multi-Objective Optimization Method by Ratio Analysis MOORA
6 pages
SSRN Id3525037 Code1354282
No ratings yet
SSRN Id3525037 Code1354282
3 pages
Previewpdf
No ratings yet
Previewpdf
64 pages
Priyasingh Paper
No ratings yet
Priyasingh Paper
6 pages
COVID19 Peak Time Predictionviaa Gradient Boosting Method
No ratings yet
COVID19 Peak Time Predictionviaa Gradient Boosting Method
10 pages
Utilization of Artificial Intelligence A
No ratings yet
Utilization of Artificial Intelligence A
11 pages
AI-Assisted Resource Allocation in Project Management
No ratings yet
AI-Assisted Resource Allocation in Project Management
5 pages
Electronics and Communication Engineering
No ratings yet
Electronics and Communication Engineering
3 pages
Chapter1-Introduction To Computer
No ratings yet
Chapter1-Introduction To Computer
38 pages
Lan Wan
No ratings yet
Lan Wan
7 pages
4qn S4hana2023 Set-Up en XX
No ratings yet
4qn S4hana2023 Set-Up en XX
66 pages
958 (Ebook PDF) Systems Architecture 7th Edition by Stephen D. Burdpdf Download
100% (4)
958 (Ebook PDF) Systems Architecture 7th Edition by Stephen D. Burdpdf Download
48 pages
Cisco - Premium.200 155.by - Vceplus.75q
No ratings yet
Cisco - Premium.200 155.by - Vceplus.75q
35 pages
Cisco 8000 Routers for Network Pros
No ratings yet
Cisco 8000 Routers for Network Pros
14 pages
18-2-3 Lab View Wireless and Wired NIC Information
No ratings yet
18-2-3 Lab View Wireless and Wired NIC Information
5 pages
خطة تدريبية تخصص شبكات
No ratings yet
خطة تدريبية تخصص شبكات
7 pages
Company Secretaries of India
No ratings yet
Company Secretaries of India
43 pages
CSC208 Group Assignment Print
No ratings yet
CSC208 Group Assignment Print
15 pages
Mis 3
No ratings yet
Mis 3
16 pages
Introduction To Computer Network PDF
No ratings yet
Introduction To Computer Network PDF
8 pages
Patent Infringement Complaint
No ratings yet
Patent Infringement Complaint
10 pages
Cisco ACI Multi-Pod Vs Multi-Site Detailed Comparison - IP With Ease
No ratings yet
Cisco ACI Multi-Pod Vs Multi-Site Detailed Comparison - IP With Ease
7 pages
300-430-ENWLSI Implementing Cisco Enterprise Wireless Networks PDF
100% (1)
300-430-ENWLSI Implementing Cisco Enterprise Wireless Networks PDF
3 pages
PB - 50C - 50CX - JUL24 - VF
No ratings yet
PB - 50C - 50CX - JUL24 - VF
9 pages
CS601
No ratings yet
CS601
16 pages
Network Security Guide for Students
No ratings yet
Network Security Guide for Students
38 pages
Competency-Based Learning Materials: Sector: Electronics
No ratings yet
Competency-Based Learning Materials: Sector: Electronics
56 pages
Chapter 1 - Computer Networks and The Internet by Keyur Parmar
No ratings yet
Chapter 1 - Computer Networks and The Internet by Keyur Parmar
134 pages
Speaker - A01 - 5752 - Reinvent The AI Networking
No ratings yet
Speaker - A01 - 5752 - Reinvent The AI Networking
32 pages
Manual Eee PC 900
No ratings yet
Manual Eee PC 900
8 pages
D - C - Chapter - 14 - Topic 191 To 196 (Part 1)
No ratings yet
D - C - Chapter - 14 - Topic 191 To 196 (Part 1)
26 pages
Performance Based Network Concept For Advanced Air Traffic Services
No ratings yet
Performance Based Network Concept For Advanced Air Traffic Services
5 pages
Document Version: 1.0 Image Version: V1.3: Lgt-92 Lorawan Gps Tracker User Manual
No ratings yet
Document Version: 1.0 Image Version: V1.3: Lgt-92 Lorawan Gps Tracker User Manual
32 pages
HP Laserjet Pro M404-M405 Series
No ratings yet
HP Laserjet Pro M404-M405 Series
4 pages
IP Subnetting Exercise Questions
0% (1)
IP Subnetting Exercise Questions
7 pages
RouterOS VLAN Configuration Guide
100% (1)
RouterOS VLAN Configuration Guide
114 pages

Toit

Uploaded by

Toit

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Deep Learning-Based Network Trafﬁc Prediction for Secure Backbone Networks

Article in ACM Transactions on Internet Technology · January 2020

Xiaojie Wang Zhaolong Ning

SEE PROFILE SEE PROFILE

Liang Guo Xinbo Gao

806 PUBLICATIONS 22,341 CITATIONS

The user has requested enhancement of the downloaded file.

Manuscript submitted to ACM 1

ACM Reference Format:

IoV Backbone Network

Fig. 1. An illustration of IoV backbone networks.

2.2 Deep Learning-base Network Traffic Prediction

𝑋𝑛,𝑡 =𝑓 (𝑋𝑛,𝑡−1 , 𝑋𝑛,𝑡−2 , ..., 𝑋𝑛,𝑡−𝐸 ) . (2)

4.1 Traffic Prediction Based on CNN and LSTM

Manuscript submitted to ACM

Fully Connected Layer

Fully Connected Layer

Fig. 2. Deep architecture with CNN and LSTM.

where 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 (·) denotes the average pooling process.

Manuscript submitted to ACM

Manuscript submitted to ACM

4.2 Deep Architecture Update Algorithm Based on RL

Manuscript submitted to ACM

state 𝑠˜, which can be denoted by:

Algorithm 1 Residual-based Dictionary Learning

Algorithm 2 Network traffic prediction by way of DL and RL

Manuscript submitted to ACM

5.2 Prediction Error Evaluation

Spatial Relative Error

Fig. 7. Evaluation for SREs and TREs in Abilene.

Fig. 8. Evaluation for CDF of SREs and TREs in Abilene.

Fig. 9. Evaluation for SREs and TREs in testbed.

Manuscript submitted to ACM

Fig. 10. Evaluation for CDF of SREs and TREs in testbed.

107 (a) Prediction bias

Fig. 11. Prediction bias and its SD in Abilene.

6 CONCLUSIONS AND FUTURE WORK

(a) Prediction bias

Fig. 12. Prediction bias and its SD in testbed.

Manuscript submitted to ACM

Manuscript submitted to ACM

View publication stats

You might also like