基于局部张量填充的网络流量数据恢复
首发时间:2019-04-16
摘要:随着网络规模的持续扩大,监测网络中所有节点间的流量数据是不现实的,利用部分测量数据推断出整体网络流量数据成为网络工程中越来越重要的任务之一。张量填充是一种有前景的技术,利用多维数据结构特征,可以更准确地推断出丢失数据。在本文中提出了一种新颖的局部张量填充模型。通过利用局部数据来形成和恢复每个具有低秩结构的子张量,从而提高数据恢复精度。为了克服研究中的挑战,提出了几种新技术,包括基于局部敏感哈希(LSH)有效计算候选锚点,选择适当的锚点来构建子张量,利用因子矩阵进行编码以便于在数据缺失的张量中发现相似性较高的元素,以及相似性较高的局部张量填充和数据融合。最后使用真实数据集进行了实验,实验结果表明在同等采样率和秩填充的情况下,该填充算法与经典的张量平行因子(CP)分解相比,采样数据相对误差、推断数据的误差大约分别降低了7\%、3\%。本论文的研究成果可以用来更加精准的恢复网络流量数据。
For information in English, please click here
Network traffic data recovery based on local tensor completion
Abstract:As the network scale continues to expand, it is unrealistic to monitor the traffic data of all nodes in the network. Inferring the overall network traffic data from some measurement data becomes one of the more and more important tasks in the project. With multidimensional data structure features, tensor completion is a promising technique to infer lost data more accurately. A novel local tensor completion model is proposed in this paper. The data recovery accuracy is improved by using local data to form and recover each of the local tensor with a low rank structure. In order to overcome the challenges in the research, several new techniques are proposed, including efficient calculation of candidate anchors based on local sensitive hash (LSH), selection of appropriate anchor points to construct sub-tensors, and coding of factor matrices to facilitate data loss. The local tensor was found with similar elements, as well as local tensor completion and data fusion . Finally, experiments were carried out using the real data sets. The experimental results show that the relative error of the sampled data and the error of infered data are compared with the classical tensor balance factor (CP) decomposition in the case of the same sampling rate and rank, reduced by 7\% and 3\% respectively. The research results of this paper can be used to recover network traffic data more accurately.
Keywords: Tensor completion Network measurement data Local sensitive hash
基金:
引用
No.****
同行评议
勘误表
基于局部张量填充的网络流量数据恢复
评论
全部评论