中国科学技术大学学报 ›› 2019, Vol. 49 ›› Issue (2): 149-158.DOI: 10.3969/j.issn.0253-2778.2019.02.010

• 原创论文 • 上一篇    下一篇

一种建模社交化点过程序列预测算法

江海洋   

  1. 1.太原理工大学信息与计算机学院,山西晋中 030600;2.太原理工大学大数据学院,山西晋中 030600
  • 收稿日期:2018-09-21 修回日期:2018-12-04 出版日期:2019-02-28 发布日期:2019-02-28
  • 通讯作者: 王莉
  • 作者简介:江海洋,女,1993年生,硕士生. 机器学习、序列预测. E-mail: 2541529703@qq.com
  • 基金资助:
    国家自然科学基金(61872260),山西省重点研发计划国际合作项目(201703D421013)资助.

A modeling socialization point process sequence prediction algorithm

JIANG Haiyang   

  1. 1.College of Information and Computer,Taiyuan University Of Technology, Jinzhong 030600,China; 2.College of Big Data ,Taiyuan University Of Technology, Jinzhong 030600,China
  • Received:2018-09-21 Revised:2018-12-04 Online:2019-02-28 Published:2019-02-28

摘要: 根据序列数据预测下次事件类型和时间是一个值得研究的课题.目前点过程强度函数算法仅从时间维度考虑背景知识和历史影响两个方面,没有从空间维度加入社交关系的影响.针对该问题,提出基于时空深度网络的社交化点过程的序列预测算法(SPSP算法).该模型首先运用双LSTM(long short-term memory)分别建模强度函数的背景知识和历史影响;然后经过联合层将双LSTM输出合并,生成事件类型和时间向量表征;最后在空间维度上加入社交关系网络影响,优化强度函数.通过深度时空社交网络的多次训练,得到最优网络模型.在新浪微博数据集上的实验验证算法的有效性,证明社交化点过程序列预测算法可高效准确预测出事件类型与时间.

关键词: 序列预测, 社交网络, 点过程, LSTM, 时空深度网络

Abstract: Predicting the type and time of the next event according to the sequence data is a subject worth studying.At present, the point process intensity function only considers the background knowledge and historical influence from the time dimension, and has no influence on the social relations from the spatial dimension.Aiming at this problem, a sequence prediction algorithm (SPSP algorithm) is proposed based on the spatio-temporal deep network.In this model, firstly the background knowledge and historical influence of the intensity function are modeled with the dual LSTM (long short-term memory).Then the output of two LSTMs are combined by the union layer to generate the vector representation of event type and time.Finally, the influence of social networks on the spatial dimension is added to optimize the intensity function.Through multiple training of the Spatio-temporal deep neural network, the optimal network model is obtained.Sina weibo data sets are used to verify the validity of the algorithm, and it has been proven by experiments that the proposed algorithm can predict the event type and time efficiently accurately.

Key words: sequence prediction, social networking, point process, LSTM, deep time-space network