Journal of University of Science and Technology of China ›› 2018, Vol. 48 ›› Issue (1): 47-56.DOI: 10.3969/j.issn.0253-2778.2018.01.007

• Original Paper • Previous Articles     Next Articles

A trajectory data density partition based distributed parallel clustering method

WANG Jiayu, ZHANG Zhenyu, CHU Zheng, WU Xiaohong   

  1. 1. School of Software, Xinjiang University, Urumqi 830008;
    2. College of Information Science and Engineering, Xinjiang University, Urumqi 830046)
  • Received:2017-05-20 Revised:2017-06-23 Online:2018-01-01 Published:2018-01-01

Abstract: The development of global positioning technology and location-based service have contributed to the development of trajectory big data. Trajectory clustering is one of the most important trajectory analysis tasks and has been extensively studied. Currently, most of the clustering methods operate in a single-processor mode, and large-scale trajectory data processing is a lengthy process, making it difficult to meet the strong timeliness of the trajectory analysis task. To solve the problem, a distributed parallel clustering method based on trajectory density partition is proposed. Firstly, the whole dataset is abstracted in a rectangular region, and the dataset is divided into several partitions with tasks that have almost the same amount by the transformation of the longest dimension of the rectangle, thus constructing the local datasets for distributed parallel clustering. Then the worker servers implement the DBSCAN clustering algorithm for the local partitions respectively, and the manager server merges and integrates the local clustering results. The experimental results show that the algorithm is effective and improves the computational rate of clustering analysis to a certain degree.

Key words: trajectory big data, distributed clustering, DBSCAN algorithm, clustering algorithm

CLC Number: