An efficient data processing framework for mining the massive trajectory of moving objects

Yuanchun Zhou, Yang Zhang, Yong Ge, Zhenghua Xue, Yanjie Fu, Danhuai Guo, Jing Shao, Tiangang Zhu, Xuezhi Wang, Jianhui Li

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Recently, there has been increasing development of positioning technology, which enables us to collect large scale trajectory data for moving objects. Efficient processing and analysis of massive trajectory data has thus become an emerging and challenging task for both researchers and practitioners. Therefore, in this paper, we propose an efficient data processing framework for mining massive trajectory data. This framework includes three modules: (1) a data distribution module, (2) a data transformation module, and (3) a high performance I/O module. Specifically, we first design a two-step consistent hashing algorithm, which takes into account load balancing, data locality, and scalability, for a data distribution module. In the data transformation module, we present a parallel strategy of a linear referencing algorithm with reduced subtask coupling, easy-implemented parallelization, and low communication cost. Moreover, we propose a compression-aware I/O module to improve the processing efficiency. Finally, we conduct a comprehensive performance evaluation on a synthetic dataset (1.114 TB) and a real world taxi GPS dataset (578 GB). The experimental results demonstrate the advantages of our proposed framework.

Original languageEnglish (US)
Pages (from-to)129-140
Number of pages12
JournalComputers, Environment and Urban Systems
Volume61
DOIs
StatePublished - Jan 1 2017
Externally publishedYes

Keywords

  • Big data
  • Compression contribution model
  • Parallel linear referencing
  • Trajectory of moving object
  • Two step consistent hashing

ASJC Scopus subject areas

  • Geography, Planning and Development
  • Ecological Modeling
  • General Environmental Science
  • Urban Studies

Fingerprint

Dive into the research topics of 'An efficient data processing framework for mining the massive trajectory of moving objects'. Together they form a unique fingerprint.

Cite this