中国科学技术大学学报 ›› 2015, Vol. 45 ›› Issue (10): 813-821.DOI: 10.3969/j.issn.0253-2778.2015.10.003

• 论著 • 上一篇    

基于ADMD融合策略的海洋大数据索引技术研究

黄冬梅,孙乐,赵丹枫   

  1. 上海海洋大学信息学院,上海 2 01306
  • 收稿日期:2015-08-27 修回日期:2015-09-29 接受日期:2015-09-29 出版日期:2015-09-29 发布日期:2015-09-29
  • 通讯作者: 黄冬梅
  • 作者简介:黄冬梅(通讯作者),女,1964年生,教授,研究方向:数据挖掘. E-mail:dmhuang@shou.edu.cn
  • 基金资助:
    国家自然科学基金项目(61272098);上海市自然科学基金项目(13ZR1455800)

A composite index strategy for big marine data based on adaptive method of data merging strategy

HUANG Dongmei, SUN Le, ZHAO Danfeng   

  1. College of Information, Shanghai Ocean University, Shanghai 201306
  • Received:2015-08-27 Revised:2015-09-29 Accepted:2015-09-29 Online:2015-09-29 Published:2015-09-29

摘要: 海洋数据具有多源、多类、多维、海量等特点,是一种典型的大数据,海洋大数据上的快速查询是该领域各类应用的基本需求.提高查询速度的关键是建立一个完善的索引结构,为此提出了一种基于时间间隔B+-tree和HSP-tree的多层索引架构ML-index(multi-layer index),分别制定样本驱动的数据融合机制(adaptive method of data merging strategy)以确定分布式时态数据分区;并基于海洋数据特性、数据单元饱和度等,提出了一种自适应空间划分方法(adaptive space partition),在此基础上建立HSP-tree作为辅助索引.实验验证在海洋数据模式下,提出的多层索引结构保证了海洋数据的查询速度,逼近线性的时间复杂度.

关键词: 海洋大数据, 时间间隔B+-tree索引, 自适应空间划分, AMDM

Abstract: Marine data fall easily into category of Big Data. A basic requirement for various marine monitoring applications is quick retrieval and the establishment of a sound index structure is of great importance. A multi-layer index (ML-index, for short) with regard to time interval B+-tree and hybrid space partition tree (HSP-tree, for short) was proposed. It employs the adaptive method of data merging strategy to optimize the primary key index (i.e. B+-tree). An adaptive space partition method was also proposed on the basis of data characteristics, and data unit capacity particular, for building secondary index, namely, HSP-tree. The experiment result shows that ML-index saves about 2/3 of the time in comparison with two state-of-the-art index methods.

Key words: marine data value function, time interval B+-tree, adaptive space partition, HSP-tree

中图分类号: