中国科学技术大学学报 ›› 2018, Vol. 48 ›› Issue (4): 275-283.DOI: 10.3969/j.issn.0253-2778.2018.04.002

• 论著 • 上一篇    下一篇

MapReduce环境下基于概念分层的概念格并行构造算法

蔡勇,陈红梅,   

  1. 1.西南交通大学信息科学与技术学院,四川成都 611756;
    2.四川省云计算与智能技术高校重点实验室,四川成都 611756
  • 收稿日期:2017-05-23 修回日期:2017-06-24 出版日期:2018-04-30 发布日期:2018-04-30
  • 通讯作者: 陈红梅
  • 作者简介:蔡 勇,男,1990年生,硕士生. 研究方向:数据挖掘. E-mail: yongcai@my.swjtu.edu.cn
  • 基金资助:
    国家自然科学基金(61572406)资助.

A parallel algorithm for constructing concept lattice based on hierarchical concept under MapReduce

CAI Yong, CHEN Hongmei,   

  1. 1. School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China
    2. Key Laboratory of Cloud Computing and Intelligent Technology, Chengdu 611756, China)
  • Received:2017-05-23 Revised:2017-06-24 Online:2018-04-30 Published:2018-04-30

摘要: 概念格是形式概念分析中的核心数据结构.对此提出运用划分分治和分层约束的方法研究MapReduce框架下概念格并行生成算法以有效地构造概念格.将形式背景按对象划分成外延独立子背景后并行计算子背景上的临时概念,融合各节点临时概念形成全局概念.全局概念按照各概念外延基数进行分层,通过分层约束计算概念父子节点的搜索范围和并行搜索各层概念的父子节点,进而构建概念格.算法基于MapReduce框架实现并在公共数据集上进行测试,实验结果表明,基于概念分层方法的概念格并行构造算法能够对大数据形式背景有效地进行处理.

关键词: 概念格, 概念分层, 并行计算, MapReduce

Abstract: Concept lattice is the core data structure of formal concept analysis. #br##br#A parallel algorithm is focused on for constructing concept lattice under the framework of MapReduce using the methods of divide and conquer based on partition and constrains in layers which aim to construct concept lattice effectively. Firstly, sub-formal contexts are formed by partitioning the formal context by objects and the concepts in each sub-formal context are calculated. Then the global concept is formed by merging concepts in different nodes. Next, different layers of concepts are formed by partitioning the global concept. Finally, constraints in different layers are used to compute the scope of search and concept lattice is constructed by searching and merging parent-son nodes in different layers of concepts. The proposed algorithm is realized in the framework of MapReduce. Extensive experiments carried out on public datasets verify the effectiveness of the parallel algorithm based on concept layer to deal with the formal context in big data.

Key words: concept lattice, hierarchical concept, parallel computing, MapReduce

中图分类号: