计算机与现代化

• 网络与通信 • 上一篇    下一篇

异构信息网络的相似性度量方法

  

  1. (常州纺织服装职业技术学院机电工程系,江苏 常州 213164)
  • 收稿日期:2015-04-30 出版日期:2016-03-17 发布日期:2016-03-17
  • 作者简介:伍转华(1978-),男,江苏武进人,常州纺织服装职业技术学院机电工程系讲师,硕士,研究方向:数据库技术,数据流挖掘,网络技术。

A Measurement of Similarity Search on Heterogeneous Information Networks

  1. (Department of Electromechanical Engineering, Changzhou Textile Garment Institute, Changzhou 213164, China)
  • Received:2015-04-30 Online:2016-03-17 Published:2016-03-17

摘要: 随着社交网络和文献索引网络等大规模互联多类异质信息网络的浮现,为相似搜索提出许多挑战,其中相似性度量是关键问题之一。现有适用于同构网络的相似度量方法没有考虑网络多个路径的不同语义。本文提出一种新的基于元路径的相似性度量方法,可以在异构网络中搜索相同类型的对象。元路径是由在不同对象类型中定义的一系列关系所组成的路径,可以为网络中相似搜索引擎提供共同的基础。在真实数据集上的实验表明,与无序相似性衡量方法相比,本文提出的方法支持快速路径相似性查询,可广泛地应用于社交网络和电子商务领域。

关键词: 相似性搜索, 社交网络, 元路径, 异构网络

Abstract: The emerging of large-scale interconnection networks such as social networks and bibliographic networks raised many challenges for similarity search, wherein the similarity measure is one of the key issues. The existing methods of similarity measure did not consider the different semantic of multiple paths on homogeneous networks. This paper proposes a novel similarity measure method based on meta path, which can search the same type of objects on heterogeneous networks. Meta path is path consisting of a series of relationships defined in the different object types, which provides a common basis for the similarity search engines on the networks. Experiments on real data sets show that compared to the disorder similarity measures, the proposed method supports fast path similarity query, and it can be widely used in social networks and e-commerce.

Key words: similarity search, social networks, meta path, heterogeneous network

中图分类号: