电信科学 ›› 2016, Vol. 32 ›› Issue (8): 82-89.doi: 10.11959/j.issn.1000-0801.2016197

• 研究与开发 • 上一篇    下一篇

一种适应于非完备标签数据和标签关联性的多标签分类方法

张丽娜1,2,戴灵鹏2,匡泰1   

  1. 1 浙江安防职业技术学院信息工程系,浙江 温州 325016
    2 温州大学生命与环境科学学院,浙江 温州 325035
  • 出版日期:2016-08-20 发布日期:2017-04-26
  • 基金资助:
    浙江省教育科学规划基金资助项目;浙江省自然科学基金资助项目

A multi-label classification method for disposing incomplete labeled data and label relevance

Lina ZHANG1,2,Lingpeng DAI2,Tai KUANG1   

  1. 1 Department of Information Engineering,Zhejiang College of Security Technology,Wenzhou 325016,China
    2 College of Life and Environmental Science,Wenzhou University,Wenzhou 325035,China
  • Online:2016-08-20 Published:2017-04-26
  • Supported by:
    Education Science Department Foundation of Zhejiang Province;The Natural Science Foundation of Zhejiang Province of China

摘要:

多标签分类已在很多领域得到了实际应用,所用标签大多具有很强的关联性,甚至存在非完备标签或部分标签遗失。然而,现有的多标签分类算法难以同时处理这两种情况。基于此,提出一种新的概率模型处理方法,实现同时对具有标签关联性和遗失标签情况进行多标签分类。该方法可以自动获知和掌握多标签的关联性。此外,通过整合遗失的标签信息,该方法能够提供一个自适应策略来处理遗失的标签。在完备标签和非完备标签的数据上进行实验,结果表明,与现有的多标签分类算法相比,提出的方法得到了较好的分类预测评价值。

关键词: 非完备标签, 标签关联性, 多标签分类, 概率模型

Abstract:

Multi-label classification methods have been applied in many real-world fields,in which the labels may have strong relevance and some of them even are incomplete or missing.However,existing multi-label classification algorithms are unable to handle both issues simultaneously.A new probabilistic model that can automatically learn and exploit multi-label relevance was proposed on label relevance and missing label classification simultaneously.By integrating out the missing information,it also provides a disciplined approach to handle missing labels.Experiments on a number of real world data sets with both complete and incomplete labels demonstrated that the proposed method can achieve higher classification and prediction evaluation scores than the existing multi-label classification algorithms.

Key words: incomplete label, label relevance, multi-label classification, probabilistic model

No Suggested Reading articles found!