通信学报 ›› 2021, Vol. 42 ›› Issue (11): 121-132.doi: 10.11959/j.issn.1000-436x.2021186

• 学术论文 • 上一篇    下一篇

基于实例结构的不完备多标签学习

陈天柱1,2,3, 李凤华1,2, 郭云川1,2, 李子孚1   

  1. 1 中国科学院信息工程研究所,北京 100093
    2 中国科学院大学网络空间安全学院,北京 100049
    3 中国电子科技集团公司信息科学研究院,北京 100086
  • 修回日期:2021-10-09 出版日期:2021-11-25 发布日期:2021-11-01
  • 作者简介:陈天柱(1987− ),男,河北秦皇岛人,博士,中国电子科技集团公司工程师,主要研究方向为自然语言处理
    李凤华(1966− ),男,湖北浠水人,博士,中国科学院信息工程研究所研究员、博士生导师,主要研究方向为网络与系统安全、信息保护、隐私计算
    郭云川(1977− ),男,四川营山人,博士,中国科学院信息工程研究所正高级工程师、博士生导师,主要研究方向为访问控制、网络安全
    李子孚(1992− ),女,内蒙古赤峰人,博士,中国科学院信息工程研究所工程师,主要研究方向为网络与系统安全、访问控制
  • 基金资助:
    国家重点研发计划基金资助项目(2018YFB2100402);国家自然科学基金资助项目(U1836203);山东省重点研发计划基金资助项目(2019JZY020127)

Instance structure based multi-label learning with missing labels

Tianzhu CHEN1,2,3, Fenghua LI1,2, Yunchuan GUO1,2, Zifu LI1   

  1. 1 Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
    2 School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China
    3 Information Science Academy of China Electronics Technology Group Corporation, Beijing 100086, China
  • Revised:2021-10-09 Online:2021-11-25 Published:2021-11-01
  • Supported by:
    The National Key Research and Development Program of China(2018YFB2100402);The National Natural Science Foundation of China(U1836203);ShanDong Provincial Key Research and Development Program(2019JZY020127)

摘要:

针对现有标签缺失下多标签学习方案未能有效解决标签缺失的问题,提出了基于实例结构的不完备多标签学习方案,考虑实例特征和标签结构特点,利用数据标签向量几何相似度来补全缺失标签,利用加权排序来降低正关系学为负关系所带来的模型偏差,并利用低秩结构来俘获模型低秩结构。具体地,通过确保数据预测标签几何相似度与数据标签几何相似度的一致性来俘获数据流型结构;通过度量完备标签下和不完备标签下的排序损失来区分标签与实例的相关程度。实验结果表明,所提方案优于典型的标签缺失下的多标签学习方案,甚至在一些评估标准下其精度比最好对比方案提升了10%以上。

关键词: 多标签学习, 低秩结构, 流型正则, 标签排序

Abstract:

To address the problem that the existing methods in multi-label learning did not efficiently deal with the problems, the instance structure based multi-label learning scheme with missing labels was proposed.By considering the feature and label structure of instance, the similarity of label vectors were exploit to fill the missing labels and the weight rank loss was exploit to reduce the model bias.Meanwhile, the weight rank loss was also exploit to reduce the model bias.More specially, the manifold structure was capture by forcing the consistency of the geometry similarity of labels and one of the predicted labels.By measuring ranking loss for complete labels and incomplete labels, the relevance of label was distinguish to instance.Experiment results show that the superior performances of the proposed approach compared with the state-of-the-art methods and the accuracy is improved by more than 10% compared with the best comparison scheme under some evaluation criteria.

Key words: multi-label learning, low rank structure, manifold structure, label ranking

中图分类号: 

No Suggested Reading articles found!