电信科学 ›› 2023, Vol. 39 ›› Issue (3): 124-134.doi: 10.11959/j.issn.1000-0801.2023045

• 研究与开发 • 上一篇    下一篇

基于K-means聚类的子结构相关适配迁移学习方法

刘昊双1, 张永2,3, 曹莹波1   

  1. 1 辽宁师范大学计算机与信息技术学院,辽宁 大连 116081
    2 湖州师范学院信息工程学院,浙江 湖州 313000
    3 浙江省现代农业资源智慧管理与应用研究重点实验室,浙江 湖州 313000
  • 修回日期:2023-03-15 出版日期:2023-03-20 发布日期:2023-03-01
  • 作者简介:刘昊双(1996- ),女,辽宁师范大学硕士生,主要研究方向为数据挖掘、机器学习
    张永(1975- ),男,博士,湖州师范学院教授、博士生导师,主要研究方向为数据挖掘、智能计算
    曹莹波(1999- ),女,辽宁师范大学硕士生,主要研究方向为机器学习、智能计算
  • 基金资助:
    国家自然科学基金资助项目(61772252);辽宁省教育厅科学研究经费项目(LJKZ0965)

Substructure correlation adaptation transfer learning method based on K-means clustering

Haoshuang LIU1, Yong ZHANG2,3, Yingbo CAO1   

  1. 1 School of Computer &Information Technology, Liaoning Normal University, Dalian 116081, China
    2 School of Information Engineering, Huzhou University, Huzhou 313000, China
    3 Zhejiang Province Key Laboratory of Smart Management & Application of Modern Agricultural Resources, Huzhou 313000, China
  • Revised:2023-03-15 Online:2023-03-20 Published:2023-03-01
  • Supported by:
    The National Natural Science Foundation of China(61772252);Scientific Research Foundation of the Education Depart-ment of Liaoning Province(LJKZ0965)

摘要:

域漂移严重影响了传统机器学习方法的性能,现有的领域自适应方法主要通过全局、类级或样本级分布匹配自适应地调整跨域表示。但全局匹配和类级匹配过于粗糙会导致自适应不足,而样本级匹配受到噪声的影响可能导致过度自适应。基于此,提出了一种基于K均值(K-means)聚类的子结构相关适配(SCOAD)迁移学习算法,首先通过 K-means 聚类分别获得源域和目标域的多个子域,其次寻求子域中心二阶统计量的匹配,最后利用子域内结构对目标域样本进行分类。该方法在传统方法的基础上进一步提高了源域与目标域之间知识迁移的性能。在常用迁移学习数据集上的实验结果表明了所提方法的有效性。

关键词: 迁移学习, 领域自适应, 子结构适配, 聚类

Abstract:

Domain drifts severely affect the performance of traditional machine learning methods, and existing domain adaptive methods are mainly represented by adaptive adjustment cross-domain through global, class-level, or sample-level distribution adaptation.However, too coarse global matching and class-level matching can lead to insufficient adaptation, and sample-level adaptation to noise can lead to excessive adaptation.A substructure correlation adaptation (SCOAD) transfer learning algorithm based on K-means clustering was proposed.Firstly, multiple subdomains of the source domain and the target domain were obtained by K-means clustering.Then, the matching of the second-order statistics of the subdomain center was sought.Finally, the target domain samples were classified by using the subdomain structure.The proposed method approach further improves the performance of knowledge transfer between the source and target domains on top of the traditional approach.Experimental results on common transfer learning datasets show the effectiveness of the proposed method.

Key words: transfer learning, domain adaptation, substructural adaptation, clustering

No Suggested Reading articles found!