Journal on Communications ›› 2018, Vol. 39 ›› Issue (8): 69-82.doi: 10.11959/j.issn.1000-436x.2018148

• Artificial Intelligence and Network Security • Previous Articles     Next Articles

DeepRD:LSTM-based Siamese network for Android repackaged applications detection

Run WANG1,2,Benxiao TANG1,2,Li’na WANG1,2()   

  1. 1 Key Laboratory of Aerospace Information Security and Trusted Computing Ministry of Education,Wuhan University,Wuhan 430072,China
    2 School of Cyber Science and Engineering,Wuhan University,Wuhan 430072,China
  • Revised:2018-06-28 Online:2018-08-01 Published:2018-09-13
  • Supported by:
    The National Natural Science Foundation of China(U1536204);The Central University Basic Business Expenses Special Funding for Scientific Research Project(2042018kf1028);The National High Technology Research and Development Program of China(2015AA016004)

Abstract:

The state-of-art techniques in Android repackaging detection relied on experts to define features,however,these techniques were not only labor-intensive and time-consuming,but also the features were easily guessed by attackers.Moreover,the feature representation of applications which defined by experts cannot perform well to the common types of repackaging detection,which caused a high false negative rate in the real detection scenario.A deep learning-based repackaged applications detection approach was proposed to learn the program semantic features automatically for addressing the above two issues.Firstly,control and data flow analysis were taken for applications to form a sequence feature representation.Secondly,the sequence features were transformed into vectors based on word embedding model to train a Siamese LSTM network for automatically program feature learning.Finally,repackaged applications were detected based on the similarity measurement of learned program features.Experimental results show that the proposed approach achieves a precision of 95.7% and false negative rate of 6.2% in an open sourced dataset AndroZoo.

Key words: repackaging, deep learning, Siamese network, LSTM, security and privacy

CLC Number: 

No Suggested Reading articles found!