Journal on Communications

Previous Articles     Next Articles

Large-scale duplicate image retrieval technical research for the internet

  

  • Online:2014-12-25 Published:2014-12-15

Abstract: For the typical social media application on the internet, a large-scale distributed duplicate image retrieval approach based on random projection and the block DCT coefficients was proposed. On the basis of Hadoop, this approach exploited image signatures generated by random projection mapping to retrieve HBase efficiently. And candidate images with high-recall were achieved. Then in order to improve the retrieval precision, the block DCT coefficients were used to further filter candidate images. For 12 million images, experimental results showed that with our approach the recall ratio reached 98%, the precision ratio reached 93.2%, and the average retrieval time was 6.7s when H=2 and T=150.

No Suggested Reading articles found!