Big Data Research ›› 2021, Vol. 7 ›› Issue (2): 123-146.doi: 10.11959/j.issn.2096-0271.2021017

• TOPIC:VIRTUAL DATA SPACE FOR HIGH-PERFORMANCE COMPUTING • Previous Articles     Next Articles

GVDS: a global virtual data space for wide-area high-performance computing environments

Limin XIAO1,2, Yao SONG1,2, Guangjun QIN3, Hanjie ZHOU1,2, Chaobo WANG1,2, Bing WEI1,2, Wei WEI4, Zhisheng HUO1,2   

  1. 1 School of Computer Science and Engineering, Beihang University, Beijing 100191, China
    2 State Key Laboratory of Software Development Environment, Beijing 100191, China
    3 Smart City College, Beijing Union University, Beijing 100101, China
    4 School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China
  • Online:2021-03-15 Published:2021-03-01
  • Supported by:
    The National Key Research and Development Program of China(2018YFB0203901)

Abstract:

The wide-area high-performance computing environment is the core information infrastructure to support technology innovation, economic development, and national defense.However, heterogeneous storage resources are geographically distributed in wide-area high-performance computing environments, resulting in the barriers between applications and data.The requirements of unified data management and efficient data access cannot be met.A method of establishing virtual data space and a data access optimization method was presented, and a global virtual data space (GVDS) for wide-area high-performance computing environments to satisfy the requirements was implemented.GVDS aggregates geographically distributed and heterogeneous storage resources, creating a unified virtual data space to provide unified and efficient data access.Sharing and collaborative processing of geographically distributed data were achieved in widearea environments.The experimental results indicate that compared with the state-of-the-art wide-area storage system in the field of high-performance computing, such as OneData and GFFS, GVDS has similar functions and improves the read bandwidth significantly.

Key words: global virtual data space, wide-area high-performance computing environment, efficient data access, heterogeneous storage resource

CLC Number: 

No Suggested Reading articles found!