[1] |
PINTO C , GKOUFAS Y , REALE A ,et al. Hoard:a distributed data caching system to accelerate deep learning training on the cloud[J]. arXiv preprint,2018,arXiv:1812.00669.
|
[2] |
KUMAR A V , SIVATHANU M . Quiver:an informed storage cache for deep learning[C]// Proceedings of the 18th USENIX Conference on File and Storage Technologies (FAST’20). Berkeley:USENIX Association, 2020: 283-296.
|
[3] |
WANG L P , YE S G , YANG B C ,et al. DIESEL:a dataset-based distributed storage and caching system for largescale deep learning training[C]// Proceedings of the 49th International Conference on Parallel Processing. New York:ACM Press, 2020: 1-11.
|
[4] |
ABADI M , AGARWAL A , BARHAM P ,et al. TensorFlow:large-scale machine learning on heterogeneous distributed systems[J]. arXiv preprint,2016,arXiv:1603.04467.
|
[5] |
ABADI M , BARHAM P , CHEN J M ,et al. TensorFlow:a system for large-scale machine learning[J]. arXiv preprint,2016,arXiv:1605.08695.
|
[6] |
PATARASUK P , YUAN X . Bandwidth optimal all-reduce algorithms for clusters of workstations[J]. Journal of Parallel and Distributed Computing, 2009,69(2): 117-124.
|
[7] |
LI Z W , YAN Y L , MO J T ,et al. Performance optimization of in-memory file system in distributed storage system[C]// Proceedings of the 2017 International Conference on Networking,Architecture,and Storage. Piscataway:IEEE Press, 2017.
|
[8] |
LI H Y , GHODSI A , ZAHARIA M ,et al. Tachyon:reliable memory speed storage for cluster computing frameworks[C]// Proceedings of the ACM Symposium on Cloud Computing. New York:ACM Press, 2014: 1-15.
|
[9] |
CHANG X , ZHA L . The performance analysis of cache architecture based on Alluxio over virtualized infrastructure[C]// Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops. Piscataway:IEEE Press, 2018: 515-519.
|
[10] |
HE K M , ZHANG X Y , REN S Q ,et al. Deep residual learning for image recognition[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE Press, 2016.
|
[11] |
DENG J , DONG W , SOCHER R ,et al. ImageNet:a large-scale hierarchical image database[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE Press, 2009.
|
[12] |
SERGEEV A , BALSO M D . Horovod:fast and easy distributed deep learning in TensorFlow[J]. arXiv preprint,2018,arXiv:1802.05799.
|
[13] |
LIU Z X , BAI Z H , LIU Z M ,et al. DistCache:provable load balancing for large-scale storage systems with distributed caching[C]// Proceedings of the 17th USENIX Conference on File and Storage Technologies. Berkeley:USENIX Association, 2019: 143-157.
|
[14] |
DONG W J , WEN D X , ZHANG Z . Optimization of cache strategy based on Alluxio remote scenario[J]. Application Research of Computers, 2018,35(10): 3025-3028.
|
[15] |
杨青霖, 吴桂勇, 张广艳 . 分布式存储系统中的数据高效缓存方法[J]. 大数据, 2021,7(2): 147-157.
|
|
YANG Q L , WU G Y , ZHANG G Y . An approach to buffering data efficiently in distributed storage systems[J]. Big Data Research, 2021,7(2): 147-157.
|