Big Data Research ›› 2024, Vol. 10 ›› Issue (1): 1-8.doi: 10.11959/j.issn.2096-0271.2024016
• STRATEGY RESEARCH • Next Articles
Weimin ZHENG
Online:
Published:
Abstract:
There are three types of computer systems that support large model training, among which the ecosystem based on domestic AI chip systems is not very good.To change this situation, it is necessary to develop 10 key software such as AI compilers and parallel acceleration.Moreover, systems based on supercomputers require good software and hardware collaborative design to better serve large model training.This article proposes a 4-point balanced design for building the infrastructure of a large model to ensure system performance, reliability, and scalability.
Key words: large model training, computer system, supercomputing system, large model infrastructure
CLC Number:
TP319
Weimin ZHENG. Four issues to consider in building a computer system supporting large model training[J]. Big Data Research, 2024, 10(1): 1-8.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.infocomm-journal.com/bdr/EN/10.11959/j.issn.2096-0271.2024016
https://www.infocomm-journal.com/bdr/EN/Y2024/V10/I1/1
"