Data-Algorithm Co-Design for Balancing Accuracy and Efficiency in LargeLanguage Models by Dr. Zhaozhuo Xu
Abstract: As large language models (LLMs) grow in size, recentalgorithmic methods such as quantization, sparsification, and approximate gradient estimation aim to reduce memory and computation costs. However, these gains often come at the expense of accuracy. This talk introduces a data-algorithm co-design approach to address […]