Abstract: Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage ...