My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
machine-learning deep-neural-networks artificial-intelligence transformer deeplearning sparse-neural-networks large-language-models q-sparse
-
Updated
Aug 14, 2024 - Python