You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+41
Original file line number
Diff line number
Diff line change
@@ -276,6 +276,47 @@ indices = quantizer(x)
276
276
277
277
This repository should also automatically synchronizing the codebooks in a multi-process setting. If somehow it isn't, please open an issue. You can override whether to synchronize codebooks or not by setting `sync_codebook = True | False`
278
278
279
+
### Sim VQ
280
+
281
+
<imgsrc="./images/simvq.png"width="400px"></img>
282
+
283
+
A <ahref="https://arxiv.org/abs/2411.02038">new ICLR 2025 paper</a> proposes a scheme where the codebook is frozen, and the codebook is implicitly generated through a linear projection. The author claims this setup leads to less codebook collapse as well as easier convergence. I have found this to perform even better when paired with <ahref="https://arxiv.org/abs/2410.06424">rotation trick</a> from Fifty et al., and expanding the linear projection to a small one layer MLP. You can experiment with it as so
0 commit comments