2.8K views
Yannic Kilcher
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained)
Login with Google Login with Discord