RWKV 5: 1.5B LLM Model With No Attention Mechanism!

255 views

Richard Aragon

1 year ago

RWKV 5: 1.5B LLM Model With No Attention Mechanism!

RWKV 5: 1.5B LLM Model With No Attention Mechanism!