Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

31K views

Efficient NLP

4 weeks ago

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models