Variants of Multi-head attention: Multi-query (MQA) and Grouped-query attention (GQA)

22 views

Machine Learning Studio

2 weeks ago

Variants of Multi-head attention: Multi-query (MQA) and Grouped-query attention (GQA)

Variants of Multi-head attention: Multi-query (MQA) and Grouped-query attention (GQA)