ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeed

66 views

Microsoft Research

2 days ago

ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeed

ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeed