Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization

30 views

Xiaol.x

9 days ago

Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization

Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization