913 views
AI Bites
DeepSeek R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (paper explained)
Login with Google Login with Discord