DeepSeek R1 Enhanced 🚀
🐋 DeepSeek has released an updated version of its reasoning-focused R1 model, now named R1-0528, as open source once again under the MIT license. Although presented as a minor interim update 🤔, this version delivers significant improvements, particularly in mathematics and programming tasks. The new release stands out as a strong alternative to OpenAI’s o3 and Google’s Gemini 2.5 Pro models.
📌 Key Highlights:
Improved Performance on Reasoning Tasks: Thanks to enhanced depth of thinking (with the average number of tokens used increasing from 12K to 23K), the model’s accuracy on the AIME 2025 mathematics test rose from 70% to 87.5%.
Open Source and Broad Usability: Distributed under the MIT license, the model can be freely used for both academic and commercial purposes, making it highly accessible and flexible for researchers and developers.
Efficient and Cost-Effective Performance: The distilled DeepSeek-R1-0528-Qwen3-8B version is lightweight enough to run on a single GPU, while offering comparable reasoning performance to bigger models like Phi-4.
💡 Why It Matters:
This move by DeepSeek demonstrates that high-performance AI models are not exclusive to tech giants; smaller and innovative teams can also make significant contributions to the field. Such developments signal a future where open-source AI systems can rival proprietary models in quality and become more widely accessible and integrated into AI-powered products.
