Mistral’s New Model Is Trained to Solve ‘Real GitHub Issues’

Mistral has launched Devstral, a new open-source AI model designed for coding tasks, developed in collaboration with All Hands AI. Devstral achieved a 46.8% score on the SWE-Bench Verified benchmark, outperforming other open-source models and rivals like OpenAI’s GPT-4.1 Mini and Claude 3.5 Haiku. The model is optimized for solving real GitHub issues and can run on readily available hardware, facilitating local deployment; this release follows Mistral's recent partnership with G42 and the unveiling of Mistral Medium 3, emphasizing cost-effectiveness and enterprise deployment.

Analysis

Mistral, the French AI startup, continues its aggressive push in the AI landscape with the launch of Devstral, a new open-source model specifically engineered for coding tasks and developed with All Hands AI. Devstral's reported 46.8% score on the SWE-Bench Verified benchmark is notable, as it surpasses other open-source AI models and also outperforms proprietary models such as OpenAI’s GPT-4.1 Mini and Claude 3.5 Haiku in this specific real-world software issue evaluation. A key attribute of Devstral is its efficiency, designed to run on readily available hardware like a single RTX 4090 or a Mac with 32GB RAM, facilitating local deployment and wider accessibility. This development follows Mistral's recent strategic partnership with G42, an Abu Dhabi-based technology group, aimed at co-developing next-generation AI platforms and infrastructure across Europe, the Middle East, and the Global South, including collaborations with Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). Furthermore, Mistral recently unveiled Mistral Medium 3, an enterprise-focused model emphasizing cost-effectiveness while claiming superior performance over competitors like Meta’s Llama 4 Maverick and achieving over 90% of Claude Sonnet 3.7’s benchmark scores at significantly lower pricing ($0.40 per million input tokens, $2 for output). These moves collectively indicate Mistral's strategy to compete vigorously on both performance and cost, targeting both open-source developer communities and enterprise clients, thereby intensifying competition within the AI industry.

AllMind

AllMind

Mistral’s New Model Is Trained to Solve ‘Real GitHub Issues’

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors