A new study by AI research group METR challenges the widely assumed productivity gains of AI coding tools for experienced developers, reporting that their use *increased* task completion time by 19% against a forecasted 24% reduction. This randomized controlled trial, involving seasoned open-source developers on complex codebases, suggests current 'vibe coders' may introduce inefficiencies like increased prompting time and struggle with large systems. The findings question the immediate ROI and universal workflow acceleration narrative for investors regarding AI software engineering tools, despite researchers' caveats about AI's rapid evolution and other studies showing positive impacts.
A recent study by the non-profit research group METR introduces a significant counterpoint to the prevailing narrative that AI coding tools universally enhance developer productivity. The randomized controlled trial, involving 16 experienced open-source developers working on large code repositories, found that access to state-of-the-art AI tools increased task completion time by 19%, directly contradicting the participants' forecast of a 24% time reduction. Researchers theorize this inefficiency may stem from time lost to prompting the AI and the models' struggles with complex codebases. However, the findings are heavily caveated; the study's authors acknowledge other, larger studies have shown productivity gains and emphasize the rapid pace of AI model improvement could render these results obsolete within months. Additionally, while most participants had experience with LLMs, 44% were first-time users of Cursor, the primary tool provided, which could have impacted performance despite training. This report, alongside mentions of other studies flagging AI-introduced code errors and security vulnerabilities, suggests the path to realizing productivity gains is not linear and that the practical application of AI in specialized, high-stakes environments like enterprise software development remains a challenge.
AI-powered research, real-time alerts, and portfolio analytics for institutional investors.
Request a DemoOverall Sentiment
mixed
Sentiment Score
-0.15
Ticker Sentiment