Back to News
Market Impact: 0.55

Apple’s upgraded AI models underwhelm on performance

BABAGOOGLGOOGAAPLMETA
Artificial IntelligenceTechnology & InnovationProduct LaunchesCompany Fundamentals

Apple's newly announced AI models, powering Apple Intelligence features, have underperformed compared to existing models from competitors like OpenAI, Google, and Meta, according to Apple's own benchmarks; specifically, the on-device model was rated comparably to Google and Alibaba models, while the server model lagged behind OpenAI's GPT-4o and Meta's Llama 4 Scout. Despite improvements in tool use, efficiency, and language understanding through an expanded training dataset, the benchmark results reinforce concerns about Apple's ability to compete in the rapidly evolving AI landscape, especially given past delays and customer lawsuits related to AI capabilities.

Analysis

Apple's (AAPL) recent unveiling of its AI models, "Apple On-Device" and "Apple Server," intended for its Apple Intelligence suite, indicates a performance deficit relative to established competitors, based on the company's own published benchmarks. Human testers rated the text generation quality of the offline "Apple On-Device" model (approximately 3 billion parameters) as merely "comparable" to, not exceeding, similarly-sized models from Google (GOOGL, GOOG) and Alibaba (BABA). More critically, Apple's data center model, "Apple Server," was found to underperform OpenAI's year-old GPT-4o. In image analysis capabilities, human evaluators preferred Meta's (META) Llama 4 Scout over "Apple Server," even though Llama 4 Scout itself generally lags behind top-tier models from other AI labs. These results substantiate concerns regarding Apple's AI research division's challenges in keeping pace within the highly competitive AI landscape, further compounded by past underwhelming AI features, an indefinitely delayed Siri upgrade, and customer lawsuits alleging unfulfilled AI promises. While Apple notes improvements in tool-use, efficiency, and multilingual capabilities (around 15 languages) due to an expanded training dataset, the comparative benchmark performance suggests a persistent gap with leading AI innovators.

AllMind AI Terminal

AI-powered research, real-time alerts, and portfolio analytics for institutional investors.

Request a Demo

Market Sentiment

Overall Sentiment

moderately negative

Sentiment Score

-0.55

Ticker Sentiment

AAPL-0.80
BABA0.10
GOOG0.20
GOOGL0.20
META0.40

Key Decisions for Investors

  • Investors should critically evaluate Apple's capacity to bridge the competitive AI divide, given that its newly announced models underperform existing offerings from key players like OpenAI, Google, and Meta based on Apple's internal testing.
  • Monitor upcoming disclosures on Apple's AI strategy execution, particularly focusing on the tangible user benefits derived from these models and any evidence of accelerated performance improvements in future iterations, which are crucial for maintaining ecosystem loyalty and driving new revenue streams.