Google’s Will Smith double is better at eating AI spaghetti … but it’s crunchy?

Google has launched Veo 3, a new AI video synthesis model capable of generating synchronized audio tracks in high-definition video clips, a first for major AI video generators. Early tests, including the benchmark of generating a video of Will Smith eating spaghetti, revealed a glitch in Veo 3's sound effect application, with the faux Smith appearing to crunch the spaghetti due to potential biases in the model's training data. This highlights the challenges of ensuring balanced and representative training data for generative AI models.

Analysis

Alphabet Inc.'s Google has launched Veo 3, a new AI video synthesis model that introduces a significant capability: the generation of synchronized audio tracks for high-definition video clips, a first among major AI video generators. This advancement allows for eight-second clips with integrated voices, dialogue, and sound effects, moving beyond the silent, short-form videos characteristic of earlier models from 2022-2024. Initial tests, including the 'Will Smith eating spaghetti' benchmark popularized by a 2023 ModelScope example, revealed an audio glitch where the synthesized spaghetti sounded like it was being crunched. This anomaly is attributed to Veo 3's experimental sound effect application and likely stems from imbalances in its training data, where chewing mouths might have been predominantly associated with crunching sounds. The incident highlights a critical aspect of generative AI: these pattern-matching systems require extensive and well-balanced training data to produce convincing outputs, and unaddressed biases or data misrepresentations can lead to unexpected results, underscoring the ongoing refinement needed even in advanced models.

AllMind

AllMind

Google’s Will Smith double is better at eating AI spaghetti … but it’s crunchy?

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors