Google’s generative video model Veo 3 has a subtitles problem
Summary
Google’s new generative video model, Veo 3, introduced the ability to create sounds and dialogue alongside hyperrealistic video clips, quickly attracting attention from creatives. However, the model struggles with generating accurate subtitles, highlighting ongoing challenges in synchronizing audio and text in AI-generated content. This limitation points to the need for further advancements in multimodal AI systems for seamless video production.