Skip to content

Google Slashes API Prices, Introduces Image-to-Video Feature

Create engaging videos from a single image or text. Google's new Image-to-Video feature makes it more affordable than ever.

In this image we can see a website. There are videos and we can see text.
In this image we can see a website. There are videos and we can see text.

Google Slashes API Prices, Introduces Image-to-Video Feature

Google has slashed prices and introduced new features for its Veo 3 and Veo 3 Fast APIs. The updates include a significant reduction in costs and the addition of an Image-to-Video function, allowing users to create dynamic videos with you from a single image and a text prompt.

The new Image-to-Video function enables users to generate videos in 720p or 1080p resolution from either text or image inputs, with synchronized sound. Additionally, Google has extended the capabilities of Veo 3 and Veo 3 Fast with vertical video support in 9:16 format and 1080p output.

Veo 3 is designed for high image quality, while Veo 3 Fast offers faster processing. These models were previously available via products like the Gemini app, Flow, and Vertex AI and have been used millions of times. The company that integrated these models into the API is Serviceplan Group with their Sōkosumi platform, launched in June 2025. The Image-to-Video function is available from MidJourney since June 25, 2025, initially limited to 5-second clips and accessible via their Discord server.

The new functions are available in a paid preview via the Gemini API, with videos generated from images billed at the same price as text-to-video outputs of the respective model. The cost for Veo 3 with sound has fallen from $0.75 to $0.40 per second, and Veo 3 Fast with sound now costs $0.15 per second. Veo 3 Fast without sound costs $0.10 per second, and Veo 3 without sound costs $0.20 per second.

These updates allow for mobile-optimized videos and high-resolution content generation via the Gemini API, making it more accessible and affordable for users to create engaging visual content.

Read also:

Latest