Convergence India
header banner
Google Enhances Gemini’s Image Generation—New Features You Need to See!
The Gemini 2.0 model is equipped with enhanced reasoning and natural language understanding to generate detailed images.

By Kumar Harshit

on March 13, 2025

After the introduction of native image output in Gemini 2.0 for testers, it has expanded to reach developers across all regions. Google AI Studio currently supports it. It can be accessed through an experimental version of Gemini 2.0 Flash in Google AI Studio and via the Gemini API.

The latest AI model combines multimodal input, enhanced reasoning, and natural language understanding to create images. It is now equipped with features like following a pattern and setting for a given task, editing images with specifics like adding flowers, etc., to text rendering, and recipe generation with images.

Give your stories an AI touch!

The newly updated AI model can generate stories with images, keeping the setting and characters the same throughout. It will add a touch of visual appeal to your stories with images generated that seem to connect over without any anomaly. 

Register to attend our upcoming tech event from March 19th to 21st at: Convergence India 2025: Driving Innovation in Telecom & ICT

Edit On The Go!

The newly updated model allows you to edit the AI-generated images through prompts. This means you can edit the generated images based on color, element, setting, etc. For instance, Google’s blog shows that you can ask it to add flowers and further change them to red tulips. This gives it an edge over the other image generation models in the market. 

Detailed Images 

Unlike many other image generation models, the newly launched model combines world knowledge with advanced reasoning to generate precise and realistic images. This makes it ideal for crafting detailed visuals—such as illustrating a recipe. While it aims for accuracy, its knowledge, like all language models, remains broad and general rather than absolute or exhaustive.

Read about XAI's image generation model "Aurora" at: XAI unveils image generation model Aurora, capable of generating realistic human portraits in seconds

At-Par Text Rendering 

Most image generation models struggle with long text, often producing illegible or misspelled characters. Internal benchmarks show Gemini 2.0 Flash outperforms competitors, making it ideal for ads, social posts, and invitations.