News

Google Expands Gemini 2.0 Flash's Image Generation Capability

Google Expands Gemini 2.0 Flash's Image Generation Capability

May 07, 2025
Google Gemini 2.0 Flash image generation Google AI Studio Gemini API multimodal input natural language understanding developer tools
Google has made Gemini 2.0 Flash's native image generation feature available for developer experimentation through Google AI Studio and the Gemini API, enabling multimodal input, enhanced reasoning, and natural language understanding for creating images.

Google Expands Gemini 2.0 Flash's Image Generation Capability

Video: Google Gemini 2.0 Flash: FREE Image Generation Better Than Midjourney | 2025

Gemini 2.0 Flash's native image generation capability is now available for developer experimentation through Google AI Studio and the Gemini API. This feature was initially introduced to trusted testers in December and has since been expanded to all regions supported by Google AI Studio. Developers can use the experimental version of Gemini 2.0 Flash (gemini-2.0-flash-exp) to explore its multimodal input, enhanced reasoning, and natural language understanding for creating images.

Key features of Gemini 2.0 Flash's image generation include:

  • Text and Images Together: Generate illustrated stories with consistent characters and settings. Provide feedback to retell the story or alter the drawing style.
  • Conversational Image Editing: Edit images through natural language dialogue, enabling iterative refinement and exploration of different ideas.
  • World Understanding: Leverage world knowledge and enhanced reasoning to create detailed and realistic imagery, such as illustrating a recipe.
  • Text Rendering: Accurately render long sequences of text, making it suitable for creating advertisements, social posts, or invitations.

Developers can get started with image generation using the following code snippet:

        
from google import genai
from google.genai import types

client = genai.Client(api_key="GEMINI_API_KEY")
response = client.models.generate_content(
    model="gemini-2.0-flash-exp",
    contents=(
        "Generate a story about a cute baby turtle in a 3d digital art style. "
        "For each scene, generate an image."
    ),
    config=types.GenerateContentConfig(
        response_modalities=["Text", "Image"]
    ),
)
        
    

This feature allows developers to build AI agents, develop apps with beautiful visuals, and brainstorm visual ideas using a single model. Feedback from developers will help finalize a production-ready version of this capability.

Sources

Gemini 2.5 Flash is now in preview - Google Blog We're excited to roll out an early version of Gemini 2.5 Flash today in preview in the Gemini API via Google AI Studio and Vertex AI.
Gemini 2.5 Flash Preview – Vertex AI - Google Cloud Console These capabilities are available in the Gemini 2.5 Flash model: Controlled generation; Function calling; Search; Image generation (preview); Audio generation ( ...
Experiment with Gemini 2.0 Flash native image generation Gemini 2.0 Flash helps you edit images through many turns of a natural language dialogue, great for iterating towards a perfect image, or to ...