OpenAI

The OpenAI backend provides image generation, editing, and analysis capabilities using OpenAI's image models. It supports text-to-image generation with DALL·E 3 and GPT-Image models, image editing with GPT-Image models, and AI-powered image analysis using vision-capable GPT models.

Prerequisites

Before configuring the OpenAI backend, you need to obtain an OpenAI API key:

Go to OpenAI Platform
Sign in with your OpenAI account
Create a new API key or use an existing one
Copy the API key

Then configure it in Talemate:

Open Talemate Settings → Application → OpenAI API
Paste your OpenAI API key in the "OpenAI API Key" field
Save your changes

For additional instructions, see the OpenAI API setup guide.

Configuration

In the Visualizer agent settings, select OpenAI as your backend for text-to-image generation, image editing, image analysis, or any combination of these. Each operation can be configured separately.

Text-to-Image Configuration

For text-to-image generation, configure the following settings:

OpenAI API Key: Your OpenAI API key (configured globally in Talemate Settings)
Model: Select the image generation model to use:
- dall-e-3: OpenAI's DALL·E 3 model (widely available)
- gpt-image-1: OpenAI's GPT-Image model (may require organization verification)
- gpt-image-1-mini: Smaller version of GPT-Image (may require organization verification)

Organization Verification

The gpt-image-1 and gpt-image-1-mini models may require your OpenAI organization to be verified before you can use them. If you encounter errors with these models, you may need to complete OpenAI's organization verification process.

Model Testing Status

Talemate's organization is not verified with OpenAI, and we have not tested the gpt-image-1 and gpt-image-1-mini models. We have confirmed that dall-e-3 works correctly. If you have access to the GPT-Image models and encounter issues, please report them so we can improve support for these models.

The OpenAI backend automatically sets resolution based on the format and model you select:

gpt-image-1 and gpt-image-1-mini:
- Landscape: 1536x1024
- Portrait: 1024x1536
- Square: 1024x1024
dall-e-3:
- Landscape: 1792x1024
- Portrait: 1024x1792
- Square: 1024x1024

Image Editing Configuration

For image editing, configure similar settings but note that DALL·E 3 does not support image editing:

OpenAI API Key: Your OpenAI API key
Model: Select an image editing model:
- gpt-image-1: Full-featured image editing model (may require organization verification)
- gpt-image-1-mini: Smaller image editing model (may require organization verification)

DALL·E 3 Limitations

DALL·E 3 does not support image editing. If you select DALL·E 3 for image editing, you will receive an error. Use gpt-image-1 or gpt-image-1-mini for image editing instead.

Reference Images

OpenAI's image editing models support a single reference image. When editing an image, provide one reference image that will be used as the base for the edit.

Image Analysis Configuration

For image analysis, configure the following:

OpenAI API Key: Your OpenAI API key
Model: Select a vision-capable text model:
- gpt-4.1-mini: Fast analysis model with vision capabilities
- gpt-4o-mini: Alternative vision model option

Analysis Models

Image analysis uses text models that support vision capabilities, not the image generation models. These models can analyze images and provide detailed descriptions, answer questions about image content, and extract information from visual content.

Usage

Once configured, the OpenAI backend will appear in the Visualizer agent status with green indicators showing which capabilities are available.

The status indicators show:

Text to Image: Available when text-to-image backend is configured
Image Edit: Available when image editing backend is configured (shows "References 1" indicating single reference support)
Image Analysis: Available when image analysis backend is configured

Model Recommendations

Text-to-Image

dall-e-3: Most widely available option. Good for general use, though quality may vary.
gpt-image-1: Higher quality option, but requires organization verification. Use if you have access and need better results.
gpt-image-1-mini: Smaller version of GPT-Image, faster generation. Requires organization verification.

Image Editing

gpt-image-1: Best quality for image editing. Requires organization verification.
gpt-image-1-mini: Faster editing option. Requires organization verification.

Image Analysis

gpt-4.1-mini: Recommended default for image analysis. Fast and accurate.
gpt-4o-mini: Alternative option if you prefer this model.

Prompt Formatting

The OpenAI backend uses Descriptive prompt formatting by default. This means prompts are formatted as natural language descriptions rather than keyword lists. Provide detailed, natural language descriptions of what you want to create or edit.