Settings

The Visualizer agent settings are organized into three main sections: General, Prompt Generation, and Styles. Additionally, each backend may have its own configuration options, including resolution presets for local image generation backends.

General Settings

Backend Configuration

The Visualizer agent supports separate backends for different types of operations. You can configure each independently:

Backend (text to image): The backend to use for basic text-to-image generation. This is the primary backend used when generating new images from prompts.
Backend (image editing): The backend to use for contextual image editing. This allows you to modify existing images using reference images.
Backend (image analysis): The backend to use for image analysis. This enables AI-powered analysis of images to extract information.

Backend Selection

Not all backends support all three operations. The dropdown menus will only show backends that support the specific operation. You can use different backends for different operations if needed.

Image Generation Timeout

Controls how long the agent will wait for an image generation request to complete before considering it failed. The default is 300 seconds (5 minutes), and can be adjusted from 1 to 900 seconds (15 minutes).

If a backend doesn't generate an image within this time limit, the generation will be marked as failed and you'll be notified.

Automatic Setup

When enabled, the Visualizer agent will automatically configure backends if your selected client has built-in support for them. For example, some clients like KoboldCpp provide an Automatic1111 API that can be automatically detected and configured.

This setting is enabled by default and helps streamline the setup process when using compatible clients.

Automatic Generation

When enabled, allows the Visualizer agent to automatically generate visual content based on scene context. This can be useful for automatically creating character portraits or scene illustrations as your story progresses.

This setting is disabled by default, giving you full control over when images are generated.

Prompt Generation

The Prompt Generation section contains settings that control how image prompts are created and refined before being sent to the image generation backend.

Fallback Prompt Type

Determines the format used for prompt-only generation when no backends are configured. This setting only affects the output format when generating prompts without actually creating images.

Available options:

Keywords: Generates prompts using keyword-based formatting
Descriptive: Generates prompts using descriptive text formatting

Max. Prompt Generation Length

Controls the maximum token length for AI-generated image prompts. When you use Instruct mode or any feature that asks the AI to create a prompt for your image, this setting limits how long that generated prompt can be.

The default is 1024 tokens, and can be adjusted from 512 to 4096 tokens.

Both keyword-style and descriptive prompts are always generated together, so this limit must accommodate both formats.

Automatic Analysis of References

When enabled, reference images that lack analysis data will be automatically analyzed before being used in prompt generation. This ensures that the AI has detailed information about your reference images when creating prompts, which can lead to better results when generating variations or editing images.

How It Works

Normally, you analyze images manually using the Analyze button in the Visual Library. The analysis text captures details about the image content, which can then be used during prompt generation to help the AI understand what your reference images contain.

With Automatic Analysis of References enabled, any reference images that don't already have analysis data will be analyzed on-the-fly when you start a generation. This is particularly useful when:

You have uploaded images that haven't been analyzed yet
You're using newly saved images as references before analyzing them
You want to ensure all references have analysis data without manually analyzing each one

The analysis results are saved to the asset metadata, so each image only needs to be analyzed once. Future generations using the same reference will use the cached analysis.

Interaction with Prompt Revision

This setting works in conjunction with Perform Extra Revision of Editing Prompts (below). When both settings are enabled and you're generating with references:

First, any unanalyzed references are automatically analyzed
Then, the prompt revision step uses this analysis data to refine your prompt

If you only enable prompt revision without automatic analysis, the revision step will still work but may have less information about unanalyzed references to work with.

Additional AI Queries

Enabling this option adds one AI query per unanalyzed reference image. You must have an image analysis backend configured for this feature to work.

This setting is disabled by default.

Perform Extra Revision of Editing Prompts

When enabled, the AI will refine and simplify image editing prompts based on the provided reference images. This additional processing step can improve generation results by better aligning the prompt with your selected references.

This revision step analyzes the reference images (using their analysis data or tags) and adjusts the prompt to:

Reference characters or elements by their image number instead of re-describing them
Preserve the scene composition and setting from your original prompt
Maintain important context like actions, positioning, and mood
Only describe differences from the reference images when needed

For example, instead of generating a prompt like "Elena, a tall woman with red hair and green eyes wearing a blue dress, stands in a dimly lit tavern," the revised prompt might become "Elena (IMAGE 1) stands in a dimly lit tavern, looking worried" - letting the reference image provide the character's appearance while your prompt defines the scene.

Additional AI Query

This adds an extra AI query to the prompt generation process when reference images are provided.

This setting is enabled by default.

Styles Configuration

The Styles tab allows you to configure visual styles for different types of content. Each style is a template that defines how prompts are formatted and what visual characteristics are applied.

Style Options

Art Style: The default art style to use for visual prompt generation. This can be overridden in scene settings.
Character Card: The style to use for character card visual prompt generation
Character Portrait: The style to use for character portrait visual prompt generation
Scene Card: The style to use for scene card visual prompt generation
Scene Background: The style to use for scene background visual prompt generation
Scene Illustration: The style to use for scene illustration visual prompt generation

Managing Styles

Additional styles can be created and managed in the Templates manager. Styles are reusable templates that define visual characteristics, keywords, and instructions for different types of visual content.

Each style template can include:

Positive keywords (things to include)
Negative keywords (things to avoid)
Descriptive text (detailed descriptions)
Instructions (specific generation instructions)

These styles are applied automatically when generating images based on the visual type you select.

Resolution Presets

Local image generation backends (ComfyUI, SD.Next, and AUTOMATIC1111) include a resolution preset picker that lets you quickly select appropriate image dimensions for your generated images. This feature appears in each backend's configuration section.

How It Works

The resolution preset picker provides settings for three aspect ratios that match the format options available during image generation:

Square: Used for character portraits and icons (e.g., CHARACTER_PORTRAIT visual type)
Portrait: Used for tall images like character cards (e.g., CHARACTER_CARD, SCENE_CARD visual types)
Landscape: Used for wide images like scene backgrounds (e.g., SCENE_BACKGROUND, SCENE_ILLUSTRATION visual types)

Each resolution setting displays two number fields for width and height, along with a dropdown menu button that reveals available presets.

Available Presets

The preset picker includes resolution options optimized for different model types:

Preset	Square	Portrait	Landscape
SD 1.5	512 x 512	512 x 768	768 x 512
SDXL	1024 x 1024	832 x 1216	1216 x 832
Qwen Image	1328 x 1328	928 x 1664	1664 x 928
Z-Image Turbo	2048 x 2048	1088 x 1920	1920 x 1088

Selecting a Preset

To select a resolution preset:

Open the Visualizer agent settings
Navigate to the backend configuration tab (ComfyUI, SD.Next, or AUTOMATIC1111)
Find the Resolutions section
Click the dropdown menu button next to the resolution you want to change
Select the appropriate preset for your model

You can also manually enter custom width and height values by typing directly into the number fields.

Backend-Specific Settings

Resolution presets are configured separately for text-to-image and image editing operations if you're using a backend that supports both (like ComfyUI or SD.Next). This allows you to use different resolutions for each type of generation.

Cloud Backends

Cloud-based backends (Google, OpenAI, OpenRouter) do not have resolution presets because they use fixed or automatically determined resolutions based on the model's capabilities.