FLUX Best Practices

Version 1.0.0
Black Forest Labs
January 2026

Note:
This document is for AI agents and LLMs to follow when working with
FLUX image generation prompting and workflows. Humans may also find it useful,
but guidance here is optimized for automation and consistency.

Abstract

Comprehensive prompting and workflow guide for BFL FLUX image generation models. Covers all FLUX.2 and FLUX.1 models including text-to-image, image-to-image editing, JSON structured prompting, color specification, typography, and multi-reference editing. Each rule includes detailed explanations, examples of effective vs ineffective approaches, and model-specific guidance.

Core Principles - CRITICAL
- 1.1 Core FLUX Prompting Principles
Model Selection - HIGH
- 2.1 FLUX Model Selection Guide
- 2.2 FLUX.1 Model Family
- 2.3 FLUX.2 Model Family
Text-to-Image Prompting - HIGH
- 3.1 Text-to-Image (T2I) Prompting
Image-to-Image Editing - HIGH
- 4.1 Image-to-Image (I2I) Prompting
JSON Structured Prompting - MEDIUM-HIGH
- 5.1 JSON Structured Prompting
Color Specification - MEDIUM
- 6.1 Hex Color Prompting
Typography and Text - MEDIUM
- 7.1 Typography and Text Prompting
Multi-Reference Editing - MEDIUM
- 8.1 Multi-Reference Image Editing
Positive Prompt Alternatives - MEDIUM
- 9.1 Negative Prompt Alternatives

1. Core Principles

Impact: CRITICAL

Universal prompting principles that apply to all FLUX models. Master these before diving into specific techniques.

1.1 Core FLUX Prompting Principles

Impact: MEDIUM

These principles apply to all FLUX models and form the foundation of effective prompting.

FLUX does NOT support negative prompts. Always describe what you WANT, not what you don't want.

Build prompts using this structure for consistent results:

Wrong Approach:

a portrait of a woman, no glasses, no hat, no makeup

Correct Approach:

a portrait of a woman with natural skin, clear face, bare head, visible eyes

Example:

A young woman with flowing auburn hair (subject)
dancing gracefully in mid-leap (action)
in the style of classical oil painting (style)
in a moonlit garden with roses (context)
soft diffused moonlight with subtle rim lighting (lighting)
medium shot, shallow depth of field (technical)

More specific prompts yield dramatically better results.

Vague: Poor Results

a cat sitting

Specific: Excellent Results

A fluffy orange tabby cat with bright green eyes sitting regally on a vintage
velvet armchair, afternoon sunlight streaming through lace curtains, warm
golden hour lighting, shallow depth of field, shot on medium format film

Write prompts as descriptive prose rather than keyword lists.

Keyword Style: Less Effective

woman, portrait, beautiful, blonde, studio, professional, 8k, detailed

Prose Style: More Effective

A professional studio portrait of a beautiful blonde woman in her thirties,
captured with soft studio lighting that accentuates her features, rendered
in stunning detail with natural skin texture and subtle catchlights in her eyes

Always specify lighting - it has the single greatest impact on image quality.

Natural Lighting:

Golden hour - warm, soft, directional
Overcast - soft, diffused, even
Harsh midday - high contrast, strong shadows
Dappled forest light - specular, organic patterns

Studio Lighting:

Softbox - even, professional
Rim light - edge definition, separation
Butterfly lighting - beauty, glamour
Rembrandt lighting - dramatic, classic portraits

Atmospheric Lighting:

Volumetric fog - depth, mystery
God rays - dramatic, spiritual
Neon glow - urban, cyberpunk
Candlelight - warm, intimate

Mood-Based Lighting:

Dramatic shadows - tension, noir
High key - bright, airy, clean
Low key - moody, mysterious
Chiaroscuro - strong contrast, painterly

FLUX prioritizes elements that appear earlier in the prompt. Front-load important elements.

Less Effective:

A forest background with soft lighting where a knight in shining armor stands

More Effective:

A knight in shining armor stands in a forest, soft dappled lighting filtering
through the canopy

Optimal prompt length is typically 30-80 words (FLUX can handle up to 512 tokens).

Too short: Lacks direction, generic results
Too long: Can become unfocused
Sweet spot: Enough detail to guide, not so much it confuses

Build prompts iteratively:

Start with core subject and action
Add style and medium
Specify lighting and atmosphere
Include technical details
Refine based on results

Change one element at a time to understand what affects your output.

Reference: negative-prompt-alternatives.md

2. Model Selection

Impact: HIGH

Choosing the right FLUX model for your use case. Covers both FLUX.2 (latest) and FLUX.1 (legacy) model families.

2.1 FLUX Model Selection Guide

Impact: MEDIUM

Decision guide for selecting the optimal FLUX model based on your requirements.

| Priority | Recommended Model |

| ------------- | ------------------------------- |

| Speed | FLUX.2 [klein] |

| Quality | FLUX.2 [max] |

| Balance | FLUX.2 [pro] |

| Typography | FLUX.2 [flex] |

| Image Editing | FLUX.2 [klein], [pro], or [max] |

| Local/Free | FLUX.2 [dev] |

| Inpainting | FLUX.1 Fill |

Note: All FLUX.2 models natively support image-to-image editing via reference images. Simply provide your source image(s) as references and describe the desired changes.

By Speed:

| Model | Relative Speed | Best For |

| ----------------- | -------------- | ----------------------- |

| FLUX.2 [klein] 4B | Fastest | Rapid prototyping |

| FLUX.2 [klein] 9B | Very Fast | Better quality previews |

| FLUX.2 [pro] | Medium | Production workflows |

| FLUX.2 [flex] | Medium | Typography tasks |

| FLUX.2 [max] | Slower | Final hero images |

By Quality:

| Model | Quality Level | Trade-off |

| ----------------- | ------------- | -------------------------- |

| FLUX.2 [max] | Highest | Slowest, most expensive |

| FLUX.2 [pro] | High | Good balance |

| FLUX.2 [flex] | High (text) | Specialized for typography |

| FLUX.2 [klein] 9B | Good | Fast, slightly less detail |

| FLUX.2 [klein] 4B | Moderate | Fastest, preview quality |

By Cost:

Credit pricing: 1 credit = $0.01 USD. FLUX.2 uses megapixel-based pricing.

| ----------------- | ------ | ---- | ------- | ------- | --------------------------- |

| FLUX.2 [klein] 4B | 1.4c | 0.1c | $0.014 | $0.015 | High volume, previews |

| FLUX.2 [klein] 9B | 1.5c | 0.2c | $0.015 | $0.017 | High volume, better quality |

| FLUX.2 [pro] | 3c | 1.5c | $0.03 | $0.045 | Production workloads |

| FLUX.2 [max] | 7c | 3c | $0.07 | $0.10 | Hero images, premium |

| FLUX.2 [flex] | 6c | 6c | $0.06 | $0.12 | Typography |

Pricing formula: (firstMP + (outputMP-1) * mpPrice) + (inputMP * mpPrice) in cents

| Model | Price/Image | Use Case |

| -------------------- | ----------- | ----------------------- |

| FLUX.1 Kontext [pro] | $0.04 | Context-aware editing |

| FLUX.1 Kontext [max] | $0.08 | Max quality editing |

| FLUX1.1 [pro] | $0.04 | Standard T2I |

| FLUX1.1 [pro] Ultra | $0.06 | Ultra high-resolution |

| FLUX1.1 [pro] Raw | $0.06 | Candid photography feel |

| FLUX.1 Fill [pro] | $0.05 | Inpainting |

| FLUX.1 [pro] | $0.05 | Original pro model |

Use bfl.ai/pricing calculator for exact costs at different resolutions.

Creative Exploration / Ideation:

Recommended: FLUX.2 [klein]

Fast iterations
Quick concept testing
Mood board generation
Exploring prompt variations

Production Marketing Assets:

Recommended: FLUX.2 [pro]

Consistent quality
Reasonable speed
Cost-effective at scale
Reliable for automation

Hero Images / Premium Content:

Recommended: FLUX.2 [max]

Maximum detail
Best coherence
Supports grounding search
Worth the premium for key visuals

Typography / Signage / Posters:

Recommended: FLUX.2 [flex]

Superior text rendering
Adjustable quality settings
Best for readable text
UI mockups and infographics

Character Consistency:

Recommended: FLUX.2 [max] or [pro]

Multi-reference support (up to 8-10 images)
Best editing consistency
Maintains identity across scenes
Superior quality over FLUX.1 Kontext

Photo Editing / Retouching:

Recommended: FLUX.2 [klein], [pro], or [max]

Native image-to-image support via references
Style transfer
Object modification
Attribute changes
Better results than FLUX.1 Kontext

Real-Time Information:

Recommended: FLUX.2 [max]

Grounding search feature
Current events
Recent news visualization
Weather/location data

Local Development / Testing:

Recommended: FLUX.2 [dev]

No API costs
Full control
Fine-tuning experiments
Offline capability

Editorial with Typography:

1. FLUX.2 [max] - Generate base image (highest quality)
2. FLUX.2 [flex] - Add text overlay pass

Character-Consistent Series:

1. FLUX.2 [max] - Create character reference
2. FLUX.2 [max]/[pro] - Generate consistent variations using reference images
3. FLUX.2 [klein] - Quick iteration on variations if needed

E-commerce Product Pipeline:

1. FLUX.2 [pro] - Bulk product generations
2. FLUX.2 [pro]/[klein] - Product variations (colors, angles) using references
3. FLUX.2 [flex] - Add promotional text/pricing

Limited Budget:

High volume: FLUX.2 [klein] 4B
Quality needed: FLUX.2 [pro] (best value)

Tight Deadline:

Any task: FLUX.2 [klein]
Quality matters: FLUX.2 [pro]

Maximum Quality Required:

Always: FLUX.2 [max]

Text Must Be Readable:

Always: FLUX.2 [flex]

Editing Existing Images:

Fast edits: FLUX.2 [klein] with reference images
Quality edits: FLUX.2 [max] or [pro] with reference images
Alternative: FLUX.1 Kontext (FLUX.2 preferred)

Rate Limit Sensitivity:

Speed?      → FLUX.2 [klein]
Quality?    → FLUX.2 [max]
Balance?    → FLUX.2 [pro]
Text?       → FLUX.2 [flex]
Edit?       → FLUX.2 [klein/pro/max] with reference images
Free?       → FLUX.2 [dev]

Prefer: FLUX.2 models (24 concurrent limit)
Avoid: FLUX.1 Kontext Max (6 concurrent limit)

Key insight: All FLUX.2 models support image editing natively via reference images. FLUX.2 is recommended over FLUX.1 Kontext for editing tasks.

2.2 FLUX.1 Model Family

Impact: MEDIUM

Tip: FLUX.2 models are the latest generation and recommended for most use cases. FLUX.1 models are still available for specific needs.

Guide to FLUX.1 models and their specialized capabilities.

| Model | Purpose | Notes |

|-------|---------|-------|

| FLUX1.1 [pro] | Text-to-image | FLUX.2 [pro] offers improved results |

| FLUX.1 Kontext | Image-to-image | FLUX.2 with references recommended |

| FLUX.1 Kontext Max | Image-to-image | FLUX.2 [max] with references recommended |

| FLUX.1 Fill | Inpainting | Useful for specific inpainting tasks |

Fast and reliable text-to-image generation.

Characteristics:

Strong prompt adherence
Production-grade architecture
Consistent, reliable results
Scalable for high-volume
Pricing: $0.04 per image

Prompting Style:

Standard descriptive prompts with clear subject and style specification.

Example Prompt:

A golden retriever puppy playing in autumn leaves, warm afternoon sunlight,
shallow depth of field with bokeh background, joyful expression, professional
pet photography style

Recommendation: FLUX.2 models with reference images provide improved editing results.

Context-aware image-to-image editing model for transformations and modifications.

Characteristics:

Understands image context
Preserves unedited regions
Style transfer capabilities
Object modification
Basic to complex transformations

Prompting Strategies:

Add the text "OPEN" as a neon sign in the window, red glowing letters
with slight reflection on the glass, matching the nighttime atmosphere

Simple, direct instructions work best:

Be explicit about what to preserve:

For dramatic changes, be specific about preservation:

Reference specific artistic movements:

Describe text placement and integration:

Tips for Kontext:

Be explicit about what should NOT change
Start with simpler edits, build complexity
Specify style preservation when needed
Use for incremental refinement

Advanced multi-reference editing for complex compositions.

Characteristics:

Handles up to 10 reference images
Best editing consistency across references
Complex scene composition
Character consistency maintenance
Rate limit: 6 concurrent requests

Multi-Reference Prompting:

Replace the top half of the person in image 1 with the clothing
from image 2, maintaining the pose and background

Describe relationships between images naturally:

Reference images by number for precision:

Tips for Kontext Max:

Plan your reference images carefully
Use natural language for relationships
Specify which elements come from which image
Leverage for character consistency across scenes

Specialized tool for object removal and area completion.

Characteristics:

Clean object removal
Intelligent background completion
Texture-aware filling
Seamless blending

Use Cases:

Remove unwanted objects from photos
Complete partial images
Replace specific regions
Clean up image artifacts

Prompting for Fill:

Complete with matching ocean waves and sandy beach

Describe what should fill the masked area:

Tips for Fill:

Provide context about surrounding areas
Specify texture and pattern continuation
Describe lighting consistency
Use for cleanup and removal tasks

2.3 FLUX.2 Model Family

Impact: MEDIUM

Complete guide to FLUX.2 variants and their optimal prompting strategies.

Key Feature: All FLUX.2 models natively support both text-to-image generation AND image-to-image editing via reference images. There's no need to use legacy FLUX.1 Kontext models for editing tasks.

|-------|------------|----------|-------|------------------|

Best for rapid prototyping, previews, and high-volume generation.

Characteristics:

4B or 9B parameter versions available
Sub-second generation times
Optimized for speed over maximum detail
No prompt upsampling - be descriptive yourself
Supports up to 4 reference images

Prompting Style: Narrative Prose:

Klein responds best to descriptive, narrative-style prompts with emphasis on lighting and atmosphere.

Example Prompt:

A cozy coffee shop interior bathed in warm afternoon light, steam rising lazily
from ceramic cups, worn leather armchairs arranged around small wooden tables,
bookshelves lining exposed brick walls, the soft atmosphere of a quiet afternoon
with dust motes floating in sunbeams through tall windows

Tips for [klein]:

Write like a novelist describing a scene
Front-load your subject (word order critical)
Emphasize lighting descriptions heavily
Keep prompts moderately detailed (40-70 words)

Premium model for final production assets and maximum detail.

Characteristics:

Highest detail and coherence
Best editing consistency
Vast world knowledge
Includes grounding search (real-time web data)
Strongest prompt following
Supports up to 8 reference images (API), 10 (playground)

Prompting Style: Technical + Descriptive:

[max] excels with detailed technical specifications combined with descriptive prose.

Example Prompt:

Portrait of a weathered fisherman, age 70, deep wrinkles telling stories of
decades at sea, salt-and-pepper beard with streaks of white, wearing a navy
cable-knit sweater with visible wool texture. Shot on Hasselblad X2D with
90mm f/2.8 lens at f/4, golden hour natural light from the left creating
strong rim lighting, shallow depth of field with soft bokeh from harbor
lights behind, Kodak Portra 400 color science with natural grain

Tips for [max]:

Include camera and lens specifications for photorealism
Specify film stock or digital sensor characteristics
Use technical photography terms (aperture, focal length)
Leverage grounding search for current events: "news photo of [recent event]"

Optimal balance of quality and speed for production workflows.

Characteristics:

Good quality-to-speed ratio
Reliable, consistent output
Suitable for batch processing
Supports prompt upsampling
Supports up to 8 reference images

Prompting Style: Balanced Detail:

Standard detailed prompts work well without excessive technical specification.

Example Prompt:

A modern minimalist living room with floor-to-ceiling windows overlooking
a city skyline at dusk, clean white furniture with subtle textures, a single
statement plant in the corner, warm ambient lighting from hidden sources,
architectural photography style with clean lines and balanced composition

Tips for [pro]:

Balance specificity with generation speed
Good for template-based prompt systems
Enable prompt upsampling for enhanced results
Consistent quality for production pipelines

Optimized for text rendering and typographic content.

Characteristics:

Superior text rendering quality
Handles multiple text elements
Adjustable steps (1-50) and guidance (1.5-10)
Best for signage, posters, UI mockups
Supports up to 8 reference images

Prompting Style: Typography-Focused:

Always quote text and specify font characteristics explicitly.

Example Prompt:

A modern minimalist poster design with the headline "DESIGN SUMMIT 2025"
in bold condensed sans-serif typography centered in the upper third,
subtitle "Innovation Meets Creativity" in lighter weight below,
date "MARCH 15-17" in small caps at the bottom, all text in white
on a gradient background transitioning from deep purple #4A0080 to
coral #FF6B6B, clean geometric accent lines, professional print quality

Tips for [flex]:

Always quote exact text: "Your Text Here"
Specify font style: serif, sans-serif, script, display, monospace
Describe text hierarchy: headline, subhead, body
Include placement: centered, left-aligned, upper third
Adjust steps (higher = better quality) and guidance (higher = stricter)

For local development, testing, and non-commercial use.

Characteristics:

Open weights on Hugging Face
Runs locally (~13GB VRAM recommended)
Full customization available
Free for non-commercial use
Base variants available (undistilled) for fine-tuning

Prompting Style: Standard:

Same prompting patterns as [pro] work well.

Tips for [dev]:

Use for development and testing before production
Experiment with prompt variations
Good for fine-tuning experiments
Check license for commercial use restrictions

All FLUX.2 models support image editing via reference images. This replaces the need for legacy FLUX.1 Kontext models.

How It Works:

Provide your source image(s) as reference images
Describe the desired changes in your prompt
The model preserves context while applying edits

Example: Style Transfer:

Reference: [your source image]
Prompt: Transform this image into a watercolor painting style,
maintaining the exact composition and subject positioning

Example: Object Modification:

Reference: [your source image]
Prompt: Change the car color to red while keeping everything else identical

Example: Character Consistency:

Reference: [character reference image]
Prompt: The same person from the reference image walking through
a busy Tokyo street at night, neon lights reflecting on wet pavement

Model Selection for Editing:

| Use Case | Recommended Model |

|----------|-------------------|

| Quick iterations/previews | FLUX.2 [klein] |

| Production quality edits | FLUX.2 [pro] |

| Maximum quality/complex edits | FLUX.2 [max] |

| Text/typography edits | FLUX.2 [flex] |

3. Text-to-Image Prompting

Impact: HIGH

Crafting effective prompts for generating images from text descriptions.

3.1 Text-to-Image (T2I) Prompting

Impact: MEDIUM

Comprehensive guide to crafting effective text-to-image prompts for FLUX models.

Basic Formula:

[Subject] + [Action] + [Style] + [Context] + [Lighting] + [Technical]

Expanded Framework:

[Main Subject] - who/what is the focus
[Attributes] - characteristics, details, clothing
[Action/Pose] - what they're doing
[Environment] - where, setting, background
[Style/Medium] - artistic approach
[Lighting] - light source, quality, mood
[Composition] - framing, camera angle
[Technical] - camera, lens, film stock

People/Portraits:

A distinguished professor in his 60s with silver hair and round spectacles,
wearing a tweed jacket with leather elbow patches, deep-set thoughtful eyes,
slight smile suggesting hidden wisdom

Animals:

A majestic snow leopard with piercing blue-grey eyes, thick spotted fur
dusted with snowflakes, powerful muscular build, alert posture on a
rocky outcrop

Objects/Products:

A vintage Leica M3 camera with worn brass edges showing decades of use,
black leather covering with patina, sitting on weathered wooden table

Landscapes:

A dramatic fjord at dawn, steep granite cliffs rising from mirror-still
water, wisps of morning mist, distant snow-capped peaks catching first
golden light

Architecture:

A brutalist concrete apartment building in late afternoon light, geometric
shadows creating abstract patterns, warm sunlight contrasting with cool
grey concrete

Photorealistic:

80s film photography, film grain, warm color cast, soft focus, nostalgic

Artistic Styles:

anime style, large expressive eyes, clean linework, cel shading, vibrant palette

Portrait Lighting:

Rembrandt lighting - 45 degree key light creating triangle shadow on cheek
Butterfly lighting - overhead key creating shadow under nose
Split lighting - 90 degree side light, half face in shadow
Loop lighting - slight angle creating small nose shadow

Natural Lighting:

Golden hour - warm, soft, directional light 1 hour before sunset
Blue hour - cool, ambient light just after sunset
Overcast - soft, even, diffused lighting
Harsh midday - strong contrast, defined shadows

Atmospheric:

Volumetric light - visible light rays through fog/dust
Rim lighting - backlight creating edge glow
Practical lighting - visible light sources in scene
Neon glow - colorful artificial urban lighting

Camera Bodies:

Shot on Hasselblad X2D - medium format, exceptional detail
Shot on Canon 5D Mark IV - professional DSLR quality
Shot on Leica M10 - rangefinder character, smooth tonality
Shot on iPhone 15 Pro - computational photography look

Lens Characteristics:

85mm f/1.4 - classic portrait, creamy bokeh
24mm f/2.8 - wide angle, environmental
50mm f/1.2 - natural perspective, shallow DOF
135mm f/2 - compressed perspective, smooth background
Macro lens - extreme close-up detail
Tilt-shift lens - miniature effect or architectural correction

Technical Settings:

f/1.4 - extremely shallow depth of field
f/2.8 - moderate background blur
f/8 - sharp throughout, landscape
f/16 - maximum sharpness, long exposure
ISO 100 - clean, no noise
ISO 3200 - visible grain, low light

Framing:

extreme close-up - filling frame with detail
close-up - head and shoulders
medium shot - waist up
full shot - entire body
wide shot - subject in environment
establishing shot - location focus

Angles:

eye level - natural, relatable
low angle - powerful, imposing
high angle - diminished, overview
Dutch angle - tension, unease
bird's eye - pattern, layout
worm's eye - dramatic upward view

Composition Rules:

rule of thirds - subject at intersection points
centered composition - symmetry, stability
leading lines - guiding eye to subject
frame within frame - natural framing elements
negative space - minimalist, breathing room

Editorial Portrait:

A fashion editorial portrait of a young woman with striking features and
high cheekbones, wearing an avant-garde geometric collar in silver, dramatic
side lighting creating strong shadows, shot on Hasselblad with 100mm lens
at f/2.8, studio background with subtle gradient, high fashion magazine style

Product Photography:

A premium wireless headphone product shot, matte black finish with rose gold
accents, floating at slight angle against pure white background, soft even
lighting eliminating harsh shadows, reflection visible on glossy surface below,
commercial catalog style, ultra sharp focus throughout

Landscape:

A misty morning in ancient redwood forest, towering trees disappearing into
fog above, ferns covering forest floor in layers of green, single shaft of
golden sunlight breaking through canopy, shot on large format camera, rich
detail in bark textures, Ansel Adams inspired black and white with deep tones

Architectural:

Modern minimalist beach house at golden hour, floor-to-ceiling glass walls
reflecting sunset colors, clean white concrete and natural wood, infinity
pool merging with ocean horizon, architectural photography style, wide angle
showing full structure, warm evening light

4. Image-to-Image Editing

Impact: HIGH

Techniques for editing and transforming existing images using FLUX.2 models.

4.1 Image-to-Image (I2I) Prompting

Impact: MEDIUM

Guide to effective image-to-image editing with FLUX models.

All FLUX.2 models support image-to-image editing via reference images:

FLUX.2 [klein]: Up to 4 reference images - fast editing
FLUX.2 [pro]: Up to 8 reference images - balanced quality/speed
FLUX.2 [max]: Up to 8-10 reference images - highest quality editing
FLUX.2 [flex]: Up to 8 reference images - best for typography edits

Simply provide your source image as a reference and describe the desired changes. The model understands image context and can modify specific elements while preserving others.

Note: FLUX.2 models are recommended for image editing. They provide better results than the older FLUX.1 Kontext models.

Preferred: Use URLs directly - simpler and more convenient than base64.

When you have an image URL, pass it directly to input_image:

The API fetches URLs automatically. Both URL and base64 work, but URLs are recommended when available.

Simple Modifications:

Add snow to the ground

Direct, single-change instructions:

Attribute Changes:

Age the person to appear 20 years older

Modifying specific characteristics:

Explicit Preservation:

Transform the daytime photo to nighttime, maintaining the exact
composition, colors of the subject's outfit, and lighting direction

When you need to keep specific elements unchanged:

Style Preservation:

Add rain effects to the scene while preserving the painting's
impressionist brushwork and color palette

Preventing unwanted style shifts:

Environmental Changes:

Change to spring with cherry blossoms, fresh green leaves,
soft warm lighting

Style Transfer:

Transform into a watercolor painting with soft edges,
transparent washes, and paper texture visible

Subject Modifications:

Make the subject appear younger, around 25 years old

Object Editing:

Swap the coffee mug for an ornate teacup with floral pattern

Adding Text:

Include a wooden sign with hand-painted text "Welcome Home"
mounted above the door

Modifying Text:

Update the poster text to "SUMMER SALE 2025" maintaining the design

For dramatic transformations, consider breaking into steps:

Step-by-Step Approach:

Transform this modern office into a Victorian library with completely
different furniture, add a fireplace, change the lighting to candlelit,
and age the photograph

Instead of:

Try sequential edits:

Change the furniture style to Victorian antique pieces
Add a stone fireplace on the right wall
Transform lighting to warm candlelit atmosphere
Apply vintage photograph aesthetic with sepia tones

Avoid Vague Instructions:

Bad: Make it look better
Good: Increase contrast, add warm color grading, sharpen details

Be Specific About Scope:

Bad: Change the background
Good: Replace the office background with a tropical beach at sunset,
      maintaining the subject's exact position and lighting direction

Explicit Style Preservation:

Bad: Make it nighttime
Good: Transform to nighttime while maintaining the photorealistic style,
      add appropriate artificial lighting sources

Start Simple - Begin with single-element changes
Be Explicit - State what should change AND what should stay
Reference Context - Mention existing elements when relevant
Iterate - Refine through multiple small edits rather than one large one
Preserve Deliberately - Always specify style/composition preservation needs

5. JSON Structured Prompting

Impact: MEDIUM-HIGH

Using structured JSON for complex multi-element scene composition.

5.1 JSON Structured Prompting

Impact: MEDIUM

For complex scenes with multiple elements, spatial relationships, or production automation, use JSON-structured prompts.

Multiple characters with distinct attributes
Precise spatial positioning
Complex scene composition
Reproducible, template-based prompts
Programmatic prompt generation
Production workflows with variable substitution

Flatten your JSON into flowing prose for the actual prompt:

From JSON:

{
  "subjects": [
    {
      "type": "person",
      "description": "elderly craftsman with weathered hands",
      "position": "seated at workbench",
      "action": "carefully carving wood"
    }
  ],
  "scene": { "setting": "traditional workshop", "time": "morning" },
  "technical": { "lighting": "natural window light from right" }
}

To Prompt:

{
  "composition": {
    "layout": "triangular",
    "focal_point": "center-left intersection",
    "depth_layers": [
      {
        "layer": "foreground",
        "elements": ["flowers in vase"],
        "focus": "soft blur"
      },
      {
        "layer": "midground",
        "elements": ["main subject"],
        "focus": "sharp"
      },
      {
        "layer": "background",
        "elements": ["window", "garden view"],
        "focus": "soft blur"
      }
    ]
  }
}

Use JSON structure for template-based generation:

Define explicit spatial relationships:

Use IDs for References - Give subjects IDs when they interact
Separate Concerns - Keep scene, subjects, style, and technical distinct
Be Consistent - Use the same terminology throughout
Include All Details - Don't assume, specify everything
Flatten for Execution - Convert to natural language before sending to model
Version Templates - Track template versions for reproducibility

6. Color Specification

Impact: MEDIUM

Precise color control using hex codes for brand-accurate generations.

6.1 Hex Color Prompting

Impact: MEDIUM

FLUX supports hex color codes (#RRGGBB) for precise color specification, essential for brand consistency and exact color matching.

Include hex codes directly in your prompt with descriptive names:

Use these keywords to indicate color specification:

1. Always Pair with Description:

Good: #FF6B6B (coral pink)
Bad: #FF6B6B

Never use hex codes alone - include the color name:

2. Associate with Specific Objects:

A product shot featuring a smartphone with a #1DA1F2 (Twitter blue) case,
resting on a #14171A (near black) matte surface

Clearly connect colors to their targets:

3. Limit Color Palette:

Color palette for the scene: #2ECC71 (emerald green), #3498DB (sky blue),
#F1C40F (sunflower yellow), #FFFFFF (pure white)

3-5 colors typically work best. Too many can confuse the model:

Brand Colors:

Corporate office reception with brand colors prominently featured:
walls in #0066CC (company blue), accent furniture in #FF6600 (company orange),
logo displayed in #FFFFFF (white) against the blue backdrop

Interior Design:

Scandinavian minimalist bedroom with #F5F5F5 (warm white) walls,
#8B4513 (saddle brown) wooden headboard and nightstands,
#708090 (slate gray) linen bedding, and #DAA520 (goldenrod) accent lamp

Fashion:

Editorial fashion photo: model wearing #000000 (black) cashmere turtleneck,
#FF4500 (orange-red) wide-leg wool pants, #C0C0C0 (silver) geometric earrings,
against a #F0F0F0 (light gray) studio backdrop

Product Design:

Premium headphones product shot: #1C1C1E (space gray) aluminum body,
#F5F5F7 (silver) mesh ear cups, #FF9500 (iOS orange) accent ring around controls

Digital Art:

Synthwave cityscape: #FF00FF (magenta) and #00FFFF (cyan) neon signs,
#1A1A2E (deep navy) night sky, #E94560 (hot pink) setting sun on horizon,
#16213E (dark blue) building silhouettes, rain-slicked streets reflecting lights

Data Visualization:

Sunset sky gradient from #FF6B6B (coral) at horizon through
#FFA07A (light salmon) to #87CEEB (sky blue) at top

Specify gradients with start and end colors:

Complementary: Opposite on color wheel

Scene using complementary colors: #3498DB (blue) dominant with
#E67E22 (orange) accents for visual pop

Analogous: Adjacent colors

Harmonious palette using analogous colors: #9B59B6 (purple),
#8E44AD (deep purple), #3498DB (blue) - flowing naturally together

Triadic: Evenly spaced

Vibrant triadic scheme: #E74C3C (red), #F1C40F (yellow),
#3498DB (blue) - balanced and dynamic

Monochromatic: Single hue variations

# Social Media
Twitter/X Blue: #1DA1F2
Facebook Blue: #1877F2
Instagram Gradient: #833AB4 to #FD1D1D
LinkedIn Blue: #0A66C2

# Tech
Apple Gray: #1C1C1E
Google Blue: #4285F4
Microsoft Blue: #00A4EF
Amazon Orange: #FF9900

# Design
Figma Purple: #A259FF
Dribbble Pink: #EA4C89
Behance Blue: #1769FF

For reference only - always verify current brand guidelines:

Color Not Accurate:

Add the color name alongside hex
Specify the exact object the color applies to
Use fewer total colors in the prompt

Color Bleeding:

Clearly delineate which objects get which colors
Use spatial descriptions: "the LEFT chair in #color"

Muddy Colors:

Check hex code accuracy
Specify lighting that won't shift colors
Use "maintaining exact color #XXXXXX" for emphasis

7. Typography and Text

Impact: MEDIUM

Rendering text and typography within generated images.

7.1 Typography and Text Prompting

Impact: MEDIUM

Guide to rendering text in FLUX images. Use FLUX.2 [flex] for best typography results.

Always quote the exact text you want rendered:

1. Use Quotation Marks:

Correct: A poster with "HELLO WORLD" in bold letters
Wrong: A poster with HELLO WORLD in bold letters

2. Specify Font Style:

"ADVENTURE" in bold sans-serif typography
"Welcome" in elegant cursive script
"CHAPTER ONE" in classic serif typeface
"CODE" in monospace terminal font
"SALE!" in decorative display lettering

3. Describe Size Hierarchy:

Large headline "BREAKING NEWS" above smaller subtext "Details inside"

4. Indicate Placement:

"OPEN" sign centered in storefront window
"EXIT" text positioned above doorway
"Page 1" in bottom right corner

5. Front-Load Text:

Good: A sign reading "FRESH BREAD" in a bakery window...
Less Good: A bakery window with a sign that says "FRESH BREAD"...

Place text descriptions early in the prompt for better accuracy:

Sans-Serif (Modern/Clean):

"MINIMAL" in clean geometric sans-serif, Swiss modernist style
"TECH SUMMIT" in bold condensed grotesque typeface
"future" in thin uppercase sans-serif, contemporary design

Serif: Classic/Elegant

"The New Yorker" in traditional serif typeface, editorial masthead
"LUXURY" in high-contrast Didone serif with thin/thick strokes
"Wisdom" in old-style serif with subtle bracketed serifs

Script/Cursive (Decorative):

"With Love" in flowing calligraphic script with flourishes
"Signature" in connected brush script, casual elegance
"Romance" in formal copperplate script, wedding invitation style

Display/Decorative:

"ROCK CONCERT" in distressed vintage concert poster lettering
"CIRCUS" in ornate Victorian display type with decorative elements
"RETRO" in 1970s rounded bubble letters

Handwritten:

"Note to self" in casual handwritten style, slightly imperfect
"Thanks!" in quick marker pen handwriting
"ideas" in sketchy pencil handwriting

Monospace:

"CODE_COMPLETE" in terminal monospace, developer aesthetic
"SYSTEM" in typewriter monospace, vintage tech
"DEBUG" in LCD-style digital monospace

Neon Signs:

Glowing neon sign spelling "OPEN 24/7" in pink neon tubes with
blue outline, slight glow and reflection, night scene

Metallic/3D:

"GOLD" in three-dimensional metallic gold letters with realistic
reflections and subtle shadows, luxury aesthetic

Embossed/Debossed:

"PREMIUM" embossed into leather surface, subtle shadows showing
the raised letterforms

Outlined:

"MODERN" in outline-only letters, no fill, thin white stroke
on dark background

Gradient Text:

"SUMMER" with gradient fill from #FF6B6B (coral) at top to
#4ECDC4 (teal) at bottom

Poster Design:

Event poster with "SUMMER FEST 2025" as large headline in bold
condensed sans-serif at top, "JULY 15-17" as medium subheading
in regular weight, "Central Park, NYC" as small body text at
bottom, all in white text on #FF6B35 (sunset orange) background

Book Cover:

Book cover design: "THE GREAT GATSBY" in elegant art deco gold
lettering centered in upper third, author name "F. SCOTT FITZGERALD"
in smaller gold caps below, #1A1A2E (midnight blue) background
with geometric gold accents

Magazine Cover:

Fashion magazine cover with "VOGUE" in classic serif masthead at top,
cover line "SPRING COLLECTION" in bold sans-serif, "The New Rules of Style"
in lighter weight italic, all in white against dramatic portrait

Signage:

Vintage diner sign: "MEL'S DINER" in red neon script lettering,
"OPEN" below in separate green neon block letters, chrome border,
1950s Americana aesthetic

Business Card:

Minimalist business card with "JOHN SMITH" in medium weight sans-serif,
"Creative Director" in lighter weight below, contact details in small
type at bottom, #2C3E50 (dark blue) text on white background

Centered Composition:

Centered text layout: "WELCOME" in large caps at center,
perfectly balanced with equal margins

Left-Aligned:

Left-aligned text block: "Company Name" as header,
"Tagline goes here" below, flush left alignment

Text on Path:

"GOING IN CIRCLES" text following a circular path around
the center of the design

Text Overlay:

"ADVENTURE AWAITS" in bold white text overlaid on landscape
photograph, positioned in lower third with slight shadow for readability

Steps Parameter:

Higher steps (30-50) = better text quality
Lower steps (10-20) = faster, lower quality

Guidance Parameter:

Higher guidance (6-10) = stricter prompt following
Lower guidance (1.5-4) = more creative interpretation

Recommended Settings:

For clean typography: steps=50, guidance=7
For artistic text: steps=30, guidance=4

Misspelled Words:

Keep text short (1-4 words work best)
Use common words when possible
Repeat the exact text in the prompt

Illegible Text:

Specify larger text size
Use simpler fonts (sans-serif)
Ensure high contrast with background
Use [flex] model

Wrong Font Style:

Instead of: "text in a nice font"
Use: "text in bold geometric sans-serif similar to Futura"

Be more specific:

Text Not Appearing:

Front-load text description in prompt
Put text in quotes
Specify exact placement
Reduce other prompt complexity

8. Multi-Reference Editing

Impact: MEDIUM

Combining multiple reference images for style transfer and composition.

8.1 Multi-Reference Image Editing

Impact: MEDIUM

Guide to using multiple reference images for character consistency, style transfer, and complex compositions.

FLUX.2 models support multiple reference images for advanced editing:

FLUX.2 [klein]: Up to 4 reference images - fast editing
FLUX.2 [pro]: Up to 8 via API - balanced quality/speed
FLUX.2 [max]: Up to 8 via API, 10 in playground - highest quality
FLUX.2 [flex]: Up to 8 via API - best for typography

Note: FLUX.2 models are recommended over FLUX.1 Kontext Max for better results.

Preferred: Use URLs directly - simpler and more convenient than base64.

Pass image URLs directly to input_image, input_image_2, etc.:

The API fetches URLs automatically. Both URL and base64 work, but URLs are recommended when available.

Natural Language Description:

The person from image 1 is sitting at the cafe table from image 2,
wearing the outfit from image 3, with the warm lighting style of image 4

Describe relationships between images naturally:

Explicit Indexing:

Combine the face from image 1 with the hairstyle from image 2
on the body pose from image 3

Reference images by number for precision:

Character Consistency:

The same person from image 1, now seated at a desk in a modern office,
same clothing and hairstyle, different environment

Maintain the same character across multiple scenes:

For sequential consistency:

Style Transfer:

Apply the color grading and mood from image 2 to the scene in image 1

Apply the style of one image to another:

Pose Guidance:

The person from image 1 in the exact pose shown in image 2,
placed in the environment from image 3

Use a reference for body positioning:

Object Composition:

Place the product from image 1 on the table setting from image 2,
using the lighting style from image 3

Combine elements from multiple images:

Background Replacement:

Keep the subject from image 1 exactly as shown, replace the background
with the beach scene from image 2, match the lighting naturally

Two Characters:

Image 1 (person A) and image 2 (person B) having a conversation
at a coffee shop table, person A on the left gesturing, person B
on the right listening intently

Group Composition:

The three people from images 1, 2, and 3 standing together for a
group photo, arranged left to right in that order, friendly poses,
outdoor park setting

Selective Attribute Transfer:

The face and expression from image 1, the hairstyle from image 2,
wearing the outfit from image 3, in the pose from image 4

Partial Transfer:

Arrange the scene using the layout shown in the collage input:
- Person from image 1 in the left position
- Object from image 2 in the center position
- Background element from image 3 filling the right side

Use a collage input for layout guidance:

1. Clear Image Roles:

Image 1: face/identity reference
Image 2: pose/body reference
Image 3: style/aesthetic reference
Image 4: environment/background reference

Specify what each reference provides:

2. Quality References:

Use high-quality, clear reference images
Ensure good lighting in references
Avoid heavily processed or filtered images

3. Consistent Lighting:

...ensure the lighting direction matches across all elements,
with main light source from the upper left

When combining elements:

4. Resolution Awareness:

For [pro] API with 9MP total limit:

At 1MP output: up to 8 reference images comfortably
Calculate: input images + output = total MP

5. Explicit Relationships:

Create a scene combining:
- The woman from image 1 (keep exact face, expression, hair)
- Wearing the vintage dress from image 2 (exact pattern and cut)
- In the pose from image 3 (seated position, arm placement)
- Set in the library from image 4 (bookshelves, furniture)
- Using the warm lighting style from image 5 (golden hour quality)

Position her in the center of frame, medium shot, looking slightly
to the right with a thoughtful expression.

Don't assume - specify exactly how elements relate:

Elements Not Transferring:

Be more specific about which element from which image
Use explicit indexing ("from image 1")
Reduce the number of references and complexity

Inconsistent Blending:

Specify lighting consistency
Describe how elements should interact
Use style references to unify the composition

Identity Drift:

Emphasize key identifying features
Use phrases like "maintaining exact likeness"
Provide multiple angles of the same subject if available

9. Positive Prompt Alternatives

Impact: MEDIUM

Strategies for achieving results without negative prompts, which FLUX does not support.

9.1 Negative Prompt Alternatives

Impact: MEDIUM

FLUX does not support negative prompts. This guide provides positive alternatives for common negative prompt patterns.

Negative prompts can actually make models focus MORE on unwanted elements. Instead, describe exactly what you DO want - this gives clearer direction and better results.

For any unwanted element:

Identify what you don't want
Ask: "What would be there instead?"
Describe the positive alternative

People/Crowds:

| Instead of | Use |

|-----------|-----|

| "no people" | "empty", "deserted", "solitary", "abandoned" |

| "no crowds" | "quiet", "peaceful", "secluded", "private" |

| "without background people" | "isolated subject", "clean background", "solo figure" |

Example:

Bad: A beach scene, no people
Good: A deserted beach at dawn, pristine untouched sand, solitary seagull

Skin/Appearance:

| Instead of | Use |

|-----------|-----|

| "no makeup" | "natural skin", "bare face", "fresh-faced" |

| "no blemishes" | "clear skin", "smooth complexion", "healthy glow" |

| "no wrinkles" | "youthful skin", "smooth features" |

Example:

Bad: Portrait of woman, no makeup, no blemishes
Good: Portrait of a woman with natural clear skin, fresh-faced with a healthy glow

Accessories:

| Instead of | Use |

|-----------|-----|

| "no glasses" | "visible eyes", "unobstructed gaze", "clear eye contact" |

| "no hat" | "bare head", "visible hair", "uncovered head" |

| "no jewelry" | "minimal accessories", "understated", "unadorned" |

Example:

Bad: Man portrait, no glasses, no hat
Good: Portrait of a man with clear direct gaze, wind-swept visible hair

Colors:

| Instead of | Use |

|-----------|-----|

| "no color" | "monochrome", "black and white", "grayscale" |

| "not colorful" | "muted tones", "subdued palette", "desaturated" |

| "no bright colors" | "neutral tones", "earth tones", "soft pastels" |

Example:

Bad: Landscape photo, no bright colors
Good: Landscape in muted earth tones, soft morning light, desaturated palette

Text/Watermarks:

| Instead of | Use |

|-----------|-----|

| "no text" | "clean surfaces", "unmarked", "text-free" |

| "no watermark" | "pristine image", "clean composition" |

| "no logos" | "unbranded", "plain", "logo-free surface" |

Example:

Bad: Product photo, no watermark, no text
Good: Clean product photography with pristine unmarked surfaces, minimal unbranded design

Style/Era:

| Instead of | Use |

|-----------|-----|

| "not modern" | "traditional", "classical", "vintage", "historical" |

| "no CGI look" | "photorealistic", "authentic", "natural", "organic" |

| "not cartoonish" | "realistic", "lifelike", "naturalistic" |

Example:

Bad: Building design, not modern, no futuristic elements
Good: Traditional Victorian architecture with classical ornate details and period-accurate features

Quality/Artifacts:

| Instead of | Use |

|-----------|-----|

| "no blur" | "sharp focus", "crisp details", "tack-sharp" |

| "no noise" | "clean image", "smooth gradients", "low ISO" |

| "no artifacts" | "pristine quality", "clean render", "flawless" |

Example:

Bad: Portrait, no blur, no noise
Good: Tack-sharp portrait with pristine image quality, smooth skin tones, crisp details

Objects:

| Instead of | Use |

|-----------|-----|

| "no cars" | "pedestrian area", "car-free zone", "walking street" |

| "no buildings" | "open landscape", "natural scenery", "wilderness" |

| "no furniture" | "empty room", "bare space", "minimalist interior" |

Example:

Bad: Street scene, no cars, no modern buildings
Good: Historic cobblestone walking street lined with traditional stone buildings from the 1800s

Weather/Environment:

| Instead of | Use |

|-----------|-----|

| "no rain" | "clear sky", "dry weather", "sunny day" |

| "no clouds" | "clear blue sky", "cloudless", "perfect visibility" |

| "not dark" | "well-lit", "bright", "daylight", "illuminated" |

Example:

Bad: Outdoor portrait, no rain, no clouds, not dark
Good: Outdoor portrait under clear blue sky on a bright sunny day, perfect natural lighting

Composition:

| Instead of | Use |

|-----------|-----|

| "no distractions" | "clean composition", "focused framing", "minimal elements" |

| "nothing in background" | "solid background", "isolated subject", "clean backdrop" |

| "no clutter" | "organized", "tidy", "minimal", "streamlined" |

Example:

Bad: Product shot, no distractions, nothing in background
Good: Product on clean white seamless backdrop, isolated subject, minimal focused composition

Original Negative-Heavy Prompt:

Portrait of a woman, no glasses, no makeup, no wrinkles, no blemishes,
no bright colors, no distracting background, no harsh lighting

Positive Rewrite:

Portrait of a youthful woman with clear natural skin and visible bright eyes,
fresh-faced with a healthy glow, wearing muted earth tones against a soft
blurred neutral background, gentle diffused lighting creating soft shadows

Original Negative-Heavy Prompt:

Landscape photo, no people, no buildings, no power lines, no modern elements,
no overcast sky, no dead trees

Positive Rewrite:

Pristine wilderness landscape with lush green living forest, clear blue sky,
untouched natural scenery stretching to the horizon, peaceful solitude with
only birdsong and wind, golden hour sunlight filtering through healthy foliage

| Unwanted | Positive Alternative |

|----------|---------------------|

| No people | Empty, solitary, deserted |

| No makeup | Natural, fresh-faced, bare |

| No text | Clean, unmarked, pristine |

| No blur | Sharp, crisp, tack-sharp |

| No modern | Traditional, vintage, classical |

| No dark | Bright, well-lit, luminous |

| No busy | Minimal, clean, focused |

| No artificial | Natural, organic, authentic |

ナビゲーション

Skillsとは？

リンク

FLUX Best Practices

FLUX Best Practices

Abstract

Table of Contents

1. Core Principles

1.1 Core FLUX Prompting Principles

2. Model Selection

2.1 FLUX Model Selection Guide

2.2 FLUX.1 Model Family

2.3 FLUX.2 Model Family

3. Text-to-Image Prompting

3.1 Text-to-Image (T2I) Prompting

4. Image-to-Image Editing

4.1 Image-to-Image (I2I) Prompting

5. JSON Structured Prompting

5.1 JSON Structured Prompting

6. Color Specification

6.1 Hex Color Prompting

7. Typography and Text

7.1 Typography and Text Prompting

8. Multi-Reference Editing

8.1 Multi-Reference Image Editing

9. Positive Prompt Alternatives

9.1 Negative Prompt Alternatives

References

関連スキル(🔧 開発ツール)