FLUX Best Practices
Version 1.0.0
Black Forest Labs
January 2026
Note:
This document is for AI agents and LLMs to follow when working with
FLUX image generation prompting and workflows. Humans may also find it useful,
but guidance here is optimized for automation and consistency.
Abstract
Comprehensive prompting and workflow guide for BFL FLUX image generation models. Covers all FLUX.2 and FLUX.1 models including text-to-image, image-to-image editing, JSON structured prompting, color specification, typography, and multi-reference editing. Each rule includes detailed explanations, examples of effective vs ineffective approaches, and model-specific guidance.
Table of Contents
- Core Principles - CRITICAL
- Model Selection - HIGH
- Text-to-Image Prompting - HIGH
- Image-to-Image Editing - HIGH
- JSON Structured Prompting - MEDIUM-HIGH
- Color Specification - MEDIUM
- Typography and Text - MEDIUM
- Multi-Reference Editing - MEDIUM
- Positive Prompt Alternatives - MEDIUM
1. Core Principles
Impact: CRITICAL
Universal prompting principles that apply to all FLUX models. Master these before diving into specific techniques.
1.1 Core FLUX Prompting Principles
Impact: MEDIUM
These principles apply to all FLUX models and form the foundation of effective prompting.
FLUX does NOT support negative prompts. Always describe what you WANT, not what you don't want.
Build prompts using this structure for consistent results:
Wrong Approach:
a portrait of a woman, no glasses, no hat, no makeup
Correct Approach:
a portrait of a woman with natural skin, clear face, bare head, visible eyes
Example:
A young woman with flowing auburn hair (subject)
dancing gracefully in mid-leap (action)
in the style of classical oil painting (style)
in a moonlit garden with roses (context)
soft diffused moonlight with subtle rim lighting (lighting)
medium shot, shallow depth of field (technical)
More specific prompts yield dramatically better results.
Vague: Poor Results
a cat sitting
Specific: Excellent Results
A fluffy orange tabby cat with bright green eyes sitting regally on a vintage
velvet armchair, afternoon sunlight streaming through lace curtains, warm
golden hour lighting, shallow depth of field, shot on medium format film
Write prompts as descriptive prose rather than keyword lists.
Keyword Style: Less Effective
woman, portrait, beautiful, blonde, studio, professional, 8k, detailed
Prose Style: More Effective
A professional studio portrait of a beautiful blonde woman in her thirties,
captured with soft studio lighting that accentuates her features, rendered
in stunning detail with natural skin texture and subtle catchlights in her eyes
Always specify lighting - it has the single greatest impact on image quality.
Natural Lighting:
-
Golden hour - warm, soft, directional
-
Overcast - soft, diffused, even
-
Harsh midday - high contrast, strong shadows
-
Dappled forest light - specular, organic patterns
Studio Lighting:
-
Softbox - even, professional
-
Rim light - edge definition, separation
-
Butterfly lighting - beauty, glamour
-
Rembrandt lighting - dramatic, classic portraits
Atmospheric Lighting:
-
Volumetric fog - depth, mystery
-
God rays - dramatic, spiritual
-
Neon glow - urban, cyberpunk
-
Candlelight - warm, intimate
Mood-Based Lighting:
-
Dramatic shadows - tension, noir
-
High key - bright, airy, clean
-
Low key - moody, mysterious
-
Chiaroscuro - strong contrast, painterly
FLUX prioritizes elements that appear earlier in the prompt. Front-load important elements.
Less Effective:
A forest background with soft lighting where a knight in shining armor stands
More Effective:
A knight in shining armor stands in a forest, soft dappled lighting filtering
through the canopy
Optimal prompt length is typically 30-80 words (FLUX can handle up to 512 tokens).
-
Too short: Lacks direction, generic results
-
Too long: Can become unfocused
-
Sweet spot: Enough detail to guide, not so much it confuses
Build prompts iteratively:
-
Start with core subject and action
-
Add style and medium
-
Specify lighting and atmosphere
-
Include technical details
-
Refine based on results
Change one element at a time to understand what affects your output.
Reference: negative-prompt-alternatives.md
2. Model Selection
Impact: HIGH
Choosing the right FLUX model for your use case. Covers both FLUX.2 (latest) and FLUX.1 (legacy) model families.
2.1 FLUX Model Selection Guide
Impact: MEDIUM
Decision guide for selecting the optimal FLUX model based on your requirements.
| Priority | Recommended Model |
| ------------- | ------------------------------- |
| Speed | FLUX.2 [klein] |
| Quality | FLUX.2 [max] |
| Balance | FLUX.2 [pro] |
| Typography | FLUX.2 [flex] |
| Image Editing | FLUX.2 [klein], [pro], or [max] |
| Local/Free | FLUX.2 [dev] |
| Inpainting | FLUX.1 Fill |
Note: All FLUX.2 models natively support image-to-image editing via reference images. Simply provide your source image(s) as references and describe the desired changes.
By Speed:
| Model | Relative Speed | Best For |
| ----------------- | -------------- | ----------------------- |
| FLUX.2 [klein] 4B | Fastest | Rapid prototyping |
| FLUX.2 [klein] 9B | Very Fast | Better quality previews |
| FLUX.2 [pro] | Medium | Production workflows |
| FLUX.2 [flex] | Medium | Typography tasks |
| FLUX.2 [max] | Slower | Final hero images |
By Quality:
| Model | Quality Level | Trade-off |
| ----------------- | ------------- | -------------------------- |
| FLUX.2 [max] | Highest | Slowest, most expensive |
| FLUX.2 [pro] | High | Good balance |
| FLUX.2 [flex] | High (text) | Specialized for typography |
| FLUX.2 [klein] 9B | Good | Fast, slightly less detail |
| FLUX.2 [klein] 4B | Moderate | Fastest, preview quality |
By Cost:
Credit pricing: 1 credit = $0.01 USD. FLUX.2 uses megapixel-based pricing.
| Model | 1st MP | +MP | 1MP T2I | 1MP I2I | Volume Recommendation |
| ----------------- | ------ | ---- | ------- | ------- | --------------------------- |
| FLUX.2 [klein] 4B | 1.4c | 0.1c | $0.014 | $0.015 | High volume, previews |
| FLUX.2 [klein] 9B | 1.5c | 0.2c | $0.015 | $0.017 | High volume, better quality |
| FLUX.2 [pro] | 3c | 1.5c | $0.03 | $0.045 | Production workloads |
| FLUX.2 [max] | 7c | 3c | $0.07 | $0.10 | Hero images, premium |
| FLUX.2 [flex] | 6c | 6c | $0.06 | $0.12 | Typography |
| FLUX.2 [dev] | - | - | Free | Free | Local dev (non-commercial) |
Pricing formula:
(firstMP + (outputMP-1) * mpPrice) + (inputMP * mpPrice)in cents
| Model | Price/Image | Use Case |
| -------------------- | ----------- | ----------------------- |
| FLUX.1 Kontext [pro] | $0.04 | Context-aware editing |
| FLUX.1 Kontext [max] | $0.08 | Max quality editing |
| FLUX1.1 [pro] | $0.04 | Standard T2I |
| FLUX1.1 [pro] Ultra | $0.06 | Ultra high-resolution |
| FLUX1.1 [pro] Raw | $0.06 | Candid photography feel |
| FLUX.1 Fill [pro] | $0.05 | Inpainting |
| FLUX.1 [pro] | $0.05 | Original pro model |
Use bfl.ai/pricing calculator for exact costs at different resolutions.
Creative Exploration / Ideation:
Recommended: FLUX.2 [klein]
-
Fast iterations
-
Quick concept testing
-
Mood board generation
-
Exploring prompt variations
Production Marketing Assets:
Recommended: FLUX.2 [pro]
-
Consistent quality
-
Reasonable speed
-
Cost-effective at scale
-
Reliable for automation
Hero Images / Premium Content:
Recommended: FLUX.2 [max]
-
Maximum detail
-
Best coherence
-
Supports grounding search
-
Worth the premium for key visuals
Typography / Signage / Posters:
Recommended: FLUX.2 [flex]
-
Superior text rendering
-
Adjustable quality settings
-
Best for readable text
-
UI mockups and infographics
Character Consistency:
Recommended: FLUX.2 [max] or [pro]
-
Multi-reference support (up to 8-10 images)
-
Best editing consistency
-
Maintains identity across scenes
-
Superior quality over FLUX.1 Kontext
Photo Editing / Retouching:
Recommended: FLUX.2 [klein], [pro], or [max]
-
Native image-to-image support via references
-
Style transfer
-
Object modification
-
Attribute changes
-
Better results than FLUX.1 Kontext
Real-Time Information:
Recommended: FLUX.2 [max]
-
Grounding search feature
-
Current events
-
Recent news visualization
-
Weather/location data
Local Development / Testing:
Recommended: FLUX.2 [dev]
-
No API costs
-
Full control
-
Fine-tuning experiments
-
Offline capability
Editorial with Typography:
1. FLUX.2 [max] - Generate base image (highest quality)
2. FLUX.2 [flex] - Add text overlay pass
Character-Consistent Series:
1. FLUX.2 [max] - Create character reference
2. FLUX.2 [max]/[pro] - Generate consistent variations using reference images
3. FLUX.2 [klein] - Quick iteration on variations if needed
E-commerce Product Pipeline:
1. FLUX.2 [pro] - Bulk product generations
2. FLUX.2 [pro]/[klein] - Product variations (colors, angles) using references
3. FLUX.2 [flex] - Add promotional text/pricing
Limited Budget:
-
High volume: FLUX.2 [klein] 4B
-
Quality needed: FLUX.2 [pro] (best value)
Tight Deadline:
-
Any task: FLUX.2 [klein]
-
Quality matters: FLUX.2 [pro]
Maximum Quality Required:
- Always: FLUX.2 [max]
Text Must Be Readable:
- Always: FLUX.2 [flex]
Editing Existing Images:
-
Fast edits: FLUX.2 [klein] with reference images
-
Quality edits: FLUX.2 [max] or [pro] with reference images
-
Alternative: FLUX.1 Kontext (FLUX.2 preferred)
Rate Limit Sensitivity:
Speed? → FLUX.2 [klein]
Quality? → FLUX.2 [max]
Balance? → FLUX.2 [pro]
Text? → FLUX.2 [flex]
Edit? → FLUX.2 [klein/pro/max] with reference images
Free? → FLUX.2 [dev]
-
Prefer: FLUX.2 models (24 concurrent limit)
-
Avoid: FLUX.1 Kontext Max (6 concurrent limit)
Key insight: All FLUX.2 models support image editing natively via reference images. FLUX.2 is recommended over FLUX.1 Kontext for editing tasks.
2.2 FLUX.1 Model Family
Impact: MEDIUM
Tip: FLUX.2 models are the latest generation and recommended for most use cases. FLUX.1 models are still available for specific needs.
Guide to FLUX.1 models and their specialized capabilities.
| Model | Purpose | Notes |
|-------|---------|-------|
| FLUX1.1 [pro] | Text-to-image | FLUX.2 [pro] offers improved results |
| FLUX.1 Kontext | Image-to-image | FLUX.2 with references recommended |
| FLUX.1 Kontext Max | Image-to-image | FLUX.2 [max] with references recommended |
| FLUX.1 Fill | Inpainting | Useful for specific inpainting tasks |
Fast and reliable text-to-image generation.
Characteristics:
-
Strong prompt adherence
-
Production-grade architecture
-
Consistent, reliable results
-
Scalable for high-volume
-
Pricing: $0.04 per image
Prompting Style:
Standard descriptive prompts with clear subject and style specification.
Example Prompt:
A golden retriever puppy playing in autumn leaves, warm afternoon sunlight,
shallow depth of field with bokeh background, joyful expression, professional
pet photography style
Recommendation: FLUX.2 models with reference images provide improved editing results.
Context-aware image-to-image editing model for transformations and modifications.
Characteristics:
-
Understands image context
-
Preserves unedited regions
-
Style transfer capabilities
-
Object modification
-
Basic to complex transformations
Prompting Strategies:
Add the text "OPEN" as a neon sign in the window, red glowing letters
with slight reflection on the glass, matching the nighttime atmosphere
Simple, direct instructions work best:
Be explicit about what to preserve:
For dramatic changes, be specific about preservation:
Reference specific artistic movements:
Describe text placement and integration:
Tips for Kontext:
-
Be explicit about what should NOT change
-
Start with simpler edits, build complexity
-
Specify style preservation when needed
-
Use for incremental refinement
Advanced multi-reference editing for complex compositions.
Characteristics:
-
Handles up to 10 reference images
-
Best editing consistency across references
-
Complex scene composition
-
Character consistency maintenance
-
Rate limit: 6 concurrent requests
Multi-Reference Prompting:
Replace the top half of the person in image 1 with the clothing
from image 2, maintaining the pose and background
Describe relationships between images naturally:
Reference images by number for precision:
Tips for Kontext Max:
-
Plan your reference images carefully
-
Use natural language for relationships
-
Specify which elements come from which image
-
Leverage for character consistency across scenes
Specialized tool for object removal and area completion.
Characteristics:
-
Clean object removal
-
Intelligent background completion
-
Texture-aware filling
-
Seamless blending
Use Cases:
-
Remove unwanted objects from photos
-
Complete partial images
-
Replace specific regions
-
Clean up image artifacts
Prompting for Fill:
Complete with matching ocean waves and sandy beach
Describe what should fill the masked area:
Tips for Fill:
-
Provide context about surrounding areas
-
Specify texture and pattern continuation
-
Describe lighting consistency
-
Use for cleanup and removal tasks
2.3 FLUX.2 Model Family
Impact: MEDIUM
Complete guide to FLUX.2 variants and their optimal prompting strategies.
Key Feature: All FLUX.2 models natively support both text-to-image generation AND image-to-image editing via reference images. There's no need to use legacy FLUX.1 Kontext models for editing tasks.
| Model | Parameters | Best For | Speed | Reference Images |
|-------|------------|----------|-------|------------------|
| [klein] | 4B/9B | Fast iterations, previews, quick edits | Fastest | Up to 4 |
| [max] | - | Highest quality generation & editing | Slowest | Up to 8-10 |
| [pro] | - | Production balanced | Medium | Up to 8 |
| [flex] | - | Typography, text rendering | Medium | Up to 8 |
| [dev] | - | Local development | Varies | Varies |
Best for rapid prototyping, previews, and high-volume generation.
Characteristics:
-
4B or 9B parameter versions available
-
Sub-second generation times
-
Optimized for speed over maximum detail
-
No prompt upsampling - be descriptive yourself
-
Supports up to 4 reference images
Prompting Style: Narrative Prose:
Klein responds best to descriptive, narrative-style prompts with emphasis on lighting and atmosphere.
Example Prompt:
A cozy coffee shop interior bathed in warm afternoon light, steam rising lazily
from ceramic cups, worn leather armchairs arranged around small wooden tables,
bookshelves lining exposed brick walls, the soft atmosphere of a quiet afternoon
with dust motes floating in sunbeams through tall windows
Tips for [klein]:
-
Write like a novelist describing a scene
-
Front-load your subject (word order critical)
-
Emphasize lighting descriptions heavily
-
Keep prompts moderately detailed (40-70 words)
Premium model for final production assets and maximum detail.
Characteristics:
-
Highest detail and coherence
-
Best editing consistency
-
Vast world knowledge
-
Includes grounding search (real-time web data)
-
Strongest prompt following
-
Supports up to 8 reference images (API), 10 (playground)
Prompting Style: Technical + Descriptive:
[max] excels with detailed technical specifications combined with descriptive prose.
Example Prompt:
Portrait of a weathered fisherman, age 70, deep wrinkles telling stories of
decades at sea, salt-and-pepper beard with streaks of white, wearing a navy
cable-knit sweater with visible wool texture. Shot on Hasselblad X2D with
90mm f/2.8 lens at f/4, golden hour natural light from the left creating
strong rim lighting, shallow depth of field with soft bokeh from harbor
lights behind, Kodak Portra 400 color science with natural grain
Tips for [max]:
-
Include camera and lens specifications for photorealism
-
Specify film stock or digital sensor characteristics
-
Use technical photography terms (aperture, focal length)
-
Leverage grounding search for current events: "news photo of [recent event]"
Optimal balance of quality and speed for production workflows.
Characteristics:
-
Good quality-to-speed ratio
-
Reliable, consistent output
-
Suitable for batch processing
-
Supports prompt upsampling
-
Supports up to 8 reference images
Prompting Style: Balanced Detail:
Standard detailed prompts work well without excessive technical specification.
Example Prompt:
A modern minimalist living room with floor-to-ceiling windows overlooking
a city skyline at dusk, clean white furniture with subtle textures, a single
statement plant in the corner, warm ambient lighting from hidden sources,
architectural photography style with clean lines and balanced composition
Tips for [pro]:
-
Balance specificity with generation speed
-
Good for template-based prompt systems
-
Enable prompt upsampling for enhanced results
-
Consistent quality for production pipelines
Optimized for text rendering and typographic content.
Characteristics:
-
Superior text rendering quality
-
Handles multiple text elements
-
Adjustable steps (1-50) and guidance (1.5-10)
-
Best for signage, posters, UI mockups
-
Supports up to 8 reference images
Prompting Style: Typography-Focused:
Always quote text and specify font characteristics explicitly.
Example Prompt:
A modern minimalist poster design with the headline "DESIGN SUMMIT 2025"
in bold condensed sans-serif typography centered in the upper third,
subtitle "Innovation Meets Creativity" in lighter weight below,
date "MARCH 15-17" in small caps at the bottom, all text in white
on a gradient background transitioning from deep purple #4A0080 to
coral #FF6B6B, clean geometric accent lines, professional print quality
Tips for [flex]:
-
Always quote exact text:
"Your Text Here" -
Specify font style: serif, sans-serif, script, display, monospace
-
Describe text hierarchy: headline, subhead, body
-
Include placement: centered, left-aligned, upper third
-
Adjust steps (higher = better quality) and guidance (higher = stricter)
For local development, testing, and non-commercial use.
Characteristics:
-
Open weights on Hugging Face
-
Runs locally (~13GB VRAM recommended)
-
Full customization available
-
Free for non-commercial use
-
Base variants available (undistilled) for fine-tuning
Prompting Style: Standard:
Same prompting patterns as [pro] work well.
Tips for [dev]:
-
Use for development and testing before production
-
Experiment with prompt variations
-
Good for fine-tuning experiments
-
Check license for commercial use restrictions
All FLUX.2 models support image editing via reference images. This replaces the need for legacy FLUX.1 Kontext models.
How It Works:
-
Provide your source image(s) as reference images
-
Describe the desired changes in your prompt
-
The model preserves context while applying edits
Example: Style Transfer:
Reference: [your source image]
Prompt: Transform this image into a watercolor painting style,
maintaining the exact composition and subject positioning
Example: Object Modification:
Reference: [your source image]
Prompt: Change the car color to red while keeping everything else identical
Example: Character Consistency:
Reference: [character reference image]
Prompt: The same person from the reference image walking through
a busy Tokyo street at night, neon lights reflecting on wet pavement
Model Selection for Editing:
| Use Case | Recommended Model |
|----------|-------------------|
| Quick iterations/previews | FLUX.2 [klein] |
| Production quality edits | FLUX.2 [pro] |
| Maximum quality/complex edits | FLUX.2 [max] |
| Text/typography edits | FLUX.2 [flex] |
3. Text-to-Image Prompting
Impact: HIGH
Crafting effective prompts for generating images from text descriptions.
3.1 Text-to-Image (T2I) Prompting
Impact: MEDIUM
Comprehensive guide to crafting effective text-to-image prompts for FLUX models.
Basic Formula:
[Subject] + [Action] + [Style] + [Context] + [Lighting] + [Technical]
Expanded Framework:
[Main Subject] - who/what is the focus
[Attributes] - characteristics, details, clothing
[Action/Pose] - what they're doing
[Environment] - where, setting, background
[Style/Medium] - artistic approach
[Lighting] - light source, quality, mood
[Composition] - framing, camera angle
[Technical] - camera, lens, film stock
People/Portraits:
A distinguished professor in his 60s with silver hair and round spectacles,
wearing a tweed jacket with leather elbow patches, deep-set thoughtful eyes,
slight smile suggesting hidden wisdom
Animals:
A majestic snow leopard with piercing blue-grey eyes, thick spotted fur
dusted with snowflakes, powerful muscular build, alert posture on a
rocky outcrop
Objects/Products:
A vintage Leica M3 camera with worn brass edges showing decades of use,
black leather covering with patina, sitting on weathered wooden table
Landscapes:
A dramatic fjord at dawn, steep granite cliffs rising from mirror-still
water, wisps of morning mist, distant snow-capped peaks catching first
golden light
Architecture:
A brutalist concrete apartment building in late afternoon light, geometric
shadows creating abstract patterns, warm sunlight contrasting with cool
grey concrete
Photorealistic:
80s film photography, film grain, warm color cast, soft focus, nostalgic
Artistic Styles:
anime style, large expressive eyes, clean linework, cel shading, vibrant palette
Portrait Lighting:
Rembrandt lighting - 45 degree key light creating triangle shadow on cheek
Butterfly lighting - overhead key creating shadow under nose
Split lighting - 90 degree side light, half face in shadow
Loop lighting - slight angle creating small nose shadow
Natural Lighting:
Golden hour - warm, soft, directional light 1 hour before sunset
Blue hour - cool, ambient light just after sunset
Overcast - soft, even, diffused lighting
Harsh midday - strong contrast, defined shadows
Atmospheric:
Volumetric light - visible light rays through fog/dust
Rim lighting - backlight creating edge glow
Practical lighting - visible light sources in scene
Neon glow - colorful artificial urban lighting
Camera Bodies:
Shot on Hasselblad X2D - medium format, exceptional detail
Shot on Canon 5D Mark IV - professional DSLR quality
Shot on Leica M10 - rangefinder character, smooth tonality
Shot on iPhone 15 Pro - computational photography look
Lens Characteristics:
85mm f/1.4 - classic portrait, creamy bokeh
24mm f/2.8 - wide angle, environmental
50mm f/1.2 - natural perspective, shallow DOF
135mm f/2 - compressed perspective, smooth background
Macro lens - extreme close-up detail
Tilt-shift lens - miniature effect or architectural correction
Technical Settings:
f/1.4 - extremely shallow depth of field
f/2.8 - moderate background blur
f/8 - sharp throughout, landscape
f/16 - maximum sharpness, long exposure
ISO 100 - clean, no noise
ISO 3200 - visible grain, low light
Framing:
extreme close-up - filling frame with detail
close-up - head and shoulders
medium shot - waist up
full shot - entire body
wide shot - subject in environment
establishing shot - location focus
Angles:
eye level - natural, relatable
low angle - powerful, imposing
high angle - diminished, overview
Dutch angle - tension, unease
bird's eye - pattern, layout
worm's eye - dramatic upward view
Composition Rules:
rule of thirds - subject at intersection points
centered composition - symmetry, stability
leading lines - guiding eye to subject
frame within frame - natural framing elements
negative space - minimalist, breathing room
Editorial Portrait:
A fashion editorial portrait of a young woman with striking features and
high cheekbones, wearing an avant-garde geometric collar in silver, dramatic
side lighting creating strong shadows, shot on Hasselblad with 100mm lens
at f/2.8, studio background with subtle gradient, high fashion magazine style
Product Photography:
A premium wireless headphone product shot, matte black finish with rose gold
accents, floating at slight angle against pure white background, soft even
lighting eliminating harsh shadows, reflection visible on glossy surface below,
commercial catalog style, ultra sharp focus throughout
Landscape:
A misty morning in ancient redwood forest, towering trees disappearing into
fog above, ferns covering forest floor in layers of green, single shaft of
golden sunlight breaking through canopy, shot on large format camera, rich
detail in bark textures, Ansel Adams inspired black and white with deep tones
Architectural:
Modern minimalist beach house at golden hour, floor-to-ceiling glass walls
reflecting sunset colors, clean white concrete and natural wood, infinity
pool merging with ocean horizon, architectural photography style, wide angle
showing full structure, warm evening light
4. Image-to-Image Editing
Impact: HIGH
Techniques for editing and transforming existing images using FLUX.2 models.
4.1 Image-to-Image (I2I) Prompting
Impact: MEDIUM
Guide to effective image-to-image editing with FLUX models.
All FLUX.2 models support image-to-image editing via reference images:
-
FLUX.2 [klein]: Up to 4 reference images - fast editing
-
FLUX.2 [pro]: Up to 8 reference images - balanced quality/speed
-
FLUX.2 [max]: Up to 8-10 reference images - highest quality editing
-
FLUX.2 [flex]: Up to 8 reference images - best for typography edits
Simply provide your source image as a reference and describe the desired changes. The model understands image context and can modify specific elements while preserving others.
Note: FLUX.2 models are recommended for image editing. They provide better results than the older FLUX.1 Kontext models.
Preferred: Use URLs directly - simpler and more convenient than base64.
When you have an image URL, pass it directly to input_image:
The API fetches URLs automatically. Both URL and base64 work, but URLs are recommended when available.
Simple Modifications:
Add snow to the ground
Direct, single-change instructions:
Attribute Changes:
Age the person to appear 20 years older
Modifying specific characteristics:
Explicit Preservation:
Transform the daytime photo to nighttime, maintaining the exact
composition, colors of the subject's outfit, and lighting direction
When you need to keep specific elements unchanged:
Style Preservation:
Add rain effects to the scene while preserving the painting's
impressionist brushwork and color palette
Preventing unwanted style shifts:
Environmental Changes:
Change to spring with cherry blossoms, fresh green leaves,
soft warm lighting
Style Transfer:
Transform into a watercolor painting with soft edges,
transparent washes, and paper texture visible
Subject Modifications:
Make the subject appear younger, around 25 years old
Object Editing:
Swap the coffee mug for an ornate teacup with floral pattern
Adding Text:
Include a wooden sign with hand-painted text "Welcome Home"
mounted above the door
Modifying Text:
Update the poster text to "SUMMER SALE 2025" maintaining the design
For dramatic transformations, consider breaking into steps:
Step-by-Step Approach:
Transform this modern office into a Victorian library with completely
different furniture, add a fireplace, change the lighting to candlelit,
and age the photograph
Instead of:
Try sequential edits:
-
Change the furniture style to Victorian antique pieces -
Add a stone fireplace on the right wall -
Transform lighting to warm candlelit atmosphere -
Apply vintage photograph aesthetic with sepia tones
Avoid Vague Instructions:
Bad: Make it look better
Good: Increase contrast, add warm color grading, sharpen details
Be Specific About Scope:
Bad: Change the background
Good: Replace the office background with a tropical beach at sunset,
maintaining the subject's exact position and lighting direction
Explicit Style Preservation:
Bad: Make it nighttime
Good: Transform to nighttime while maintaining the photorealistic style,
add appropriate artificial lighting sources
-
Start Simple - Begin with single-element changes
-
Be Explicit - State what should change AND what should stay
-
Reference Context - Mention existing elements when relevant
-
Iterate - Refine through multiple small edits rather than one large one
-
Preserve Deliberately - Always specify style/composition preservation needs
5. JSON Structured Prompting
Impact: MEDIUM-HIGH
Using structured JSON for complex multi-element scene composition.
5.1 JSON Structured Prompting
Impact: MEDIUM
For complex scenes with multiple elements, spatial relationships, or production automation, use JSON-structured prompts.
-
Multiple characters with distinct attributes
-
Precise spatial positioning
-
Complex scene composition
-
Reproducible, template-based prompts
-
Programmatic prompt generation
-
Production workflows with variable substitution
Flatten your JSON into flowing prose for the actual prompt:
From JSON:
{
"subjects": [
{
"type": "person",
"description": "elderly craftsman with weathered hands",
"position": "seated at workbench",
"action": "carefully carving wood"
}
],
"scene": { "setting": "traditional workshop", "time": "morning" },
"technical": { "lighting": "natural window light from right" }
}
To Prompt:
{
"composition": {
"layout": "triangular",
"focal_point": "center-left intersection",
"depth_layers": [
{
"layer": "foreground",
"elements": ["flowers in vase"],
"focus": "soft blur"
},
{
"layer": "midground",
"elements": ["main subject"],
"focus": "sharp"
},
{
"layer": "background",
"elements": ["window", "garden view"],
"focus": "soft blur"
}
]
}
}
Use JSON structure for template-based generation:
Define explicit spatial relationships:
-
Use IDs for References - Give subjects IDs when they interact
-
Separate Concerns - Keep scene, subjects, style, and technical distinct
-
Be Consistent - Use the same terminology throughout
-
Include All Details - Don't assume, specify everything
-
Flatten for Execution - Convert to natural language before sending to model
-
Version Templates - Track template versions for reproducibility
6. Color Specification
Impact: MEDIUM
Precise color control using hex codes for brand-accurate generations.
6.1 Hex Color Prompting
Impact: MEDIUM
FLUX supports hex color codes (#RRGGBB) for precise color specification, essential for brand consistency and exact color matching.
Include hex codes directly in your prompt with descriptive names:
Use these keywords to indicate color specification:
1. Always Pair with Description:
Good: #FF6B6B (coral pink)
Bad: #FF6B6B
Never use hex codes alone - include the color name:
2. Associate with Specific Objects:
A product shot featuring a smartphone with a #1DA1F2 (Twitter blue) case,
resting on a #14171A (near black) matte surface
Clearly connect colors to their targets:
3. Limit Color Palette:
Color palette for the scene: #2ECC71 (emerald green), #3498DB (sky blue),
#F1C40F (sunflower yellow), #FFFFFF (pure white)
3-5 colors typically work best. Too many can confuse the model:
Brand Colors:
Corporate office reception with brand colors prominently featured:
walls in #0066CC (company blue), accent furniture in #FF6600 (company orange),
logo displayed in #FFFFFF (white) against the blue backdrop
Interior Design:
Scandinavian minimalist bedroom with #F5F5F5 (warm white) walls,
#8B4513 (saddle brown) wooden headboard and nightstands,
#708090 (slate gray) linen bedding, and #DAA520 (goldenrod) accent lamp
Fashion:
Editorial fashion photo: model wearing #000000 (black) cashmere turtleneck,
#FF4500 (orange-red) wide-leg wool pants, #C0C0C0 (silver) geometric earrings,
against a #F0F0F0 (light gray) studio backdrop
Product Design:
Premium headphones product shot: #1C1C1E (space gray) aluminum body,
#F5F5F7 (silver) mesh ear cups, #FF9500 (iOS orange) accent ring around controls
Digital Art:
Synthwave cityscape: #FF00FF (magenta) and #00FFFF (cyan) neon signs,
#1A1A2E (deep navy) night sky, #E94560 (hot pink) setting sun on horizon,
#16213E (dark blue) building silhouettes, rain-slicked streets reflecting lights
Data Visualization:
Sunset sky gradient from #FF6B6B (coral) at horizon through
#FFA07A (light salmon) to #87CEEB (sky blue) at top
Specify gradients with start and end colors:
Complementary: Opposite on color wheel
Scene using complementary colors: #3498DB (blue) dominant with
#E67E22 (orange) accents for visual pop
Analogous: Adjacent colors
Harmonious palette using analogous colors: #9B59B6 (purple),
#8E44AD (deep purple), #3498DB (blue) - flowing naturally together
Triadic: Evenly spaced
Vibrant triadic scheme: #E74C3C (red), #F1C40F (yellow),
#3498DB (blue) - balanced and dynamic
Monochromatic: Single hue variations
# Social Media
Twitter/X Blue: #1DA1F2
Facebook Blue: #1877F2
Instagram Gradient: #833AB4 to #FD1D1D
LinkedIn Blue: #0A66C2
# Tech
Apple Gray: #1C1C1E
Google Blue: #4285F4
Microsoft Blue: #00A4EF
Amazon Orange: #FF9900
# Design
Figma Purple: #A259FF
Dribbble Pink: #EA4C89
Behance Blue: #1769FF
For reference only - always verify current brand guidelines:
Color Not Accurate:
-
Add the color name alongside hex
-
Specify the exact object the color applies to
-
Use fewer total colors in the prompt
Color Bleeding:
-
Clearly delineate which objects get which colors
-
Use spatial descriptions: "the LEFT chair in #color"
Muddy Colors:
-
Check hex code accuracy
-
Specify lighting that won't shift colors
-
Use "maintaining exact color #XXXXXX" for emphasis
7. Typography and Text
Impact: MEDIUM
Rendering text and typography within generated images.
7.1 Typography and Text Prompting
Impact: MEDIUM
Guide to rendering text in FLUX images. Use FLUX.2 [flex] for best typography results.
Always quote the exact text you want rendered:
1. Use Quotation Marks:
Correct: A poster with "HELLO WORLD" in bold letters
Wrong: A poster with HELLO WORLD in bold letters
2. Specify Font Style:
"ADVENTURE" in bold sans-serif typography
"Welcome" in elegant cursive script
"CHAPTER ONE" in classic serif typeface
"CODE" in monospace terminal font
"SALE!" in decorative display lettering
3. Describe Size Hierarchy:
Large headline "BREAKING NEWS" above smaller subtext "Details inside"
4. Indicate Placement:
"OPEN" sign centered in storefront window
"EXIT" text positioned above doorway
"Page 1" in bottom right corner
5. Front-Load Text:
Good: A sign reading "FRESH BREAD" in a bakery window...
Less Good: A bakery window with a sign that says "FRESH BREAD"...
Place text descriptions early in the prompt for better accuracy:
Sans-Serif (Modern/Clean):
"MINIMAL" in clean geometric sans-serif, Swiss modernist style
"TECH SUMMIT" in bold condensed grotesque typeface
"future" in thin uppercase sans-serif, contemporary design
Serif: Classic/Elegant
"The New Yorker" in traditional serif typeface, editorial masthead
"LUXURY" in high-contrast Didone serif with thin/thick strokes
"Wisdom" in old-style serif with subtle bracketed serifs
Script/Cursive (Decorative):
"With Love" in flowing calligraphic script with flourishes
"Signature" in connected brush script, casual elegance
"Romance" in formal copperplate script, wedding invitation style
Display/Decorative:
"ROCK CONCERT" in distressed vintage concert poster lettering
"CIRCUS" in ornate Victorian display type with decorative elements
"RETRO" in 1970s rounded bubble letters
Handwritten:
"Note to self" in casual handwritten style, slightly imperfect
"Thanks!" in quick marker pen handwriting
"ideas" in sketchy pencil handwriting
Monospace:
"CODE_COMPLETE" in terminal monospace, developer aesthetic
"SYSTEM" in typewriter monospace, vintage tech
"DEBUG" in LCD-style digital monospace
Neon Signs:
Glowing neon sign spelling "OPEN 24/7" in pink neon tubes with
blue outline, slight glow and reflection, night scene
Metallic/3D:
"GOLD" in three-dimensional metallic gold letters with realistic
reflections and subtle shadows, luxury aesthetic
Embossed/Debossed:
"PREMIUM" embossed into leather surface, subtle shadows showing
the raised letterforms
Outlined:
"MODERN" in outline-only letters, no fill, thin white stroke
on dark background
Gradient Text:
"SUMMER" with gradient fill from #FF6B6B (coral) at top to
#4ECDC4 (teal) at bottom
Poster Design:
Event poster with "SUMMER FEST 2025" as large headline in bold
condensed sans-serif at top, "JULY 15-17" as medium subheading
in regular weight, "Central Park, NYC" as small body text at
bottom, all in white text on #FF6B35 (sunset orange) background
Book Cover:
Book cover design: "THE GREAT GATSBY" in elegant art deco gold
lettering centered in upper third, author name "F. SCOTT FITZGERALD"
in smaller gold caps below, #1A1A2E (midnight blue) background
with geometric gold accents
Magazine Cover:
Fashion magazine cover with "VOGUE" in classic serif masthead at top,
cover line "SPRING COLLECTION" in bold sans-serif, "The New Rules of Style"
in lighter weight italic, all in white against dramatic portrait
Signage:
Vintage diner sign: "MEL'S DINER" in red neon script lettering,
"OPEN" below in separate green neon block letters, chrome border,
1950s Americana aesthetic
Business Card:
Minimalist business card with "JOHN SMITH" in medium weight sans-serif,
"Creative Director" in lighter weight below, contact details in small
type at bottom, #2C3E50 (dark blue) text on white background
Centered Composition:
Centered text layout: "WELCOME" in large caps at center,
perfectly balanced with equal margins
Left-Aligned:
Left-aligned text block: "Company Name" as header,
"Tagline goes here" below, flush left alignment
Text on Path:
"GOING IN CIRCLES" text following a circular path around
the center of the design
Text Overlay:
"ADVENTURE AWAITS" in bold white text overlaid on landscape
photograph, positioned in lower third with slight shadow for readability
Steps Parameter:
-
Higher steps (30-50) = better text quality
-
Lower steps (10-20) = faster, lower quality
Guidance Parameter:
-
Higher guidance (6-10) = stricter prompt following
-
Lower guidance (1.5-4) = more creative interpretation
Recommended Settings:
For clean typography: steps=50, guidance=7
For artistic text: steps=30, guidance=4
Misspelled Words:
-
Keep text short (1-4 words work best)
-
Use common words when possible
-
Repeat the exact text in the prompt
Illegible Text:
-
Specify larger text size
-
Use simpler fonts (sans-serif)
-
Ensure high contrast with background
-
Use [flex] model
Wrong Font Style:
Instead of: "text in a nice font"
Use: "text in bold geometric sans-serif similar to Futura"
Be more specific:
Text Not Appearing:
-
Front-load text description in prompt
-
Put text in quotes
-
Specify exact placement
-
Reduce other prompt complexity
8. Multi-Reference Editing
Impact: MEDIUM
Combining multiple reference images for style transfer and composition.
8.1 Multi-Reference Image Editing
Impact: MEDIUM
Guide to using multiple reference images for character consistency, style transfer, and complex compositions.
FLUX.2 models support multiple reference images for advanced editing:
-
FLUX.2 [klein]: Up to 4 reference images - fast editing
-
FLUX.2 [pro]: Up to 8 via API - balanced quality/speed
-
FLUX.2 [max]: Up to 8 via API, 10 in playground - highest quality
-
FLUX.2 [flex]: Up to 8 via API - best for typography
Note: FLUX.2 models are recommended over FLUX.1 Kontext Max for better results.
Preferred: Use URLs directly - simpler and more convenient than base64.
Pass image URLs directly to input_image, input_image_2, etc.:
The API fetches URLs automatically. Both URL and base64 work, but URLs are recommended when available.
Natural Language Description:
The person from image 1 is sitting at the cafe table from image 2,
wearing the outfit from image 3, with the warm lighting style of image 4
Describe relationships between images naturally:
Explicit Indexing:
Combine the face from image 1 with the hairstyle from image 2
on the body pose from image 3
Reference images by number for precision:
Character Consistency:
The same person from image 1, now seated at a desk in a modern office,
same clothing and hairstyle, different environment
Maintain the same character across multiple scenes:
For sequential consistency:
Style Transfer:
Apply the color grading and mood from image 2 to the scene in image 1
Apply the style of one image to another:
Pose Guidance:
The person from image 1 in the exact pose shown in image 2,
placed in the environment from image 3
Use a reference for body positioning:
Object Composition:
Place the product from image 1 on the table setting from image 2,
using the lighting style from image 3
Combine elements from multiple images:
Background Replacement:
Keep the subject from image 1 exactly as shown, replace the background
with the beach scene from image 2, match the lighting naturally
Two Characters:
Image 1 (person A) and image 2 (person B) having a conversation
at a coffee shop table, person A on the left gesturing, person B
on the right listening intently
Group Composition:
The three people from images 1, 2, and 3 standing together for a
group photo, arranged left to right in that order, friendly poses,
outdoor park setting
Selective Attribute Transfer:
The face and expression from image 1, the hairstyle from image 2,
wearing the outfit from image 3, in the pose from image 4
Partial Transfer:
Arrange the scene using the layout shown in the collage input:
- Person from image 1 in the left position
- Object from image 2 in the center position
- Background element from image 3 filling the right side
Use a collage input for layout guidance:
1. Clear Image Roles:
Image 1: face/identity reference
Image 2: pose/body reference
Image 3: style/aesthetic reference
Image 4: environment/background reference
Specify what each reference provides:
2. Quality References:
-
Use high-quality, clear reference images
-
Ensure good lighting in references
-
Avoid heavily processed or filtered images
3. Consistent Lighting:
...ensure the lighting direction matches across all elements,
with main light source from the upper left
When combining elements:
4. Resolution Awareness:
For [pro] API with 9MP total limit:
-
At 1MP output: up to 8 reference images comfortably
-
Calculate: input images + output = total MP
5. Explicit Relationships:
Create a scene combining:
- The woman from image 1 (keep exact face, expression, hair)
- Wearing the vintage dress from image 2 (exact pattern and cut)
- In the pose from image 3 (seated position, arm placement)
- Set in the library from image 4 (bookshelves, furniture)
- Using the warm lighting style from image 5 (golden hour quality)
Position her in the center of frame, medium shot, looking slightly
to the right with a thoughtful expression.
Don't assume - specify exactly how elements relate:
Elements Not Transferring:
-
Be more specific about which element from which image
-
Use explicit indexing ("from image 1")
-
Reduce the number of references and complexity
Inconsistent Blending:
-
Specify lighting consistency
-
Describe how elements should interact
-
Use style references to unify the composition
Identity Drift:
-
Emphasize key identifying features
-
Use phrases like "maintaining exact likeness"
-
Provide multiple angles of the same subject if available
9. Positive Prompt Alternatives
Impact: MEDIUM
Strategies for achieving results without negative prompts, which FLUX does not support.
9.1 Negative Prompt Alternatives
Impact: MEDIUM
FLUX does not support negative prompts. This guide provides positive alternatives for common negative prompt patterns.
Negative prompts can actually make models focus MORE on unwanted elements. Instead, describe exactly what you DO want - this gives clearer direction and better results.
For any unwanted element:
-
Identify what you don't want
-
Ask: "What would be there instead?"
-
Describe the positive alternative
People/Crowds:
| Instead of | Use |
|-----------|-----|
| "no people" | "empty", "deserted", "solitary", "abandoned" |
| "no crowds" | "quiet", "peaceful", "secluded", "private" |
| "without background people" | "isolated subject", "clean background", "solo figure" |
Example:
Bad: A beach scene, no people
Good: A deserted beach at dawn, pristine untouched sand, solitary seagull
Skin/Appearance:
| Instead of | Use |
|-----------|-----|
| "no makeup" | "natural skin", "bare face", "fresh-faced" |
| "no blemishes" | "clear skin", "smooth complexion", "healthy glow" |
| "no wrinkles" | "youthful skin", "smooth features" |
Example:
Bad: Portrait of woman, no makeup, no blemishes
Good: Portrait of a woman with natural clear skin, fresh-faced with a healthy glow
Accessories:
| Instead of | Use |
|-----------|-----|
| "no glasses" | "visible eyes", "unobstructed gaze", "clear eye contact" |
| "no hat" | "bare head", "visible hair", "uncovered head" |
| "no jewelry" | "minimal accessories", "understated", "unadorned" |
Example:
Bad: Man portrait, no glasses, no hat
Good: Portrait of a man with clear direct gaze, wind-swept visible hair
Colors:
| Instead of | Use |
|-----------|-----|
| "no color" | "monochrome", "black and white", "grayscale" |
| "not colorful" | "muted tones", "subdued palette", "desaturated" |
| "no bright colors" | "neutral tones", "earth tones", "soft pastels" |
Example:
Bad: Landscape photo, no bright colors
Good: Landscape in muted earth tones, soft morning light, desaturated palette
Text/Watermarks:
| Instead of | Use |
|-----------|-----|
| "no text" | "clean surfaces", "unmarked", "text-free" |
| "no watermark" | "pristine image", "clean composition" |
| "no logos" | "unbranded", "plain", "logo-free surface" |
Example:
Bad: Product photo, no watermark, no text
Good: Clean product photography with pristine unmarked surfaces, minimal unbranded design
Style/Era:
| Instead of | Use |
|-----------|-----|
| "not modern" | "traditional", "classical", "vintage", "historical" |
| "no CGI look" | "photorealistic", "authentic", "natural", "organic" |
| "not cartoonish" | "realistic", "lifelike", "naturalistic" |
Example:
Bad: Building design, not modern, no futuristic elements
Good: Traditional Victorian architecture with classical ornate details and period-accurate features
Quality/Artifacts:
| Instead of | Use |
|-----------|-----|
| "no blur" | "sharp focus", "crisp details", "tack-sharp" |
| "no noise" | "clean image", "smooth gradients", "low ISO" |
| "no artifacts" | "pristine quality", "clean render", "flawless" |
Example:
Bad: Portrait, no blur, no noise
Good: Tack-sharp portrait with pristine image quality, smooth skin tones, crisp details
Objects:
| Instead of | Use |
|-----------|-----|
| "no cars" | "pedestrian area", "car-free zone", "walking street" |
| "no buildings" | "open landscape", "natural scenery", "wilderness" |
| "no furniture" | "empty room", "bare space", "minimalist interior" |
Example:
Bad: Street scene, no cars, no modern buildings
Good: Historic cobblestone walking street lined with traditional stone buildings from the 1800s
Weather/Environment:
| Instead of | Use |
|-----------|-----|
| "no rain" | "clear sky", "dry weather", "sunny day" |
| "no clouds" | "clear blue sky", "cloudless", "perfect visibility" |
| "not dark" | "well-lit", "bright", "daylight", "illuminated" |
Example:
Bad: Outdoor portrait, no rain, no clouds, not dark
Good: Outdoor portrait under clear blue sky on a bright sunny day, perfect natural lighting
Composition:
| Instead of | Use |
|-----------|-----|
| "no distractions" | "clean composition", "focused framing", "minimal elements" |
| "nothing in background" | "solid background", "isolated subject", "clean backdrop" |
| "no clutter" | "organized", "tidy", "minimal", "streamlined" |
Example:
Bad: Product shot, no distractions, nothing in background
Good: Product on clean white seamless backdrop, isolated subject, minimal focused composition
Original Negative-Heavy Prompt:
Portrait of a woman, no glasses, no makeup, no wrinkles, no blemishes,
no bright colors, no distracting background, no harsh lighting
Positive Rewrite:
Portrait of a youthful woman with clear natural skin and visible bright eyes,
fresh-faced with a healthy glow, wearing muted earth tones against a soft
blurred neutral background, gentle diffused lighting creating soft shadows
Original Negative-Heavy Prompt:
Landscape photo, no people, no buildings, no power lines, no modern elements,
no overcast sky, no dead trees
Positive Rewrite:
Pristine wilderness landscape with lush green living forest, clear blue sky,
untouched natural scenery stretching to the horizon, peaceful solitude with
only birdsong and wind, golden hour sunlight filtering through healthy foliage
| Unwanted | Positive Alternative |
|----------|---------------------|
| No people | Empty, solitary, deserted |
| No makeup | Natural, fresh-faced, bare |
| No text | Clean, unmarked, pristine |
| No blur | Sharp, crisp, tack-sharp |
| No modern | Traditional, vintage, classical |
| No dark | Bright, well-lit, luminous |
| No busy | Minimal, clean, focused |
| No artificial | Natural, organic, authentic |