AI Skill Report Card

Directing Image Generation

A-88·Jun 12, 2026·Source: Web

Quick Start

Input: "futuristic city at night" Output: Complete brief with cinematic lighting strategy, specific architectural elements, color temperature (cool blue 5600K), camera specs (35mm wide establishing shot), and production-ready prompt.

Workflow

  • Decode the core visual intent
  • Identify missing strategic elements
  • Determine optimal visual approach
  • Research relevant style references
  • Generate 3 distinct creative directions
  • Evaluate each for impact and feasibility
  • Select strongest concept with rationale
  • Refine chosen direction
  • Subject: Define primary focus and secondary elements
  • Environment: Establish setting, scale, atmosphere
  • Composition: Frame structure, rule of thirds, focal points
  • Camera: Angle, lens choice, depth of field
  • Lighting: Direction, quality, color temperature, mood
  • Color: Palette strategy, saturation, contrast
  • Textures: Surface qualities, material properties
  • Style: Reference artists, movements, techniques
  • Format: Aspect ratio, resolution, end use
  • Negative prompt covers common AI failures
  • Typography-safe areas identified
  • Three variations created (commercial/artistic/experimental)
  • Technical specifications confirmed

Examples

Example 1: Input: "robot chef cooking" Output:

  • Creative Diagnosis: Humanization of AI through domestic activity
  • Strategic Direction: Premium lifestyle photography meets sci-fi elegance
  • Selected Concept: Michelin-star kitchen with precision robotics
  • Production Prompt: "Professional humanoid robot chef with brushed titanium finish, precisely julienning vegetables in award-winning restaurant kitchen, warm 3200K pendant lighting, shot with 85mm lens at f/2.8, shallow depth of field isolating subject, rich copper and steel color palette..."
  • Negative: "cartoon, toy-like, clumsy, messy kitchen, harsh lighting, amateur photography"

Example 2: Input: "magical forest portal"

  • Selected Concept: Ancient growth redwoods with ethereal dimensional gateway
  • Production Prompt: "Towering 500-year-old redwood grove with luminous interdimensional portal carved into massive trunk, volumetric light rays piercing morning mist, shot with 24mm wide-angle lens, low angle emphasizing tree scale, emerald and gold color palette, photorealistic with subtle fantasy elements..."

Best Practices

Creative Direction:

  • Start with emotional core, then build visuals
  • Choose one primary mood/feeling to amplify
  • Research real-world references for authenticity
  • Consider the image's final purpose and audience

Technical Specifications:

  • Match lighting color temperature to mood (warm=cozy, cool=modern)
  • Use specific lens focal lengths (24mm=epic, 85mm=intimate, 200mm=compressed)
  • Define depth of field clearly (f/1.4=dreamy, f/8=sharp throughout)
  • Specify material properties (matte/glossy, rough/smooth)

Prompt Engineering:

  • Lead with strongest visual element
  • Use photography terms for realism
  • Include artist names sparingly but strategically
  • Balance detail with brevity (150-250 words optimal)

Common Pitfalls

Avoid These Creative Mistakes:

  • Mixing incompatible art styles ("photorealistic anime oil painting")
  • Vague lighting descriptions ("good lighting")
  • Generic compositions without focal strategy
  • Style reference overload (more than 2-3 references)
  • Ignoring negative prompts (leads to AI artifacts)

Technical Issues:

  • Forgetting aspect ratio for intended use
  • Over-specifying minor details
  • Underspecifying crucial elements
  • Not accounting for text/logo placement areas
  • Missing quality control for common AI failures (hands, text, proportions)

Workflow Problems:

  • Jumping to final prompt without concept development
  • Not considering three distinct approaches
  • Skipping the creative diagnosis phase
  • Failing to match technical specs to creative vision
0
Grade A-AI Skill Framework
Scorecard
Criteria Breakdown
Quick Start
15/15
Workflow
15/15
Examples
18/20
Completeness
12/20
Format
15/15
Conciseness
13/15