AI Skill Report Card
Directing Image Generation
Quick Start
Input: "futuristic city at night" Output: Complete brief with cinematic lighting strategy, specific architectural elements, color temperature (cool blue 5600K), camera specs (35mm wide establishing shot), and production-ready prompt.
Workflow
Creative Analysis Phase
- Decode the core visual intent
- Identify missing strategic elements
- Determine optimal visual approach
- Research relevant style references
Concept Development
- Generate 3 distinct creative directions
- Evaluate each for impact and feasibility
- Select strongest concept with rationale
- Refine chosen direction
Production Brief Creation
- Subject: Define primary focus and secondary elements
- Environment: Establish setting, scale, atmosphere
- Composition: Frame structure, rule of thirds, focal points
- Camera: Angle, lens choice, depth of field
- Lighting: Direction, quality, color temperature, mood
- Color: Palette strategy, saturation, contrast
- Textures: Surface qualities, material properties
- Style: Reference artists, movements, techniques
- Format: Aspect ratio, resolution, end use
Quality Control
- Negative prompt covers common AI failures
- Typography-safe areas identified
- Three variations created (commercial/artistic/experimental)
- Technical specifications confirmed
Examples
Example 1: Input: "robot chef cooking" Output:
- Creative Diagnosis: Humanization of AI through domestic activity
- Strategic Direction: Premium lifestyle photography meets sci-fi elegance
- Selected Concept: Michelin-star kitchen with precision robotics
- Production Prompt: "Professional humanoid robot chef with brushed titanium finish, precisely julienning vegetables in award-winning restaurant kitchen, warm 3200K pendant lighting, shot with 85mm lens at f/2.8, shallow depth of field isolating subject, rich copper and steel color palette..."
- Negative: "cartoon, toy-like, clumsy, messy kitchen, harsh lighting, amateur photography"
Example 2: Input: "magical forest portal"
- Selected Concept: Ancient growth redwoods with ethereal dimensional gateway
- Production Prompt: "Towering 500-year-old redwood grove with luminous interdimensional portal carved into massive trunk, volumetric light rays piercing morning mist, shot with 24mm wide-angle lens, low angle emphasizing tree scale, emerald and gold color palette, photorealistic with subtle fantasy elements..."
Best Practices
Creative Direction:
- Start with emotional core, then build visuals
- Choose one primary mood/feeling to amplify
- Research real-world references for authenticity
- Consider the image's final purpose and audience
Technical Specifications:
- Match lighting color temperature to mood (warm=cozy, cool=modern)
- Use specific lens focal lengths (24mm=epic, 85mm=intimate, 200mm=compressed)
- Define depth of field clearly (f/1.4=dreamy, f/8=sharp throughout)
- Specify material properties (matte/glossy, rough/smooth)
Prompt Engineering:
- Lead with strongest visual element
- Use photography terms for realism
- Include artist names sparingly but strategically
- Balance detail with brevity (150-250 words optimal)
Common Pitfalls
Avoid These Creative Mistakes:
- Mixing incompatible art styles ("photorealistic anime oil painting")
- Vague lighting descriptions ("good lighting")
- Generic compositions without focal strategy
- Style reference overload (more than 2-3 references)
- Ignoring negative prompts (leads to AI artifacts)
Technical Issues:
- Forgetting aspect ratio for intended use
- Over-specifying minor details
- Underspecifying crucial elements
- Not accounting for text/logo placement areas
- Missing quality control for common AI failures (hands, text, proportions)
Workflow Problems:
- Jumping to final prompt without concept development
- Not considering three distinct approaches
- Skipping the creative diagnosis phase
- Failing to match technical specs to creative vision