Whisk: Google's Image-Based AI Generator

and

Jan 08, 2025

What is happening?

Google Labs launched Whisk, an image-to-image generator that allows users to upload photos and get a combined, AI-generated image.

Traditional AI image generators require carefully crafted text prompts, creating a high barrier to entry
Many users struggle to translate their visual ideas into effective text descriptions
Artists and creatives need rapid visual explorations

Breaks down image generation into three simple parts:
- what you want to create (subject)
- where you want it to be (scene)
- how you want it to look (style)
Image credit: digestible UX
Each part can be defined by dropping in an image or picking from Google's suggestions
Text prompts are still there if users want them, but they're optional – not required
Users can edit auto-generated prompts

How can you reduce users' cognitive load by aligning with their natural mental model? Review the complexity of your product.