Skip to main content

Google Whisk is a new way to create AI visuals using image prompts –here's how to try it

Web Hosting & Remote IT Support
  • Google Whisk uses images as inputs instead of text-based prompts
  • It's built on Google’s Imagen 3 generative AI model
  • The experimental tool is free to try for users in the US

Google’s new AI tool makes it easier to create and remix your visual concepts. Instead of asking you to describe what’s in your mind’s eye, Whisk lets you input three image prompts: one for subject, one for scene and one for style. Whisk takes care of the rest, making it a more intuitive way to experiment with different ideas.

While most of the best AI image generators require you to write a detailed prompt, Whisk handles that behind the scenes. When you drop pictures into the web-based Whisk interface as inspiration, Google’s Gemini model automatically analyzes them and writes a detailed caption for each. These are then fed into the Imagen 3 model, to create a matching image.

For example, you could drop in an image of a car as the subject and a photo of a rural landscape for the scene. You could them add a watercolor as the style to see what Whisk creates. Hit the button and you’ll get a pair of images based on your inputs.

From here, it’s easy to remix the images. The interface allows you to specify additional text-based details to tweak the outcomes. You can also easily drop in different source images or roll the dice if you’re in need of inspiration. New results appear in pairs in the feed, making it an intuitive way to ideate. You can also choose to refine images by revealing the text prompt and adding more details.

Whisk it up

While Whisk is designed to eliminate the need for text-based prompts, Google includes the option to refine the written prompts because results won’t always match up to the source material.

In a blog post about the experimental tool, Google explains that Whisk, “captures your subject’s essence, not an exact replica.” It’s only as effective as Gemini’s analysis of the images you submit. While this is generally very impressive, it also isn’t able to get inside your mind: you might expect Whisk to pull out one detail from an image, where it focuses on another.

The post explains further: “Since Whisk extracts only a few key characteristics from your image, it might generate images that differ from your expectations. For example, the generated subject might have a different height, weight, hairstyle or skin tone. We understand these features may be crucial for your project and Whisk may miss the mark, so we let you view and edit the underlying prompts at any time.”

Even with these shortcomings, Whisk an interesting application of Google’s existing AI tools. The underlying generative models are the same as if you were chatting with Gemini via its text interface. By relying on image inputs, though, Whisk is a more accessible and intuitive way for visual creators to play with their ideas.

Based on early feedback from digital creatives, Google refers to Whisk as “a new type of creative tool” which is intended for “rapid visual exploration, not pixel-perfect edits.”

How to try Google Whisk

Google Whisk is currently only available to users in the US. If you’re based there, you can try it out via your web browser at labs.google/whisk.

The experimental tool is completely free to play with. Data from your experience with Whisk will be fed back to Google to help refine and develop future AI products.

You might also like...



via Hosting & Support

Comments

Popular posts from this blog

Microsoft, Google, and Meta have borrowed EV tech for the next big thing in data centers: 1MW watercooled racks

Web Hosting & Remote IT Support Liquid cooling isn't optional anymore, it's the only way to survive AI's thermal onslaught The jump to 400VDC borrows heavily from electric vehicle supply chains and design logic Google’s TPU supercomputers now run at gigawatt scale with 99.999% uptime As demand for artificial intelligence workloads intensifies, the physical infrastructure of data centers is undergoing rapid and radical transformation. The likes of Google, Microsoft, and Meta are now drawing on technologies initially developed for electric vehicles (EVs), particularly 400VDC systems, to address the dual challenges of high-density power delivery and thermal management. The emerging vision is of data center racks capable of delivering up to 1 megawatt of power, paired with liquid cooling systems engineered to manage the resulting heat. Borrowing EV technology for data center evolution The shift to 400VDC power distribution marks a decisive break from legacy sy...

The Apple Watch ban is lifted, on appeal – but the reprieve might only be temporary

Web Hosting & Remote IT Support The Apple Watch ban story has developed quickly over the last week and a bit, and there's now a new twist: the US Court of Appeals is putting a pause on the US sales and import ban while it reviews the case, which means the Apple Watch 9 and Apple Watch Ultra 2 can go back on sale for the time being. "We are thrilled to return the full Apple Watch lineup to customers in time for the new year," an Apple spokesperson told TechRadar. "We are pleased the US Court of Appeals for the Federal Circuit has stayed the exclusion order while it considers our request to stay the order pending our full appeal." The watches in question are now once again available from "select" Apple Stores, and will also be going on sale from the Apple website from 12pm PT / 3pm ET on Thursday, December 28 (that's 8pm in the UK, and early on December 29 in Australia). All Apple Stores should have stock by the weekend. As for how long t...

The Samsung Galaxy Ring could go into production as soon as next month

Web Hosting & Remote IT Support With the dust beginning to settle from the huge Samsung Unpacked 2023 event, we can turn our attention towards what Samsung might have planned next: and a smart ring seems to be in the company's near future. As per a report from South Korean outlet The Elec (via SamMobile ), mass production on a Samsung Galaxy Ring could begin as early as August, with a decision imminent on the schedule for getting the wearable manufactured and out to consumers. A full launch is slated for some point during 2024 though, rather than 2023. The nature of the device means that it'll need to clear several regulatory hurdles before it can go on sale and start tracking various vital statistics. An early 2024 launch would put the Galaxy Ring on a similar schedule to the Samsung Galaxy S24 – and it would therefore make sense to launch both gadgets at the same time, perhaps in January or February if Samsung follows its 2023 routine. The story so far Rumors ar...