26 May 2022

Google claims its text-to-image AI delivers 'unprecedented photorealism'

Google has shown off an artificial intelligence system that can create images based on text input. The idea is that users can enter any descriptive text and the AI will turn that into an image. The company says the Imagen diffusion model, created by the Brain Team at Google Research, offers "an unprecedented degree of photorealism and a deep level of language understanding."

This isn't the first time we've seen AI models like this. OpenAI's DALL-E (and its successor) generated headlines as well as images because of how adeptly it can turn text into visuals. Google's version, however, tries to create more realistic images.

Like DALL-E, Imagen is not available to the public. Google doesn't think it's suitable as yet for use by the general population for a number of reasons. For one thing, text-to-image models are typically trained on large datasets that are scraped from the web and are not curated, which introduces a number of problems. 

Imagen is not (yet?) available to the public - but you can have a play! Go HERE, scroll about 60% of the way down the page to the section headed "State-of-the-art text-to-image", then select various (albeit limited) parameters to vary the image. Cool, but somewhat scary - think of the implications... Here's one I prepared earlier:


No comments:

Post a Comment