The Rise of The Machines - Is A.I. challenging Photography ?

The Colorado State Fair hit global news last month. It wasn’t because of some weird animal or corndog incident. It was down the very concept of what we call art. That’s because an Artificial intelligence (A.I.) created artwork walked away with first prize in the Digital Arts segment of the Fine Arts competition. this has led to a lot of outrage and navel gazing in the digital Arts world.

And turns out A.I can do photos now too. But should we worry too ?

If you’ve click on this post you’ve already seen one A.I created photo like image. The street does not exist nor does the night market below. These were generated on webpage on my mobile phone in about 60 seconds from a simple one line prompt.

AI generated image of a medieval Town Square at Night — Image Generated using Stable Diffusion engine with prompt of *“A medieval town square during a night time festival captured on Kodak P3200”*

If you look closely you can see some errors in both images but it take you time to notice and at first glance you can easily be fooled into thinking they are kosher.

But they were both created by Artificial Intelligence using a simple text prompt that I entered on my mobile’s web browser

Officer Momoa - A.I. Generated — Stable Diffusion with Promot *“Jason Momoa as WWII Soldier”*

So it’s not just Art the A.I. can do photos

So What Exactly happened in Colorado ?

As Vice reported on the 29^th of August, Jason Allen, the owner of a tabletop games company walked away with the first prize in the Digital Arts Category at the State Fair.

View this post on Instagram

A post shared by Colorado State Fair (@colostatefair)

No great shakes I hear you say. Not my cuppa tea but he has talent. But to some the problem is how he created it – he used an A.I image generator called Midjourney

And things went a bit crazy.

“Art is dead, dude. It’s over. A.I. won. Humans lost.”

Jason hasn’t exactly helped by saying quotes like ths to the NY times. But there has been lots of posting about the doom of art or that it was a cheat to enter.

To be fair he did create it using a tool and his prompt. It’s likely he will have selected the image over a few botched one and may well have run it again through Midjourney. He then polished that image in photoshop and blew the image up with Gigapixel before printing it onto Canvas. so not exactly a passive partner.

There was some criticism that he didn’t let the Judges know enough. He did use the “Jason Allen via Midjourney” on his pieces but never explicitly said that involved A.I.. However as the NY times reported the judges would still have awarded him the prize.

The Cold Heart of W\r- Mock Book cover made in part with A.I. — Nope I’m not branching out. A Midjourney created image tweaked to book cover

But there is fear and loathing. Some of that may be driven by the potential impact of this on digital artist. Particularly those that rely on small commissions to meet their needs. And there is a point. Say I want an illustration for my new book cover, I can now DIY it with much less skill and much less time. And no artist to pay

But I betcha physical media artist were saying the same thing when Digital artistry took off. And photographers were met with contempt and fear back in the 1800’s for much the same.

A.I. Generated image. — Deux Ex Machina for Photos ?

Art will live on but it has been a cold new dawn for some.

I’m sorry Old Chap let me put down my paper and tell me what is this A.I. Trickery ?

Read All About it. A.I. image . 2022 — This B&W image looks real enough but it’s been created on Stable diffusion using just the prompt *“1930’s B&W photograph of man reading newspaper in a working man’s club”*

We’ve become use to A.I. help with digital editing. Be Honest, many of you will use your photo editing software to adjust level, contrast etc with varying degrees of A.I. support and remove dust etc even off your digitalized analogue photos.

And the big buzz over the last few years has been over A.I. supportered face swap technology, from dodgy Porn deepfakes, to image manipulation in the news to playing around swapping yer mate’s face onto Boris.

Example of face swap — An innocent deepface of my significant other being swapped into a vintage photo using a mediocre standard passport photo as the basis. Done using the Reface App

And the tech behind this is all pretty clever but you give the computer 2 images and get it to map over a small portion.

The rise of A.I. Image generation

But now the A.I. doesn’t need images to work from a simple prompt will do. The woman below doesn’t exist

Virtual Comrade. A.I Generated image, 2022 — Generated by Stable Diffusion using the Prompt *“Photographic Portrait of a female model in Soviet Uniform”*. Great face but symbols on uniform way off.

A.I. image creation is done by machine learning. There are various systems. But in the end these all boil down to the systems reviewing millions of image images and linking those to text prompts. How the AI construction the image can be varied using a random number sequence (generated by a so called seed. If you keep the seed the same you’ll get a identical (or near depending on the software) image.

This Dall-E mini image set illustrates this as all generated for me from just one prompt “Photorealistic Red Apple”

Dall-E mini image of apples. — All 9 apple generated in 1 run but each will have a different “seed”. Some more real than other but this ran off a simple prompt.

And what’s more these tools can run in a web browser on a mobile phone. No need for a powerful PC .

Power to the People or Uncanny Valley ?

And the Ai is getting slicker.

The image below was created in just one run of a person that does not exist (outside Mario games).

Princess Peaches. A.I. generated 2022 — If it wasn’t for the eyes I could start to convince you I’ve been doing studio shots. Image created on Nightcafe’s version of Stable Diffusion with prompt of *” Photographic Portrait of a woman based on Princess Peaches”*

It can get things wrong at time badly but items are improving. And it’s worth noting these images were run off in a web browser. You can do this on a cheap PC, Chromebook or even your smartphone. In fairness these public accessible A.I. are not without issues.

Sometimes you get a great image first go. The big challenge has been photo realistic hands and eyes as this image demonstrates

Wonky image — Not one of my better results. From massive pints to the Adam’s family hand having a night off, 3 fingered hands and those eyes…

But the same A.I. chucked this out later.

Woman in Bar. A.I. Generated — Much improved still minor issues if you look hard enough at the eyes

And if you think that still has an uncanny valley feel take a look at the photo below.

Ghosts in the Machine ?

StyleGAN generated face of a woman. — The image of a young woman by Owlsmcgee, Public domain, via Wikimedia Commons (2019)

The woman in the 2019 photo does not exist. The image was created by StyleGAN a very advanced A.I. system developed by NVidia Researchers. Although this system is not full proof it produces much more realistic images. For more example visit thispersondoesnotexist.com (hit refresh to change the image and watch out for the odd oops )

Dall-E 2 is also being billed as the A.I. image generator to put this into the mainstream.

And also these advanced facial A.I. are now available commercially. companies like Icon8 produce and sell these online. They get used for advertising (no model or photographer fees) and a host of legitimate uses. They even allow you the ability to make your face from a simple features menu (here’s one I did but they charge you to download it)

Okay So What are the issues ?

(1) They suffer from (our) biases at bit

There is an issue within current A.I. and that is bias. In part that’s down to us. The A.I. use massive amounts of images we create to form their algorithms and sadly we tend to have marked stereotyping.

I ran just the prompt “Nurse” in Stable diffusion and got this

A.I generated images of Nurses — 9 Image sheet generated using stable diffusion in Nightcafé using prompt “Nurse”

Now the A.I did a good job here face wise. But they all look like white females. I ran it 3 times again. It was only on the third batch of images that I clearly get a nurse of black and minority ethnic (BAME) background. There were 2 other image in the whole run whom may have not been caucasian but it was hard to tell. And guess what all 36 images appear to be of female nurses.

That’s pretty awful. NHS England identified that 1 in 5 nurse are of a BAME background. And whilst there are issues about gender inclusion in nursing over 1 in 10 nurse in the UK are male. But not according to the 36 image in a row I made via stable diffusion. Not one male nurse.

…and the cause

And that bias is down to us. We’ve made more stereotypical images over the years

Interestingly google has put in algorithms to produce much more diverse image on search So when I searched for nurse on Google images I got a much more diverse set of images in both race and gender.

And it’s not that A.I. algorithms can’t do darker skin tones and yes the nurse generated was spontaneous, but really it shouldn’t require you to race prompt.

Image based on the prompt “Photograph of Naomi Campbell” Rendered in Image Diffusion

Interestingly Google are not allowing public access to Imagen, their A.I generator in part due to this (although oddly they’ve fixed it on their image search). They noted “..the data requirements of text-to-image models have led researchers to rely heavily on large, mostly uncurated, web-scraped datasets. While this approach has enabled rapid algorithmic advances in recent years, datasets of this nature often reflect social stereotypes, oppressive viewpoints, and derogatory, or otherwise harmful, associations to marginalized identity groups.”

(2) Potential for Misinformation and fraud

And yes sadly these are tool that can be used for misinformation. and yes these engines are incredibly powerful allowing state agents, agitators and a whose of folk to to potential use them for mischief or worse.

A few prompts and you can have a fake moon landing. Or worse

Fake Moon Landing film set ? Only as seen by an A.I. Generated in Stable diffusion

I’ve not spent ages doing that but you get the idea how good the software can be. It does good likeness of well known people and AI can be trained.

Margot in the Apocalypse. A.I. Generated 2022 — You’d be easily fooled into wondering what Margot Robbie Movie this is but this is a Stable diffusion created image using the prompt *“Margot Robbie as an Apocalypse Survivor in a ruined City”.* No City but 😲. The only clue is the eyes if you look closely

It’s not just image misinformation. AI created images are being used to create false accounts. In 2009 facebook took down over 900 accounts with AI generated images.

But this isn’t anything new. A.I assisted face swapping technology existed long before Art generators. And before that well there was always photoshop favoured tools of fraudsters agent provocateurs and meme creators Alike.

Star Trek Meme — If Star Trek was British ?

But Photoshopping goes back to the dawn of photography. The Stalin’s USSR was well know for making folk disappear from official photographs. But it dates back further. A good example is this photograph US Civil War General and later President Ulysses S. Grant. It’s his head but it’s not his body nor was the rider in the original image

This image is actually a composite of 3 images made in 1902 long before Adobe was even founded. Grant’s head has been used on an image of another officer on horseback which in turn is layered on the background scene Click on image for full size and more info. “Ulysses S. Grant at City Point”L. C. Handy, Public domain, via Wikimedia Commons

(3) No point the A.I. can do it

AI generated Image of Fjord — Image generated on Stable Diffusion using prompt *“B&W Photograph Fjord with Tri-X”*

Well yes A.I can produce stunning images and the errors will get better, but photorealistic images predate the digital era and never killed of photography.

Looks like a photo (Click on image for original at wikicommons). This is actually an oil painting by one of the 1st wave US Photorealist. *John’s Diner with John’s Chevelle*, oil on canvas, 30×48 inches. John Baeder (2007) , CC BY-SA 3.0, via Wikimedia Commons

For decades digital artist have produced photorealistic images but in fact photorealism goes back to the 1960’s. And photorealism & digital art has bet genuine photos in winning contracts but so does photography pinch some of their markets. A.I. is never going to catch the intimacy of wedding photos, family holiday shoots or, that perfect day in the studio/on the mountains etc.

Yes it will provide another choice to companies for their images but it’s not the first time artificially created images have been used.

Ladies in the Lights. A. Generated image. 2022 — Better than Barcelona’s inclusive beach ad with no photoshopped people complaining. I’m open to offers if you need a new perfume ad shot !!

But could A.I. help photographers?

There is a good chance you use A.I to tweak your images in the the likes of lightroom, photoshop and Luminar. The new AI image generators now can tweak images. You can mask areas such as clothes and get the generator to change those.

Most spectacular is something called outpainting. You can take a core image but then expand the edges of this in frames gradually building up a much bigger image as shown here

The tech is likely to incorporated into improving editing. You can also use the image to create the basis of another

A final thought on A.I. photos –
More Opportunities than things to fear

It’s a new media and it brings great challenges and issues. But it also brings opportunity and I suspect we have much less to fear than we could benefit from it. But it may be tougher in the Digital Arts world, although again may be a gamechanger for others.

And funnily enough folk were saying the same thing when D

Wanna Play with A.I. ?

I think you’re better trying these, than being worried. There are several programs available but i’ll mention the key ones. I’m focusing on the big generators that you can access and this is not an exhaustive list.

All the tools that are available can be run in a web browser.

(1) Dall-E 2

DALL-E 2 generated this image when given the prompt “Teddy bears working on new AI research underwater with 1990s technology” DALL-E, Public domain, via Wikimedia Commons

Arguably the biggest name at the moment, Dall-E 2 offer potentially game changing features like realistic eyes, allowing you to outpaint an image. And I’d love to play with it but I sit on a waiting list with millions over at open.ai

Their Dall-E 2 page has an interactive limited demo and the chance to join that list too. Dall-E 2 generates 4 images per credit. New users currently get 50 free credits and every month get 15 more free. More credits can be bought. The headline figure you read of $15 for 115 credits (460 images) but some report you are encourage to buy way more.

(2) Craiyon (formerly Dall-E mini)

Craiyon isn’t the choice for photorealistic images but it is worth a play

Craiyon is the most accessible and fun of the tools. Open source and completely free to use. But it is the weakest of the tools here. It produces 9 images in a go. But it really struggles with faces and images have a lo res quality. But it’s free and fun.

(3) Stable Diffusion

Stability.ai’s Stable diffusion is current major rival to Dall-E 2 and is useable without a wait. Funnily enough, most images here were created by this A.I. . What’s more there are several platforms using it.

Most Notable are Stability.ai own version which I guess is bleed edge stuff with the most up to date stuff. You get currently 200 free credits. Everytime you generate a 512×512 image on basic settings you use a credit. Make the image bigger, make multiple, use more processing time the cost goes up.

For casual users who want to dabble with no commitment it offers the cheapest block purchase of credit options (1000 cost just a tenner). You can set up a subscription as well which may be cheaper if you use in the long run.

Vintage This Morning. — NightCafe’s Stable diffusion Generated image with prompt *“1920’s B&W photographic portrait Holly Willoughby, Underexposed”*. She’s a British Daytime TV presenter. Eye iris shape just off a bit

NightCafé also gives you access to Image diffusion, but also 3 other A.I. art orientated image generators. You can generate an image in one and use in the others to evolve that image. It gives 20 free credits for new accounts but can get 5 free each day if you log in and claim. And you can get more free credits by engaging in its social content . You can pay but it’s more pricey ($14.99USD for 100 credits). It also offers subscription. It has a much better interface plus it stores your images for you.

(4) Midjourney

Woman In Red – This image was evolved 3 fold from that prompt in Midjourney

I include Midjourney for completeness. It after all is the A.I. that caused all the fuss in Colorado. It does more art orientated images than photo. Again it offers a free trial of 25 credits. But unlike the Stable diffusion offerings you must sign up to subscription for more credits rather than by a block

(5) Google Imagen

Photo of A Corgi Riding a Bicycle in Times Square wearing sunglasses and a hat. Produced on Imagen. Image property of Google and used under fair use for illustrative properties.

Potentially on of the most powerful A.I. image generators on the planet. But you won’t be using it properly any time soon. As Engadget reported – Google worries about how A.I. generators can be exploited to make inappropriate or malicious images and the potential for bias.

You can at least see examples and fiddle with a limited image set.

4 thoughts on “The Rise of The Machines – Is A.I. challenging Photography ?”

Matthew Bowen says:

28/09/2022 at 10:18 pm

Thank you. This was really interesting and I’ve since enjoyed experimenting with Stable Diffusion, Night Cafe and DALL-E. So far I’d say that DALL-E is my favourite, it seems to be both higher quality and more user-friendly…..but to be fair I’ve only spent a couple of hours on each. But great fun….definitely.

Steve says:

01/10/2022 at 2:29 pm

Fascinating, thank you. For me no time spent sitting at a computer beats the feeling of being there in the moment, knowing you are alive, an affirmation of existence with the pain of knowing you might have created something wonderful, or missed the moment entirely. Perhaps that is why I love your photo, I know you were there and it happened, and against the light is always a good one. But at the end of the day I could be easily fooled. However, you are right to help me realise the creative possibilities are huge. Really important as a means of pictorial communication. But can it creat pictures that are the punctum , that pierce the heart. Time will tell.

Pingback: Precisamos falar sobre Arte, Inteligência Artificial, Luciana Vendramini e a Caixa de Pandora - Contraditorium
Nicholas says:

22/11/2022 at 11:43 am

Hmmmm it’s certainly going to become an interesting world! Makes one question the value of human made art vs AI art and all the existential implications….