As far as I can tell, AI image generation still struggles with some things after...

ancientworldnow · on April 4, 2024

It's like cgi, you only recognize bad examples of it while the good ones go right past you. I've got plenty of ai generations that fool professional photo retouchers - it just takes more time and some custom tooling.

throwup238 · on April 4, 2024

> I've got plenty of ai generations that fool professional photo retouchers - it just takes more time and some custom tooling.

What’s a good place to find out the SOTA of the custom tooling and workflow?

zzzzzzzzzz10 · on April 4, 2024

Comfyui + civitai. 4chan and reddit threads if you want to go deep

ptx · on April 4, 2024

> It's like cgi

Right. Full of code injection vulnerabilities.

Legend2440 · on April 5, 2024

"many years" lol, midjourney only came out like a year and a half ago and the quality has quadrupled in that time.

n4r9 · on April 5, 2024

Generative text-to-image models based on neural networks have been developing since around 2015. Dall-E was the first to gain widespread attention in 2021. Then later models like Stable Diffusion and Midjourney.

"Quadrupled" is a very specific and quantitative word. What measure are you basing that on?

hhjinks · on April 11, 2024

The recommended resolution went from 512x512 to 1024x1024 in that time span :)

n4r9 · on April 16, 2024

Ah right. But that's only tangentially related to being able to distinguish AI-generated images. There are tells that are completely separate to resolution, such as getting the correct spacing of black keys on a piano.

VelesDude · on April 4, 2024

From audio video editing experience years back, it is much easier to slip some cheap audio cuts past people than visual ones.

Paul-Craft · on April 4, 2024

This already sounds like something I would have listened to in the 90s, except with too much autotune.