As far as I can tell, AI image generation still struggles with some things after many years of research and is often detectable. Perhaps vocals is easier though.
It's like cgi, you only recognize bad examples of it while the good ones go right past you. I've got plenty of ai generations that fool professional photo retouchers - it just takes more time and some custom tooling.
Generative text-to-image models based on neural networks have been developing since around 2015. Dall-E was the first to gain widespread attention in 2021. Then later models like Stable Diffusion and Midjourney.
"Quadrupled" is a very specific and quantitative word. What measure are you basing that on?
Ah right. But that's only tangentially related to being able to distinguish AI-generated images. There are tells that are completely separate to resolution, such as getting the correct spacing of black keys on a piano.