> But this suggests a real breakthrough is needed to keep pace with future requi...

> But this suggests a real breakthrough is needed to keep pace with future requirements.

Not necessarily. Look at papers like the lottery ticket hypothesis - big ML models may be doing better simply because gradient descent just isn't doing a good enough job. Better optimizers would go a long way than just throwing compute at the the problem. Even if you can, it's impractical to use something like GPT-3 all the time.