Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> But this suggests a real breakthrough is needed to keep pace with future requirements.

Not necessarily. Look at papers like the lottery ticket hypothesis - big ML models may be doing better simply because gradient descent just isn't doing a good enough job. Better optimizers would go a long way than just throwing compute at the the problem. Even if you can, it's impractical to use something like GPT-3 all the time.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: