Small Fine-Tuned Models Are All You Need

aininja · 2025-10-19T21:49:09 1760910549

For most enterprise use cases it is indeed all they need. Why use a slow, expensive and inaccurate sledge hammer to push your very specific small nail through the wall?

It just doesn't make sense most of the times to use the slow, expensive, generic black box models that are not optimized for the specific task.

stefanwebb · 2025-10-19T20:29:43 1760905783

Seems topical given some recent front-page HN articles on fine-tuning. I discuss a large-scale empirical study from 2014 of fine-tuning 7B models to outperform GPT-4 and GPT-3.5-Turbo, as well as arguments why fine-tuning is coming back into favor