> Not to nitpick words, but ablation is the practice of stripping out features o... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		HarHarVeryFunny 12 days ago \| parent \| context \| favorite \| on: Kimi K2 Thinking, a SOTA open-source trillion-para... > Not to nitpick words, but ablation is the practice of stripping out features of an algorithm ... Ablation generally refers to removing parts of a system to see how it performs without them. In the context of an LLM it can refer to training data as well as the model itself. I'm not saying it'd be the most cost-effective method, but one could certainly try to create a small coding model by starting with a large one that performs well, and seeing what can be stripped out of the training data (obviously a lot!) without impacting the performance.

oofbey 11 days ago [–]

ML researchers will sometimes vary the size of the training data set to see what happens. It’s not common - except in scaling law research. But it’s never called “ablation”.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact