Hacker Newsnew | past | comments | ask | show | jobs | submit | jandom's commentslogin

Pssst they'll realise scientists hand out here too


Remote friendly is a life changer to young parents… and we saw what happened to that fairly quickly


Didn't the COVID lockdowns lead to a small baby boom for first-time parents?


My neighborhood is a positive data point.


This feels like a poorly controlled experiment: the reverse effect should be studied with a less empathetic model, to see if the reliability issue is not simply caused by the act of steering the model


Hi, author here, this is exactly what we tested in our article:

> Third, we show that fine-tuning for warmth specifically, rather than fine-tuning in general, is the key source of reliability drops. We fine-tuned a subset of two models (Qwen-32B and Llama-70B) on identical conversational data and hyperparameters but with LLM responses transformed to be have a cold style (direct, concise, emotionally neutral) rather than a warm one [36]. Figure 5 shows that cold models performed nearly as well as or better than their original counterparts (ranging from a 3 pp increase in errors to a 13 pp decrease), and had consistently lower error rates than warm models under all conditions (with statistically significant differences in around 90% of evaluation conditions after correcting for multiple comparisons, p<0.001). Cold fine-tuning producing no changes in reliability suggests that reliability drops specifically stem from warmth transformation, ruling out training process and data confounds.


I had the same thought, and looked specifically for this in the paper. They do have a section where they talk about fine tuning with “cold” versions of the responses and comparing it with the fine tuned “warm” versions. They found that the “cold” fine tune performed as good or better than the base model, while the warm version performed worse.


Also its not clear if the same effect appears on larger models like GPT-5, gemini 2.5-pro and whatever the largest most recent Anthropic model is.

The title is an overgeneralization.


Thank you! I did use an existing repo and forked it, so kudos to that guy


If it was possible to beat silicon valley competitors we'd see a lot more unicorns outside silicon valley. You certainly can hide but you can't run ;-)


> a lot more unicorns outside silicon valley

The problem I see is that the massive funding available to "SV Startups" often skews what a success really is. Say you have a startup that's received several rounds of funding and based on those rounds it's now valued at $1B+, what is that valuation really worth if you're burning cash so fast you're only 1 or 2 failed rounds away from bankruptcy. Sure it's nice to be able to say "we have a 1 billion dollar company" but what is that worth if the valuation only holds if investors keep pouring cash into it? IMHO a company that is worth $100M and can stand on it's own feet without constant infusions of cash is inherently more valuable than a company "worth" $1B but needs to make every funding round to keep the lights on.


"Unicorn" is a term meaning a billion dollar valuation, without being listed as a stock. If I go spend a billion dollars on a company it is a unicorn, even if has zero success or sustainability.

So of course there are fewer unicorns outside of SV. The entire concept of a unicorn is tightly coupled to VC dollars.


Do you even have to spend $1B? If you spend $100M for 10% or $10M for 1% of the company will get you to “unicorn” status, technically.


Lets go all the way then - if I spend one dollar on one billionth of a company, it is a unicorn.

OK, someone go launch UaaS - Unicorn as a Service: I'll buy a billionth of any company for a dolla, and you get to claim to be the latest unicorn!


Convincing someone to pay $100 million for 10% of a business is quite a feat. Not for average joes.


Yeah, sorry if my jest wasn't clearly jesting. I'm actually not overly concerned with successful VC-backed companies, I'm just pointing out that the term "unicorn" is fairly contrived.


Actually, this is indicative of the SV mindset where "biggest wins". I hope that's not the case for many businesses, and there are plenty where "successful" means having sustainable income growth, and "win" means "being the obvious choice in your niche".


Most unicorn value is predicated upon the petrodollar/US Power and influence.


Yeah this kind if work is usually meh. It’s unclear if the high concentration is physically relevant in humans, plus it’s not clear if it causes any other problems, and plus it’s not clear how it compares to existing treatments. So maybe promising but v v early


You cannot completely reject it, this is the first step, the next is human clinical trials.


Obligatory XKCD: https://xkcd.com/1217/


Isn't that being a little optimistic? After all if this was good news, it wouldn't be news :-)

Fully appreciating the point: antibody response is not the only response of the immune system.

The other story on HN right now is "AstraZeneca vaccine doesn't prevent B1351 Covid in early trial" – a finding consistent with the results of this study (yes, yes, AZ vaccine != Pfizer/Moderna).

https://www.cidrap.umn.edu/news-perspective/2021/03/astrazen...

So what becomes interesting is: does the AZ vaccine not produce the same T-cell response as Pfizer/Moderna, or are these vaccines in fact broadly similar (mutants don't affect T-cells but affect antibodies). The latter wouldn't be consistent with Skelly 2021.


The article has already been peer-reviewed and published in the "Cell" journal

https://www.sciencedirect.com/science/article/pii/S009286742...


"TIL: Rafael Quintero, a mexican drug trafficker, once tortured two students with ice picks and buried them alive for accidentally walking into his private dinner party."

Some parties just get out of hand...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: