jonrouach's comments

jonrouach · 2025-12-19T03:34:33 1766115273

or better, like you mentioned, try to convince Exo to develop in the open, so everyone gets any capability as PRs.

geerlingguy · 2025-12-19T05:01:26 1766120486

They are now, this morning they pushed all the code to the Exo repo, and archived the earlier Exo branch. We'll see how open they are now that whatever embargoed work they did with Apple is public..

jonrouach · 2025-11-04T20:27:24 1762288044

yes!

this announcement had such strong gpt-output vibe..

to the "writers": pleeease don't present unedited slop to me. i'm a human, if you want my attention consider using your own voice. i don't want to read what gpt thought would "market" you best.

but this thread did remind me of marimo so that's sweet

jonrouach · 2025-07-16T00:07:27 1752624447

you're sure it's not their "feature" that calling the api with empty string returns random hallucinations?

https://jarbon.medium.com/gpt-prompt-bug-94322a96c574

requilence · 2025-07-16T00:13:15 1752624795

No, definitely not the empty string hallucination bug. These are clearly real user conversations. They start like proper replies to requests, sometimes reference the original question, and appear in different languages.

jonrouach · 2025-07-16T00:46:15 1752626775

i had the exact same behavior back in 2023, it seemed like clearly leakage of user conversations - but it was just a bug with api calls in the software i was using.

https://snipboard.io/FXOkdK.jpg

postalcoder · 2025-07-16T02:53:39 1752634419

There was an issue with conversation leakage, though. It involved some bug with Redis.

I felt like it was a huge deal at the time but it’s surprisingly hard to quickly google it.

Sebguer · 2025-07-16T02:59:06 1752634746

It was the classic "oh no we did caching wrong" bug that many startups bump into. It didn't expose actual conversations though, only their titles: https://openai.com/index/march-20-chatgpt-outage/

postalcoder · 2025-07-16T04:25:04 1752639904

ah there it is. thanks for jogging my memory. funny to think of how niche chatgpt was considered then to now.

JyB · 2025-07-16T00:24:30 1752625470

I don’t see anything here that would prevent a LLM from generating these. Right?

requilence · 2025-07-16T00:50:42 1752627042

In one of the responses, it provided the financial analysis of a not well-known company with a non-Latin name located in a small country. I found this company; it is real and numbers in the response are real. When I asked my ChatGPT to provide a financial report for this company without using web tools, it responded: `Unfortunately, I don’t have specific financial statements for “xxx” for 2021 and 2022 in my training data, and since you’ve asked not to use web search, I can’t pull them live.`.

BoiledCabbage · 2025-07-16T14:18:10 1752675490

> numbers in the response are real.

OpenAI very well may have a bug, but I'm not clear on this part. How do you know the numbers are real?

I understand you know the name is the company is real, but how do you know the numbers are real?

It's way may than anyone should need to do, but the only way I can see someone knowing this is contacting the owners is the company.

Sebguer · 2025-07-16T01:08:36 1752628116

Do you understand what a hallucination is?

jojobas · 2025-07-16T02:21:21 1752632481

Coming up with accurate financial data that you can't get it to report outright doesn't seem like one.

Sebguer · 2025-07-16T02:50:44 1752634244

Models do not possess awareness of their training data. Also you are taking at face value that it is "accurate".

refulgentis · 2025-07-16T02:25:12 1752632712

I don't understand the wording

Accurate financial data?

How do we know?

What does using not-web-search not having the data have to do with the claim that private chats with the data are being leaked?

01HNNWZ0MV43FF · 2025-07-16T03:47:37 1752637657

> I found this company; it is real and numbers in the response are real.

???

refulgentis · 2025-07-16T04:37:48 1752640668

Which of my questions does that answer?

queenkjuul · 2025-07-16T08:32:01 1752654721

That the financial data is accurate?

refulgentis · 2025-07-16T13:42:17 1752673337

It's an ourobos - he can't verify it's real! If he can, its online and available by search.

JyB · 2025-07-16T23:36:35 1752708995

Therefore what are the odds that this is just the LLM doing its thing versus "a vulnerability". Seem like a pretty obvious bet.

addandsubtract · 2025-07-16T10:19:00 1752661140

New Touring Test unlocked! Differentiate between real and fake hallucinations.

DANmode · 2025-07-16T20:13:54 1752696834

So THAT'S what the "GT" means on all of these GPU model names!

jonrouach · on June 15, 2024

is that a method to save context tokens by "baking" the attended in-context-learned into the weights? Im missing one step of so-what in the paper abstract..

jonrouach · on June 21, 2023

Love this!

I've been iterating on a similar project, Talk-to-Repo.

It uses retrieval-augmented-generation to access the relevant parts of the code, and lets you chat and collect which code pieces you want.

Got stuck at generating good diffs, I'll be sure to look at how you've done it!

btw i started my project by turning another project, "Twitter explainer", on itself. It loaded its own code, i asked it to add new features and copy-pasted the results (with some tweaking and occasional trips to phind.com )... :)

https://github.com/Arjeo-Inc/talk-to-repo original project by Mark Tennenholtz: https://twitter.com/marktenenholtz/status/165156810719298355...