More

film42 · 2025-09-16T14:47:58 1758034078

Very cool post. If Jeff Geerling is reading this, I wouldn't mind watching a video on each of these ;)

_whiteCaps_ · 2025-09-16T16:26:01 1758039961

Check out saveitforparts on youtube, he does lots of this stuff.

mindcrime · 2025-09-16T17:03:19 1758042199

Gabe just quit his dayjob to go full-time on SaveItForParts, so hopefully we'll be seeing even more cool stuff in the near future. Me personally, I'm hoping for a collab between him and Jeff. They've had some interaction already (Jeff donated Gabe a spare computer he had lying around) so maybe... That would be epic if it did happen.

_whiteCaps_ · 2025-09-16T18:52:41 1758048761

They already did a quick collab with Jeff's PIs in space video. But I'm definitely hoping for more.

mindcrime · 2025-09-16T19:10:14 1758049814

Aaah. I haven't watched that one yet, so I didn't even know. I'll definitely check that out later today.

film42 · 2025-09-04T01:20:21 1756948821

I think I update Vundle like once every 3 years.

film42 · 2025-08-12T17:00:18 1755018018

The 1M token context was Gemini's headlining feature. Now, the only thing I'd like Claude to work on is tokens counted towards document processing. Gemini will often bill 1/10th the tokens Anthropic does for the same document.

film42 · 2025-08-12T16:57:11 1755017831

Agree but pricing wise, Gemini 2.5 pro wins. Gemini input tokens are half the cost of Claude 4. Output is $5/million cheaper than Claude. But, document processing is significantly cheaper. A 5MB PDF (customer invoice) with Gemini is like 5k tokens vs 56k with Claude.

The only downside with Gemini (and it's a big one) is availability. We get rate limited by their dynamic QoS all the time even if we haven't reached our quota. Our GCP sales rep keeps recommending "provisioned throughput," but it's both expensive, and doesn't fit our workload type. Plus, the VertexAI SDK is kind of a PITA compared to Anthropic.

Alex-Programs · 2025-08-12T19:49:03 1755028143

Google products are such a pain to work with from an API perspective that I actively avoid them where possible.

film42 · 2025-08-06T17:54:36 1754502876

Is there a crowd-sourced sentiment score for models? I know all these scores are juiced like crazy. I stopped taking them at face value months ago. What I want to know is if other folks out there actually use them or if they are unreliable.

hnfong · 2025-08-06T18:20:03 1754504403

Besides the LM Arena Leaderboard mentioned by a sibling comment, if go to the r/LocalLlama/ subreddit, you can very unscientifically get a rough sentiment of the performance of the models by reading the comments (and maybe even check the upvotes). I think the crowd's knee-jerk reaction is unreliable though, but that's what you asked for.

NitpickLawyer · 2025-08-06T19:36:11 1754508971

Not anymore tho. It used to be the place to vibe-check a model ~1 year ago, but lately it's filled with toxic my team vs. your team, memes about CEOs (wtf) and general poor takes on a lot of things.

For a while it was china vs. world, but lately it's even more divided, with heavy camping on specific models. You can still get some signal, but you have to either ban a lot of accounts, or read new during different tzs so you can get some of that "i'm just here for the tech stack" vibe from posters.

littlestymaar · 2025-08-06T20:26:05 1754511965

Yeah, some people just can't stop acting as if tech companies were sport teams, and it gets annoying fast.

parineum · 2025-08-07T00:46:14 1754527574

I don't really go there much anymore but, when I was, there seemed to be an innordinate amount of Chinese nationalism from young accounts speaking odd English.

nurettin · 2025-08-06T17:59:20 1754503160

This has been around for a while https://lmarena.ai/leaderboard/text/coding

klohto · 2025-08-06T17:59:59 1754503199

openrouter usage stats

esafak · 2025-08-06T18:04:26 1754503466

https://openrouter.ai/rankings

The new qwen3 model is not out yet.

setsewerd · 2025-08-06T18:45:55 1754505955

Since the ranking is based on token usage, wouldn't this ranking be skewed by the fact that small models' APIs are often used for consumer products, especially free ones? Meanwhile reasoning models skew it in the opposite direction, but to what extent I don't know.

It's an interesting proxy, but idk how reliable it'd be.

matznerd · 2025-08-06T19:32:06 1754508726

Also, these small models are meant to be run local so not going to appear on openrouter...

film42 · 2025-06-23T20:57:28 1750712248

Thanks for your comment! I have a few PDFs that I need to generate for groups of users every so often and since wkhtmltopdf is considered EOL, I've been forced to use chrome (which sucks to manage). I just rewrote that code to use Typst (via the typst gem) and it's so so so much better.

film42 · 2025-06-16T20:19:24 1750105164

Looks great! I'm definitely in the market for something like this; and building on top of helm charts makes me want to try it out.

Can Canine automatically upgrade my helm charts? That would be killer. I usually stay on cloud-hosted paid plans because remembering to upgrade is not fun. The next reason is that I often need to recall the ops knowledge just after I've forgotten it.

czhu12 · 2025-06-16T20:59:13 1750107553

It can apply upgrades but I don't think it solves your core problem, which is how to perform upgrades safely. Most of the time its totally fine, but sometimes a config key changes across versions.

Upgrading helm charts without some manual monitoring seems like it might still be an unsolved problem :(

film42 · 2025-06-02T21:12:01 1748898721

Congrats to the Crunchy Data team! Thanks for making containerized postgres so easy for years and years. Wish you all the best!

film42 · 2025-05-01T20:17:18 1746130638

Congrats to the teams! Like others have said, your pricing ends up killing adoption for my company. We ended up self-hosting Airbyte. It ain't perfect but at least we're not paying $10/GB to replicate data within our own VPC.

film42 · 2025-04-30T16:34:34 1746030874

I'm guessing any useful use of AI has already been adopted by some volunteers. Wikipedia might be able to build tools around the parts that work well, but the volunteers will end up footing the bill for the AI spend. Wikipedia will probably pivot to building an AI research product which they can sell to universities/ b2c.

tempfile · 2025-04-30T16:38:09 1746031089

> Wikipedia will probably pivot to building an AI research product which they can sell to universities/ b2c.

Why would they do this? All of wikipedia is publicly available for any use. They literally do not have a competitive advantage (and don't seem interested in it, either).

film42 · 2025-04-30T17:01:24 1746032484

Exactly. But using AI to summarize articles, stitch them together, etc. under the Wikipedia brand as a product is something they could easily sell. I can totally see a university buying WikiResearch™ for every student.

some_furry · 2025-04-30T17:39:26 1746034766

I don't anticipate them selling anything, ever.