More

sali0 · 2025-12-21T14:39:47 1766327987

Should have called it Blackmail

sali0 · 2025-08-16T03:42:03 1755315723

Not having this statement results in a compile time error in solidity.

yjftsjthsd-h · 2025-08-16T04:29:15 1755318555

Even if that's true: Okay, so why "UNLICENSED" instead of "CC0-1.0"? Especially for a trivial example? Or I guess MIT or something if you really care about something at the same level as hello world?

sali0 · 2025-07-26T03:10:35 1753499435

No it's called sarcasm

sali0 · 2025-06-15T10:00:07 1749981607

noob question, i'm currently adding telemetry to my backend.

I was at first implementing otel throughout my api, but ran into some minor headaches and a lot of boilerplate. I shopped a bit around and saw that Sentry has a lot of nice integrations everywhere, and seems to have all the same features (metrics, traces, error reporting). I'm considering just using Sentry for both backend and frontend and other pieces as well.

Curious if anyone has thoughts on this. Assuming Sentry can fulfill our requirements, the only thing taht really concerns me is vendor-lockin. But I'm wondering other people's thoughts

srikanthccv · 2025-06-15T10:24:37 1749983077

>I was at first implementing otel throughout my api, but ran into some minor headaches and a lot of boilerplate

OTeL also has numerous integrations https://opentelemetry.io/ecosystem/registry/. In contrast, Sentry lacks traditional metrics and other capabilities that OTeL offers. IIRC, Sentry experimented with "DDM" (Delightful Developer Metrics), but this feature was deprecated and removed while still in alpha/beta.

Sentry excels at error tracking and provides excellent browser integration. This might be sufficient for your needs, but if you're looking for the comprehensive observability features that OpenTelemetry provides, you'd likely need a full observability platform.

vrosas · 2025-06-15T14:48:05 1749998885

Think of otel as just a standard data format for your logs/traces/metrics that your backend(s) emit, and some open source libraries for dealing with that data. You can pipe it straight to an observability vendor that accepts these formats (pretty much everyone does - datadog, stackdriver, etc) or you can simply write the data to a database and wire up your own dashboards on top of it (i.e. graphana).

Otel can take a little while to understand because, like many standards, it's designed by committee and the code/documentation will reflect that. LLMs can help but the last time I was asking them about otel they constantly gave me code that was out of date with the latest otel libraries.

vanschelven · 2025-06-16T14:31:10 1750084270

I'd say "track errors first" [0] and focus on APM later (if at all). If you're worried about Sentry's lock-in, know that there are API-compatible drop-in replacements[1][2] though they are less feature-complete on the APM/observability side.

[0] https://www.bugsink.com/blog/track-errors-first/

[1] https://www.bugsink.com/

[2] https://glitchtip.com/

stackskipton · 2025-06-15T17:25:42 1750008342

Ops type here, Otel is great but if your metrics are not there, please fix that. In particular, consider just import prometheus_client and going from there.

Prometheus is bog easy to run, Grafana understands it and anything involving alerting/monitoring from logs is bad idea for future you, I PROMISE YOU, PLEASE DON'T!

sali0 · 2025-06-16T07:21:16 1750058476

Thank you, this is where I'll likely start.

From other comments as well, seems it's still worth trying to integrate otel. Appreciate everyone's insights

avtar · 2025-06-15T19:13:52 1750014832

> anything involving alerting/monitoring from logs is bad idea for future you

Why is issuing alerts for log events a bad idea?

_kblcuk_ · 2025-06-15T19:26:11 1750015571

It’s trivial to alter or remove log lines without knowing or realizing that it affects some alerting or monitoring somewhere. That’s why there are dedicated monitoring and alerting systems to start with.

sethammons · 2025-06-15T19:46:26 1750016786

Same with metrics.

If you need an artifact from your system, it should be tested. We test our logs and many types of metrics. Too many incidents from logs or metrics changing and no longer causing alerts. Never got to build out my alert test bed that exercises all know alerts in prod, verifying they continue to work.

stackskipton · 2025-06-15T23:53:04 1750031584

Couple of reasons.

Biggest one, sample rate is much higher (every log) and this can cause problems if service goes haywire and starts spewing logs everywhere. Logging pipelines tend to be very rigid as well for various reasons. Metrics are easier to handle as you can step back sample rate, drop certain metrics or spin up additional Prometheus instances.

Logging format becomes very rigid and if the company goes multiple languages, this can be problematic as different languages can behave differently. Is this exception something we care about or not? So we throw more code in attempt to get logging alerting into state that does not drive everyone crazy where if we were just doing "rate(critical_errors[5m] > 10" in Prometheus, we would be all set!

whatevermom · 2025-06-15T10:11:01 1749982261

Sentry isn’t really a full on observability platform. It’s for error reporting only (that is annotated with traces and logs). It turns out that for most projects, this is sufficient. Can’t comment on the vendor lock-in part.

dboreham · 2025-06-15T12:30:18 1749990618

You can run your own sentry server (or at least last time I worked with it you could). But as others have noted sentry is not going to provide the same functionality as OTel.

mdaniel · 2025-06-15T16:54:21 1750006461

The word "can" is doing a lot of work in your comment, based on the now horrific number of moving parts[1] and I think David has even said the self-hosting story isn't a priority for them. Also, don't overlook the license, if your shop is sensitive to non-FOSS licensing terms

1: https://github.com/getsentry/self-hosted/blob/25.5.1/docker-...

sali0 · 2025-06-12T12:02:35 1749729755

Next and RSCs have become some of the most frustrating things I've worked with on frontend. Dealing with FE is already annoying enough, but having to wrestle the magic of Next and then vendor lock-in to Vercel to top it off.

Team is trying out Tanstack router + vite this week. Excited to build a regular ass CSA.

koakuma-chan · 2025-06-12T12:08:16 1749730096

> Next and RSCs have become some of the most frustrating things I've worked with on frontend

What is frustrating about RSCs? They work really well for me and I'm still using Next.js only because of RSCs.

sali0 · 2025-06-13T10:28:15 1749810495

It's fine if you know them well. The unclarity in boundaries between client and server components, and the unintentional complexity that brings, is just frustrating to work with. I will gladly take a CSA any day.

But don't let a random internet stranger detract you. If it works for you, go for it.

sali0 · 2025-06-09T06:11:32 1749449492

URCL is sending me down a rabbithole. Haven't looked super deeply yet, but the most hilarious timeline would be that an IR built for Minecraft becomes a viable compilation target for languages.

sali0 · 2025-05-27T06:20:17 1748326817

I love this, spent more time on it than i thought i would. You should make this into a native mobile game.

cube00 · 2025-05-27T06:24:36 1748327076

This kind of simple game feels like it would be perfect to remain web/PWA based, what benefits do you foresee in a native mobile version?

sali0 · 2025-05-27T06:49:38 1748328578

Would be nice to pop it up on the phone via an app. PWA would also be nice.

But main reason is so he can monetize. Would gladly pay a few dollars for this.

solumunus · 2025-05-27T07:50:04 1748332204

Monetisation.

johnisgood · 2025-05-27T10:14:51 1748340891

With screen blocking ads popping up every 5 seconds? :D

Ylpertnodi · 2025-05-27T11:06:53 1748344013

With 'several' X's on the ads....to make closing them a 'game'.

sali0 · 2025-05-23T16:41:25 1748018485

Partially agree. But I can't help thinking this is the natural lifecycle of protocols. They first start as open projects, proliferate as such, and evolve into standards with governance once they catch on.

What would you call these projects? Open protocols?

sali0 · 2025-05-22T16:54:21 1747932861

I've found myself having brand loyalty to Claude. I don't really trust any of the other models with coding, the only one I even let close to my work is Claude. And this is after trying most of them. Looking forward to trying 4.

SkyPuncher · 2025-05-22T17:15:03 1747934103

Gemini is _very_ good at architecture level thinking and implementation.

I tend to find that I use Gemini for the first pass, then switch to Claude for the actual line-by-line details.

Claude is also far superior at writing specs than Gemini.

leetharris · 2025-05-22T17:25:47 1747934747

Much like others, this is my stack (or o1-pro instead of Gemini 2.5 Pro). This is a big reason why I use aider for large projects. It allows me to effortlessly combine architecture models and code writing models.

I know in Cursor and others I can just switch models between chats, but it doesn't feel intentional the way aider does. You chat in architecture mode, then execute in code mode.

adriand · 2025-05-23T01:55:18 1747965318

I also use Aider (lately, always with 3.7-sonnet) and really enjoy it, but over the past couple of weeks, the /architect feature has been pretty weird. It previously would give me points (e.g. 1. First do this, 2. Then this) and, well, an architecture. Now it seems to start spitting out code like crazy, and sometimes it even makes commits. Or it thinks it has made commits, but hasn't. Have you experienced anything like this? What am I doing wrong?

piperswe · 2025-05-22T21:45:54 1747950354

Cline also allows you to have separate model configuration for "Plan" mode and "Act" mode.

Keyframe · 2025-05-22T18:30:37 1747938637

could you describe a bit how does this work? I haven't had much luck with AI so far, but I'm willing to try.

BeetleB · 2025-05-22T19:19:07 1747941547

https://aider.chat/2024/09/26/architect.html

The idea is that some models are better at reasoning about code, but others are better at actually creating the code changes (without syntax errors, etc). So Aider lets you pick two models - one does the architecting, and the other does the code change.

SirYandi · 2025-05-22T20:41:11 1747946471

https://harper.blog/2025/02/16/my-llm-codegen-workflow-atm/

"tl:dr; Brainstorm spec, then plan a plan, then execute using LLM codegen. Discrete loops. Then magic."

justinbaker84 · 2025-05-22T17:27:31 1747934851

I have been very brand loyal to claude also but the new gemini model is amazing and I have been using it exclusively for all of my coding for the last week.

I am excited to try out this new model. I actually want to stay brand loyal to antropic because I like the people and the values they express.

vFunct · 2025-05-22T17:17:11 1747934231

Yah Claude tends to output 1200+ line architectural specification documents while Gemini tends to output ~600 line. (I just had to write 100+ architectural spec documents for 100+ different apps)

Not sure why Claude is more thorough and complete than the other models, but it's my go-to model for large projects.

The OpenAI model outputs are always the smallest - 500 lines or so. Not very good at larger projects, but perfectly fine for small fixes.

chermi · 2025-05-22T17:41:17 1747935677

I'd interested to hear more about your workflow. I use Gemini for discussing the codebase, making ADR entries based on discussion, ticket generation, documenting the code like module descriptions and use cases+examples, and coming up with detailed plans for implementation that cursor with sonnet can implement. Do you have any particular formats, guidelines or prompts? I don't love my workflow. I try to keep everything in notion but it's becoming a pain. I'm pretty new to documentation and proper planning, but I feel like it's more important now to get the best use out of the llms. Any tips appreciated!

vFunct · 2025-05-22T18:15:52 1747937752

For a large project, the real human trick for you to do is to figure out how to partition it down to separate apps, so that individual LLMs can work on them separately, as if they were their own employees in separate departments.

You then ask LLMs to first write features for the individual apps (in Markdown), giving it some early competitive guidelines.

You then tell LLMs to read that features document, and then write an architectural specification document. Tell it to maybe add example data structures, algorithms, or user interface layouts. All in Markdown.

You then feed these two documents to individual LLMs to write the rest of the code, usually starting with the data models first, then the algorithms, then the user interface.

Again, the trick is to partition your project to individual apps. Also an app isn't the full app. It might just be a data schema, a single GUI window, a marketing plan, etc.

The other hard part is to integrate the apps back together at the top level if they interact with each other...

chermi · 2025-05-23T14:42:06 1748011326

Awesome, thanks! It's interesting how the most effective LLM use for coding kind of enforces good design principles. It feels like good architects/designers are going to be more important than ever.

Edit- Except maybe TDD? Which kind of makes me wonder if TDD was a good paradigm to begin with. I'm not sure, but I'm picturing an LLM writing pretty shitty/hacky code if its goal is just passing tests. But I've never really tried TDD either before or after LLM so I should probably shut up.

rogerrogerr · 2025-05-22T17:23:26 1747934606

> I just had to write 100+ architectural spec documents for 100+ different apps

… whaaaaat?

vFunct · 2025-05-22T17:29:48 1747934988

Huge project..

skybrian · 2025-05-22T17:55:00 1747936500

Big design up front is back? But I guess it's a lot easier now, so why not?

Bjartr · 2025-05-22T17:44:34 1747935874

What's your prompt look like for creating spec documents?

jonny_eh · 2025-05-22T18:27:30 1747938450

Who's reading these docs?

tasuki · 2025-05-22T20:16:29 1747944989

Another LLM, which distills it back into a couple of sentences for human consumption.

causal · 2025-05-22T17:18:48 1747934328

This is exactly my approach. Use Gemini to come up with analysis and a plan, Claude to implement.

cheema33 · 2025-05-22T17:29:40 1747934980

This matches with my experience as well.

MattSayar · 2025-05-22T17:04:57 1747933497

Same. And I JUST tried their GitHub Action agentic thing yesterday (wrote about it here[0]), and it honestly didn't perform very well. I should try it again with Claude 4 and see if there are any differences. Should be an easy test

[0] https://mattsayar.com/personalized-software-really-is-coming...

WillPostForFood · 2025-05-22T17:15:27 1747934127

It would be pretty funny if your "not today. Maybe tomorrow" proposition actually did happen the next day.

MattSayar · 2025-05-22T18:15:21 1747937721

It literally almost did!

fivestones · 2025-05-23T11:14:02 1747998842

Did you try it? Were the results any better?

kilroy123 · 2025-05-22T17:05:51 1747933551

The new Gemini models are very good too.

pvg · 2025-05-22T18:02:16 1747936936

As the poet wrote:

I prefer MapQuest

that's a good one, too

Google Maps is the best

true that

double true!

sergiotapia · 2025-05-22T17:18:18 1747934298

Gemini 2.5 Pro replaced Claude 3.7 for me after using nothing but claude for a very long time. It's really fast, and really accurate. I can't wait to try Claude 4, it's always been the most "human" model in my opinion.

ashirviskas · 2025-05-22T17:27:45 1747934865

Idk I found Gemini 2.5 Breaking code style too often and introducing unneeded complexity on the top of leaving unfinished functions.

qingcharles · 2025-05-22T18:58:50 1747940330

I'm slutty. I tend to use all four at once: Claude, Grok, Gemini and OpenAI.

They keep leap-frogging each other. My preference has been the output from Gemini these last few weeks. Going to check out Claude now.

cleak · 2025-05-22T17:30:03 1747935003

Something I’ve found true of Claude, but not other models, is that when the benchmarks are better, the real world performance is better. This makes me trust them a lot more and keeps me coming back.

Recursing · 2025-05-22T17:21:26 1747934486

I also recommend trying out Gemini, I'm really impressed by the latest 2.5. Let's see if Claude 4 makes me switch back.

dwaltrip · 2025-05-22T17:25:00 1747934700

What's the best way to use gemini? I'm currently pretty happy / impressed with claude code via the CLI, its the best AI coding tool I've tried so far

stavros · 2025-05-22T17:47:33 1747936053

I use it via Aider, with Gemini being the architect and Claude the editor.

alex1138 · 2025-05-22T17:09:34 1747933774

I think "Kagi Code" or whatever it's called is using Claude

javier2 · 2025-05-22T17:23:47 1747934627

I wouldn't go as far, but I actually have some loyalty to Claude as well. Don't even know why, as I think the differences are marginal.

bckr · 2025-05-23T15:04:36 1748012676

It’s possible to get to know the quirks of these models and intuit what will and won’t work, and how to overcome those limitations. It’s also possible to just get to know, like, and trust their voice. I’m certain that brand awareness is also a factor for me in preferring Claude over ChatGPT etc

nico · 2025-05-22T17:41:58 1747935718

I think it really depends on how you use it. Are you using an agent with it, or the chat directly?

I've been pretty disappointed with Cursor and all the supported models. Sometimes it can be pretty good and convenient, because it's right there in the editor, but it can also get stuck on very dumb stuff and re-trying the same strategies over and over again

I've had really good experiences with o4-high-mini directly on the chat. It's annoying going back and forth copying/pasting code between editor and the browser, but it also keeps me more in control about the actions and the context

Would really like to know more about your experience

rasulkireev · 2025-05-22T17:23:06 1747934586

i was the same, but then slowly converted to Gemini. Still not sure how that happened tbh

anal_reactor · 2025-05-22T16:56:22 1747932982

I've been initially fascinated by Claude, but then I found myself drawn to Deepseek. My use case is different though, I want someone to talk to.

miroljub · 2025-05-22T17:03:07 1747933387

I also use DeepSeek R1 as a daily driver. Combined with Qwen3 when I need better tool usage.

Now that both Google and Claude are out, I expect to see DeepSeek R2 released very soon. It would be funny to watch an actual open source model getting close to the commercial competition.

ashirviskas · 2025-05-22T17:30:49 1747935049

Have you compared R1 with V3-0324?

miroljub · 2025-05-27T14:35:03 1748356503

R1 takes more time to answer, but I don't remember a single case where I actually compared answers where R1 was worse than pure V3.

And I don't even have to wait that long. If I watch the thinking, I can spot quickly it misunderstood me and rephrase the question without even waiting for the full response.

jabroni_salad · 2025-05-22T19:57:37 1747943857

A nice thing about Deepseek is that it is so cheap to run. It's nice being able to explore conversational trees without getting a $12 bill at the end of it.

sali0 · 2025-05-22T04:57:55 1747889875

Will give this a try. I was using claude code to make a small game in love2d, and really hoped for a nice lua ecs lib to help manage the game objects (its my first time playing around with game dev).