More

cv5005 · 2025-12-22T19:01:36 1766430096

For systems programming the correct way is to have explicit annotations so you can tell the compiler things like:

    void foo(void *a, void *b, int n) {
        assume_aligned(a, 16);
        assume_stride(a, 16);
        assume_distinct(a, b);
        ... go and vectorize!
    }

newpavlov · 2025-12-22T19:13:23 1766430803

LOL, nope. Those annotations must be part of the type system (e.g. `&mut T` in Rust) and must be checked by the compiler (the borrow checker). The language can provide escape hatches like `unsafe`, but they should be rarely used. Without it you get a fragile footgunny mess.

Just look at the utter failure of `restrict`. It was so rarely used in C that it took several years of constant nagging from Rust developers to iron out various bugs in compilers caused by it.

aw1621107 · 2025-12-22T22:41:37 1766443297

Does make me wonder what restrict-related bugs will be (have been?) uncovered in GCC, if any. Or whether the GCC devs saw what LLVM went through and decided to try to address any issues preemptively.

newpavlov · 2025-12-23T18:59:42 1766516382

IIRC at least one of the `restrict` bugs found by Rust was reproduced on both LLVM and GCC.

gpderetta · 2025-12-23T08:30:59 1766478659

gcc has had restrict for 25 years I think. I would hope most bugs have been squashed by now.

aw1621107 · 2025-12-23T09:34:29 1766482469

Possibly? LLVM had been around for a while as well but Rust still ended up running into aliasing-related optimizer bugs.

Now that I think about it some more, perhaps gfortran might be a differentiating factor? Not familiar enough with Fortran to guess as to how much it would exercise aliasing-related optimizations, though.

gpderetta · 2025-12-24T00:06:59 1766534819

I think Fortran function arguments are assumed not to alias. I'm not sure if it matches C restrict semantics though.

aw1621107 · 2025-12-25T02:00:52 1766628052

Yeah, that's why I was wondering whether GCC might have shaken out its aliasing bugs. Sibling seems to recall otherwise, though.

cv5005 · 2025-11-04T20:21:22 1762287682

This data is publically available to anyone in Sweden:

Your salary (well, last years taxable income), debts/credit rating, criminal history, address, phone number, which vehicles and properties you own and which company boards you're on.

One of organized criminals biggest income these days are scamming rich old folks because it's so trivial to get all details needed (and who to target) to be a pretty convincing bankman, IRS type agent/etc.

Some of it you have to kind of manually request at various places, but it's all available.

So data breaches aren't really that big of a deal when everything is already public.

reppap · 2025-11-04T21:52:30 1762293150

Afaik this breach also contained a lot of data about medical condition related to workplaces.

zith · 2025-11-04T20:39:14 1762288754

If I understand correctly the only thing not public that was leaked was the role each person had in the government.

tuwtuwtuwtuw · 2025-11-04T21:23:28 1762291408

Why would the role within the government not be public? I can't imagine that being treated as a secret.

naIak · 2025-11-04T23:14:55 1762298095

[flagged]

victorbjorklund · 2025-11-05T18:47:18 1762368438

Europe is not one country. It’s like seeing Tornados in Kansas and assuming that is all of US

arianvanp · 2025-11-05T17:21:59 1762363319

Sweden*

Non of this is public in Germany or Netherlands

cv5005 · 2025-11-02T19:36:46 1762112206

You don't have to do int2ptr for mmio or absolute addresses, you can punt that to the linker.

    extern struct uart UART0;

Then place that symbol at address X in your linker script.

cv5005 · 2025-10-24T15:40:05 1761320405

>CVE-2025-32463

Looks like a logic bug to me? So rust wouldn't have helped.

Those are exactly the kind of bugs you might introduce when you do a rewrite.

danudey · 2025-10-24T16:12:42 1761322362

One great way you can make things more secure is by reducing attack surface. sudo is huge and old, and has tons of functionality that almost no one uses (like --chroot). A from-scratch rewrite with a focus on the 5% of features that 99% of users use means less code to test and audit. Also a newer codebase that hasn't grown and mutated over the course of 35 years is going to be a lot more focused and easier to reason about.

1718627440 · 2025-10-25T21:59:20 1761429560

> Sudo is [...] old.

This is a take I never understood. I get being huge, but old? Software doesn't age, when it is older it tends to have less bugs, not more.

gilcot · 2025-10-25T11:55:55 1761393355

Do you mean doas ?

pepoluan · 2025-10-25T14:08:14 1761401294

doas is too much of a difference from sudo. For instance, it uses a completely different syntax for its config.

sudo-rs is designed to be a drop-in replacement for maybr 95-99% of people who have been using sudo.

(I do use doas on my own systems though)

tcfhgj · 2025-10-26T08:47:47 1761468467

A good type system can prevent all sorts of logic bugs

cv5005 · 2025-07-26T06:49:56 1753512596

And if I want a vec(int *)? These token pasting 'generic' macros never work for non-trivial types.

teo_zero · 2025-07-26T07:07:21 1753513641

Correct, complex types must be typedef'd. At least, until c2y integrates _Record as per N3332: https://thephd.dev/_vendor/future_cxx/papers/C%20-%20_Record...

uecker · 2025-07-27T06:39:34 1753598374

I am not terribly excited about this proposal. It is overly complex.

teo_zero · 2025-07-27T07:33:56 1753601636

I agree, but the current specification is complex too: two identical "tagged" structs are compatible, two identical "untagged" structs are not. And before C23 it was even worse, depending on whether the two structs were defined in the same file or not.

We're applying a patch over a patch over a patch... no surprise the end result looks like a patchwork!

uecker · 2025-07-27T07:39:30 1753601970

Sure, but _Record would add even more complexity. The tag rules I had changed in C23 were a step to remove complexity, so a step towards cleaning it up. I wasn't able to fix the untagged case, because WG14 had concerns, but I think these can be addressed, making another step. It is always much harder to undo complexity than to add it.

cv5005 · 2025-04-14T18:05:27 1744653927

Pick an appropriate base type (uintN_t) for a bitset, make an array of those (K * N/4) and write a couple inline functions or macros to set and clear those bits.

cv5005 · 2025-04-14T17:59:57 1744653597

simd doesnt make much sense as a standard feature/library for a general purpose language. If you're doing simd its because you're doing something particular for a particular machine and you want to leverage platform specific instructions, so thats why intrinsics (or hell, even externally linked blobs written in asm) is the way to go and C supports that just fine.

But sure, if all youre doing is dot products I guess you can write a standard function that will work on most simd platforms, but who cares, use a linalg library instead.

cv5005 · 2025-04-12T07:27:23 1744442843

How does the rust compiler assure that when compiling to machine code? Machine code is less safe than C after all.

lmm · 2025-04-12T07:58:09 1744444689

Machine code is generally much safer than C - e.g. it usually lacks undefined behaviour. If you're unsure about how a given piece of machine code behaves, it's usually sufficient to test it empirically.

IshKebab · 2025-04-12T13:08:48 1744463328

Not true on RISC-V. That's full of undefined behaviour.

But anyway this is kind of off-topic. I think OutOfHere was imagining that this somehow skips the type checking and borrow checking steps which of course it doesn't.

dzaima · 2025-04-12T15:37:39 1744472259

What's all that undefined behavior? Closest I can think of is executing unsupported instructions, but you have to mess up pretty hard for that to happen, and you're not gonna get predictable behavior here anyway (and sane hardware will trap of course; and executing random memory as instructions is effectively UB on any architecture).

(there's a good bit of unpredictable behavior (RVV tail-agnostic elements, specific vsetvl result), but unpredictable behavior includes any multithreading in any architecture and even Rust (among other languages))

IshKebab · 2025-04-13T08:20:22 1744532422

Accessing non-existent CSRs is another big one, which also means you can't probe for features.

There's loads more though. Just search for "reserved" in the ISA manual.

Of course a Rust to C compiler is not going to hit any of these. I was just pointing them out.

dzaima · 2025-04-13T12:57:29 1744549049

Fair point on CSRs, though I'd count that as a subset of unsupported/not-yet-specified instructions; pretty sure all of the "reserved"s in the spec are effectively not-yet-defined instructions too, which'll have equivalents in any architecture with encoding space left for future extensions, not at all unique to RISC-V.

But yeah, no try-running-potentially-unsupported-things-to-discover-what-is-supported; essentially a necessary property for an open ISA as there's nothing preventing a vendor from adding random custom garbage in encoding space they don't use.

IshKebab · 2025-04-13T19:47:04 1744573624

Yeah I guess the difference is once an instruction/CSR has been defined in x86 or ARM the only two options are a) it doesn't exist, and b) it's that instruction.

In RISC-V it can be anything even after it has been defined.

Actually... I say that, but they do actually reserve spaces in the CSR and opcode maps specifically for custom extensions so in theory they could say it's only undefined behaviour in those spaces and then you would be able to probe in the standard spaces. Maybe.

I think they just don't want people probing though, even though IMO it's the most expedient solution most of the time. Otherwise you have to go via an OS syscall, through the firmware and ACPI tables, device tree or mconfigptr (when they eventually define that).

dzaima · 2025-04-13T22:48:49 1744584529

On getting supported extension status - there's a C API spec that could potentially become an option for an OS-agnostic way: https://github.com/riscv-non-isa/riscv-c-api-doc/blob/main/s.... libc already will want to call whatever OS thing to determine what extensions it can use for memcpy etc, so taking the results from libc is "free".

cv5005 · 2025-04-12T10:08:47 1744452527

Not any different from C - a given C compiler + platform will behave completetly deterministically and you can test the output and see what it does, regardless of UB or not.

lmm · 2025-04-12T10:54:43 1744455283

> a given C compiler + platform will behave completetly deterministically and you can test the output and see what it does, regardless of UB or not.

Sure[1], but that doesn't mean it's safe to publish that C code - the next version of that same compiler on that same platform might do something very different. With machine code (especially x86, with its very friendly memory model) that's unlikely.

(There are cases like unused instructions becoming used in never revisions of a processor - but you wouldn't be using those unused instructions in the first place. Whereas it's extremely common to have C code that looks like it's doing something useful, and is doing that useful thing when compiled with a particular compiler, but is nevertheless undefined behaviour that will do something different in a future version)

[1] Build nondeterminism does exist, but it's not my main concern

baq · 2025-04-12T11:01:15 1744455675

CPUs get microcode updates all the time, too. Nothing is safe from bitrot unless you’re dedicated to 100% reproducible builds and build on the exact same box you’re running on. (…I’m not, for the record - but the more, the merrier.)

lmm · 2025-04-12T11:19:47 1744456787

> CPUs get microcode updates all the time, too.

To fix bugs, sure. They don't generally get updates that contain new optimizations that radically break existing machine code, justifying this by saying that the existing code violated some spec.

carlmr · 2025-04-12T11:51:41 1744458701

>To fix bugs, sure.

Maybe your program worked due to the bug they fixed.

lmm · 2025-04-12T14:21:52 1744467712

Extremely unlikely. CPU bugs generally halt the CPU or fail to write the result or something like that. The Pentium FDIV bug where it would give a plausible but wrong result was a once in a lifetime thing.

baq · 2025-04-12T19:14:59 1744485299

Spectre and Meltdown exploits stopped working, too. Some of them on some CPUs, anyway.

lmm · 2025-04-13T07:15:01 1744528501

Sure. But those were obviously exploits from the start. You wouldn't write code like that accidentally.

ryao · 2025-04-13T01:58:01 1744509481

Do a web search for rdrand and systemd.

lmm · 2025-04-13T07:17:10 1744528630

> Do a web search for rdrand and systemd.

RDRAND always returning all-FF is exactly the kind of thing that's an obvious bug, not a plausible-but-wrong result.

ryao · 2025-04-14T02:21:47 1744597307

The other guy said "Maybe your program worked due to the bug they fixed.". The RDRAND fix achieved exactly that.

uecker · 2025-04-12T11:04:56 1744455896

It is not terribly hard to generate C code that does not use undefined behavior.

lmm · 2025-04-12T11:17:59 1744456679

Maybe. But when carefully investigated, the overwhelming majority of C code does in fact use undefined behaviour, and there is no practical way to verify that any given code doesn't.

uecker · 2025-04-13T20:33:37 1744576417

It is easy to create code where this can be verified. It is difficult to verify for arbitrary code.

uecker · 2025-04-12T10:10:48 1744452648

cv5005 · on Dec 11, 2024

Or it could be made faster because certain manual optimizations become possible.

An example would a table of interned strings that you wanna match against (say you're writing a parser). Since standard C says thou shall not compare pointers with < or > unless they both point into the same 'object' you are forbidden from doing the speed of light code:

  char *keywords_begin, *keywords_end;
  if(some_str >= keywords_begin && some_str < keywords_end) ...

Official standard sanctioned workarounds would require extra indirection (using indices for example) which is suboptimal.

gpderetta · on Dec 11, 2024

You can cast them to uintptr_t and compare them to your heart's desire.

cv5005 · on Dec 11, 2024

You dont need UB for that.

A simple model for both compilers and programmers to understand:

"A variable whose address has not been taken need not be reachable via a random pointer".

I mean that's how an assembly programmer would think - if I put something in r0 I don't expect a store instruction to clobber it.

UncleMeat · on Dec 11, 2024

What you describe there is UB. If you define this in the standard, you are defining a kind of runtime behavior that can never happen in a well formed program and the compiler does not have to make a program that encounters this behavior do anything in particular.