Can I disable the safety checking tag on a bare union without packed?

Sze · June 7, 2024, 1:03am

I am building a data structure which uses a non-tagged/bare union for an array of slots:

pub const Head = packed struct {
    mask: MaskInt,
    meta: MetaInt,
};
pub const Node = union {
    head: Head,
    data: T,
};

nodes:[*]Node,

I could turn this union into a packed union, to get rid of the automatically added safety checking tag in safe modes.

However if I do that T has to be a data type with defined memory layout, but I want to allow the user to use whatever they want.

Basically I want to opt-out of the safety check even in safe modes, because the tag brings more trouble then it helps me.

This is part of data structures meant to store things compactly, if I can’t disable the tag, then I guess I have to avoid using the union completely and instead calculate addresses and offsets directly.
So that I can have both compactness and allow the user to choose an arbitrary type for T.

One of the problems with the tag is that it doubles the size of the nodes.
Using types with vastly different memory layouts while testing and then have everything different in release also seems unhelpful, while checks are on, things get tested, but when they are off things have a different layout.

Another problem is that I no longer can reliably cast the [*]Node pointer to a [*]align(@alignOf(Node)) T pointer, because the beginning of the node is the tag and thus T reinterprets the debug tag info as T, instead of the real value behind the tag.

Ironically the safety feature makes it more difficult to use the union in a way that is helpful to the goal.

I am wondering whether this is just a case where Zig doesn’t want me to use a union?

I guess I might just change it to use head: *Head and then add @sizeOf(Head) to that and forwardAlign to the next [*]T to get to the data.
It just seems a bit unsatisfying having to do this so manually.

If it currently needs to be done manually, I would also be interested if anybody knows about plans to make things like this easier.

mnemnion · June 7, 2024, 1:42am

This works:

// safe union
pub const SafeUnion = union {
    int: i64,
    float: f64,
};

// unsafe union
pub const UnsafeUnion = blk: {
    @setRuntimeSafety(false);
    break :blk union {
        int: i64,
        float: f64,
    };
};

test "safe and unsafe" {
    std.debug.print("\nsize of SafeUnion is {d}\n", .{@sizeOf(SafeUnion)});
    std.debug.print("size of UnsafeUnion is {d}\n", .{@sizeOf(UnsafeUnion)});
}

Sze · June 7, 2024, 1:55am

Thank you, this is exactly what I needed!
I had a look at @setRuntimeSafety, but didn’t think of using it with a block like that.

andrewrk · June 7, 2024, 3:53am

No! This is still illegal, and should be unmarked as a solution.

If it crashes with safety enabled, it means your program has Undefined Behavior without safety.

Extern unions offer type-punning. Bare unions do not. To convert between i64 and f64 your best strategy is @bitCast.

Sze · June 7, 2024, 3:59am

The intent wasn’t to change types, but to avoid having the overhead of the safety checking tag and the change in memory layout that it causes.

Do you still think that is a misuse too?

I am only accessing the active elements I just don’t want my 4 byte nodes to become 8 byte nodes in safe mode, messing up a tight memory layout.

andrewrk · June 7, 2024, 5:51am

If it crashes with safety enabled, it means your program has Undefined Behavior without safety.

dimdin · June 7, 2024, 6:14am

Why not use the following?

pub const Node = extern union {
    head: Head,
    data: T,
};

If T is something special that cannot be used in extern ABI you can compromise using T=usize instead and @bitCast.

mnemnion · June 7, 2024, 8:46am

So, it is the correct answer: the question was how to remove the safety checking tag from a bare union, without forcing the entire build to turn off runtime safety checks. That’s how.

You appear to be concerned that this answer will lead someone who reads it to the wrong conclusions about what to do. So I’ll explain what’s going on here as best I can, and if I get anything wrong, you can correct it.

Zig has several kinds of unions, the one we’re discussing is a ‘bare’ union. This means that the different types inside the union are accessed using the field name. There’s an allocated area of memory which is big enough to hold any one variant of the union, but only one of them at a time.

Like with Zig structs (not extern or packed), a union of this type has no defined memory layout. This makes trying to access the wrong field/type of the union a serious mistake. C style unions, extern in Zig, do have a defined memory layout, so they’re used for type punning sometimes, although for turning an i64 into an f64 and back, Zig has @bitCast, which is what you should use.

If you tried that type pun on UnsafeUnion, in Fast/Small release modes, it might work today, it might not. But it might not work tomorrow. It’s undefined behavior, because the type has no defined layout in memory. So it’s really important that no one does this.

So important that, in safe builds, Zig adds a hidden tag to each instance of a union type. You can’t write code which discriminates on this union, like a switch, that’s what a tagged union is for. The tag is there, and if you access the wrong field of the union, you’ll get a runtime panic.

So the bare union is only to be used if the program you’re writing has some way to distinguish whether an instance of that union will be a specific member. As a toy example, you could allocate a slice of SafeUnion, and just declare that even numbered elements of the slice are .float, and odd numbered ones are .int.

Then you could do this:

for (union_slice, 0..) |u, i| {
    if (i % 2 == 0) {
        // do float stuff:
        _ = u.float;
    } else {
        _ = u.int;
    }
}

And everything is fine.

Thing is, with safety checking, that slice takes twice as much memory as it would without the hidden tag. Since the largest elements (both of them) are word-aligned, each address of an i64 or f64 needs to be align(8), so with the tag, the stride is 16.

Unlike many runtime safety checks, which you might not ever want to turn off, you’ll want to turn this one off eventually. Because if you didn’t want to, you could use a tagged union instead, and be able to switch on the enum, rather than have to track which field applies in some other way. It would be the same size, and more flexible: the slice could be any random mix of ints and floats, and the switch would discriminate correctly between them.

So this:

pub const UntaggedUnion = blk: {
    @setRuntimeSafety(false);
    break :blk union {
        int: i64,
        float: f64,
    };
};

Actually combines several of my favorite things about Zig: block scope with labeled breaks, fine-grained control of memory, and flexible, scope-limited runtime safety behavior.

I’m working on a VM right now, and I managed to squeeze all the opcodes into a tagged union which fits into a single machine word. But some of the “opcodes” are actually bitmasks, so they take up the entire word, and there’s no room for the tag in the enum. But because of how the VM functions, the instruction pointer is never pointed at a bitmask, they just live after instructions which know what to do with them, including advancing the instruction pointer to another opcode which has the tag.

Therefore I made a second, bare union: one field is an opcode, and the other is just a u64. As with the SafeUnion in the first post, this ends up with a stride of 16 bytes.

I’m porting this from my earlier draft in another language, and while I’m rewriting the core VM loop, I’ve kept the safety-checked mode on for the untagged union. That way, if code tries to access a bitmap, thinking it’s an opcode, I’ll get a crash instead of whatever happens when that memory is treated as an opcode (nothing good, surely).

But this will be bad for performance if I leave it that way, because it will double the code size, and the distance of each jump instruction, which is bad for cache locality. What I like about the construct above is that I’ll be able to turn off just the safety tag of the union, and get my properly-sized instructions without giving up on runtime panics of any other sort.

The TigerStyle talk was a good reminder to me that, just because you can turn off runtime safety in production, doesn’t mean you have to, or even should, necessarily. It seems wisest to flip it off carefully, a scope at a time, while benchmarking the result, and only keep the check-free blocks if there’s a clear performance boost. The compiler for the VM, for instance, can probably afford to be, say, 20% slower, if it means that buffer overruns will crash rather than escalate towards pwning the program.

But the core dispatch loop has to be as fast as it can be, and I definitely can’t afford to have the instructions be 16 bytes when the program only needs them to be 8.

So I certainly didn’t mean to give the impression that you should just switch off the safety check and do bad things to undefined memory, where you’ll have no idea what the program will do. Andrew’s right, don’t do that.

chung-leong · June 7, 2024, 11:08am

A big question here is whether runtime safety is an attribute of the union. If it’s not then using a type defined within a runtime-safe-off context outside that context will break the compiler.

nyc · June 7, 2024, 12:35pm

the block thing is a nice trick - i have not seen that before.

nyc · June 7, 2024, 12:37pm

extern is too limiting. i deal with this constantly where i just want a fixed, well defined layout but regular/auto structs don’t give that to you and extern has too much extra baggage. There needs to be a better solution besides the current offering.

it is basically struct coloring., and it is just as annoying with structs as with functions.

Sze · June 7, 2024, 4:26pm

For this use case, the restrictions of what you can put in a extern union or packed union make the program more complicated, reduce the usefulness of the data structure (user is disallowed to use types that should work) and/or require complicated casting that makes using unions unattractive.

To the point where I am likely to calculate my addresses manually instead of using any union, because they bring more restrictions than when I don’t use them.

Basically I would want a unsafe union, that has the same size as its biggest member, the alignment of the highest needed alignment, puts every member at the 0 offset. So I guess it would be considered as having a defined memory layout. (And if it was named differently and had safety checks, those either wouldn’t be intrinsically implemented, or would be at the end of the union)

So that you can have [*]UnsafeUnion, contain a bitmask as the first member (thus you know it is always the bitmask), then you can use that bitmask to be able to tell what is the length of the multi-item slice via @popCount of the set bits, and then by having specific bits mean specific types you can have arbitrary types as members of the slot, you check the bit, index into the multi-item pointer and access the correct active field.

The trouble only starts when you know that everything after the head slot is just data slots and you want to use a slice with the right alignment to provide a view into only these data slots:

const head:[*]UnsafeUnion = try allocAndInitialize(...);
// NOTE this only works for defined memory layouts which bare unions lack
// NOTE this also requires that @sizeOf(Data) == @sizeOf(UnsafeUnion)
const view:[]alignment(slot_alignment) Data = @ptrCast(@alignCast(head[1..getSizeFromMask(head)]));

With the theoretical UnsafeUnion that happens to work because the element is garanteed to be put at the 0 offset (it doesn’t have an unaligned stride) and the size of Data and the union is the same.

But with a safety checked union, the safety check gets aligned and my data is located after it. That ruins the ability to properly align the data within the slot or provide a view into it via a slice (basically we would need a slice with a stride).
And if turning of safety checks to get an unsafe union, is invalid, then unions are useless for this usecase.

Because this becomes easier (without any of the restrictions of packed or extern):

const head:[*]Head = try allocAndInitialize(...);
const data:[*]Data = calculateDataPointer(head); // basically advances forward and aligns by whatever amount is necessary
const view:[]Data = data[0..getDataSizeFromMask(head)];

So I think I will try to change my code towards using that.

chung-leong · June 7, 2024, 5:15pm

Runtime-safety is not an attribute of the type for i8:

const std = @import("std");

const UnsafeI8 = define: {
    @setRuntimeSafety(false);
    break :define i8;
};

pub fn main() void {
    var i: UnsafeI8 = undefined;
    var j: u8 = 255;
    _ = &j;
    i = @intCast(j);
    std.debug.print("{d}\n", .{i});
}

thread 268704 panic: integer cast truncated bits
/home/cleong/Desktop/test.zig:12:9: 0x103524f in main (test)
    i = @intCast(j);
        ^
/home/cleong/.zvm/0.13.0/lib/std/start.zig:514:22: 0x1034a59 in posixCallMainAndExit (test)
            root.main();
                     ^
/home/cleong/.zvm/0.13.0/lib/std/start.zig:266:5: 0x10345c1 in _start (test)
    asm volatile (switch (native_arch) {
    ^
???:?:?: 0x0 in ??? (???)
Aborted (core dumped)

I would like such usage supported personally.

Sze · June 7, 2024, 5:58pm

I think this answers the topic, you can’t disable safety as a property of the type.
(And have the compiler treat that type in an unsafe way everywhere)

I don’t know whether this is generally the case, but that is what @chung-leong’s code seems to hint at.

I think technically with bare unions the observed behavior is different, but the language disallows to treat bare unions as if their memory layout was defined.
Even when turning off safety seems to make the layout empirically predictable, it still is considered UndefinedBehavior by the language.

dimdin · June 7, 2024, 7:07pm

I think I am missing something.

I can understand the restrictions of extern union, but I don’t know any restrictions for packed union.

packed union have the size as its biggest member and puts every member at the 0 offset.
I don’t know about the alignment handling.

What I am understanding is that you have a slice and the stride is needed to move to the next element when the size of the data is smaller than the union.
If I am understanding correctly this works:

const std = @import("std");

pub fn main() void {
    const PU = packed union {
        foo: u32,
        bar: u8,
    };
    const pus: []const PU = &[2]PU{
        PU{ .foo = 1024 + 1 },
        PU{ .foo = 1024 + 2 },
    };
    std.debug.print("{d}, {d}", .{ pus[0].bar, pus[1].bar });
}

Displays 1, 2

I think the key phrase here is: the right alignment.
But how this differs from the above working example?

Sze · June 7, 2024, 8:14pm

I want to allow the user to use normal structs (if they choose), you can’t put a normal struct inside a packed union.

    const Data = struct {
        a: u16,
        b: u16,
    };
    const PU = packed union {
        foo: u32,
        data: Data,
    };
    const pus: []const PU = &[2]PU{
        PU{ .foo = 1024 + 1 },
        PU{ .data = .{ .a = 4, .b = 7 } },
    };
    std.debug.print("{d}, {d}", .{ pus[0].foo, pus[1].data });

Gives me:

packedunion.zig:10:15: error: packed unions cannot contain fields of type 'packedunion.main.Data'
        data: Data,
              ^~~~
packedunion.zig:10:15: note: only packed structs layout are allowed in packed types
packedunion.zig:4:18: note: struct declared here
    const Data = struct {
                 ^~~~~~

With packed union everything is packed, I want to give the user the choice whether their data is packed or a normal struct or a primitive type, or even an enum. The data is only as packed as the type they supply.

I also tried your extern union suggestion, instead with a packed union and using data: std.meta.Int(.unsigned, @bitSizeOf(T)), and then bit casting to it, but then I noticed that you can’t use @bitCast on an enum and instead have to use @intFromEnum, which makes the code more complicated.

I don’t think this code should require any bitcasts anyway, I don’t want to reinterpret any memory, I just want to assign the correct values to specific slots of specific sizes, I don’t really care about the internal layout of the user supplied type. They get a slot where they put their data and can read it back out again.

I shouldn’t have to type erase their data to be able to put the blob of data into my data field, but if I don’t type erase and use a union, then I am forced to know more about the user supplied type then I want to.

It does work, but only for certain allowed types. (sidenote: I don’t want to do type-punning at least not between different field types. I don’t know if using an zero overhead union and casting that to the type of the active field is also called type-punning, the way I see it, is that I want to strip away the union, which only works if it is of the same size)

With the packed union the stride isn’t a problem, because the size is well defined, but it doesn’t allow normal structs.

The union allows normal structs, but you don’t have a well defined size for the union, and you would need a slice with a stride to construct a view that only shows the data field. (It instead shows you the debug tag as if it were data, if you try to pointer cast it)

mnemnion · June 7, 2024, 11:57pm

So this definitely isn’t going to work with a bare union, because it has a size, but it doesn’t have a layout. So you won’t be able to get a single offset inside the union to have a consistent interpretation between the field types of the union. You should be able to make a Zig struct with a field (that would have a consistent offset) that also has another field which is a bare union, and use a tagging scheme on the offset-defined field to decide on the value of the union part of the struct.

Would you be able to write an example of the type you’re looking for in C? obviously not generic, but it isn’t clear to me the specific kind of memory-punning you’re trying to do. I did assume when you asked about disabling the safety check tag, that you had some external way of discriminating the union.

I don’t think that @chung-leong’s example is doing anything which usefully generalizes. Because for a union, you absolutely can disable the tag.

Run this (it won’t segfault):


// unsafe union
pub const UnsafeUnion = blk: {
    @setRuntimeSafety(false);
    break :blk union {
        int: i64,
        float: f64,
    };
};

const union_array: [2]UnsafeUnion = .{ UnsafeUnion{ .int = 64 }, UnsafeUnion{ .float = 1.5 } };

fn runtimePass() []const UnsafeUnion {
    return union_array[0..];
}

test "safe and unsafe" {
    std.debug.print("\nsize of SafeUnion is {}\n", .{@sizeOf(SafeUnion)});
    std.debug.print("size of UnsafeUnion is {}\n", .{@sizeOf(UnsafeUnion)});
    std.debug.print("size of array of UnsafeUnion is {}\n", .{@sizeOf([2]UnsafeUnion)});
    const union_int = UnsafeUnion{ .int = 64 };
    std.debug.print("value of union_int is {}\n", .{union_int.int});
    const union_float = UnsafeUnion{ .float = 3.14159 };
    std.debug.print("value of union_float is {}\n", .{union_float.float});
    const union_slice = runtimePass();
    // this is VERY ILLEGAL and is meant to illustrate the actual behavior
    // I DISAVOW
    std.debug.print("Does a bad thing happen? {}\n", .{union_slice[1].int});
}

The important question for me is this: say we have a bare union just like from the example. This is my question: if a type like this is used only correctly in safety-checked blocks, is it guaranteed to be correct? Or is the whole program undefined?

Because the union type evidently doesn’t have the tag. That much is very clear. As long as the compiler knows that, and doesn’t write safety checks it can’t make, then this is ok.

So just empirically, we’ve confirmed that: the block removes the tag, and we get the “expected” undefined behavior, in a block which is safety checked. If you try this with the SafeUnion variant, it will panic. With UnsafeUnion, it just prints random stuff (from the perspective of the standard).

But is this property guaranteed to hold for any union defined this way, which is used properly: that is, always defined and accessed through the same field? I know that it risks invoking undefined behavior, the big question for me is whether it guarantees the expected behavior given that the type invariant is upheld.

It seems to me like it has to work that way, as in, the future standard would have to forbid ever getting this wrong. Otherwise block level runtime safety isn’t well-defined, or, it would need to be impossible to disable safety checking of bare unions. In which case, as I said in my first post, why have them? If the tag is mandatory you may as well be able to use it.

I’m glad @chung-leong tried that example, but I wouldn’t expect it to work, because it doesn’t change the type. Unions might be unique here in that the physical type of a runtime-checked union is literally different, so (as we’ve seen) it is possible to compile it in a context which isn’t safety checked, and then use it in one which is.

Wrong Union Field Access sure looks to me like the only case where the runtime safety setting active at type creation can follow the type around.

I’m intending to build an entire program on this premise it matters to me if what I’m trying to do is well defined, provided that use of the union is correct.

nyc · June 8, 2024, 12:03am

how does that work? To get a consistent offset, you need to make it extern, then you need to have an extern union, and back to where you can’t have arbitrary structs inside it?

mnemnion · June 8, 2024, 12:28am

I should have said a defined offset, you’re right. It doesn’t need to be consistent in this case, as long as it refers to a well-defined part of memory. Which actual offset that is could change between version of the compiler, or build modes, or what have you.

The idea is that a struct with an internal union will have at least one field which points to consistent memory, and that can be e.g. pointer tagged to discriminate the inner union.

Or it could be a usize-backed packed struct (inside the struct) where the tag is an enum, that has a receiver method which casts it to the correct pointer when it’s needed, masking off the tag bits in some fashion. The VM I’m working on now doesn’t need this, but I’ll need something like that down the line, so I’ve been experimenting a bit with how to make it work.