Custom errors with payloads (with current zig)

Sze · December 18, 2023, 9:08am

Somewhere I read stuff about users wanting errors with payloads, then I read stuff about zigs destructuring syntax for tuples and I began wondering whether using destructuring syntax could help in creating custom errors with payloads. I nerd-sniped myself
So I began an exploration (procrastination) journey…

Here is what I came up with, custom errors are structs like this:

const UnexpectedToken = struct {
    expected: Token,
    got: Token,
    location: Location,

    pub fn init(expected: Token, got: Token, location: Location) @This() {
        return .{ .expected = expected, .got = got, .location = location };
    }
    pub inline fn err(_: @This()) !void {
        return error.UnexpectedToken;
    }
    pub fn format(
        self: UnexpectedToken,
        comptime fmt: []const u8,
        options: std.fmt.FormatOptions,
        writer: anytype,
    ) !void {
        _ = fmt;
        _ = options;
        try writer.writeAll("UnexpectedToken\n");
        try writer.print("    expected: {}\n", .{self.expected});
        try writer.print("    got: {}\n", .{self.got});
        try writer.print("    location: {}\n", .{self.location});
    }
};

The only really essential method is err the rest is just fluff to make initialization/formatting nicer.

How do you write a function returning custom errors?

const payload = customerrors.Payload(.{});
const TokenOrFile = customerrors.Union(.{ UnexpectedToken, FileFetchError });
const NumbersErrors = customerrors.Union(.{ customerrors.AllocationError, TokenOrFile });
const Numbers = payload.Error(*Node, NumbersErrors);
fn parseNumbers(self: *Parser) Numbers.res {
    const choice = self.rng.random().uintLessThan(u16, 10);
    if (choice == 0) {
        return Numbers.fail(UnexpectedToken.init(.NUMBER, .SOMETHING_NOT_ALLOWED_IN_PARENS, .{
            .file = fakefile,
            .line = 10,
            .column = 33,
        }));
    } else if (choice == 1) {
        return Numbers.fail(FileFetchError{ .file = fakefile, .pos = 42 });
    } else {
        var node = self.randomNodeAllocFail() catch {
            return Numbers.fail(customerrors.AllocationError{ .src = @src() });
        };
        node.data = self.rng.random().uintLessThan(u16, 1000);
        return Numbers.success(node);
    }
}

Here customerrors.Union combines N different errors into a union so we can return any of them, we have to do this explicitly because we don’t have a way to infer it, but we can combine unions into bigger unions (they get flattened).

payload.Error(*Node, NumbersErrors) defines that we have a payload *Node and a custom error (union) NumbersErrors. Numbers is a struct with functions that are the interface used for writing functions with custom errors, res is the result type which is always a 2 element tuple (kind of like in go, but try makes it nicer?) where the first is the result of the function (only defined if successful) and the second is an optional of the custom-error type, simplified you can say that success returns .{value, null} and fail returns .{undefined, custom-error} however fail constructs the union instance based on the given type automatically and can “disable” custom errors (e.g. based on build flags) always returning OpaqueError which doesn’t have more information than zig error codes.

How do you use a function?

fn parse(self: *Parser) !void {
    const node1 = try payload.unwrap(self.parseNumbers());

    const node2, const err = self.parseNumbers();
    try payload.check(err);

    const node3, const err2 = self.parseNumbers();
    payload.custom(err2) catch |e| {
        std.debug.print("the custom error was:\n{}\n", .{err2.?});
        std.debug.print("the error code is: {}\n", .{e});
        std.debug.dumpCurrentStackTrace(null); // TODO better stack trace
        return e;
    };

    const node4, const err3 = self.parseNumbers();
    if (err3) |custom_error| {
        std.debug.print("using if on the optional:\n{}\n", .{custom_error});
        std.debug.dumpCurrentStackTrace(null); // TODO better stack trace
        return custom_error.err();
    }

    // use nodes
    std.debug.print("{} {} {} {}\n", .{ node1, node2, node3, node4 });
}

Here payload.unwrap gets the 2-tuple returned from parseNumbers and returns its success value if its successful and on fail it prints the custom error and fails with the zig error code, so unwrap uses the custom error information in a predefined way (it might make sense to have a customization function for this) and converts it to a zig error.

payload.check(err) here err is the custom error and the function prints the custom error and fails with the zig error on failure, otherwise it does nothing.

payload.custom(err2) fails with the zig error without printing anything on failure, otherwise does nothing.

And you also can use if on the optional custom error value.

There are a bunch of things that could be explored further here:

when custom errors are “disabled” how much of the code really disappears from what gets compiled into the program? I haven’t looked into this…
in functions that use custom errors can we capture stack trace information properly and print one unified stack trace that looks similar to a zig error just with more information?
we are converting custom errors to zig errors, maybe the other way around also makes sense sometimes?
what about resources being associated with errors?
should the generated union fields have better names?
switching on the custom error union would be better with predictable field names!?!

Here is my sketch of the idea:

This topic is about what can be done with current zig, however here is a link to an issue about adding a feature to the language: Allow returning a value with an error · Issue #2647 · ziglang/zig · GitHub

I currently don’t have a usecase for errors with payloads (maybe in the future when I revisit my interpreter). Do you use custom errors of some kind? What are your use cases? What features do you miss? Is this useful?

If this idea is useful, maybe we can work together on creating a polished version of this, that can be used as a library.

Tosti · December 18, 2023, 5:10pm

Could you explain why have you decided to use go/odin-like 2-tuples instead of making actual payload a part of the union? Making payload a part of the union prevents from accidentally using it without error checking. This is the case for built-in error unions.

AndrewCodeDev · December 18, 2023, 7:04pm

@Tosti I had the same question - maybe partial success is an option here? You could have another union member representing that too, though.

@Sze The only case I’ve personally worked on had this functionality was with sending network requests (my experience is limited here). I wanted information about why something failed but that had to come from an outside source. I can see the benefit if you do not have all the information about what went wrong locally and you need to further handle things on your end based on that information. What I’m describing here is more like a response, though. Theoretically, you could have a unique integer identifying just about every situation you can think of, but the combinatorics of that can be awful if you have the potential for multiple independent errors.

On a more language level note, it depends on how you interpret the word “error” as a construct in the language. Does it have to require the use of a keyword or can a struct with a string and a bool do the job? If you program a system around that, you can certainly treat those like errors. Maybe what people are more worried about here is that if it’s not a fundamental part of the language (like a keyword) then people won’t use them?

nurpax · December 18, 2023, 7:38pm

A usecase that I wished for error payload was when parsing JSON. The error now doesn’t contain any context on where and why parsing failed. (There is a way to accomplish the same ny passing around a parsing context, but it’s not as handy as the error containing this info.)

Tosti · December 18, 2023, 7:39pm

If it proofs to be useful and many start to use it, everyone will come up with their own solution. One project will have its own generic to store errors with custom payloads, another project – its own. It will be a pain to build an interface between the two.

AndrewCodeDev · December 18, 2023, 7:56pm

That definitely happens with the C-family of languages. People throw everything including the kitchen sink. You have to catch ints, chars, basically whatever someone decided to throw that day… especially in older code. It was bad and I’ve seen youtube tutorials promoting this same idea.

Sze · December 18, 2023, 9:08pm

I think part of the answer is that I haven’t thought that much about it, it felt like there might be an idea here worth exploring, but I couldn’t tell what the details would look like by trying to imagine all the implications, so I decided to explore it with code.
Another part is probably that I had seen this odin vs zig video before:

Does it? To me this seems similar:

const node_tuple = self.parseNumbers();
// have to pry open node_tuple to get to data

const node_result =  self.parseNumbersMadeUpUnionVersion();
// have to pry open node_result to get to data

I think the one benefit of the latter might be if you always unpack the thing with switch statements. The things I like about the former are that you can easily say what is the result and what is the error, instead of the result just being one of the n cases. Also I like the idea of potentially having many different functions being able to use the same type for the error part.

Other things I wonder but haven’t actually looked into are:

does the compiler do clever things for tuples being returned from functions, like treat them as if they were independent output parameters and optimize them individually, if that may be helpful?
what happens when I have a really big error type but a tiny success result or the other way around, can the tuple be optimized better?
if I “disable” these custom errors (reduce them to wrappers of zig errors the example has a flag -Dcustomerrors=false) can I get rid of the overhead of those errors, or close to it? Would this be more difficult if it was all one union, or just different?
are there cases where you can write error handling functions that can be used for multiple functions that return that error type (where having the success result be mixed into the union would prevent reuse?)

Another thing, combining error unions makes sense to me, but how do I combine two things where one of the values isn’t an error but the success thing. Then when I want to convert back to the zig error I need to have some kind of convention / flag, that tells me how to get the error code or succeed. I think in a way you could argue that putting everything in one union is worse, because it puts everything in the same code path, instead of putting special attention on what you are doing with the error, but I am not sure if that is a strong argument, on the flip side you could say, that the switch at least complains about unhandled cases and if you use it for both data and error, you are more likely to handle the error. So I don’t know.

Seems to me without language support for these errors, you can’t really prevent someone from forgetting about an error (sure there are unused variable errors, but if someones lsp just fixups away that error, you could forget about handling it).

Another thing I wanted to explore eventually is whether you can build up more complex error types as you go up the callstack and collect more information, for that it seemed it might be easier to take apart and reassemble that 2-tuple than having to deal with something more “complex”/structured.

But I also kind of like the pattern shown in the github issue, with just creating a struct on the callsite and passing a reference to it via a config argument, sure it seems very adhoc, but I also like its simplicity.

srikumarks · June 27, 2025, 5:41am

Came here since I felt that errors with attached data would be natural in some code I was writing. However, there are perhaps “closer to the metal” concerns that make this not so straightforward.

For starters, union(enum) is essentially a full on sum type and what we’re seeking here is to make errors also a such a sum type instead of a set of integer error codes.

One way we can accomplish this currently is to keep a global union(enum) type value in a module and store information in that union whenever we’re returning an error from within a module’s function. Any function trapping the error can then lookup more details from that associated union(enum). This brings up a key consideration for the error propagation process – the error value MUST be stack only and by-copy to avoid a lot of complexity that will otherwise arise with allocators. For example, if fnA calls fnB which calls fnC and fnB sets up an arena allocator for use by fnC, then fnA can’t receive any allocated error data from fnC … unless the error data is entirely on the stack and by-copy and not allocated using an allocator. This is also why error as a set of integer codes works very well.

So would having a restricted kind of union(enum) for error types (stack-only + by-copy) still be useful enough? I’m not so sure it will be. So first reaction might be to ask people to use some passed-down or global context (such as the global union(enum) mentioned earlier) into which additional error data can be placed.

If people are going to do a lot of that, then having some language around it along with some compiler checks to enforce the stack-only + by-copy constraint on the union(enum) type might be a useful guard against common memory allocation mistakes.

PS: For the record, I think the approach to errors in zig is brilliant and ergonomical w.r.t. the language, and I’d expect more close-to-the-metal languages to start adopting it at some point.

PPS: The “global union(enum)” suggested above was for illustration. It won’t be a thread-safe approach. Passing down a context would be better for thread safety - as long as the “by copy” constraint is adhered to for error data.