Is Zig's New Writer Unsafe?

vpol · September 20, 2025, 3:11am

squeek502 · September 20, 2025, 3:35am

The example in the post is using the “direct mode” of std.compress.zstd.Decompress (i.e. giving it a zero-length buffer) which actually has much more unsafety than detailed in the post:

github.com/ziglang/zig

`flate.Decompress`: direct_vtable cannot implement `discard`, `rebase`, and `readVec`

opened 05:45AM - 27 Aug 25 UTC

squeek502

bug standard library

### Zig Version 0.16.0-dev.38+37f4bee92 ### Steps to Reproduce and Observed Be…havior `flate.Decompress` uses `direct_vtable` (which sets `.discard = discardDirect`) only when `r.buffer` has a length of 0: https://github.com/ziglang/zig/blob/9399fcddce0bcd8e987b053f3946aa1b0ff2ef0a/lib/std/compress/flate/Decompress.zig#L84 This means that nothing about the `discardDirect` implementation makes sense, since it all assumes that `r.buffer` has a length > 0: https://github.com/ziglang/zig/blob/9399fcddce0bcd8e987b053f3946aa1b0ff2ef0a/lib/std/compress/flate/Decompress.zig#L117-L139 For example: no matter what, the first `if` will be true and call into `rebase` which is guaranteed to hit integer overflow when evaluating this assertion (`0 - flate.history_len`): https://github.com/ziglang/zig/blob/9399fcddce0bcd8e987b053f3946aa1b0ff2ef0a/lib/std/compress/flate/Decompress.zig#L106 This also means that the `rebase` implementation doesn't make sense when `buffer.len == 0`, so `direct_vtable` using `rebaseFallible` is also guaranteed illegal behavior if it ever gets called. And the same problem exists for the `readVec` implementation. --- Any `Reader` function that calls into `vtable.discard` will reproduce this, but here's one example: ``` head -c 4096 /dev/zero | gzip > test.gz ``` ```zig const std = @import("std"); test "flate discard direct" { const compressed = @embedFile("test.gz"); var in: std.Io.Reader = .fixed(compressed); var decompress: std.compress.flate.Decompress = .init(&in, .gzip, &.{}); _ = try decompress.reader.discardRemaining(); } ``` ``` $ zig test flate-discard.zig thread 374932 panic: integer overflow /home/ryan/Programming/zig/zig/lib/std/compress/flate/Decompress.zig:106:37: 0x1199ca9 in rebase (std.zig) assert(capacity <= r.buffer.len - flate.history_len); ^ /home/ryan/Programming/zig/zig/lib/std/compress/flate/Decompress.zig:118:57: 0x119926b in discardDirect (std.zig) if (r.end + flate.history_len > r.buffer.len) rebase(r, flate.history_len); ^ /home/ryan/Programming/zig/zig/lib/std/Io/Reader.zig:273:35: 0x102b6d2 in discardRemaining (std.zig) offset += r.vtable.discard(r, .unlimited) catch |err| switch (err) { ^ /home/ryan/Programming/zig/tmp/flate-discard.zig:10:47: 0x102b0ec in test.flate streamExact direct (flate-discard.zig) _ = try decompress.reader.discardRemaining(); ^ ``` ### Expected Behavior `flate.Decompress.direct_vtable` to have functions that take `buffer.len == 0` into account

(the issue is about flate but it applies to zstd as well)

Any usage of Decompress in “direct mode” that calls into vtable.rebase/discard/readVec is guaranteed to trigger illegal behavior.

See this comment for context. As I commented here, I think there is room for improvement on the documentation surrounding this, but there’s also some inherent problems with this use case that will hopefully get some attention soon.

vulpesx · September 20, 2025, 3:42am

I am confused at the point of that blog post.

This isn’t an issue with the new interfaces.

This is an issue with implementations having potentially complex buffer requirements.

It’s an issue that exists regardless of the new or old API, even if you avoid reader/writer entirely.

You can see in the old implementation, there were multiple buffers, the new interfaces having buffers + being able to have requirements for those buffers seems to simplify the implementation.

The difference is now it’s exposed to the caller, as the caller has control over the buffer, and is also able to take advantage of its existence for its own purposes.

karlseguin · September 20, 2025, 3:49am

I don’t think that’s right. If you take the original snippet:

fn output(r: *std.Io.Reader) !void {
    const stdout = std.fs.File.stdout();
    var buffer: [???]u8 = undefined;
    var writer = stdout.writer(&.{});
    _ = try r.stream(&writer.interface, .unlimited);
    try writer.interface.flush();
}

What’s the correct buffer size to use? I believe the answer is: it depends. But the issue, and the point of the post, is that there are cases where you might not be able to know.

Possibly this was an issue with the old API as well (got an example?). But that doesn’t change the fact that it’s an issue with the new one as well.

squeek502 · September 20, 2025, 3:56am

Do you think something like this doc comment on zstd.Decompress.init would help?

/// `buffer` fulfilling the requirements supersedes/obsoletes the requirements on
/// the `Writer.buffer`, but may introduce redundant memcpy calls. This can be useful
/// when you do not know/control the buffer capacity of the `Writer`.

In the old API, this “direct” mode wasn’t supported (but, as mentioned in my last comment, there’s still a lot of limitations around it currently that are unresolved/unresolvable).

vulpesx · September 20, 2025, 4:05am

The file reader/writer implementation has almost no requirements, some fall back operations (like when the os doesn’t have a send file syscall) need a non-zero sized buffer.

The rest of the requirements are from your use case and the data you’re working with, which is something only you could know, the implementations can’t. This applies to pretty much all implementations.

With the old API, the implementations had their own buffer(s) if they needed them, and BufferedReader/Writer was usually used with a default size of 4096 due to the convenience functions.
an example
new zstd decompresser state

input: *Reader,
reader: Reader,
state: State,
verify_checksum: bool,
window_len: u32,
err: ?Error = null,

old state

        source: std.io.CountingReader(ReaderType),
        state: enum { NewFrame, InFrame, LastBlock },
        decode_state: decompress.block.DecodeState,
        frame_context: decompress.FrameContext,
        buffer: WindowBuffer,
        literal_fse_buffer: [table_size_max.literal]types.compressed_block.Table.Fse,
        match_fse_buffer: [table_size_max.match]types.compressed_block.Table.Fse,
        offset_fse_buffer: [table_size_max.offset]types.compressed_block.Table.Fse,
        literals_buffer: [types.block_size_max]u8,
        sequence_buffer: [types.block_size_max]u8,
        verify_checksum: bool,
        checksum: ?u32,
        current_frame_decompressed_size: usize,

some of those types are also a lot of data. most of which is replaced by the buffer in the new interfaces.

The correct buffer size is almost entirely a matter of performance, most implementations both old and new can work with no buffer perfectly fine.

karlseguin · September 20, 2025, 4:13am

I feel like I tried passing a buffer, but then it did the same thing in reverse - it placed a buffering requirement on the input source. So the buffer size of the original reader (say a file) now becomes dependent on an unknown. However, I had/have a lot of problems doing this - I keep getting SIGILL (0.16.0-dev.253+4314c9653 and 0.15.1), so I maybe I’m wrong and it would solve it.

The copy case would be additive, right? If you have a chain of transforms, and they all opt for this, then it’s potential extra copy(ies) at each step.

Also, it isn’t a global solution. It’s specific to flate/zstd. Other readers that impose a buffer requirement on writers wouldn’t necessary offer the same tradeoff. So the original snippet of trying to output a Reader, the correct size of the buffer is still impossible to know - because you don’t know if Reader has its own “you don’t need to supply your own buffer” mode.

karlseguin · September 20, 2025, 4:18am

I don’t think this is always true.

vulpesx · September 20, 2025, 4:22am

I meant something only you could know, the implementation has no idea how you’re using it and with what data so it can’t document or assert requirements specific to your use case.

squeek502 · September 20, 2025, 4:25am

It shouldn’t. IMO a lot of combinations are undertested currently though, and in my experience it can be hard to distinguish between “holding it wrong” and a bug in some part of the implementation.

Yep, and those copies were unavoidable with the old API.

The intention is for the implementation to document this:

The implementation chooses:

What the minimum buffer capacity is asserted to be (perhaps zero)

Under what conditions an undersized buffer capacity can cause runtime failures (as opposed to assertions)

The implementation documents these things carefully, and asserts as early as possible.

The API user chooses:

How to obtain that buffer (stack, static, heap), if any

How big to make it

Note that buffers are edges not nodes in a stream pipeline. That means when an API user is choosing a buffer, it has to manage the expectations of two stream implementations - one on either side of the buffer.

It certainly doesn’t help that I’m having to link to random issue comments for all this stuff