What makes "ban returning pointer to stack memory" difficult?

kj4tmp · April 1, 2025, 4:27am

I recall reading somewhere from the zig core team that making returning pointers to stack memory (“dangling pointers”) a compile error was “difficult from a language specification perspective.”

Could someone elaborate more?
Are there legitimate use-cases for returning pointers to stack memory that would be difficult to carve-out space for in the language specification? Could this not be solved by requiring @intFromPtr to return them?

andrewrk · April 1, 2025, 5:46am

fn get() *i32 {
    var x: i32 = 0;
    return &x;
}
fn illegal() void {
    get().* = 42;
}

Here, calling illegal is always a bug. In fact we can reduce it to:

fn illegal() void {
    unreachable;
}

At that point, why even have a function? It’s just the construct unreachable.

It’s not a compile error to have unreachable in your code. It’s a runtime bug to reach unreachable code.

tensorush · April 1, 2025, 6:40am

I think @kj4tmp is asking about how difficult would it be to additionally analyze bodies of functions that return pointers, like fn get() *i32, such that analyzing functions that return pointers to stack memory would through a compile error.

For instance, analyzing either of these functions would be a compile error:

fn get() *i32 {
    var x: i32 = 0;
    return &x;
}

fn get2() *i32 {
    var x: i32 = 0;
    const ptr = &x;
    return ptr;
}

While analyzing any of these would still be fine:

const std = @import("std");

const X: i32 = 0;

fn getStatic() *const i32 {
    return &X;
}

fn getAlloc() !*i32 {
    const x_ptr = try std.heap.page_allocator.create(i32);
    return x_ptr;
}

fn getAlloc2() !**i32 {
    var x_ptr = try std.heap.page_allocator.create(i32);
    const ptr = &x_ptr;
    return ptr;
}

UPD: My bad, getAlloc2 is invalid! This only makes it easier to detect stack pointers, though.

yataro · April 1, 2025, 7:31am

fn getAlloc2() !**i32 {
    var x_ptr = try std.heap.page_allocator.create(i32);
    const ptr = &x_ptr;
    return ptr;
}

I don’t think that getAlloc2 is fine, it returns pointer to stack-allocated pointer x_ptr, isn’t it?

tensorush · April 1, 2025, 7:41am

Yeah, I would ban that case as well. But the Sema’s logic might be that since we’re returning a pointer to a pointer to i32, and the returned i32 value isn’t on stack, then we can make it a valid case, as it is now:

const std = @import("std");

fn getAlloc2() !**i32 {
    var x_ptr = try std.heap.page_allocator.create(i32);
    x_ptr.* = 42;
    const ptr = &x_ptr;
    return ptr;
}

pub fn main() !void {
    std.debug.print("{d}\n", .{(try getAlloc2()).*.*});
}

tensorush · April 1, 2025, 7:50am

Actually, not sure. It may be wrong even now

Zen · April 1, 2025, 8:01am

const ptr = &x_ptr;
return ptr;

Wrong ! It will be invalid after return(it is in stack). How about returning *x_ptr(which should be in heap).

Regards!

yataro · April 1, 2025, 8:14am

Assuming that is a valid case, then

fn get() *i32 {
    var x: i32 = 0;
    return &x;
}

is also a valid case, with the same logic applied. But object (pointer in this case) should be placed somewhere? Where this pointer is stored? Your example crashes on ReleaseSafe. I think it reads leftover value from stack. I don’t think sema really knows if pointer points to heap allocated object.

Edited your example slightly for easier debugging:

It indeed returns pointer to stack memory:

lea     rcx, [rbp - 32]      ; get address of `x_ptr` and write it to `rcx` (`ptr`)
mov     qword ptr [rbx], rcx ; put resulting address of stack allocated `ptr` to `rbx`

EDIT: posted old godbolt link by accident

tensorush · April 1, 2025, 8:19am

Thanks, my bad! I’ve updated the post above.

mnemnion · April 1, 2025, 3:18pm

Here’s a minimal example of where the problem lies.

fn legalToPassTo(ptr: *const Something) Blah {
    var a_blah: Blah = .initial;
    a_bla.ptr = ptr;
    return a_blah;
}

fn legalToPassDown() Blah {
    const a_something: Something = .initial;
    const a_ptr = &a_something;
    return legalToPassTo(a_ptr);
}

Both of these functions are doing perfectly legal things, on their own. Passing a pointer down-stack is perfectly ok, taking a pointer and returning a struct incorporating it is also a-ok. Returning a value directly from a called function? Absolutely normal behavior.

This being the simplest possible example, is already far more difficult to catch than it may initially look like:

fn legalToPassTo2(ptr: *Something) Blah {
    var a_blah: Blah = .initial;
    a_bla.ptr = could_be_anything.maybeClone(ptr);
    return a_blah;
}

Some of you know the punchline: this is lifetime analysis.

kj4tmp · April 1, 2025, 5:09pm

It’s seems to my untrained eye that if the problems solved by Proposal: restricted function types · Issue #23367 · ziglang/zig · GitHub are actually solved (the compiler knowing the full function call graph), then the compiler could also know the lifetimes of all stack allocated memory, and thus detect pointers to invalid stack allocated memory.

Consider the example of an acyclic function call graph. The compiler would only need to check that a given stack variable pointer does not traverse upwards from its point of creation? I’m guessing it looks a bit like reference counting.

kj4tmp · April 1, 2025, 6:54pm

If I understand you correctly, you are saying the reason that use-after-stack-lifetime-expired can not be a compile error is similar to the reason that std.debug.assert(false) cannot be a compile error?

In other words, the (unwritten) language specification currently catagorizes use-after-stack-lifetime-expired as undefined behavior?

mnemnion · April 1, 2025, 6:54pm

// A global:
var must_copy: bool = false;

fn legalToPassTo(ptr: *const Something) Blah {
    var a_blah: Blah = .initial;
    // A bug:
    if (must_copy) 
        a_blah.ptr = ptr;
    else
        a_blah.ptr = ptr.clone();
    return a_blah;
}

fn legalToPassDown() Blah {
    const a_something: Something = .initial;
    const a_ptr = &a_something;
    // doing everything right down here
    must_copy = true;
    const a_blah = legalToPassTo(a_ptr);
    // gotta clean up!
    must_copy = false;
    return a_blah;
}

Demonstrating that this is, in fact, subject to the Halting Problem, is left as an exercise for the interested reader.

kj4tmp · April 1, 2025, 7:27pm

This situation could be caught by an eager analysis, similar to how “unused function parameter” is currently analyzed? The compiler need only prove that it is possible based on the AST? Maybe that would be annoying due to false positives.

pachde · April 1, 2025, 7:45pm

No pointer provenance survives the FFI boundary or bitcasting to some integer.

Dok8tavo · April 1, 2025, 7:46pm

No, because this works:

fn f(param: u8) void {
    if (false)
        use(param);
}

The param can still actually be “unused at runtime”, and similarily, you could still leak a stack pointer at runtime with this approach.

chung-leong · April 1, 2025, 8:14pm

On a more practical front, how hard would it be fill the stack space used by a function with 0xAA on function exit?

Chaz · April 1, 2025, 8:54pm

Seems like there’s a difference between using &x when x is on the stack and when x is not on the stack. I don’t normally suggest language features, but how awful would it be if we had to write it as @someNewTypeOfCast(&x) before using it as a pointer, even for calling some x.func()?

I know it’s functionally just as easy to make a mistake, but with this I could audit them, and maybe along the way I put a few more things into an arena and off the stack. Even a FixedBufferAllocator-backed arena that uses stack memory reduces the number of places to look for this error.

andrewrk · April 1, 2025, 8:57pm

btw there is a plan to catch this at runtime

LucasSantos91 · April 2, 2025, 4:54am

I’m with @kj4tmp , it’s not as hard as people make it out. Gcc and clang’s static analyzer catch these very easily.
Catching every possible situation is unsolvable, as has been pointed out, like due to external functions. But most bugs of this type are not complicated. We could catch the vast majority of them with interprocedural analysis, though it could be costly. But I think even a naive approach, like just looking at the AST, would already catch a whole lot of bugs, while being cheap.
There are no false positives with the AST approach (though it would obviously have a lot of false negatives). And we could just design the interprocedural analysis to not have false positives, that is, do not complain unless you can prove with absolute certainty there is a dangling pointer (once again, such a high bar would mean a lot of false negatives).