Unsafe fields #3458

jhpratt · 2023-07-13T04:45:33Z

Co-authored by @jswrenn.

Fixes #381

Tracking:

Tracking issue for RFC 3458: Unsafe fields rust#132922

juntyr · 2023-07-13T04:59:58Z

text/0000-unsafe-fields.md

+
+// Unsafe field initialization requires an `unsafe` block.
+// Safety: `unsafe_field` is odd.
+let mut foo = unsafe {


I’m not sure I like that the entire struct expression is now inside an unsafe block (though I’m not sure what a better syntax would be). If the safety invariant requires the entire struct, this makes sense. However, if it is more specific, a larger unsafe block is too broad as it also allows unsafe code usage to initialise the safe fields, which should get their own unsafe blocks. Though perhaps this could just be linted against, e.g. don’t use unsafe expressions in a struct initialiser without putting them inside nested unsafe blocks.

The alternative is to do this:

let mut foo = Foo { safe_field: 0, unsafe_field: unsafe { 1 }, };

That feels worse to me.

Presumably the safety invariant being promised is an invariant within the struct, even if it is not strictly required that that be the case.

Ideally we'd also have partial initialization, such that it would be possible to do

let mut foo = Foo { safe_field: 0 }; unsafe { foo.unsafe_field = 1; }

but I expect that's quite a bit farther away.

I agree that just unsafe in one field initialiser expression is insufficient as it means something very different. I do like however that it clearly shows which field the unsafety applies to.

It reminds me a bit of unsafe blocks in unsafe functions (#2585). Ideally, the fact that struct init is unsafe would not allow unsafe field init expressions. I doubt that special-casing struct inits to not propagate outer unsafe blocks would be backwards compatible, so a new lint would be the only avenue in this direction.

Some new syntax like this

let mut foo = unsafe Foo { safe_field: 0, unsafe unsafe_field: 1, };

would communicate intent better but looks quite unnatural to me.

Perhaps in the future there might be explicit safe blocks to reassert a safe context within an unsafe block, so that it could be written as follows:

let mut foo = unsafe { Foo { safe_field: safe { /* some more complex expr here */ }, unsafe_field: safe { /* some more complex expr here */ }, } };

which could be encouraged with clippy lints. Though for now just moving safe non-trivial struct field initialisers into variables that are initialised outside the unsafe block.

Overall, I think going with the original syntax of

let mut foo = unsafe { Foo { safe_field: 2, unsafe_field: 1, } };

looks like a good and intuitive solution, though I still think that a lint against using unsafe expressions inside the struct expression without a nested unsafe block would still help.

Maybe this?

let mut foo = unsafe Foo { safe_field: 0, unsafe_field: 1, };

unsafe would be followed immediately by a struct expression, and doing so would only indicate that unsafe fields may be initialized. It would not introduce an unsafe context.

I like it - the syntax is still close enough to the unsafe block syntax (except when initialising tuple structs) that it's relatively intuitive what the unsafety applies to, but coupled strongly to the struct name so that it also makes sense why no unsafe context is introduced.

I think there's something nice about

let mut foo = unsafe Foo { safe_field: 0, unsafe_field: 1, };

but to me that would go with unsafe struct Foo, like how unsafe trait Bar leads to unsafe impl Bar.

Spitballing: if per-field safety is really needed, then maybe

let mut foo = Foo { safe_field: 0, unsafe unsafe_field: 1, };

though maybe the answer is the same as for unsafe { unsafe_fn_call(complicated_expr) }: if you don't want the expr in the block, use a let.

After all, there's always field shorthand available, so you can do

let safe_field = unsafe { complex to construct }; let unsafe_field = easy and safe to construct; unsafe { Foo { safe_field, unsafe_field } }

Perhaps the more important factor would be the workflow for the errors the programmer gets when changing a (internal and thus not semver break) field to unsafe. Anything at the struct level wouldn't give new "hey, you need unsafe here" if you already had another field that was unsafe.

Of course, if the safety is at the struct level (not the field level) then that doesn't come up.

Hmm, the "you added an additional requirement to something that's already in an unsafe block and there's no way to help you make sure you handed that" is a pre-existing problem that needs tooling for that everywhere, so maybe it's not something this RFC needs to think about.

If we had unsafe(aligned(a), nonzero(b)) { ... } so that tooling could help ensure that people thought about everything declared in the safety section, then we'd just have unsafe(initialized(ptr, len)) { ... } so that it acknowledged the discharge of the obligation for the type invariant, and the "I need 100 unsafe blocks" problem goes away.

algesten · 2023-07-13T06:34:05Z

By introducing unsafe fields, Rust can improve the situation where a field that is otherwise safe is used as a safety invariant.

I think the motivation could point out who would benefit from this.

I assume it's library authors a making unsafe fields a "reminder to self" about upholding some invariant as opposed to say expecting a unsafe field in an API surface. I.e we still expect unsafe set_len rather than pub unsafe len, right?

programmerjake · 2023-07-13T07:25:54Z

this seems closely related to mut(self) fields #3323 which should probably be mentioned.

jhpratt · 2023-07-13T07:43:11Z

I assume it's library authors a making unsafe fields a "reminder to self" about upholding some invariant as opposed to say expecting a unsafe field in an API surface. I.e we still expect unsafe set_len rather than pub unsafe len, right?

Not necessarily. It is entirely reasonable that Vec.len could be exposed. Whether it actually is exposed is a decision solely for T-libs-api. Likewise with the inner fields of the various nonzero types. I'd rather not tie down who this is intended for, as it truly is intended for everyone. I know I've written a binary that had fields relied upon in unsafe code — there was just no way to make it actually unsafe.

this seems closely related to mut(self) fields #3323 which should probably be mentioned.

Related, sure, but beyond the mention in unresolved questions, I'm not sure how it could be mentioned. Pretty much the only overlap is what's considered a "mutable access", which I didn't feel necessary to be copy-pasted.

jsgf · 2023-07-13T08:46:53Z

How does this look with functional struct update?

Foo {
// Stuff
.. unsafe { other }
}

?
Or does the whole initializer need to be unsafe?

Edit: or I guess it doesn't need unsafe if the source had been initialized with unsafe.

jhpratt · 2023-07-13T08:50:04Z

@jsgf Great question. I don't have an immediate answer, though I believe the mechanism currently in the compiler would require the entire initializer to be unsafe.

mo8it · 2023-07-13T09:20:51Z

I love it! This is much better than getters and setters, both for library authors and users.

typed-builder would have to adjust, but I would love to implement this feature there :)

@idanarye (the main author) Maybe you have some input regarding the builder pattern?

djc · 2023-07-13T10:12:48Z

The implicit notion that only mutation is unsafe (and reading is not) seems tricky. Do you have reasoning to prove that we'll never need fields that are unsafe to read? Maybe there should be an alternative syntax proposal (like unsafe(mut) or mut(unsafe)) that makes this more obvious/explicit?

idanarye · 2023-07-13T13:35:10Z

@mo8it I don't want to spam the comments here with a discussion about typed builder, so I've opened a discussion in my repository instead: idanarye/rust-typed-builder#103

idanarye · 2023-07-13T14:54:13Z

Regarding the RFC itself - I think you are trying to solve a visibility issue with the safety mechanism, which is the wrong tool for the job. You gave Vec as an example, and it looks like you want to grant public read access to its len field (so that you don't have to use len() as a method? Let's ignore the question if that's big enough an improvement to justify such a feature). To do so, you are willing to provide public write access to it as well but make that access unsafe.

But why?

I mean, it's obvious why you don't want to give regular write access to len. But why give any access at all? Even if you require an unsafe block, what good will come from letting external users modify the len field without going through a method that upholds the invariant? As long as we are devising a brand new feature, wouldn't it make more sense to add a feature that gives public read access without any write access at all (other than private access from inside the defining module, of course)?

I'm aware of the set_len method that grants such access, but this is an explicit decision to give such access, with a fully documented method. Not a side-effect of wanting to provide a non-method-call read access to the field.

Another thing - conceptually it never makes sense to define only one field as unsafe. The invariant is a property of the struct as a whole. If this is unsafe:

let mut vec = Vec::new();
vec.len = 4; // UNSAFE!!!

The why not this?

let mut vec = Vec::from([1, 2, 3, 4]);
vec.buf = RawVec::new(); // perfectly safe apparently

Yes, buf is not publicly exposed at all, but inside the module len will need unsafe block to modify and buf won't, even though the invariant is about both of them, together, and how they interact with each other.

Whatever the semantics of unsafe fields will be - conceptually it makes more sense to put the unsafe on the struct itself. If there are fields that are not part of the invariant, they should not be part of the struct - because unsafety should be contained as much as possible and not contaminated with unrelated data. The only reason putting unsafe only on len seems to make case in your example is that the goal - as I've said before - is about visibility, not about safety.

Jules-Bertholet · 2023-07-13T18:30:58Z

text/0000-unsafe-fields.md

+## "Mutable use" in the compiler
+
+The concept of a "mutable use" [already exists][mutating use] within the compiler. This catches all
+situations that are relevant here, including `ptr::addr_of_mut!`, `&mut`, and direct assignment to a


One could argue that ptr::addr_of_mut! on an unsafe field need not be unsafe, because writes through the pointer are unsafe.

Consider this in conjunction with the examples present in the RFC.

fn make_zero(p: *mut u8) { unsafe { *p = 0; } } let p = ptr::addr_of_mut!(foo.unsafe_field); make_zero(p);

Ignoring thread-safety, which can be easily achieved with locks, no single step appears wrong. make_zero does not do anything wrong — assigning zero to an arbitrary *mut u8 is fine. Passing a pointer to the method is naturally okay. Yet it still results in unsoundness, as foo.unsafe_field must be odd.

"assigning zero to an arbitrary *mut u8 is fine" what‽ No it is not fine‽ An arbitrary *mut u8 could be null, dangling, aliased... fn make_zero is unsound.

True — I typed that far too quickly and without thinking. Regardless, it's not immediately obvious to me that ptr::addr_of_mut! should be allowed safely.

I am not fully convinced either way, but I fear it would just be confusing to require unsafe for an operation that can't lead to unsoundness, especially as addr_of!(struct.field) as *mut _ would do the same thing with no unsafe.

especially as addr_of!(struct.field) as *mut _ would do the same thing

While you can do that, it's undefined behavior to actually mutate the resulting mut pointer. I've just confirmed this with miri.

It's UB under Stacked Borrows, but MIRIFLAGS=-Zmiri-tree-borrows accepts it.

That's surprising. I'm not familiar with tree borrows, admittedly.

ptr::from_mut(&mut struct).wrapping_offset(offset_of!(Struct, field)) should achieve the same thing as addr_of_mut!, and I doubt you'd want to make offset_of unsafe for unsafe fields. And even without offset_of, you could use ptr::from_mut(&mut struct).wrapping_offset(addr_of!(stuct.Field) - ptr::from_ref(&struct)). Both of these should be safe in every borrowing model.

burdges · 2023-07-13T23:19:22Z

Vec::len should not do this even if this feature exists, because Vec::set_len is better pedagogically.

static muts require unsafe blocks for both writing and reading. An unsafe field would likely be some similar construction, so unsafe for both writing and reading. An UnsafeCell already hits those requirements, but any variants should set their auto-traits.

Ain't clear this proposal handles auto-traits correctly, even if some use case exists. If you need this, then define your own type which provides this. We've the inner-builder or whatever deref polymorphism pattern, which comes up far more in practice, and can simulate this of desired.

pub struct ThingBuilder { ... }

impl ThingBuilder {
    fn build(self) -> Thing {
        ...
        Thing { ..., inner, self }
    }
}

pub struct Thing {
    ... 
    inner: ThingBuilder,
}

impl Deref for Thing {
    type Target = ThingBuilder;
    fn deref(&self) -> ThingBuilder {
        &self.inner
    }
}

// We stop mutating ThingBuilder once we create a Thing, so Thing: !DerefMut,
// but Thing: Deref<Target=ThingBuilder> to make reading & replicating the
// builder config easy.

jhpratt · 2023-07-14T00:27:16Z

Do you have reasoning to prove that we'll never need fields that are unsafe to read?

How could a field of a struct be unsafe to read?

You gave Vec as an example, and it looks like you want to grant public read access to its len field (so that you don't have to use len() as a method?

Within the module it's defined in (as the field is private), it is currently safe to assign any value, despite the fact that it can lead to undefined behavior. Said another way, the current behavior is inherently unsound.

Nothing in the RFC so much as hints at Vec.len being made public, nor is an RFC an appropriate place to make a change like that. It is an example of a field that should be unsafe to avoid unsoundness and nothing more.

Another thing - conceptually it never makes sense to define only one field as unsafe.

I never claimed that was the case.

The why not this?

let mut vec = Vec::from([1, 2, 3, 4]);
vec.buf = RawVec::new(); // perfectly safe apparently

Inclusion of one example does not mean that everything not included is forbidden. There is simply no reason to repeat the same thing for every field. Of course buf would also be unsafe. I used Vec.len as the example because it's a clear, obvious example where its safety invariants are publicly documented.

The invariant is a property of the struct as a whole.

If there are fields that are not part of the invariant, they should not be part of the struct

For Vec, yes, but only because all fields of the strict interact with all other fields.

I have real world code where this is not the case. The flags field is for whether other fields are initialized. Note that some fields have niche value optimization, and as such don't interact with other fields in any way. Are you asserting that month: Option<Month> and other similar fields should be in a separate struct solely because it has niche value optimization? That appears to a logical conclusion as a result of what you've said.

static muts require unsafe blocks for both writing and reading. An unsafe field would likely be some similar construction, so unsafe for both writing and reading.

static mut requires unsafe due to inherent data races between threads. Unsafe fields have no such issue.

Ain't clear this proposal handles auto-traits correctly, even if some use case exists.

I don't follow. What problems do you see with handling auto traits? The type of the field is unchanged, so there would be no impact on auto traits.

text/0000-unsafe-fields.md

scottmcm · 2023-07-14T02:06:37Z

(@jhpratt I'd love for you to steal something like the text in this post for the RFC)

I assume it's library authors a making unsafe fields a "reminder to self" about upholding some invariant as opposed to say expecting a unsafe field in an API surface.

A huge incentive for it, to me at least, is helping avoid misunderstandings about the model.

For example, a 2020 PACMPL paper contains the following statement:

To check how prevalent unsafe is used as a documentation feature, our queries gathered data for unsafe traits and unsafe functions with safe implementations.

set_len being an unsafe fn is of course not merely a "documentation feature". It's an absolutely critical part of the soundness of Vec. (And the paper does talk about "invariants that are potentially critical for upholding Rust’s safety guarantees" (emphasis added), so I don't think their analysis is incorrect, but I still find the phrasing curious.)

Thus a huge win of this would be to avoid the (from the same paper)

After all, the compiler does not force developers to declare such functions as unsafe -- in contrast to other unsafe features.

Having the body of set_len do something the compiler recognizes as unsafe is a big help to people understanding the soundness of Vec, and in general to people adding new features inside existing unsafety privacy boundaries.

Especially combined with other accepted work like https://rust-lang.github.io/rfcs/2316-safe-unsafe-trait-methods.html we could start even doing things like clippy lints for "why is this unsafe when it doesn't do anything unsafe? Should one of the types involved be marked unsafe?"

If we don't have to link to tootsie pops as often because changing things that are relied on by unsafe code is itself unsafe, I'd consider that a big win.

As another way to look at this, it's weird that when I'm writing a method on by type with a safety invariant that I can do Self { a, b } and it's totally "safe", whereas if I call Self::new_unchecked(a, b) I need an unsafe block and tidy nags me to write a safety comment.

Tidy should nag about a safety comment for the constructor too, so that I'm not disincentivized to use the other, correctly-marked-unsafe function when writing things.

burdges · 2023-07-14T02:26:59Z

We do not need unsafe per se when maintaining a safety invariant within its defining visibility boundary aka module:

"Because it relies on invariants of a struct field, this unsafe code does more than pollute a whole function: it pollutes a whole module. Generally, the only bullet-proof way to limit the scope of unsafe code is at the module boundary with privacy."

In other words, rust does not have unsafe types because you must enforce invariants at module boundaries anyways. Also various discussions in https://github.com/rust-lang/unsafe-code-guidelines clarify this point.

A method like Vec::set_len must be unsafe due to being public. An extern fn is unsafe because it points outside the module. etc.

Anyways..

If I understand, you want this type:

pub UnsafeInvariant<T>(T);
impl<T> UnsafeInvariant<T> {
    fn new(t: T) -> UnsafeInvariant<T> { UnsafeInvariant(t) }
    unsafe fn get(&self) -> &T { &self.0 }
    unsafe fn get_mut(&mut self) -> &mut T { &mut self.0 }
}

It's similar to UnsafeCell but propagates all auto-traits normally, based upon your comment above.

You still enforce invariants by visibility but UnsafeInvariant could provide the documentation for which you propose unsafe fields. In practice, I suspect you'd be better off like this:

    fn invariant_get(&self) -> &T { &self.0 }
    fn invariant_get_mut(&mut self) -> &mut T { &mut self.0 }

Why? All those unsafe blocks you'll write risk other mistakes, so ideally they should not exist if they merely maintain some invarant. Instead, you want a safe but distinctively named accessor method, which flags that you maintain the invariant.

Anecdotally, this type winds up being much more common:

struct HideMut<T>(T);
impl<T> Deref for HideMut<T> {
    type Target = T;
    fn deref(&self) -> &T { &self.0 }
}
impl<T> HideMut<T> {
    fn new(t: T) -> HideMut<T> { HideMut(t) }
    unsafe fn get_mut(&mut self) -> &mut T { &mut self.0 }
}

And UnsafeCell remains more common than both of course.

In fact, if you wrap the HideMut declaration inside some macro_rules! use_hide_mut then HideMut becomes module local, so the local module can access pub foo: HideMut<Foo> fields freely, but the outside world has only immutable access, even if given a &mut for the containing struct. This is really the common pattern.

We made this a local type for visibility modifiers, so a language level construct helps here. Also conversely, if you do not require visibility modifiers then simple types like UnsafeInvariant suffice, no language change necessary.

I suppose one might imagine pub(positive_visibility) unsafe(negative_visibility) mut(positive_visibility) field: type, except this still cannot capture when mutation becomes unsafe but reading remains safe. Yet, visibility control types like UnsafeInvariant and HideMut work fine.

text/0000-unsafe-fields.md

scottmcm · 2023-07-14T03:51:20Z

EDIT: see below; it looks like the thing I was worried about here is probably impossible for other reasons.

I do thing that "safe to read; unsafe to modify" is the 99%+ case, and should certainly be the default, but

How could a field of a struct be unsafe to read?

Well, the field in AtomicPtr is unsafe to read, because it could be a race, for example. https://github.com/rust-lang/rust/blob/7a5814f922f85370e773f2001886b8f57002811c/library/core/src/sync/atomic.rs#L176

Or the value field of a ShardedLock in crossbeam https://docs.rs/crossbeam-utils/0.8.11/src/crossbeam_utils/sync/sharded_lock.rs.html#81

So perhaps some nuance for !Freeze could make sense? I'm not sure what the semver implications of that would be, though.

Jules-Bertholet · 2023-07-14T03:55:25Z

Well, the field in AtomicPtr is unsafe to read, because it could be a race

I don't think that is correct? The unsafe operation is dereferencing the pointer returned by UnsafeCell::get(), accessing the field can't lead to UB on its own.

scottmcm · 2023-07-14T03:59:57Z

accessing the field can't lead to UB on its own

Ah, I guess an access can't actually read an UnsafeCell (without ownership) because it's never Copy.

So I think the PlaceMention is always ok for everything, and a read would be unsafe for an UnsafeCell, but you can't actually do a read of an UnsafeCell directly in Rust.

jhpratt · 2023-07-14T07:18:53Z

We do not need unsafe per se when maintaining a safety invariant within its defining visibility boundary aka module

rust does not have unsafe types because you must enforce invariants at module boundaries anyways

The nomicon describes current behavior; using it as an argument against this RFC is counter to the purpose of the RFC. It's circular reasoning at best.

if they merely maintain some invarant.

The invariants are "merely" there for soundness. If the invariant is violated, the result is undefined behavior. That's far more serious than you make it sound.

Yet, visibility control types like UnsafeInvariant and HideMut work fine.

I have never seen anyone write code like this in practice. The standard library and my own code (in time) is included in this. That is a significant argument in favor of something better.

@scottmcm I'll definitely include parts of that into the RFC. Also reading that blog post now — I'd never seen it before.

burdges · 2023-07-14T09:54:00Z

If the invariant is violated, the result is undefined behavior.

That is a significant argument in favor of something better.

This RFC is not better because memory safety also helps when writing unsafe code.

The unsafe keyword is not simply a marker to tell you where to read more carefully. Its a marker of where safety rules must be violated.

In other words, we always convert regular code invariants into memory safety assurance, but these regular code invariants have exactly the same risks as other regular code, including their own memory safety concerns. This RFC confuses the memory safety consumed in maintaining the regular code invariant with the actually unsafe options the code requires.

In the past, unsafe fn bodies were unsafe blocks, but rust changed this to reduce the unsafe code surface area. This RFC is a mistake because it increases the unsafe code surface area with no benefits, given the same cautions can be maintained in other ways, like by variable naming, etc.

UnsafeInvariant not being used is evidence this feature is not required. UnsafeInvariant would make sense if you wanted to split the regular code invariant across distant visibility boundaries. In practice, unsafe fns always sufficed, or indeed proved more nuanced than UnsafeInvariant.

Anyways..

I think this discussion belongs in https://github.com/rust-lang/unsafe-code-guidelines where at least some people think formally about the unsafe code boundary.

jhpratt · 2023-07-14T10:08:17Z

In other words, we always convert regular code invariants into memory safety assurance, but these regular code invariants have exactly the same risks as other regular code, including their own memory safety concerns. This RFC confuses the memory safety consumed in maintaining the regular code invariant with the actually unsafe options the code requires.

This is your fundamental misunderstanding.

Other code in Vec relies on the invariants of Vec.len in ways that leads to undefined behavior if the invariants are broken. Fields like Vec.len are not "regular code invariants" — they are tightly coupled to whether the code is sound or not. It is a soundness invariant. You cannot possibly claim otherwise.

given the same cautions can be maintained in other ways, like by variable naming, etc.

Frankly, it's thinking like this that led to the creation of Rust. Thread safety can be maintained if everyone is super careful, but we all know how that works out. Likewise with a million other things. Programmers can not be relied upon to do the right thing. We have to force them to do it by leveraging the compiler wherever possible.

I think this discussion belongs in https://github.com/rust-lang/unsafe-code-guidelines where at least some people think formally about the unsafe code boundary.

I have no idea why you think the discussion belongs there, particularly as you're the one that initiated it here.

burdges · 2023-07-14T10:48:17Z

Other code in Vec relies on the invariants of Vec.len in ways that leads to undefined behavior if the invariants are broken.

Yes, but this does not make altering Vec.len within the Vec module an unsafe operation. That's not how safety works.

What happens if I've unsafe code which relies upon an invariant between the values in a slice, so your unsafe field is a &[*mut Foo] or &[usize]? I want memory safety around the regular code invariants in how I maintain this slice. Yes, those regular code invariants control memory safety in how the module get used, hence privacy. It's clearly worse if I've many more unsafe blocks merely to access this slice, which now might intermix with some real unsafe code blocks for the *mut Foos or even violate slice invariants.

We often have this "catch your tail" phenomenon in formalalisms.

Frankly, it's thinking like this that led to the creation of Rust

No. There is a formal model from the rustbelt project about how unsafe works, which gets discussed in the unsafe code guidlines repo. We should only expand what falls under unsafe code if the formal model says so. This RFC confuses those really important formal models with mere "read this" markers.

In brief, you don't have a formal conception of when unsafe types should be used. It's unlikely one exists. That makes this change a regression towards the C days.

jhpratt · 2023-07-14T11:03:16Z

Yes, but this does not make altering Vec.len within the Vec module an unsafe operation. That's not how safety works.

It actually is. If something can result in undefined behavior, it is unsafe. Where that happens is wholly irrelevant. You're using circular logic regarding the UCG, which is by definition written about Rust as it currently is. You seem to be claiming that library undefined behavior doesn't exist, which is bizarre.

I'll leave it at that as there's clearly no convincing you that the mere concept of an unsafe field has merit. I won't be responding further to anything along those lines.

tgross35 · 2023-07-14T19:15:01Z

Could you add another motivating example to the RFC? It seems like the vec.len example isn't super convincing because changing anything in Vec/RawVec breaks the model down and is "unsafe", by definition: that is protected via the private/public module interface. This makes me think that the resolution would be something like unsafe modules or #[unsafe] types rather than fine grained per-field control, so I'm curious to see what motivates this specific solution.

jhpratt · 2023-07-15T03:27:37Z

@tgross35 I linked this earlier in the thread. Is that sufficient for you? It demonstrates that field-level safety is appropriate, as some of the fields are perfectly safe to set as they have no interaction with any other field. Yet at the same time, all fields are one logical unit that should not be split into separate structs.

tgross35 · 2023-07-15T19:25:42Z

Thank you for the link, I meant specifically to add an example to the RFC document (which perhaps you planned to do anyway).

Even with that example, it does seem to me that it would be more correct to nest flags into a separate type with the other items it has the invariant with. To me, it illustrates the concept better; a field alone is never unsafe but rather it is unsafe in the context of other fields, and it seems like this is a suitable level for abstraction to a separate type. Also:

If a struct of 12 fields has 8 marked unsafe, it isn't immediately clear how they may be related. Are they all tied together via an invariant? Are 6 tied together and the remaining 2 associated somehow? This becomes immediately clear by separating things that are related into separate types, and I think that behavior should be encouraged
For the example Parsed struct, if sufficiently many fields are related to flags then it seems like it would make sense to mark the entire struct #[upholds_unsafe_invariant] or something like that. Maybe a few fields aren't related to the invariant, but commenting // SAFETY: not related to the invariant is easy enough to make this clear (and forces you to make the change if they do become related to the invariant for some reason)

Niche optimization was mentioned as well as a reason for not wanting to split off types, but I think this is more applicable to all separations of logic, and not just those with safety concerns. I don't wish to derail this conversation, but the thought has crossed my mind before about how Rust could potentially use a #[flatten] attribute for nested struct fields that tells the compiler to merge the child struct's fields into the parent's and rearrange for best possible size. (yes - possible method duplication, but this would be meant for one-off structs).

I think that investigating something like that may be more widely useful than using niche optimization as a justification for why a single flat struct is the correct solution. (edit: I brought this up for some discussion on Zulip https://rust-lang.zulipchat.com/#narrow/stream/213817-t-lang/topic/.60.23.5Bflatten.5D.60.20struct.20field.20attribute)

…bility

Jules-Bertholet · 2025-04-15T14:10:27Z

text/0000-unsafe-fields.md

+breaking existing code. Over an edition boundary, safe reads of `unsafe` fields could be permitted
+by rewriting existing `unsafe` fields to wrap their field type in a tooling-aware `Unsafe` wrapper
+type.


I am not convinced that changing the type of the field over an edition is viable. (However, I agree that safe-to-& unsafe fields is something that can be left for later.)

It would be sound, but not seamless. Since the Unsafe wrapper is presumably #[repr(transparent)], its addition doesn't have opsem implications. Crate authors with public unsafe fields affected by this could not migrate editions without releasing a SemVer breaking change, since changing a public field's type is a SemVer breaking change.

I raise this all as a future possibility only for completeness — I don't think we'd walk through this door likely or lightly. I remain confident that the design proposed by this RFC offers the best tradeoffs.

Jules-Bertholet · 2025-04-15T14:16:42Z

text/0000-unsafe-fields.md

+- `unsafe(init,&mut,&,read)` (everything is unsafe)
+- `unsafe(init,&mut,&)` (everything except reading unsafe)
+- `unsafe(init,&mut)` (everything except reading and `&`-referencing unsafe)
+- etc.
+
+Besides the unclear semantics of an unparameterized `unsafe()`, this design has the disadvantage


The way I conceptualized it in my proposal at #3458 (comment): the argument to unsafe denotes the weakest reference type that is unsafe to obtain. So unsafe() makes & and &mut references unsafe, while unsafe(mut) only makes &mut unsafe.

If Rust ever gets partial borrows, the syntax for that could be adopted to extend this feature as well.

…tion.

…ecker. Also discuss future possible interaction with partial borrows.

text/0000-unsafe-fields.md

Resolves rust-lang#3458 (comment)

Jules-Bertholet · 2025-04-18T18:13:43Z

text/0000-unsafe-fields.md

+This alternative also inherits some of the disadvantages of [`Unsafe` wrapper
+types](#unsafe-wrapper-type); namely that the safety proofs needed to operate on an `Unsafe` wrapper
+value depend on the dataflow of the program; the wrapper value must be traced to its originating
+field so that field's safety documentation may be examined.


Again, the same is true of this RFC’s design. With this RFC, the unsafe is required at the site of the copy/move, and to determine whether the safety condition is met, you must trace dataflow forward to where the value is later used. With Unsafe wrappers, the unsafe is required at the use-site, and dataflow must be traced back to the copy/move site. Both solutions require tracing dataflow, just in different directions.

But isn't that doubly true for Unsafe? Unsafe's accessors also require unsafe, forcing you to further trace the dataflow forward to check that uses don't violate the field invariant (which you discovered by tracing the dataflow backwards).

True, but you will probably not use the accessor until the last possible moment. So you are unlikely to have to trace forward very far.

tmandry · 2025-04-24T00:24:23Z

According to @jswrenn this RFC has reached a fixed point.

There are two leading alternatives. This RFC picks the simpler one of making exactly one kind of unsafe field, where all accesses are unsafe. Cases where e.g. reads are always safe can be expressed via a helper method.

Nominating for lang team discussion to see if we can get to a quick consensus on this RFC. I'd like to hear from @scottmcm as the lang team champion before moving forward.

@rustbot label I-lang-nominated

Jules-Bertholet · 2025-04-24T01:07:08Z

text/0000-unsafe-fields.md

+block and attendant `SAFETY` comment is required. In most cases, this is a one-time chore: the
+maintainer can define a *safe* accessor (i.e.,
+[`Vec::len`](https://doc.rust-lang.org/std/vec/struct.Vec.html#method.len)) that encapsulates this
+proof. However, in cases where multiple, partial field borrows are required, such an accessor cannot
+be invoked. [Future language extensions that permit partial borrows may resolve this
+drawback.](#partial-borrows).


Even with partial borrows, it may not be possible to write a safe helper method for moving out of a field (while leaving the rest of the struct intact).

joshtriplett · 2025-05-28T17:48:18Z

I would definitely love to also have the future work of defining "unsafe to write but safe to read" fields. But I think this initial proposal as written is great and provides a good solution for many cases.

@rfcbot merge

rfcbot · 2025-05-28T17:48:20Z

Team member @joshtriplett has proposed to merge this. The next step is review by the rest of the tagged team members:

No concerns currently listed.

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

cc @rust-lang/lang-advisors: FCP proposed for lang, please feel free to register concerns.
See this document for info about what commands tagged team members can give me.

kennytm · 2025-05-28T19:24:08Z

text/0000-unsafe-fields.md

+The [`StructField` syntax][struct syntax], used for the named fields of structs, enums, and unions,
+shall be updated to accommodate an optional `unsafe` keyword just before the field `IDENTIFIER`:
+
+```diff
+StructField :
+   OuterAttribute*
+   Visibility?
+  unsafe?
+   IDENTIFIER : Type
+```
+
+[struct syntax]: https://doc.rust-lang.org/stable/reference/items/structs.html#structs


how about TupleField? (oversight? future extension? by-design unsupported because of ambiguity with struct F(unsafe fn());?)

By-design unsupported because of ambiguity. Might be possible to disambiguate with parens, but that's future work.

jhpratt · 2025-05-28T19:28:43Z

I've not been following this discussion too closely. While I understand that there is the desire to have "unsafe to use" as the default, I would like to ensure that there is, at a minimum, syntax that the lang team finds acceptable for "safe to access" fields as I'd originally proposed.

This section touches on it, but the only thing I see that even seems plausible is unsafe(mut). I'm not seeking to block the RFC, find a concrete syntax now, or otherwise hold things up, but I'd like to at least know that there's some syntax that is acceptable for the future.

jswrenn · 2025-05-28T19:59:14Z

@jhpratt I'm taking the today off for my birthday, but I'll draft a Future Possibility tomorrow that sketches out the steps unsafe(mut). :-) A definition- site modifier that makes some subset of accesses safe is possible to retrofit, but we should tread carefully when doing so. The design proposed in this RFC requires a safety proofs at only particular, actual use-sites; by contrast, a definition modifier that makes a subset of uses always safe, requires a definite-site safety proof for all hypothetical safe-subset uses. That's a much harder task for users, and not unlike the experience we had trying to carve out always-safe subsets when iterating on this RFC: we kept discovering cases in the design process where we thought we had carved out an always-safe subset of accesses, only to be surprised by an edge case producing a soundness hole!

jhpratt · 2025-05-28T20:02:46Z

For clarity, I'm not disputing the need for "unsafe to use" by default — I understand the reasoning. I was solely asking for something like unsafe(mut) to be called out more clearly, with the general understanding that it's syntactically feasible. If that's already planned, then wonderful! I know figuring out a subset isn't easy.

(happy birthday 🍰)

See: - rust-lang#3458 (comment) - rust-lang#3458 (comment)

jswrenn · 2025-05-29T16:20:49Z

@joshtriplett, @jhpratt I've now called out unsafe(mut) in Future Possibilities: Syntactic Knobs and Wrapper Types.

Jules-Bertholet · 2025-06-01T01:44:37Z

The unsafe binders proposal has some relevance here, in that both it and this RFC are meant to address a situation where it’s unsafe to operate on a field (in the case of unsafe binders, because the field references another part of the struct). That proposal makes unsafety part of the type (like my proposed Unsafe<T>), not the field (like this RFC).

Nadrieril · 2025-06-01T11:59:11Z

The unsafe binders proposal has some relevance here, in that both it and this RFC are meant to address a situation where it’s unsafe to operate on a field (in the case of unsafe binders, because the field references another part of the struct). That proposal makes unsafety part of the type (like my proposed Unsafe<T>), not the field (like this RFC).

This is consistent with the additive/subtractive invariant frame: unsafe fields are for additive invariants, unsafe binders are a specific subtractive invariant ("a value of type Foo<'a> except the 'a is unknowable/a lie").

Jules-Bertholet · 2025-06-01T13:58:14Z

This is consistent with the additive/subtractive invariant frame: unsafe fields are for additive invariants, unsafe binders are a specific subtractive invariant ("a value of type Foo<'a> except the 'a is unknowable/a lie").

Yes, and I like that framing, but it conflicts with the current version of this RFC.

Add unsafe fields RFC

95dd295

juntyr reviewed Jul 13, 2023

View reviewed changes

ehuss added the T-lang Relevant to the language team, which will review and decide on the RFC. label Jul 13, 2023

Jules-Bertholet reviewed Jul 13, 2023

View reviewed changes

scottmcm reviewed Jul 14, 2023

View reviewed changes

text/0000-unsafe-fields.md Show resolved Hide resolved

scottmcm reviewed Jul 14, 2023

View reviewed changes

text/0000-unsafe-fields.md Outdated Show resolved Hide resolved

RFC3458: Document 'Syntactic Knobs and Wrapper Types' as future possi…

f4e85fb

…bility

Jules-Bertholet reviewed Apr 15, 2025

View reviewed changes

jswrenn added 4 commits April 18, 2025 14:57

RFC3458: Clarify that syntactic knob possibility is not an easy migra…

94c173a

…tion.

RFC3458: Document trivial proof drawback's interaction with borrow ch…

462cee4

…ecker. Also discuss future possible interaction with partial borrows.

RFC3458: Document alt of Mixing Syntactic Knobs with a Wrapper Type

d1d7388

RFC3458: Add links to RFC PR and Rust tracking issue

f5a82f8

Jules-Bertholet reviewed Apr 18, 2025

View reviewed changes

text/0000-unsafe-fields.md Outdated Show resolved Hide resolved

RFC3458: Typo fix

d4cfd14

Resolves rust-lang#3458 (comment)

Jules-Bertholet reviewed Apr 18, 2025

View reviewed changes

rustbot added the I-lang-nominated Indicates that an issue has been nominated for prioritizing at the next lang team meeting. label Apr 24, 2025

Jules-Bertholet reviewed Apr 24, 2025

View reviewed changes

traviscross added the P-lang-drag-3 Lang team prioritization drag level 3. label May 16, 2025

jswrenn mentioned this pull request May 22, 2025

Unsafe Fields rust-lang/rust-project-goals#273

Open

7 tasks

rfcbot added proposed-final-comment-period Currently awaiting signoff of all team members in order to enter the final comment period. disposition-merge This RFC is in PFCP or FCP with a disposition to merge it. labels May 28, 2025

kennytm reviewed May 28, 2025

View reviewed changes

traviscross removed I-lang-nominated Indicates that an issue has been nominated for prioritizing at the next lang team meeting. P-lang-drag-3 Lang team prioritization drag level 3. labels May 28, 2025

RFC3458: Clarify 'Syntactic Knobs and Wrapper Types' possibility

7769117

See: - rust-lang#3458 (comment) - rust-lang#3458 (comment)

sosthene-nitrokey mentioned this pull request Jun 11, 2025

Add missing Debug derive to vec::IntoIter rust-embedded/heapless#580

Open

Unsafe fields #3458

Are you sure you want to change the base?

Unsafe fields #3458

Uh oh!

Conversation

jhpratt commented Jul 13, 2023 • edited by jswrenn Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juntyr Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jhpratt Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

juntyr Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

scottmcm Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

algesten commented Jul 13, 2023

Uh oh!

programmerjake commented Jul 13, 2023

Uh oh!

jhpratt commented Jul 13, 2023

Uh oh!

jsgf commented Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhpratt commented Jul 13, 2023

Uh oh!

mo8it commented Jul 13, 2023

Uh oh!

djc commented Jul 13, 2023

Uh oh!

idanarye commented Jul 13, 2023

Uh oh!

idanarye commented Jul 13, 2023

Uh oh!

Jules-Bertholet Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jhpratt Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

CodesInChaos Aug 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

burdges commented Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhpratt commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

scottmcm commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

jhpratt commented Jul 13, 2023 •

edited by jswrenn

Loading

juntyr Jul 13, 2023 •

edited

Loading

jhpratt Jul 13, 2023 •

edited

Loading

juntyr Jul 13, 2023 •

edited

Loading

scottmcm Jul 14, 2023 •

edited

Loading

jsgf commented Jul 13, 2023 •

edited

Loading

Jules-Bertholet Jul 13, 2023 •

edited

Loading

jhpratt Jul 14, 2023 •

edited

Loading

CodesInChaos Aug 8, 2024 •

edited

Loading

burdges commented Jul 13, 2023 •

edited

Loading

jhpratt commented Jul 14, 2023 •

edited

Loading

scottmcm commented Jul 14, 2023 •

edited

Loading

burdges commented Jul 14, 2023 •

edited

Loading

scottmcm commented Jul 14, 2023 •

edited

Loading

scottmcm commented Jul 14, 2023 •

edited

Loading

jhpratt commented Jul 14, 2023 •

edited

Loading

burdges commented Jul 14, 2023 •

edited

Loading

tgross35 commented Jul 15, 2023 •

edited

Loading

jswrenn Apr 18, 2025 •

edited

Loading