Proposal: Add a normalization interface

As noted in #606 and elsewhere, the URL APIs strongly lean towards preserving input in path and query components, and therefore differentiating URIs that are equivalent per e.g. https://www.rfc-editor.org/rfc/rfc9110#section-4.2.3 . But users need to compare such URIs and/or map them to resources, and doing so robustly requires normalization. I think it therefore makes sense to provide a normalization interface, and probably one that is configurable (or can become so in the future) to account for various levels of the "[comparison ladder](https://www.rfc-editor.org/rfc/rfc3986#section-6.2)" such as generic percent-decoding (and case normalization of percent-encodings that survive), dot-segment removal, component-sensitive percent-decoding, scheme-based rules, and possibly also even higher-order considerations such as full case normalization and/or query parameter ordering/combining/value normalization.

One possibility would be adding a `normalize` method to the [URL class](https://url.spec.whatwg.org/#url-class) with reasonable behavior in the absence of any arguments (e.g., as much normalization as possible without conflation of URIs that implementations supporting the scheme are permitted to differentiate), such that e.g. `new URL("httpS://EXAMPLE.com:443/%7ESMith/./home.html").normalize() === "https://example.com/~SMith/home.html"` is true but so is `new URL("http://example.com/data/").normalize() !== new URL("http://example.com/data").normalize()` (_because presence vs. absence of a trailing slash in a path are not equivalent at the level of an http-scheme URL_).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal: Add a normalization interface #729

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Proposal: Add a normalization interface #729

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions