Skip to content

8353013: java.net.URI.create(String) may have low performance to scan the host/domain name from URI string when the hostname starts with number #24295

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 10 commits into from

Conversation

rk-kmr
Copy link

@rk-kmr rk-kmr commented Mar 28, 2025

scanByte throws NumberFormatException for URIs that start with numbers, e.g., https://11111111.x.y/
The current flow is parseIPv4AddressscanIPv4AddressscanByte. parseIPv4Address uses NumberFormatException for control flow, so it captures the exception, ignores it, and returns -1. This has been reported by AWS customer to cause low performance. Details: JDK-8353013 & aws/aws-sdk-java-v2#5933

This PR avoids NumberFormatException by skipping calls to Integer.parseInt if the number of digits in the octet is > 3.

I ran following tests.

make test-tier1
make test-tier2
make test TEST=jdk/java/net


Progress

  • Change must be properly reviewed (1 review required, with at least 1 Reviewer)
  • Change must not contain extraneous whitespace
  • Commit message must refer to an issue

Issue

  • JDK-8353013: java.net.URI.create(String) may have low performance to scan the host/domain name from URI string when the hostname starts with number (Bug - P4)

Reviewers

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/24295/head:pull/24295
$ git checkout pull/24295

Update a local copy of the PR:
$ git checkout pull/24295
$ git pull https://git.openjdk.org/jdk.git pull/24295/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 24295

View PR using the GUI difftool:
$ git pr show -t 24295

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/24295.diff

Using Webrev

Link to Webrev Comment

@bridgekeeper bridgekeeper bot added the oca Needs verification of OCA signatory status label Mar 28, 2025
@bridgekeeper
Copy link

bridgekeeper bot commented Mar 28, 2025

Hi @rk-kmr, welcome to this OpenJDK project and thanks for contributing!

We do not recognize you as Contributor and need to ensure you have signed the Oracle Contributor Agreement (OCA). If you have not signed the OCA, please follow the instructions. Please fill in your GitHub username in the "Username" field of the application. Once you have signed the OCA, please let us know by writing /signed in a comment in this pull request.

If you already are an OpenJDK Author, Committer or Reviewer, please click here to open a new issue so that we can record that fact. Please use "Add GitHub user rk-kmr" as summary for the issue.

If you are contributing this work on behalf of your employer and your employer has signed the OCA, please let us know by writing /covered in a comment in this pull request.

@openjdk
Copy link

openjdk bot commented Mar 28, 2025

@rk-kmr This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

8353013: java.net.URI.create(String) may have low performance to scan the host/domain name from URI string when the hostname starts with number

Reviewed-by: michaelm, xpeng

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 324 new commits pushed to the master branch:

As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@Michael-Mc-Mahon) but any other Committer may sponsor as well.

➡️ To flag this PR as ready for integration with the above commit message, type /integrate in a new comment. (Afterwards, your sponsor types /sponsor in a new comment to perform the integration).

@openjdk
Copy link

openjdk bot commented Mar 28, 2025

@rk-kmr The following label will be automatically applied to this pull request:

  • net

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

@rk-kmr
Copy link
Author

rk-kmr commented Mar 28, 2025

/covered

@bridgekeeper bridgekeeper bot added the oca-verify Needs verification of OCA signatory status label Mar 28, 2025
@bridgekeeper
Copy link

bridgekeeper bot commented Mar 28, 2025

Thank you! Please allow for a few business days to verify that your employer has signed the OCA. Also, please note that pull requests that are pending an OCA check will not usually be evaluated, so your patience is appreciated!

@rk-kmr rk-kmr changed the title 8353013: Avoid NumberFormatException within UI.create(String) to optimize handling of URI Strings that begin with numbers 8353013: java.net.URI.create(String) may have low performance to scan the host/domain name from URI string when the hostname starts with number Mar 28, 2025
Copy link

@pengxiaolong pengxiaolong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thanks.

@myankelev
Copy link
Member

Could you please also add a test for this change/ add a bug id to the existing test if there already is one covering this functionality. Just in case

@rk-kmr
Copy link
Author

rk-kmr commented Mar 31, 2025

Thanks! I have added unit test.

@bridgekeeper bridgekeeper bot removed oca Needs verification of OCA signatory status oca-verify Needs verification of OCA signatory status labels Apr 11, 2025
@openjdk openjdk bot added the rfr Pull request is ready for review label Apr 11, 2025
@mlbridge
Copy link

mlbridge bot commented Apr 11, 2025

Webrevs

@@ -1786,6 +1787,28 @@ private static void b8272072() {
}
}

// 8353013 - java.net.URI.create(String) may have low performance to scan the host/domain name from
// URI string when the hostname starts with number
Copy link
Contributor

@AlanBateman AlanBateman Apr 12, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This comment looks a bit out of place in a unit test. I think start with a JMH benchmark and change the comment in the unit test to make it clearer that it's providing more test coverage for cases where the authority component of a hierarchical URI has a host component that starts with a number.

Copy link
Author

@rk-kmr rk-kmr Apr 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! I have updated the comment and added a JMH benchmark.

## Before
Benchmark                                                      (uri)  Mode  Cnt     Score    Error  Units
URIAuthorityParsingBenchmark.create  https://98765432101.abc.xyz.com  avgt   15  1495.273 ± 86.157  ns/op
URIAuthorityParsingBenchmark.create  https://ABCDEFGHIJK.abc.xyz.com  avgt   15   363.048 ±  6.510  ns/op

## After
Benchmark                                                    Mode  Cnt    Score    Error  Units
Benchmark                                                      (uri)  Mode  Cnt    Score    Error  Units
URIAuthorityParsingBenchmark.create  https://98765432101.abc.xyz.com  avgt   15  377.741 ± 10.497  ns/op
URIAuthorityParsingBenchmark.create  https://ABCDEFGHIJK.abc.xyz.com  avgt   15  334.215 ±  2.148  ns/op

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't is suspicious that non-number case got speedup too?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure if we should consider a 30ns speed up in a single run of the benchmark on a single node as significant.

Copy link
Member

@Michael-Mc-Mahon Michael-Mc-Mahon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks like a useful improvement. LGTM.

@openjdk
Copy link

openjdk bot commented Apr 16, 2025

⚠️ @rk-kmr the full name on your profile does not match the author name in this pull requests' HEAD commit. If this pull request gets integrated then the author name from this pull requests' HEAD commit will be used for the resulting commit. If you wish to push a new commit with a different author name, then please run the following commands in a local repository of your personal fork:

$ git checkout JDK-8353013
$ git commit --author='Preferred Full Name <[email protected]>' --allow-empty -m 'Update full name'
$ git push

@openjdk openjdk bot added the ready Pull request is ready to be integrated label Apr 16, 2025
@rk-kmr
Copy link
Author

rk-kmr commented Apr 16, 2025

/integrate

@openjdk openjdk bot added the sponsor Pull request is ready to be sponsored label Apr 16, 2025
@openjdk
Copy link

openjdk bot commented Apr 16, 2025

@rk-kmr
Your change (at version 0164bca) is now ready to be sponsored by a Committer.

@Michael-Mc-Mahon
Copy link
Member

We are going to run some more tests before sponsoring this PR.

@Michael-Mc-Mahon
Copy link
Member

/sponsor

@openjdk
Copy link

openjdk bot commented Apr 16, 2025

Going to push as commit 84458ec.
Since your change was applied there have been 335 commits pushed to the master branch:

Your commit was automatically rebased without conflicts.

@openjdk openjdk bot added the integrated Pull request has been integrated label Apr 16, 2025
@openjdk openjdk bot closed this Apr 16, 2025
@openjdk openjdk bot removed ready Pull request is ready to be integrated rfr Pull request is ready for review sponsor Pull request is ready to be sponsored labels Apr 16, 2025
@openjdk
Copy link

openjdk bot commented Apr 16, 2025

@Michael-Mc-Mahon @rk-kmr Pushed as commit 84458ec.

💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
integrated Pull request has been integrated net [email protected]
Development

Successfully merging this pull request may close these issues.

8 participants