Skip to content

Add support for Spark 3.0 #1592

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 11 commits into from
Feb 16, 2021
Merged

Add support for Spark 3.0 #1592

merged 11 commits into from
Feb 16, 2021

Conversation

jbaiera
Copy link
Member

@jbaiera jbaiera commented Jan 29, 2021

This PR adds support for Spark version 3.0.

Closes #1412

Reviewer's note: The PR is best reviewed by evaluating the changes on each commit, since 6c0b177 is just a source copy operation from the Spark 2.x project.

@jbaiera
Copy link
Member Author

jbaiera commented Jan 29, 2021

Known issue with Spark 3.0 #1593

@lyogev
Copy link

lyogev commented Jan 30, 2021

@jbaiera I was able to build your branch, with JDK 11 although I had to export JAVA8_HOME to my JDK8 dir.
Tested it with a simple writing test and it seems to work well, will update after further testing.

@paulrossmeier
Copy link

@jbaiera I was able to build you branch as well - using JDK 11 although I had to export JAVA8_HOME to my JDK8 dir.

I have tested this writing a few tables to elastic - one with ~ 9 million rows - using "WAN" setting as I am indexing into a hosted cluster.

I have had no issues this far

@danhermann danhermann self-requested a review February 15, 2021 16:50
Copy link
Contributor

@danhermann danhermann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. I've walked through the code with @jbaiera and done some basic testing both reading from and writing to ES in Spark3.

@jbaiera jbaiera merged commit e9ce14a into elastic:master Feb 16, 2021
@jbaiera jbaiera deleted the upgrade-spark-3 branch February 16, 2021 16:02
jbaiera added a commit to jbaiera/elasticsearch-hadoop that referenced this pull request Feb 16, 2021
Spark 3.0 support is added via a new source directory and subproject. Supported Scala versions are only 2.12.
jbaiera added a commit that referenced this pull request Feb 16, 2021
Spark 3.0 support is added via a new source directory and subproject. Supported Scala versions are only 2.12.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature] Spark3.0 support
4 participants