Skip to content

Updating to arrow 4.0 #418

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 15 commits into from
May 6, 2021
Merged

Conversation

nirandaperera
Copy link
Collaborator

No description provided.

@nirandaperera nirandaperera marked this pull request as ready for review May 6, 2021 13:27
@nirandaperera nirandaperera merged commit 71e57f8 into cylondata:main May 6, 2021
supunkamburugamuve added a commit that referenced this pull request Jun 6, 2021
* Initial commit for ucx integration. Still WiP

* Progress on making the code more readable

* Improved code readability

* Missing completion bug fix

* Missing completion bug fix

* Add in memory testing for UCX

* Remove UCX installation from git

* Initial move of current channel init to com init

* Move init to communicator

* Temporarily change docker to just check ucx cpp

* Test UCX with file creation

* Changes made for TODOs

* Remove unnecessary include

* Revert comment

* Update README.md

* fixing empty tables for ops (#395)

* Indexing improvement (#394)

* setup fix for pycylon upgrade for arrow 2.0.0

* add option for concat

* adding initial indexing

* initial version of arrow-input based indexing v1

* initial version of arrow-input based indexing v2

* initial version of arrow-input based indexing v3

* initial version of arrow-input based indexing v4

* initial version of arrow-input based indexing v5

* initial version of arrow-input based indexing v6

* initial version of arrow-input based indexing v7

* initial version of arrow-input based indexing v8

* removing Final to fix support for Python 3.7

* adding arrow iloc operations

* updating test cases

* arrow_index integration to python v1

* arrow_index integration to python v2

* arrow_index integration to python v3

* arrow_index integration to python v4

* arrow_index integration to python v5

* arrow_index integration to python v6

* arrow_index integration to python v7

* code clean up for indexing step 1

* code clean up for indexing step 2

* adding linear search with vector search

* indexing code clean up v1

* indexing code clean up v2

* indexing code clean up v3

* indexing code clean up v4

* modified for review v1

* modified for review v2

* modified for review v3

* modified for review v4

* modified for review v5

* modified for review v6

* modified for review v7

* modified for review v8

* modified for review v9

* modified for review v10

* modified for review v11

* modified for review v13

* modified for review v14

* modified for review v15

* modified for review v16

* modified for review v16

* modified for review v18

* modified for review v19

* modified for review v20

* adding temporal types and binary types v1

* adding temporal types and binary types v2

* adding temporal types and binary types v2

* fixed to provide output as pointers

* change unique value check for indexing with bool

* reformatting

* reformatting v1

* reformatting v2

* 0.4.0 release (#411)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>

* Update README.md

* adding missing test case

* Update README.md

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

* Update c-cpp.yml

adding github CI again

* Bug fixes in dataframe  (#420)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Op bench refactor (#417)

* refactor duplicates to unique

* adding update for indexing reset

* improving drop_na axis=0 perf

* reformatting table

* reformatting table.cpp with macros

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

* 0.4.1 release (#427)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

* Bug fixes after 0.4.0 release  (#421)

* Update README.md

(cherry picked from commit 5609535)

* adding missing test case

(cherry picked from commit 016c5c9)

* Update README.md

(cherry picked from commit 149919c)

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

(cherry picked from commit 0200c02)

* Update c-cpp.yml

adding github CI again

(cherry picked from commit e0ba964)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

(cherry picked from commit 10f5a6a)

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* missing docs (#428)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

* Bug fixes after 0.4.0 release  (#421)

* Update README.md

(cherry picked from commit 5609535)

* adding missing test case

(cherry picked from commit 016c5c9)

* Update README.md

(cherry picked from commit 149919c)

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

(cherry picked from commit 0200c02)

* Update c-cpp.yml

adding github CI again

(cherry picked from commit e0ba964)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

(cherry picked from commit 10f5a6a)

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* adding missing release docs

* file name mistamatch fixed

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* changing ctx to const reference in methods (#419)

* changing ctx to const reference in methods

* minor changes

* adding const ctx ref for parquet ops

* Updating to arrow 4.0 (#418)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* arrow update

* minor change

* removing boost url from build.sh

* adding new operators to DataFrame API (#429)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* adding isin op for dataframes

* adding applymap operator

* minor changes

* Bump dns-packet from 1.3.1 to 1.3.4 in /docs (#435)

Bumps [dns-packet](https://github.com/mafintosh/dns-packet) from 1.3.1 to 1.3.4.
- [Release notes](https://github.com/mafintosh/dns-packet/releases)
- [Changelog](https://github.com/mafintosh/dns-packet/blob/master/CHANGELOG.md)
- [Commits](mafintosh/dns-packet@v1.3.1...v1.3.4)

Signed-off-by: dependabot[bot] <[email protected]>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>
Co-authored-by: Chathura Widanage <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Sandeepa1995 pushed a commit to Sandeepa1995/cylon that referenced this pull request Jun 6, 2021
* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to cylondata#189

* arrow update

* minor change

* removing boost url from build.sh
supunkamburugamuve added a commit that referenced this pull request Jun 20, 2021
* UCX Integration (#393)

* Initial commit for ucx integration. Still WiP

* Progress on making the code more readable

* Improved code readability

* Missing completion bug fix

* Missing completion bug fix

* Add in memory testing for UCX

* Remove UCX installation from git

* Initial move of current channel init to com init

* Move init to communicator

* Temporarily change docker to just check ucx cpp

* Test UCX with file creation

* Changes made for TODOs

* Remove unnecessary include

* Revert comment

* Update README.md

* fixing empty tables for ops (#395)

* Indexing improvement (#394)

* setup fix for pycylon upgrade for arrow 2.0.0

* add option for concat

* adding initial indexing

* initial version of arrow-input based indexing v1

* initial version of arrow-input based indexing v2

* initial version of arrow-input based indexing v3

* initial version of arrow-input based indexing v4

* initial version of arrow-input based indexing v5

* initial version of arrow-input based indexing v6

* initial version of arrow-input based indexing v7

* initial version of arrow-input based indexing v8

* removing Final to fix support for Python 3.7

* adding arrow iloc operations

* updating test cases

* arrow_index integration to python v1

* arrow_index integration to python v2

* arrow_index integration to python v3

* arrow_index integration to python v4

* arrow_index integration to python v5

* arrow_index integration to python v6

* arrow_index integration to python v7

* code clean up for indexing step 1

* code clean up for indexing step 2

* adding linear search with vector search

* indexing code clean up v1

* indexing code clean up v2

* indexing code clean up v3

* indexing code clean up v4

* modified for review v1

* modified for review v2

* modified for review v3

* modified for review v4

* modified for review v5

* modified for review v6

* modified for review v7

* modified for review v8

* modified for review v9

* modified for review v10

* modified for review v11

* modified for review v13

* modified for review v14

* modified for review v15

* modified for review v16

* modified for review v16

* modified for review v18

* modified for review v19

* modified for review v20

* adding temporal types and binary types v1

* adding temporal types and binary types v2

* adding temporal types and binary types v2

* fixed to provide output as pointers

* change unique value check for indexing with bool

* reformatting

* reformatting v1

* reformatting v2

* 0.4.0 release (#411)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>

* Update README.md

* adding missing test case

* Update README.md

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

* Update c-cpp.yml

adding github CI again

* Bug fixes in dataframe  (#420)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Op bench refactor (#417)

* refactor duplicates to unique

* adding update for indexing reset

* improving drop_na axis=0 perf

* reformatting table

* reformatting table.cpp with macros

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

* 0.4.1 release (#427)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

* Bug fixes after 0.4.0 release  (#421)

* Update README.md

(cherry picked from commit 5609535)

* adding missing test case

(cherry picked from commit 016c5c9)

* Update README.md

(cherry picked from commit 149919c)

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

(cherry picked from commit 0200c02)

* Update c-cpp.yml

adding github CI again

(cherry picked from commit e0ba964)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

(cherry picked from commit 10f5a6a)

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* missing docs (#428)

* changes to docs

* Update cylon_release_0.4.0.md

* Update README.md

* changes to release note

* adding release notes to docs (#399)

* adding csv reader

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* Adding group by (#405)

* reorderd code snippets

* generated new frame.html

* adding docs changes

* adding group by

* updating docs

* changed front page code

* added missing "

* partially fixing #375 (#406)

* partially fixing #375

* adding __len__ to df

* Bug fixes and cosmetic changes (#408)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fixed front page icons order

* Fixing 375 (#410)

* partially fixing #375

* adding __len__ to df

* passing pycylon.Table to df operations

* fixing buffer copy bug

* logging status errors in macros

* bug fix

* test case for chunked tables

* cosmetic changes

* fix #375

* fix #375

* Bug fixes after 0.4.0 release  (#421)

* Update README.md

(cherry picked from commit 5609535)

* adding missing test case

(cherry picked from commit 016c5c9)

* Update README.md

(cherry picked from commit 149919c)

* adding finalize check and removing destructor finalize call. (#412)

* adding finalize check and removing destructor finalize call.

* adding env.finalize for dataframe examples

* finalize checks.

(cherry picked from commit 0200c02)

* Update c-cpp.yml

adding github CI again

(cherry picked from commit e0ba964)

* fixing #415 and #416

* changes to concat operation

* unwrapping sort options onto sort method

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

(cherry picked from commit 10f5a6a)

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* adding missing release docs

* file name mistamatch fixed

Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>

* changing ctx to const reference in methods (#419)

* changing ctx to const reference in methods

* minor changes

* adding const ctx ref for parquet ops

* Updating to arrow 4.0 (#418)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* arrow update

* minor change

* removing boost url from build.sh

* adding new operators to DataFrame API (#429)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* adding isin op for dataframes

* adding applymap operator

* minor changes

* Bump dns-packet from 1.3.1 to 1.3.4 in /docs (#435)

Bumps [dns-packet](https://github.com/mafintosh/dns-packet) from 1.3.1 to 1.3.4.
- [Release notes](https://github.com/mafintosh/dns-packet/releases)
- [Changelog](https://github.com/mafintosh/dns-packet/blob/master/CHANGELOG.md)
- [Commits](mafintosh/dns-packet@v1.3.1...v1.3.4)

Signed-off-by: dependabot[bot] <[email protected]>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>
Co-authored-by: Chathura Widanage <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Enabling scalars in df set_item (#425)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* enabling scalars in df set_item

* fixing boost error

* Updating to arrow 4.0 (#418)

* fixing minor bug in select

* adding select tests

* accommodating comments

* minor improvements to #189

* arrow update

* minor change

* removing boost url from build.sh

* Bump ws from 6.2.1 to 6.2.2 in /docs (#437)

Bumps [ws](https://github.com/websockets/ws) from 6.2.1 to 6.2.2.
- [Release notes](https://github.com/websockets/ws/releases)
- [Commits](https://github.com/websockets/ws/commits)

---
updated-dependencies:
- dependency-name: ws
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Remove commented like without -fsanitize=address for debug

* Add Status returns, UCX to travis, and const pending data size

* Change github actions to include UCX

Co-authored-by: Supun Kamburugamuve <[email protected]>
Co-authored-by: niranda perera <[email protected]>
Co-authored-by: Vibhatha Lakmal Abeykoon <[email protected]>
Co-authored-by: Chathura Widanage <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants