You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2020/12/31 10:16:49 UTC
[GitHub] [arrow] Dandandan opened a new pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Dandandan opened a new pull request #9057:
URL: https://github.com/apache/arrow/pull/9057
## Context
The context of this PR is that I want to experiment with a different, simple implementation of the hash join implementation which directly can index the build-side array instead of keeping a list of batches. This array could grow beyond 2 ^ 32 billion elements, so would need indexes of type `UInt64` rather than `UInt32`.
## Implementation
In the PR I just extend the public `take` to take any `IndexType` which implements `ArrowNumericType` and `ToPrimitive`.
I am not sure about the consideration before to restrict `take` to only `UInt32Array`.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] alamb commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
alamb commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753301391
Ah, I hadn't yet seen https://github.com/apache/arrow/pull/9061 which appears to fix the clippy errors -- thanks @Dandandan !
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-752990305
Rebased
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] alamb commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
alamb commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-752955953
The full set of Rust CI tests did not run on this PR :(
Can you please rebase this PR against [apache/master](https://github.com/apache/arrow) to pick up the changes in https://github.com/apache/arrow/pull/9056 so that they do?
I apologize for the inconvenience.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] alamb commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
alamb commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753300901
The clippy failures seem unrelated to your change -- let me check that out...
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io edited a comment on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
codecov-io edited a comment on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-752993039
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=h1) Report
> Merging [#9057](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=desc) (571bd65) into [master](https://codecov.io/gh/apache/arrow/commit/4b7cdcb9220b6d94b251aef32c21ef9b4097ecfa?el=desc) (4b7cdcb) will **not change** coverage.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9057/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9057 +/- ##
=======================================
Coverage 82.61% 82.61%
=======================================
Files 203 204 +1
Lines 50140 50140
=======================================
Hits 41422 41422
Misses 8718 8718
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/take.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL3Rha2UucnM=) | `95.21% <100.00%> (ø)` | |
| [rust/datafusion/src/physical\_plan/hash\_join.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9waHlzaWNhbF9wbGFuL2hhc2hfam9pbi5ycw==) | `89.53% <100.00%> (ø)` | |
| [rust/arrow/src/csv/writer.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY3N2L3dyaXRlci5ycw==) | `78.82% <0.00%> (-0.56%)` | :arrow_down: |
| [rust/arrow/src/util/serialization.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvdXRpbC9zZXJpYWxpemF0aW9uLnJz) | `100.00% <0.00%> (ø)` | |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=footer). Last update [4b7cdcb...571bd65](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753521743
@jorgecarleitao restarted the CI 😄
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorgecarleitao commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
jorgecarleitao commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753517104
Clippy missing xD
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753493329
FYI @jorgecarleitao @andygrove this (small) PR is needed to finish https://github.com/apache/arrow/pull/9070
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] github-actions[bot] commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-752921933
https://issues.apache.org/jira/browse/ARROW-11086
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io commented on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
codecov-io commented on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-752993039
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=h1) Report
> Merging [#9057](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=desc) (c0af0d9) into [master](https://codecov.io/gh/apache/arrow/commit/dd5fe7095bb662236e27d3343eb82bc4375f93ef?el=desc) (dd5fe70) will **increase** coverage by `0.00%`.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9057/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9057 +/- ##
=======================================
Coverage 82.61% 82.61%
=======================================
Files 202 202
Lines 50052 50052
=======================================
+ Hits 41350 41351 +1
+ Misses 8702 8701 -1
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/take.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL3Rha2UucnM=) | `95.21% <100.00%> (ø)` | |
| [rust/datafusion/src/physical\_plan/hash\_join.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9waHlzaWNhbF9wbGFuL2hhc2hfam9pbi5ycw==) | `89.94% <100.00%> (ø)` | |
| [rust/parquet/src/encodings/encoding.rs](https://codecov.io/gh/apache/arrow/pull/9057/diff?src=pr&el=tree#diff-cnVzdC9wYXJxdWV0L3NyYy9lbmNvZGluZ3MvZW5jb2RpbmcucnM=) | `95.43% <0.00%> (+0.19%)` | :arrow_up: |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=footer). Last update [dd5fe70...c0af0d9](https://codecov.io/gh/apache/arrow/pull/9057?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorgecarleitao closed pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
jorgecarleitao closed pull request #9057:
URL: https://github.com/apache/arrow/pull/9057
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] alamb edited a comment on pull request #9057: ARROW-11086: [Rust] Extend take implementation to more index types
Posted by GitBox <gi...@apache.org>.
alamb edited a comment on pull request #9057:
URL: https://github.com/apache/arrow/pull/9057#issuecomment-753300901
The clippy failures in https://github.com/apache/arrow/pull/9057/checks?check_run_id=1630788725 seem unrelated to your change -- let me check that out...
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org