You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/04/17 11:29:31 UTC
[GitHub] [arrow] Dandandan opened a new pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Dandandan opened a new pull request #10077:
URL: https://github.com/apache/arrow/pull/10077
This reproduces the test failure for the `TopKExec` test that @andygrove was seeing in combination with a 24-core machine.
Also FYI @alamb .
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] github-actions[bot] commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821808634
<!--
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
-->
Thanks for opening a pull request!
If this is not a [minor PR](https://github.com/apache/arrow/blob/master/CONTRIBUTING.md#Minor-Fixes). Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW
Opening JIRAs ahead of time contributes to the [Openness](http://theapacheway.com/open/#:~:text=Openness%20allows%20new%20users%20the,must%20happen%20in%20the%20open.) of the Apache Arrow project.
Then could you also rename pull request title in the following format?
ARROW-${JIRA_ID}: [${COMPONENT}] ${SUMMARY}
or
MINOR: [${COMPONENT}] ${SUMMARY}
See also:
* [Other pull requests](https://github.com/apache/arrow/pulls/)
* [Contribution Guidelines - How to contribute patches](https://arrow.apache.org/docs/developers/contributing.html#how-to-contribute-patches)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] andygrove closed pull request #10077: ARROW-12421: [Rust] [DataFusion] Fix topkexec failure
Posted by GitBox <gi...@apache.org>.
andygrove closed pull request #10077:
URL: https://github.com/apache/arrow/pull/10077
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #10077: ARROW-12421: [Rust] [DataFusion] Fix topkexec failure
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821971600
Thanks @alamb @andygrove .
It would still be valuable to see if we can improve the example, or even add it to DataFusion once we have a good implementation (without the extension stuff of course)!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] andygrove commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
andygrove commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821849479
Thanks @Dandandan :rocket:
So if the issue was just in the test, I should change my vote on the release to a +1?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821851791
@andygrove as far as I can see the issue was in the test and faulty TopKExec implementation only, not really in repartion or the optimization of the repartion node.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-commenter commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
codecov-commenter commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821849781
# [Codecov](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation) Report
> Merging [#10077](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=desc&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation) (7d058ea) into [master](https://codecov.io/gh/apache/arrow/commit/9c1e5bd19347635ea9f373bcf93f2cea0231d50a?el=desc&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation) (9c1e5bd) will **increase** coverage by `0.01%`.
> The diff coverage is `83.33%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/10077/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation)
```diff
@@ Coverage Diff @@
## master #10077 +/- ##
==========================================
+ Coverage 78.90% 78.92% +0.01%
==========================================
Files 286 286
Lines 64717 64713 -4
==========================================
+ Hits 51068 51075 +7
+ Misses 13649 13638 -11
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation) | Coverage Δ | |
|---|---|---|
| [rust/datafusion/src/execution/context.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9leGVjdXRpb24vY29udGV4dC5ycw==) | `92.60% <ø> (ø)` | |
| [rust/datafusion/tests/user\_defined\_plan.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9kYXRhZnVzaW9uL3Rlc3RzL3VzZXJfZGVmaW5lZF9wbGFuLnJz) | `87.74% <83.33%> (+0.32%)` | :arrow_up: |
| [rust/datafusion/src/scalar.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9zY2FsYXIucnM=) | `50.12% <0.00%> (+0.73%)` | :arrow_up: |
| [rust/arrow/src/array/array.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXkucnM=) | `77.60% <0.00%> (+0.79%)` | :arrow_up: |
| [...t/datafusion/src/physical\_plan/coalesce\_batches.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9waHlzaWNhbF9wbGFuL2NvYWxlc2NlX2JhdGNoZXMucnM=) | `84.95% <0.00%> (+0.88%)` | :arrow_up: |
| [rust/arrow/src/array/equal/utils.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvZXF1YWwvdXRpbHMucnM=) | `76.00% <0.00%> (+1.00%)` | :arrow_up: |
| [...t/datafusion/src/physical\_optimizer/repartition.rs](https://codecov.io/gh/apache/arrow/pull/10077/diff?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9waHlzaWNhbF9vcHRpbWl6ZXIvcmVwYXJ0aXRpb24ucnM=) | `96.72% <0.00%> (+4.91%)` | :arrow_up: |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=continue&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=footer&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation). Last update [9c1e5bd...7d058ea](https://codecov.io/gh/apache/arrow/pull/10077?src=pr&el=lastupdated&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache+Software+Foundation).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821854443
I probably won't get to that anymore today. Thanks, would be great
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] andygrove commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
andygrove commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821852991
ok, we probably need to get the flatbuffer dependency change in anyway, so
we should re-introduce the repartition optimization. Do you want to create
a PR for that? If not, I can get to it later today.
On Sat, Apr 17, 2021 at 10:45 AM Daniël Heres ***@***.***>
wrote:
> @andygrove <https://github.com/andygrove> as far as I can see the issue
> was in the test and faulty TopKExec implementation only, not really in
> repartion or the optimization of the repartion node.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <https://github.com/apache/arrow/pull/10077#issuecomment-821851791>, or
> unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AAHEBRA26KAMAGCFYYAUZ4DTJG3JPANCNFSM43C7OCUA>
> .
>
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] alamb commented on pull request #10077: ARROW-12421: [Rust] [DataFusion] Fix topkexec failure
Posted by GitBox <gi...@apache.org>.
alamb commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821970804
Thanks @Dandandan
> @andygrove as far as I can see the issue was in the test and faulty TopKExec implementation only, not really in repartion or the optimization of the repartion node.
For the record, the `TopKExec` is an example (not part of the DataFusion codebase)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on pull request #10077: [Rust][DataFusion] Reproduce topkexec failure
Posted by GitBox <gi...@apache.org>.
Dandandan commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821847190
@alamb @andygrove I pushed a version with a fix (I believe) for the `TopKExec` example. Not sure if the original version with `try_fold` could also be fixed to work, but this works at least. It now has a `.clone` on the map which is not a big deal for small `k` but probably would be better to avoid that.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] github-actions[bot] commented on pull request #10077: ARROW-12421: [Rust] [DataFusion] Fix topkexec failure
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #10077:
URL: https://github.com/apache/arrow/pull/10077#issuecomment-821897556
https://issues.apache.org/jira/browse/ARROW-12421
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org