You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/11/28 20:58:56 UTC

[GitHub] [beam] Abacn commented on issue #24365: [Bug]: No parallelism using WriteToParquet in Apache Spark

Abacn commented on issue #24365:
URL: https://github.com/apache/beam/issues/24365#issuecomment-1329753494

   Thanks for reporting and triaging the issue. Surprised by adding a `None` key then GroupByKey which makes no sense today, but probably how GBK works has since changed. We should be able to replace the change in #958 to a ReShuffle(). Would you mind testing if it resolves your issue and appreciate if opening a PR?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org