You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/03 22:46:29 UTC

[GitHub] [beam] kennknowles opened a new issue, #19122: Support "partition by XXX order by XXX" SQL

kennknowles opened a new issue, #19122:
URL: https://github.com/apache/beam/issues/19122

   Based on our expereince, looks like for bounded stream, beam SQL does not able to support statement like "partition by XXX order by XXX". It will not be able to parition data to different nodes and sorting data in each partition/node parallelly.
   
   We have to use Java SDK and extension to convert following SQL statement to GroupByKey **** SortValues to achieve this. 
   
    
   
   Does we miss anything? If not, is this something that we can improve? and took a quick look at calcite, seems that it can explain the query plan for "partition by... order by..." fine.
   
    
   
   Imported from Jira [BEAM-6105](https://issues.apache.org/jira/browse/BEAM-6105). Original Jira may contain additional context.
   Reported by: BrandonJ.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org