You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/06/29 05:59:25 UTC

[GitHub] [arrow-datafusion] Dandandan opened a new issue #640: Optimize min/max queries based on table statistics

Dandandan opened a new issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640


   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   In #620 a feature has been added to optimize queries like `select count(*) from table`.
   We can do the same with `min` / `max` queries (`select min(col) from table`).
   
   **Describe the solution you'd like**
   Get the table statistics based on the column name from the table provider.
   
   **Describe alternatives you've considered**
   
   **Additional context**
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] alamb closed issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
alamb closed issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] Dandandan commented on issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
Dandandan commented on issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640#issuecomment-870903170


   For it to work on parquet and in-memory data, a bit of work also has to be done there to summarize / expose those statistics.
   
   https://github.com/apache/arrow-datafusion/issues/537


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] Dandandan edited a comment on issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
Dandandan edited a comment on issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640#issuecomment-870903170


   For it to work on parquet and in-memory data, a bit of work also has to be done there to summarize / expose those statistics.
   
   see:
   https://github.com/apache/arrow-datafusion/issues/537 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] alamb commented on issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
alamb commented on issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640#issuecomment-870813510


   Following the example in https://github.com/apache/arrow-datafusion/pull/620 probably makes this PR fairly approachable to anyone who is interested


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] b41sh commented on issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
b41sh commented on issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640#issuecomment-871269286


   I'd like to do this.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] alamb commented on issue #640: Optimize min/max queries based on table statistics

Posted by GitBox <gi...@apache.org>.
alamb commented on issue #640:
URL: https://github.com/apache/arrow-datafusion/issues/640#issuecomment-871711405


   Assigning to @b41sh  per request. Thank you!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org