You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/12/10 20:55:57 UTC

[GitHub] [arrow-datafusion] Dandandan commented on issue #1433: Query failing to return any results when filter is an equality check on strings

Dandandan commented on issue #1433:
URL: https://github.com/apache/arrow-datafusion/issues/1433#issuecomment-991287408


   This seems to be something in the pruning logic:
   
   ```
   ❯ explain analyze SELECT "adt" FROM t  WHERE "direction" = 'Two Way';
   +-------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   | plan_type         | plan                                                                                                                                                                                                                       |
   +-------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   | Plan with Metrics | CoalescePartitionsExec, metrics=[output_rows=0, elapsed_compute=32.249µs]                                                                                                                                                  |
   |                   |   ProjectionExec: expr=[adt@1 as adt], metrics=[output_rows=0, elapsed_compute=NOT RECORDED]                                                                                                                               |
   |                   |     CoalesceBatchesExec: target_batch_size=4096, metrics=[output_rows=0, elapsed_compute=26.051µs]                                                                                                                         |
   |                   |       FilterExec: direction@0 = Two Way, metrics=[output_rows=0, elapsed_compute=NOT RECORDED]                                                                                                                             |
   |                   |         RepartitionExec: partitioning=RoundRobinBatch(16), metrics=[repart_time{inputPartition=0}=NOT RECORDED, send_time{inputPartition=0}=NOT RECORDED, fetch_time{inputPartition=0}=509.572µs]                          |
   |                   |           ParquetExec: batch_size=8192, limit=None, partitions=[test.parquet], metrics=[row_groups_pruned{filename=test.parquet}=1, num_predicate_creation_errors=0, predicate_evaluation_errors{filename=test.parquet}=0] |
   |                   |                                                                                                                                                                                                                            |
   +-------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   1 row in set. Query took 0.002 seconds.
   ```
   
   Already in `ParquetExec` there is no output (RepartitionExec doesn't hae to repartition any input row).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org