You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/13 03:15:01 UTC

[GitHub] [arrow-datafusion] Ted-Jiang opened a new issue, #2221: Aggregate func Approx_median not work with Parquet format

Ted-Jiang opened a new issue, #2221:
URL: https://github.com/apache/arrow-datafusion/issues/2221

   **Describe the bug**
   ```
       Finished dev [unoptimized + debuginfo] target(s) in 0.58s
        Running `target/debug/datafusion-cli`
   DataFusion CLI v7.0.0
   ❯ create external table test STORED AS PARQUET LOCATION '/Users/yangjiang/CLionProjects/github/arrow-datafusion/parquet-testing/data/alltypes_plain.parquet';
   0 rows in set. Query took 0.004 seconds.
   ❯ select approx_median(tinyint_col) from test;
   thread 'main' panicked at 'index out of bounds: the len is 0 but the index is 0', /Users/yangjiang/.cargo/registry/src/github.com-1ecc6299db9ec823/arrow-11.1.0/src/datatypes/schema.rs:193:10
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   ```
   
   **To Reproduce**
   Steps to reproduce the behavior:
   
   **Expected behavior**
   A clear and concise description of what you expected to happen.
   
   **Additional context**
   Add any other context about the problem here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] jychen7 commented on issue #2221: Aggregate func Approx_median not work with Parquet format

Posted by GitBox <gi...@apache.org>.
jychen7 commented on issue #2221:
URL: https://github.com/apache/arrow-datafusion/issues/2221#issuecomment-1100201872

   more context, it only fails for `approx_median`, and `approx_percentile_cont` works fine
   
   ```
   ❯ select approx_percentile_cont(tinyint_col, 0.5) from test;
   +-----------------------------------------------------+
   | APPROXPERCENTILECONT(test.tinyint_col,Float64(0.5)) |
   +-----------------------------------------------------+
   | 0                                                   |
   +-----------------------------------------------------+
   ```
   
   https://github.com/apache/arrow-datafusion/blob/9f2ed423dc63f9f5d0a5e586925d2c31e3b9f5b8/datafusion/core/src/optimizer/to_approx_perc.rs#L97-L100


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] andygrove commented on issue #2221: Aggregate func Approx_median not work with Parquet format

Posted by GitBox <gi...@apache.org>.
andygrove commented on issue #2221:
URL: https://github.com/apache/arrow-datafusion/issues/2221#issuecomment-1112327827

   I just ran into this and in my case it was caused by optimizing the query twice (which really should be safe buf apparently is not)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] andygrove closed issue #2221: Aggregate func Approx_median not work with Parquet format

Posted by GitBox <gi...@apache.org>.
andygrove closed issue #2221: Aggregate func Approx_median not work with Parquet format
URL: https://github.com/apache/arrow-datafusion/issues/2221


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org