You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "assignUser (via GitHub)" <gi...@apache.org> on 2023/02/24 00:55:33 UTC

[GitHub] [arrow] assignUser commented on issue #34319: v11.0.0.2 extremely slow with parquet files written in v9.0.0

assignUser commented on issue #34319:
URL: https://github.com/apache/arrow/issues/34319#issuecomment-1442640386

   Hello thanks for the detailed report! 
   
   I am able to reproduce the issue, my times are overall a bit faster (192ms for 9.0.0 read) but 192ms -> 5s is still an unacceptable regression.
   ```
   r$> packageVersion('arrow')
       microbenchmark::microbenchmark(
         arrow::read_parquet(file, col_select = 1:200), times = 10
       )
   [1] '11.0.0.2'
   Unit: seconds
                                             expr      min       lq     mean   median       uq      max neval
    arrow::read_parquet(file, col_select = 1:200) 5.736177 5.993478 6.068209 6.093701 6.165921 6.355394    10
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org