You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/06/23 13:44:07 UTC

[GitHub] [arrow] paleolimbot commented on pull request #13415: ARROW-16578: [R] unique() and is.na() on a column of a tibble is much slower after writing to and reading from a parquet file

paleolimbot commented on PR #13415:
URL: https://github.com/apache/arrow/pull/13415#issuecomment-1164426447

   I believe vroom is reading from disk which should be much slower. I suspect that our altrep is slow because it has unwind protects and my reading of vroom's implementation is that it carefully avoids them.
   
   Speeding up our implementation is orthogonal to this PR, though, which is a huge improvement until we get to systematic profiling to see if there's anything we can do.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org