You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2022/08/12 04:26:40 UTC

[GitHub] [spark] sadikovi opened a new pull request, #37485: [SPARK-40052] Handle direct byte buffers in VectorizedDeltaBinaryPackedReader

sadikovi opened a new pull request, #37485:
URL: https://github.com/apache/spark/pull/37485

### What changes were proposed in this pull request?

This PR is a follow-up for https://github.com/apache/spark/pull/37293. The patch proposes to use `hasArray()` API to check whether or not the `ByteBuffer` has an underlying array that could be leveraged to direct access.

If the condition is not met, the code falls back to the original way of handling things using `ByteBuffer`s.

### Why are the changes needed?

Fixes a potential issue of using direct byte buffers in Parquet-MR and makes the code forward-compatible.

### Does this PR introduce _any_ user-facing change?

No. This is an internal change.

### How was this patch tested?

Existing unit tests. I reran `DataSourceReadBenchmark` to ensure this `if-else` does not have a significant impact on performance.

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org