You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/11/04 21:16:18 UTC

[GitHub] [spark] sadikovi commented on pull request #34308: [SPARK-37035][SQL] Improve error message when use parquet vectorize reader

sadikovi commented on pull request #34308:
URL: https://github.com/apache/spark/pull/34308#issuecomment-961430537


   I usually load the file in memory and flip bits or replace bytes of data in a byte range that corresponds to the dictionary page. Because everything else can be constant, this allows you to trigger various decoding issues from compression to thrift metadata to data page encoding for dictionary. I can provide code if needed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org