You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Ryan Blue (JIRA)" <ji...@apache.org> on 2016/10/14 23:08:20 UTC

[jira] [Created] (PARQUET-751) DictionaryFilter patch broke column projection

Ryan Blue created PARQUET-751:
---------------------------------

             Summary: DictionaryFilter patch broke column projection
                 Key: PARQUET-751
                 URL: https://issues.apache.org/jira/browse/PARQUET-751
             Project: Parquet
          Issue Type: Bug
          Components: parquet-mr
    Affects Versions: 1.9.0
            Reporter: Ryan Blue
            Assignee: Ryan Blue
             Fix For: 1.9.0


The {{DictionaryFilter}} patch included a change to reuse a {{ParquetFileReader}} rather than creating new ones (and opening new file streams). But by reusing a reader, the projection columns are no longer set. The result is that all columns are read from disk, but only the requested columns are materialized. The fix is to set the requested schema on the file reader when it is fetched from the read context inside {{InternalParquetRecordReader}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)