You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2018/09/21 16:04:00 UTC

[jira] [Commented] (IMPALA-5843) Use page index in Parquet files to skip pages

    [ https://issues.apache.org/jira/browse/IMPALA-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623820#comment-16623820 ] 

ASF subversion and git services commented on IMPALA-5843:
---------------------------------------------------------

Commit d7b8275ed04581c4b9bef60ae0abdb7201661618 in impala's branch refs/heads/master from Zoltan Borok-Nagy
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=d7b8275 ]

IMPALA-7556: part 1: handle different file systems via polymorphism

This commit reorganizes some parts of the ScanRange class.

File operations are handled through the abstract FileReader
class. The interface supports positional read that will
be needed by IMPALA-5843. The concrete file operations are
implemented in sub-classes LocalFileReader and
HdfsFileReader.

File reader classes are responsible for setting counters and
metrics related to file operations.

The core logic haven't been changed significantly, but quite
a lot code fragments were relocated.

Testing: Debug exhaustive tests passed

Change-Id: Ia3d3d2d774075008285230606b992603d5be1a82
Reviewed-on: http://gerrit.cloudera.org:8080/11444
Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
Tested-by: Impala Public Jenkins <im...@cloudera.com>


> Use page index in Parquet files to skip pages
> ---------------------------------------------
>
>                 Key: IMPALA-5843
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5843
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Backend
>    Affects Versions: Impala 2.10.0
>            Reporter: Lars Volker
>            Assignee: Zoltán Borók-Nagy
>            Priority: Critical
>              Labels: parquet, performance
>
> Once IMPALA-5842 has been resolved, we should skip pages based on the page index in Parquet files.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org