You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Alexey Kudinkin (Jira)" <ji...@apache.org> on 2022/01/12 18:05:00 UTC

[jira] [Commented] (HUDI-2647) Ensure both Spark SQL + Datasource reads can take advantage of Data Skipping

    [ https://issues.apache.org/jira/browse/HUDI-2647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17474772#comment-17474772 ] 

Alexey Kudinkin commented on HUDI-2647:
---------------------------------------

[~vinoth] can you please elaborate what this task is about? 

IIUC, this is what we have out of the box, since data-skipping is orchestrated on the `HoodieFileIndex` level, and as such all Spark's Relations implementations that leverage it will benefit from data-skipping (if enabled) out of the box

> Ensure both Spark SQL + Datasource reads can take advantage of Data Skipping
> ----------------------------------------------------------------------------
>
>                 Key: HUDI-2647
>                 URL: https://issues.apache.org/jira/browse/HUDI-2647
>             Project: Apache Hudi
>          Issue Type: Task
>          Components: Spark Integration
>            Reporter: Vinoth Chandar
>            Assignee: Yann Byron
>            Priority: Critical
>             Fix For: 0.11.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)