You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/08/28 08:40:00 UTC

[jira] [Commented] (HUDI-1680) Add getPreferredLocations for RDD

    [ https://issues.apache.org/jira/browse/HUDI-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17406154#comment-17406154 ] 

ASF GitHub Bot commented on HUDI-1680:
--------------------------------------

pratyakshsharma commented on pull request #2663:
URL: https://github.com/apache/hudi/pull/2663#issuecomment-907594575


   @yui2010 Gentle reminder :) 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add getPreferredLocations for RDD
> ---------------------------------
>
>                 Key: HUDI-1680
>                 URL: https://issues.apache.org/jira/browse/HUDI-1680
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Spark Integration
>    Affects Versions: 0.9.0
>            Reporter: steven zhang
>            Assignee: steven zhang
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 0.10.0
>
>
> Currently HoodieMergeOnReadRDD/HoodieBootstrapRDD's partition may have multiple PartitionedFiles. we should
> compute the hosts with the most data of the PartitionedFiles for data locality



--
This message was sent by Atlassian Jira
(v8.3.4#803005)