You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Maciej Bryński (JIRA)" <ji...@apache.org> on 2018/10/20 14:39:00 UTC

[jira] [Updated] (SPARK-25787) [K8S] Spark can't use data locality information

     [ https://issues.apache.org/jira/browse/SPARK-25787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Maciej Bryński updated SPARK-25787:
-----------------------------------
    Description: 
I started experimenting with Spark based on this presentation:
https://www.slideshare.net/databricks/hdfs-on-kuberneteslessons-learned-with-kimoon-kim

I'm using excelent https://github.com/apache-spark-on-k8s/kubernetes-HDFS
charts to deploy HDFS.

Unfortunately reading from HDFS gives ANY locality for every task.
Is data locality working on Kubernetes cluster ?

  was:
I started to experimenting with Spark based on this presentation:
https://www.slideshare.net/databricks/hdfs-on-kuberneteslessons-learned-with-kimoon-kim

I'm using excelent https://github.com/apache-spark-on-k8s/kubernetes-HDFS
charts to deploy HDFS.

Unfortunately reading from HDFS gives ANY locality for every task.
Do data locality is working on Kubernetes cluster ?


> [K8S] Spark can't use data locality information
> -----------------------------------------------
>
>                 Key: SPARK-25787
>                 URL: https://issues.apache.org/jira/browse/SPARK-25787
>             Project: Spark
>          Issue Type: Bug
>          Components: Kubernetes
>    Affects Versions: 2.4.0
>            Reporter: Maciej Bryński
>            Priority: Major
>
> I started experimenting with Spark based on this presentation:
> https://www.slideshare.net/databricks/hdfs-on-kuberneteslessons-learned-with-kimoon-kim
> I'm using excelent https://github.com/apache-spark-on-k8s/kubernetes-HDFS
> charts to deploy HDFS.
> Unfortunately reading from HDFS gives ANY locality for every task.
> Is data locality working on Kubernetes cluster ?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org