You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Martin Durant (JIRA)" <ji...@apache.org> on 2017/08/02 15:06:00 UTC
[jira] [Created] (ARROW-1320) hdfs block locations
Martin Durant created ARROW-1320:
------------------------------------
Summary: hdfs block locations
Key: ARROW-1320
URL: https://issues.apache.org/jira/browse/ARROW-1320
Project: Apache Arrow
Issue Type: Improvement
Reporter: Martin Durant
To provide a function which can return the set of machines on which the data blocks of a given hdfs file are stored. This is best for scheduling systems (e.g., dask) which can move the computation to the machine which has the data, and so cut out network data traffic.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)