You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Matt Keranen (JIRA)" <ji...@apache.org> on 2018/12/01 01:25:00 UTC

[jira] [Updated] (DRILL-6873) Cluster of drillbits on local files expects same set of filenames on all nodes

     [ https://issues.apache.org/jira/browse/DRILL-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matt Keranen updated DRILL-6873:
--------------------------------
    Summary: Cluster of drillbits on local files expects same set of filenames on all nodes  (was: Cluster of drillbits on local files throws "DATA_READ ERROR file does not exist")

> Cluster of drillbits on local files expects same set of filenames on all nodes
> ------------------------------------------------------------------------------
>
>                 Key: DRILL-6873
>                 URL: https://issues.apache.org/jira/browse/DRILL-6873
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - JSON
>    Affects Versions: 1.14.0
>         Environment: Drill v1.14.0
> Zookeeper 3.4.13
> Centos 7.5
>  
>            Reporter: Matt Keranen
>            Priority: Major
>
> Running drillbits on multiple servers with Zookeeper but without HDFS - local filesystems. When file storage is configured to a common path, but not all filenames are present on all nodes, errors are thrown:
>     Error: DATA_READ ERROR: Failure reading JSON file - File [file:/localdata/logs/fileX.json.gz|file:///localdata/logs/fileX.json.gz] does not exist
> Example use case: Querying log files on multiple machines as a ZK cluster from their local filesystems without moving them to a distributed file system which may not be in use.
> Is there a (planned) configuration option to simply skip filenames that exist on some but not all nodes?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)