You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Matt Keranen (JIRA)" <ji...@apache.org> on 2018/11/27 21:01:00 UTC
[jira] [Created] (DRILL-6873) Cluster without dfs throws DATA_READ
ERROR file does not exist
Matt Keranen created DRILL-6873:
-----------------------------------
Summary: Cluster without dfs throws DATA_READ ERROR file does not exist
Key: DRILL-6873
URL: https://issues.apache.org/jira/browse/DRILL-6873
Project: Apache Drill
Issue Type: Bug
Components: Storage - JSON
Affects Versions: 1.14.0
Environment: Drill v1.14.0
Zookeeper 3.4.13
Centos 7.5
Reporter: Matt Keranen
Running drillbits on multiple servers with Zookeeper but without HDFS. When file storage is configured to a common path, but not all filenames are present on all nodes, errors are thrown:
Error: DATA_READ ERROR: Failure reading JSON file - File file:/localdata/logs/fileX.json.gz does not exist
Example use case: Querying log files on multiple machines as a ZK cluster from their local filesystems without moving them to a distributed file system which may not be in use.
Is there a (planned) configuration option to simply skip filenames that exist on some but not all nodes?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)