You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Paul Rogers (JIRA)" <ji...@apache.org> on 2017/05/09 05:13:04 UTC
[jira] [Created] (DRILL-5492) CSV with spaces for header uses
spaces as field name
Paul Rogers created DRILL-5492:
----------------------------------
Summary: CSV with spaces for header uses spaces as field name
Key: DRILL-5492
URL: https://issues.apache.org/jira/browse/DRILL-5492
Project: Apache Drill
Issue Type: Bug
Reporter: Paul Rogers
Priority: Minor
Consider the same test case as in DRILL-5491, but with a slightly different input file:
{code}
___
a,b,c
d,e,f
{code}
The underscores represent three spaces: use spaces in the real test.
In this case, the code discussed in DRILL-5491 finds some characters and happily returns the following array:
{code}
[" "]
{code}
The field name of three blanks is returned to the client to produce the following bizarre output:
{code}
2 row(s):
a
d
{code}
The blank line is normally the header, but the header here was considered to be three blanks. (In fact, the blanks are actually printed.)
Since the blanks were considered to be a field, the file is assumed to have only one field, so only the first column was returned.
The expected behavior is that spaces are trimmed from field names, so the field name list would be empty and a User Error thrown. (That is, it is confusing to the user why a blank line produces NPE, some produce the {{ExecutionSetupException}} shown in DRILL-5491, and some produce blank headings. Behavior should be consistent.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)