You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@drill.apache.org by François Méthot <fm...@gmail.com> on 2016/09/28 15:55:20 UTC

select count(1) : Cannot convert Indexed schema to NamePart

Hi,

 Since release 1.8,

we have a workspace hdfs.datarepo1 mapped to
/year/month/day/
containging csv.gz

if we do select count(1) on any level of the dir structure like
   select count(1) from hdfs.datarepo1.`/2016/08`;
We get
    Error: SYSTEM ERROR: IllegalStateException: You cannot convert a
indexed schema path to a   NamePart. NameParts can only reference Vectors,
not individual records or values.

same error with
   select count(1) from hdfs.datarepo1.`/` where dir0=2016 and dir1=08;


While this query works (or any select column)
   select count(column1) from hdfs.datarepo1.`/2016/08`;


Should I create a ticket?


Francois

Re: select count(1) : Cannot convert Indexed schema to NamePart

Posted by Zelaine Fong <zf...@maprtech.com>.

This looks like it probably got introduced some time shortly before the 1.8
release went out.  I tried your query on one of the early release candidate
builds for 1.8, and your query works fine in that older build.

We'll put this on the list to take a look at.

Thanks for reporting this.

-- Zelaine


On Fri, Sep 30, 2016 at 6:08 AM, François Méthot <fm...@gmail.com>
wrote:

> I have created a ticket:
>
> https://issues.apache.org/jira/browse/DRILL-4919
>
> The error happen on csv with header.
>
> The actual error from the Drill's original TextFormatPlugin is
>
> Error: UNSUPPORTED_OPERATION ERROR: With extractHeader enabled, only header
> names are supported
>
>
> Forget about the originally reported error, it happens on a modified
> version of the TextFormatPlugin we are using.
>
>
> On Wed, Sep 28, 2016 at 1:01 PM, Jinfeng Ni <jn...@apache.org> wrote:
>
> > I tried to query a regular csv file and a csv.gz file, and did not run
> > into the problem you saw. When you create a JIRA, it would be helpful
> > if you can share a sample file for re-produce purpose.
> >
> >
> >
> > On Wed, Sep 28, 2016 at 9:33 AM, Aman Sinha <am...@apache.org>
> wrote:
> > > Is this specific to CSV format files ?  Yes, you should create a JIRA
> for
> > > this.   Thanks for reporting.
> > >
> > > On Wed, Sep 28, 2016 at 8:55 AM, François Méthot <fm...@gmail.com>
> > > wrote:
> > >
> > >> Hi,
> > >>
> > >>  Since release 1.8,
> > >>
> > >> we have a workspace hdfs.datarepo1 mapped to
> > >> /year/month/day/
> > >> containging csv.gz
> > >>
> > >> if we do select count(1) on any level of the dir structure like
> > >>    select count(1) from hdfs.datarepo1.`/2016/08`;
> > >> We get
> > >>     Error: SYSTEM ERROR: IllegalStateException: You cannot convert a
> > >> indexed schema path to a   NamePart. NameParts can only reference
> > Vectors,
> > >> not individual records or values.
> > >>
> > >> same error with
> > >>    select count(1) from hdfs.datarepo1.`/` where dir0=2016 and
> dir1=08;
> > >>
> > >>
> > >> While this query works (or any select column)
> > >>    select count(column1) from hdfs.datarepo1.`/2016/08`;
> > >>
> > >>
> > >> Should I create a ticket?
> > >>
> > >>
> > >> Francois
> > >>
> >
>

Re: select count(1) : Cannot convert Indexed schema to NamePart

Posted by François Méthot <fm...@gmail.com>.

I have created a ticket:

https://issues.apache.org/jira/browse/DRILL-4919

The error happen on csv with header.

The actual error from the Drill's original TextFormatPlugin is

Error: UNSUPPORTED_OPERATION ERROR: With extractHeader enabled, only header
names are supported


Forget about the originally reported error, it happens on a modified
version of the TextFormatPlugin we are using.


On Wed, Sep 28, 2016 at 1:01 PM, Jinfeng Ni <jn...@apache.org> wrote:

> I tried to query a regular csv file and a csv.gz file, and did not run
> into the problem you saw. When you create a JIRA, it would be helpful
> if you can share a sample file for re-produce purpose.
>
>
>
> On Wed, Sep 28, 2016 at 9:33 AM, Aman Sinha <am...@apache.org> wrote:
> > Is this specific to CSV format files ?  Yes, you should create a JIRA for
> > this.   Thanks for reporting.
> >
> > On Wed, Sep 28, 2016 at 8:55 AM, François Méthot <fm...@gmail.com>
> > wrote:
> >
> >> Hi,
> >>
> >>  Since release 1.8,
> >>
> >> we have a workspace hdfs.datarepo1 mapped to
> >> /year/month/day/
> >> containging csv.gz
> >>
> >> if we do select count(1) on any level of the dir structure like
> >>    select count(1) from hdfs.datarepo1.`/2016/08`;
> >> We get
> >>     Error: SYSTEM ERROR: IllegalStateException: You cannot convert a
> >> indexed schema path to a   NamePart. NameParts can only reference
> Vectors,
> >> not individual records or values.
> >>
> >> same error with
> >>    select count(1) from hdfs.datarepo1.`/` where dir0=2016 and dir1=08;
> >>
> >>
> >> While this query works (or any select column)
> >>    select count(column1) from hdfs.datarepo1.`/2016/08`;
> >>
> >>
> >> Should I create a ticket?
> >>
> >>
> >> Francois
> >>
>

Re: select count(1) : Cannot convert Indexed schema to NamePart

Posted by Jinfeng Ni <jn...@apache.org>.

I tried to query a regular csv file and a csv.gz file, and did not run
into the problem you saw. When you create a JIRA, it would be helpful
if you can share a sample file for re-produce purpose.



On Wed, Sep 28, 2016 at 9:33 AM, Aman Sinha <am...@apache.org> wrote:
> Is this specific to CSV format files ?  Yes, you should create a JIRA for
> this.   Thanks for reporting.
>
> On Wed, Sep 28, 2016 at 8:55 AM, François Méthot <fm...@gmail.com>
> wrote:
>
>> Hi,
>>
>>  Since release 1.8,
>>
>> we have a workspace hdfs.datarepo1 mapped to
>> /year/month/day/
>> containging csv.gz
>>
>> if we do select count(1) on any level of the dir structure like
>>    select count(1) from hdfs.datarepo1.`/2016/08`;
>> We get
>>     Error: SYSTEM ERROR: IllegalStateException: You cannot convert a
>> indexed schema path to a   NamePart. NameParts can only reference Vectors,
>> not individual records or values.
>>
>> same error with
>>    select count(1) from hdfs.datarepo1.`/` where dir0=2016 and dir1=08;
>>
>>
>> While this query works (or any select column)
>>    select count(column1) from hdfs.datarepo1.`/2016/08`;
>>
>>
>> Should I create a ticket?
>>
>>
>> Francois
>>

Re: select count(1) : Cannot convert Indexed schema to NamePart

Posted by Aman Sinha <am...@apache.org>.

Is this specific to CSV format files ?  Yes, you should create a JIRA for
this.   Thanks for reporting.

On Wed, Sep 28, 2016 at 8:55 AM, François Méthot <fm...@gmail.com>
wrote:

> Hi,
>
>  Since release 1.8,
>
> we have a workspace hdfs.datarepo1 mapped to
> /year/month/day/
> containging csv.gz
>
> if we do select count(1) on any level of the dir structure like
>    select count(1) from hdfs.datarepo1.`/2016/08`;
> We get
>     Error: SYSTEM ERROR: IllegalStateException: You cannot convert a
> indexed schema path to a   NamePart. NameParts can only reference Vectors,
> not individual records or values.
>
> same error with
>    select count(1) from hdfs.datarepo1.`/` where dir0=2016 and dir1=08;
>
>
> While this query works (or any select column)
>    select count(column1) from hdfs.datarepo1.`/2016/08`;
>
>
> Should I create a ticket?
>
>
> Francois
>