You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@drill.apache.org by John Omernik <jo...@omernik.com> on 2016/01/19 18:35:01 UTC

Working with non-sane data, 2nd Edition

ok, this topic is complete separate from the original topic.

I have some data where in some files

it's just ({} is the record, with data, that part is sane)

{},
{},
{},
{},

and others where it's

[
{},
{},
{},
]

If there are no square brackets around the data, then drill doesn't like
that each object has a , on each line, if there are brackets it throws
 different error about starting in the middle of a record.

Are there any run time options that would/could allow us to better specify
what we are seeing. (I used sed to remove the, at the end of each line,
that seemed to work, but I'd rather not munge the source data).

Some options like "ignore command before newline" would be handy, or "data
enclosed in array" for the second error.

Do these exist?