You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@drill.apache.org by Tom Kiley <to...@kiley.us> on 2014/02/07 14:49:03 UTC

Sequence File Suport

Hello,


Are there plans to support Hadoop's Sequence File (
http://wiki.apache.org/hadoop/SequenceFile.)  Or are they already supported
and I missed it?  I could see this being useful to use Drill on the output
of MapReduce jobs.

The sequence files I have are currently all NULL keys and JSON objects as
the value.  Does anyone have a recommendation on converting to JSON files
or Parquet files for Drill?  The JSON objects are generally the same
format, but there may be some outliers with differences.  Some fields may
be non-existant in some objects.


Thanks,
Tom

P.S. Apologies for the noob questions.  I've just started looking at Drill.

Re: Sequence File Suport

Posted by Sebastian Schelter <ss...@apache.org>.

There's no need to excuse for asking questions :)

On 02/07/2014 02:49 PM, Tom Kiley wrote:
> Hello,
>
>
> Are there plans to support Hadoop's Sequence File (
> http://wiki.apache.org/hadoop/SequenceFile.)  Or are they already supported
> and I missed it?  I could see this being useful to use Drill on the output
> of MapReduce jobs.
>
> The sequence files I have are currently all NULL keys and JSON objects as
> the value.  Does anyone have a recommendation on converting to JSON files
> or Parquet files for Drill?  The JSON objects are generally the same
> format, but there may be some outliers with differences.  Some fields may
> be non-existant in some objects.
>
>
> Thanks,
> Tom
>
> P.S. Apologies for the noob questions.  I've just started looking at Drill.
>