You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by Ryan Blue <rb...@netflix.com.INVALID> on 2016/10/26 22:42:03 UTC

New Parquet CLI project

Hi everyone,

Last Parquet sync-up, I mentioned that I've been working on a new Parquet
CLI tool (based on Cloudera's Kite CLI). I haven't had a chance to move the
build to maven or get the licensing taken care of for an Apache submission,
but it is clean enough that people can start looking at it. I've posted it
here:

  https://github.com/rdblue/parquet-cli

The build uses gradle and the jar is run with the hadoop command, like the
current tools. It is based on parquet-avro and can convert between Avro,
Parquet, CSV, and JSON. It has been a great tool for trying different
settings and having an easier time inspecting Parquet file
metadata/dictionaries.

Please have a look, I'm interested to know if anyone would like this added
to the Parquet project. Thanks!

rb

-- 
Ryan Blue
Software Engineer
Netflix

Re: New Parquet CLI project

Posted by Julien Le Dem <ju...@dremio.com>.
Thanks Ryan,
This looks interesting.
I'm curious to hear other people's thoughts.
Let us know if you tried it.
Julien

On Wed, Oct 26, 2016 at 3:42 PM, Ryan Blue <rb...@netflix.com.invalid>
wrote:

> Hi everyone,
>
> Last Parquet sync-up, I mentioned that I've been working on a new Parquet
> CLI tool (based on Cloudera's Kite CLI). I haven't had a chance to move the
> build to maven or get the licensing taken care of for an Apache submission,
> but it is clean enough that people can start looking at it. I've posted it
> here:
>
>   https://github.com/rdblue/parquet-cli
>
> The build uses gradle and the jar is run with the hadoop command, like the
> current tools. It is based on parquet-avro and can convert between Avro,
> Parquet, CSV, and JSON. It has been a great tool for trying different
> settings and having an easier time inspecting Parquet file
> metadata/dictionaries.
>
> Please have a look, I'm interested to know if anyone would like this added
> to the Parquet project. Thanks!
>
> rb
>
> --
> Ryan Blue
> Software Engineer
> Netflix
>



-- 
Julien