You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gobblin.apache.org by "Abhishek Tiwari (JIRA)" <ji...@apache.org> on 2018/05/14 15:43:00 UTC
[jira] [Resolved] (GOBBLIN-352) Add example for using
gobblin-parquet module
[ https://issues.apache.org/jira/browse/GOBBLIN-352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Abhishek Tiwari resolved GOBBLIN-352.
-------------------------------------
Resolution: Fixed
Fix Version/s: 0.13.0
Issue resolved by pull request #2222
[https://github.com/apache/incubator-gobblin/pull/2222]
> Add example for using gobblin-parquet module
> --------------------------------------------
>
> Key: GOBBLIN-352
> URL: https://issues.apache.org/jira/browse/GOBBLIN-352
> Project: Apache Gobblin
> Issue Type: Improvement
> Reporter: Tilak Patidar
> Priority: Minor
> Fix For: 0.13.0
>
>
> A Gobblin CLI application to download GitHub archive data for the provided day.
> Github data is a JSON archive and this example uses ```org.apache.gobblin.converter.parquet.JsonIntermediateToParquetGroupConverter``` to convert JSON to parquet and ```org.apache.gobblin.writer.ParquetHdfsDataWriter``` to write parquet files.
> This example demonstrates usage of Json source To parquet files.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)