You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gobblin.apache.org by "Shirshanka Das (Jira)" <ji...@apache.org> on 2020/10/13 20:51:00 UTC

[jira] [Commented] (GOBBLIN-1289) Documentation is missing for configuring Kafka topic name inclusion / exclusion from the output path

    [ https://issues.apache.org/jira/browse/GOBBLIN-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17213364#comment-17213364 ] 

Shirshanka Das commented on GOBBLIN-1289:
-----------------------------------------

From slack:

 
Hi Team,
I am using gobblin to extract data from multiple Kafka topics and write it to single hdfs directory. Currently, topic name is added in output path.Does gobblin provide out of the box configuration to skip topic name in output path.?Please note I am using {{ParquetDataWriterBuilder}} as writer builder class, {{TimeBasedAvroWriterPartitioner}} as partitioner class and {{TimePartitionedDataPublisher}} as publisher class.
And property {{writer.file.path.type=tablename}}
 
 
Answer: You can also try setting {{data.publisher.appendExtractToFinalDir=false}} 
 

> Documentation is missing for configuring Kafka topic name inclusion / exclusion from the output path
> ----------------------------------------------------------------------------------------------------
>
>                 Key: GOBBLIN-1289
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-1289
>             Project: Apache Gobblin
>          Issue Type: Bug
>    Affects Versions: 0.15.0
>            Reporter: Shirshanka Das
>            Priority: Minor
>              Labels: documentation
>             Fix For: 0.16.0
>
>
> Improve Gobblin documentation to make it clear how to 
>  # Attach Kafka Topic Name to the output path
>  # How to remove it from the output path
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)