You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Prabhat Verma (JIRA)" <ji...@apache.org> on 2018/05/01 05:51:00 UTC

[jira] [Commented] (KAFKA-6831) FileStreamSink is very slow

    [ https://issues.apache.org/jira/browse/KAFKA-6831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16459466#comment-16459466 ] 

Prabhat Verma commented on KAFKA-6831:
--------------------------------------

Any help please ...

> FileStreamSink is very slow
> ---------------------------
>
>                 Key: KAFKA-6831
>                 URL: https://issues.apache.org/jira/browse/KAFKA-6831
>             Project: Kafka
>          Issue Type: Test
>          Components: consumer
>    Affects Versions: 1.1.0
>            Reporter: Prabhat Verma
>            Priority: Major
>             Fix For: 1.1.0
>
>
> Hi Team,
>  
> I am very new in kafka. My project requirement is fetch data from source location and place it in other other location (consumer location). I am using FileStreamSink class to perform above action.
> I am using Linux machine having memory of 32 GB. 
> When i start FIleStreamSink , It is syncing to consumer location very very slowly. Not sure why it is taking 2000 message at a time and then sync it. After that it wait for few second then sync again. This waiting time increases per run .
>  
> I am processing 600K message but it took 1 hrs to process only 60K message.
>  
> Below are my config details : 
>  
> connect-file-sink.property
> Name = local-file
> Connector.class = FileStreamSource
> task.max=20
> file=/d/d1/kafka/destination/outfile.txt
> topic=abc_partion_20
> connect-file-source.property
> Name = local-file
> Connector.class = FileStreamSource
> task.max=20
> file=/d/d1/kafka/source/infile.txt
> topic=abc_partion_20
>  
> Can you please help ?
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)