You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Prabhat Verma (JIRA)" <ji...@apache.org> on 2018/04/27 01:20:00 UTC

[jira] [Created] (KAFKA-6831) FileStreamSink is very slow

Prabhat Verma created KAFKA-6831:
------------------------------------

             Summary: FileStreamSink is very slow
                 Key: KAFKA-6831
                 URL: https://issues.apache.org/jira/browse/KAFKA-6831
             Project: Kafka
          Issue Type: Test
          Components: consumer
    Affects Versions: 1.1.0
            Reporter: Prabhat Verma
             Fix For: 1.1.0


Hi Team,

 

I am very new in kafka. My project requirement is fetch data from source location and place it in other other location (consumer location). I am using FileStreamSink class to perform above action.

I am using Linux machine having memory of 32 GB. 

When i start FIleStreamSink , It is syncing to consumer location very very slowly. Not sure why it is taking 2000 message at a time and then sync it. After that it wait for few second then sync again. This waiting time increases per run .

 

I am processing 600K message but it took 1 hrs to process only 60K message.

 

Below are my config details : 

 

connect-file-sink.property

Name = local-file

Connector.class = FileStreamSource

task.max=20

file=/d/d1/kafka/destination/outfile.txt

topic=abc_partion_20

connect-file-source.property

Name = local-file

Connector.class = FileStreamSource

task.max=20

file=/d/d1/kafka/source/infile.txt

topic=abc_partion_20

 

Can you please help ?

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)