You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "John Fung (JIRA)" <ji...@apache.org> on 2012/10/25 00:26:11 UTC
[jira] [Created] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
John Fung created KAFKA-583:
-------------------------------
Summary: SimpleConsumerShell may receive less data inconsistently
Key: KAFKA-583
URL: https://issues.apache.org/jira/browse/KAFKA-583
Project: Kafka
Issue Type: Bug
Reporter: John Fung
Attachments: broker_log_segment_files.tar.gz
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: broker_log_segment_files.tar.gz
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: broker_log_segment_files.tar.gz
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13486125#comment-13486125 ]
John Fung commented on KAFKA-583:
---------------------------------
This issue can be reproduced consistently using the attached broker log segment files and zookeeper data.
Please do the followings to reproduce:
1. Check out the latest 0.8 branch
2. The attached data file "kafka_583_zk_kafka_data.tar.gz" contains the following directories:
/tmp/zookeeper_0
/tmp/kafka_server_1_logs
/tmp/kafka_server_2_logs
/tmp/kafka_server_3_logs
If your local "/tmp" directory also contains the above directories, please rename them.
3. Download "kafka_583_zk_kafka_data.tar.gz" and extract them to your local "/tmp" directory
4. Download "kafka_583_reproduce_issue.patch" and apply under <kafka_home>:
patch -p0 -i kafka_583_reproduce_issue.patch
5. Build kafka as: <kafka_home> $ ./sbt update package
6. In <kafka_home>, execute "chmod u+x validate_data_and_log_segment.sh"
7. In <kafka_home>/system_test : execute "python -B system_test_runner.py"
8. Wait for about 1 min, when the following message is showing on the console:
=====================================================
*** Sleeping for 30 min ...
You may now run : <kafka_home>/run_simple_consumer.sh
=====================================================
execute this command under <kafka_home>: ./validate_data_and_log_segment.sh
9. The following will be showing:
$ ./validate_data_and_log_segment.sh
Validated by SimpleConsumerShell :
replica 1 message count:
735
735
630
replica 2 message count:
735
735
630
replica 3 message count:
735
735
200
Validated by DumpLogSegments :
broker 1 partition 0 messages count : 735
broker 1 partition 1 messages count : 735
broker 1 partition 2 messages count : 630
broker 2 partition 0 messages count : 735
broker 2 partition 1 messages count : 735
broker 2 partition 2 messages count : 630
broker 3 partition 0 messages count : 735
broker 3 partition 1 messages count : 735
broker 3 partition 2 messages count : 630
10. The message count in broker 3 partition 2 is different between SimpleConsumerShell & DumpLogSegments
11. Please note that if you get the following messages, please do Ctrl-C and re-run the test again:
Error: replica 1 does not exist for partition (test_1, 0)
Error: replica 1 does not exist for partition (test_1, 1)
Error: replica 1 does not exist for partition (test_1, 2)
Error: replica 2 does not exist for partition (test_1, 0)
Error: replica 2 does not exist for partition (test_1, 1)
Error: replica 2 does not exist for partition (test_1, 2)
Error: replica 3 does not exist for partition (test_1, 0)
Error: replica 3 does not exist for partition (test_1, 1)
Error: replica 3 does not exist for partition (test_1, 2)
Validated by SimpleConsumerShell :
replica 1 message count:
0
0
0
replica 2 message count:
0
0
0
replica 3 message count:
0
0
0
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Labels: replication-testing
> Attachments: kafka_583_reproduce_issue.patch, kafka_583_zk_kafka_data.tar.gz
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13483673#comment-13483673 ]
John Fung commented on KAFKA-583:
---------------------------------
This happens inconsistently by executing testcase_0108.
Test description:
1. Start Zk and 3-brokers cluster for 3 replicas.
2. Start producer until it finishes producing data.
3. Keep zk and brokers running.
4. Start SimpleConsumerShell to consume data as follows:
bin/kafka-run-class.sh kafka.tools.SimpleConsumerShell --broker-list localhost:9091,localhost:9092,localhost:9093 --topic test_1 --partition 0 --replica 1 --no-wait-at-logend > replica_data_r1_p0.log
bin/kafka-run-class.sh kafka.tools.SimpleConsumerShell --broker-list localhost:9091,localhost:9092,localhost:9093 --topic test_1 --partition 0 --replica 1 --no-wait-at-logend > replica_data_r1_p0.log
. . .
5. Look for the MessageID in each replica:
grep MessageID replica_data_r1_p0.log | sed 's/.*MessageID://' | sed 's/:.*//' | sort -u | wc -l
grep MessageID replica_data_r1_p1.log | sed 's/.*MessageID://' | sed 's/:.*//' | sort -u | wc -l
. . .
(a shell script is attached for the above shell commands)
6. The following is the output:
./run-simple-consumer.sh
735
735
630
735
735
200
735
735
630
7. The above numbers are showing that there are only 200 messages consumed in replica 2, partition 2 (3rd partition)
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: broker_log_segment_files.tar.gz
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: log4j_messages.tar.gz
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: broker_log_segment_files.tar.gz, log4j_messages.tar.gz, run-simple-consumer.sh
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: kafka_583_reproduce_issue.patch
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: kafka_583_reproduce_issue.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: run-simple-consumer.sh
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: broker_log_segment_files.tar.gz, run-simple-consumer.sh
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: (was: log4j_messages.tar.gz)
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: (was: run-simple-consumer.sh)
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: kafka_583_zk_kafka_data.tar.gz
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
> Attachments: kafka_583_reproduce_issue.patch, kafka_583_zk_kafka_data.tar.gz
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (KAFKA-583) SimpleConsumerShell may receive less
data inconsistently
Posted by "John Fung (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/KAFKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Fung updated KAFKA-583:
----------------------------
Attachment: (was: broker_log_segment_files.tar.gz)
> SimpleConsumerShell may receive less data inconsistently
> --------------------------------------------------------
>
> Key: KAFKA-583
> URL: https://issues.apache.org/jira/browse/KAFKA-583
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira