You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2023/01/10 17:03:16 UTC
[GitHub] [beam] Abacn opened a new pull request, #24964: Fix kafka performance test write failing due to server overwhelmed
Abacn opened a new pull request, #24964:
URL: https://github.com/apache/beam/pull/24964
Part of #22303
Kafka performance test starts failing again after #24879 in. This is because we removed shuffle=appliance there and then read from shuffle becomes faster. However, the producer is unable to digests data within timeout (see flooding warning log of `send failed : 'Expiring 148 record(s) for beam-sdf-0:130675 ms has passed since batch creation'`)
entered #24963 for the IO connector fix. For now remove ReShuffle and make the test green again.
* Remove ReShuffle before write
* Minor Optimizations on WriteCallBack
**Please** add a meaningful description for your change here
------------------------
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
- [ ] Mention the appropriate issue in your description (for example: `addresses #123`), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment `fixes #<ISSUE NUMBER>` instead.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
See the [Contributor Guide](https://beam.apache.org/contribute) for more tips on [how to make review process smoother](https://beam.apache.org/contribute/get-started-contributing/#make-the-reviewers-job-easier).
To check the build health, please visit [https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md](https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md)
GitHub Actions Tests Status (on master branch)
------------------------------------------------------------------------------------------------
[![Build python source distribution and wheels](https://github.com/apache/beam/workflows/Build%20python%20source%20distribution%20and%20wheels/badge.svg?branch=master&event=schedule)](https://github.com/apache/beam/actions?query=workflow%3A%22Build+python+source+distribution+and+wheels%22+branch%3Amaster+event%3Aschedule)
[![Python tests](https://github.com/apache/beam/workflows/Python%20tests/badge.svg?branch=master&event=schedule)](https://github.com/apache/beam/actions?query=workflow%3A%22Python+Tests%22+branch%3Amaster+event%3Aschedule)
[![Java tests](https://github.com/apache/beam/workflows/Java%20Tests/badge.svg?branch=master&event=schedule)](https://github.com/apache/beam/actions?query=workflow%3A%22Java+Tests%22+branch%3Amaster+event%3Aschedule)
[![Go tests](https://github.com/apache/beam/workflows/Go%20tests/badge.svg?branch=master&event=schedule)](https://github.com/apache/beam/actions?query=workflow%3A%22Go+tests%22+branch%3Amaster+event%3Aschedule)
See [CI.md](https://github.com/apache/beam/blob/master/CI.md) for more information about GitHub Actions CI.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1378001852
Run Java_Kafka_IO_Direct PreCommit
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn merged pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn merged PR #24964:
URL: https://github.com/apache/beam/pull/24964
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1377580364
Run Java KafkaIO Performance Test
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1378009975
Run Java KafkaIO Performance Test
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1378818024
hold on for now. Removing the reshuffle in testbatch write causing batch test has high probability fail (reshuffle added before to fix this issue)
We need to fix the IO connector to make sure it is robust to incoming traffic
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1379543289
Run Java KafkaIO Performance Test
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1379686826
Run Java KafkaIO Performance Test
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] github-actions[bot] commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1377581114
Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on a diff in pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on code in PR #24964:
URL: https://github.com/apache/beam/pull/24964#discussion_r1067646215
##########
sdks/java/io/kafka/src/test/java/org/apache/beam/sdk/io/kafka/KafkaIOIT.java:
##########
@@ -212,14 +210,16 @@ public void testKafkaIOReadsAndWritesCorrectlyInStreaming() throws IOException {
PipelineResult writeResult = writePipeline.run();
PipelineResult.State writeState = writeResult.waitUntilFinish();
+ // Fail the test if pipeline failed.
+ assertNotEquals(PipelineResult.State.FAILED, writeState);
Review Comment:
Current failing on master is due to the write pipeline continuously hitting OOM and cannot finish the bundle within timeout; then read pipeline does not receive expected number of data, e.g. https://ci-beam.apache.org/view/PerformanceTests/job/beam_PerformanceTests_Kafka_IO/3546/console
```
org.apache.beam.sdk.io.kafka.KafkaIOIT > testKafkaIOReadsAndWritesCorrectlyInStreaming FAILED
20:26:04 java.lang.AssertionError: actual number of records 34569527 smaller than expected: 100000000.
```
put this assertion right after writeResult.waitUntilFinish() to make test fail early if write pipeline fails.
The OOM issue is also a motivation of using single Callback instance (though not have decisive help)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1378010127
Run Python_PVR_Flink PreCommit
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1378001372
Run RAT PreCommit
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1377693470
Performance test succeeded after removing Reshuffle (on test branch)
https://ci-beam.apache.org/job/beam_PerformanceTests_Kafka_IO/3538/
Seems like jenkins trigger broken again
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1379735937
test succeeded. Anyway flaky is better than permared. PTAL @johnjcasey
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1382459970
GA Playground Examples CI failure irrelevant. merging for now
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on PR #24964:
URL: https://github.com/apache/beam/pull/24964#issuecomment-1377579226
R: @johnjcasey
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] johnjcasey commented on a diff in pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
johnjcasey commented on code in PR #24964:
URL: https://github.com/apache/beam/pull/24964#discussion_r1066290860
##########
sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaWriter.java:
##########
@@ -42,13 +42,16 @@
*/
class KafkaWriter<K, V> extends DoFn<ProducerRecord<K, V>, Void> {
+ protected transient @Nullable Callback callback;
Review Comment:
what is the motivation for a single callback?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] Abacn commented on a diff in pull request #24964: Fix kafka performance test write failing due to server overwhelmed
Posted by GitBox <gi...@apache.org>.
Abacn commented on code in PR #24964:
URL: https://github.com/apache/beam/pull/24964#discussion_r1066429749
##########
sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaWriter.java:
##########
@@ -42,13 +42,16 @@
*/
class KafkaWriter<K, V> extends DoFn<ProducerRecord<K, V>, Void> {
+ protected transient @Nullable Callback callback;
Review Comment:
SendCallback does not hold any state other than a reference to its outer class (KafkaWriter.this). It modified member variables of KafkaWriter in a synchronized block. This can be set as a transient member of KafkaWriter which has less overhead (no need new object each processElement call, and this is also what flink kafka connector doing)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org