You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pinot.apache.org by GitBox <gi...@apache.org> on 2021/10/04 22:50:12 UTC

[GitHub] [pinot] mcvsubbu commented on pull request #7500: upgrade helix to 1.0.2

mcvsubbu commented on pull request #7500:
URL: https://github.com/apache/pinot/pull/7500#issuecomment-933913510


   Copy pasting message from @richardstartin  before it disappears from chat
   
   hi Subbu, the upgrade wasn't simple and it's possible I made some mistakes doing it, but several controller tests time out on my branch. E.g. in ControllerInstanceToggleTest the cluster doesn't get set up properly.
   09:26:27.461 [ZkClient-EventThread-33-localhost:2191] ERROR org.apache.helix.zookeeper.zkclient.callback.ZkAsyncCallbacks - Interrupted waiting for success
   java.lang.InterruptedException: null
   	at java.lang.Object.wait(Native Method) ~[?:?]
   	at java.lang.Object.wait(Object.java:328) ~[?:?]
   	at org.apache.helix.zookeeper.zkclient.callback.ZkAsyncCallbacks$DefaultCallback.waitForSuccess(ZkAsyncCallbacks.java:249) [helix-core-1.0.2.jar:1.0.2]
   	at org.apache.helix.zookeeper.zkclient.ZkClient.issueSync(ZkClient.java:1306) [helix-core-1.0.2.jar:1.0.2]
   	at org.apache.helix.zookeeper.zkclient.ZkClient.access$300(ZkClient.java:85) [helix-core-1.0.2.jar:1.0.2]
   	at org.apache.helix.zookeeper.zkclient.ZkClient$4.run(ZkClient.java:1332) [helix-core-1.0.2.jar:1.0.2]
   	at org.apache.helix.zookeeper.zkclient.ZkEventThread.run(ZkEventThread.java:99) [helix-core-1.0.2.jar:1.0.2]
   09:26:30.819 [HelixController-pipeline-default-ControllerTestUtils-(ed00bafa_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:30.828 [HelixController-pipeline-default-ControllerTestUtils-(6ca9c1f1_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:30.834 [HelixController-pipeline-default-ControllerTestUtils-(a5f87de1_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:30.841 [HelixController-pipeline-default-ControllerTestUtils-(3801afe7_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:30.852 [HelixController-pipeline-default-ControllerTestUtils-(3805570d_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.251 [HelixController-pipeline-default-ControllerTestUtils-(aaec8ce8_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.277 [HelixController-pipeline-default-ControllerTestUtils-(d8cc3e98_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.289 [HelixController-pipeline-default-ControllerTestUtils-(1b94a4b9_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.297 [HelixController-pipeline-default-ControllerTestUtils-(842b241b_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.307 [HelixController-pipeline-default-ControllerTestUtils-(0b4b5414_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.314 [HelixController-pipeline-default-ControllerTestUtils-(8bedd9f7_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   09:26:31.320 [HelixController-pipeline-default-ControllerTestUtils-(6477b0f8_DEFAULT)] ERROR org.apache.helix.controller.rebalancer.DelayedAutoRebalancer - No instances or active instances available for resource leadControllerResource, allInstances: [], liveInstances: [], activeInstances: []
   Oct 01, 2021 9:26:37 AM org.glassfish.grizzly.http.server.NetworkListener start
   INFO: Started listener bound to [0.0.0.0:18998]
   Oct 01, 2021 9:26:37 AM org.glassfish.grizzly.http.server.HttpServer start
   INFO: [HttpServer] Started.
   09:26:41.821 [ZkClient-EventThread-320-localhost:2191] ERROR org.apache.helix.messaging.handling.HelixTaskExecutor - Message 1375bc51-25a5-4e8e-a291-e407ecb959c2 cannot be processed: 1375bc51-25a5-4e8e-a291-e407ecb959c2, {CREATE_TIMESTAMP=1633076801818, ClusterEventName=InstanceConfigChange, FROM_STATE=ONLINE, MSG_ID=1375bc51-25a5-4e8e-a291-e407ecb959c2, MSG_STATE=new, MSG_TYPE=STATE_TRANSITION, PARTITION_NAME=toggleTable72090524845971, RESOURCE_NAME=toggleTable_OFFLINE, RESOURCE_TAG=toggleTable_OFFLINE, RETRY_COUNT=3, SRC_NAME=localhost_18998, SRC_SESSION_ID=100044bca960003, STATE_MODEL_DEF=SegmentOnlineOfflineStateModel, STATE_MODEL_FACTORY_NAME=DEFAULT, TGT_NAME=Server_localhost_3, TGT_SESSION_ID=100044bca960018, TO_STATE=OFFLINE}{}{}Partition toggleTable72090524845971 current state is same as toState (ONLINE->OFFLINE) from message.
   09:26:41.822 [ZkClient-EventThread-320-localhost:2191] ERROR org.apache.helix.util.StatusUpdateUtil - StatusUpdate Error record: 100044bca960018__toggleTable_OFFLINE, {}{HELIX_ERROR     20211001-092641.000821 STATE_TRANSITION f04e6b17-edab-495a-bf33-ce056dec5c3a={AdditionalInfo=Partition toggleTable72090524845971 current state is same as toState (ONLINE->OFFLINE) from message., Class=class org.apache.helix.participant.HelixStateMachineEngine, MSG_ID=1375bc51-25a5-4e8e-a291-e407ecb959c2, Message state=NEW}}{}
   My only motivation to upgrade was to avoid creating an ObjectMapper on each ZN serialization and it was taking more time than justified. So if you plan to do it at LinkedIn with the Helix team I think it's better that way.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org