You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@skywalking.apache.org by GitBox <gi...@apache.org> on 2019/09/06 03:07:52 UTC

[GitHub] [skywalking] liyongfa0311 opened a new issue #3424: Why does the collector always crash after I upgrade to version 6.3?

liyongfa0311 opened a new issue #3424: Why does the collector always crash after I upgrade to version 6.3?
URL: https://github.com/apache/skywalking/issues/3424
 
 
   Please answer these questions before submitting your issue.
   
   - Why do you submit this issue?
   - [X] Question or discussion
   - [ ] Bug
   - [ ] Requirement
   - [ ] Feature or performance improvement
   
   ___
   ### Question
   - What do you want to know?
   After I deleted all the indexes in es, I started the 6.3 version of the collector, and then started all the agent services. After about 1 hour, the collector log showed various exceptions, such as
   2019-09-06 10:44:26,321 - org.apache.skywalking.oap.server.receiver.trace.provider.handler.v6.grpc.TraceSegmentReportServiceHandler - 64 [grpc-default-executor-575] ERROR [] - CANCELLED: cancelled before receiving half close
   io.grpc.StatusRuntimeException: CANCELLED: cancelled before receiving half close
   	at io.grpc.Status.asRuntimeException(Status.java:517) ~[grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.stub.ServerCalls$StreamingServerCallHandler$StreamingServerCallListener.onCancel(ServerCalls.java:272) [grpc-stub-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ServerCallImpl$ServerStreamListenerImpl.closed(ServerCallImpl.java:293) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ServerImpl$JumpToApplicationThreadServerStreamListener$1Closed.runInContext(ServerImpl.java:738) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ContextRunnable.run(ContextRunnable.java:37) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.SerializingExecutor.run(SerializingExecutor.java:123) [grpc-core-1.15.1.jar:1.15.1]
   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_65]
   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_65]
   	at java.lang.Thread.run(Thread.java:745) [?:1.8.0_65]
   After it multiple occurrences
   2019-09-06 10:44:30,572 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   2019-09-06 10:44:30,573 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   2019-09-06 10:44:30,573 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   2019-09-06 10:44:30,573 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   2019-09-06 10:44:30,574 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   2019-09-06 10:44:30,574 - org.apache.skywalking.oap.server.core.register.worker.RegisterPersistentWorker - 112 [DataCarrier.REGISTER_L2.BulkConsumePool.0.Thread] ERROR [] - GC overhead limit exceeded
   java.lang.OutOfMemoryError: GC overhead limit exceeded
   
   Still not working after restarting collector ,There may be an exception
   2019-09-06 00:47:24,324 - org.apache.skywalking.oap.server.storage.plugin.elasticsearch.cache.NetworkAddressInventoryCacheEsDAO - 95 [pool-14-thread-1] ERROR [] - Request cannot be executed; I/O reactor status: STOPPED
   java.lang.IllegalStateException: Request cannot be executed; I/O reactor status: STOPPED
   	at org.apache.http.util.Asserts.check(Asserts.java:46) ~[httpcore-4.4.5.jar:4.4.5]
   	at org.apache.http.impl.nio.client.CloseableHttpAsyncClientBase.ensureRunning(CloseableHttpAsyncClientBase.java:90) ~[httpasyncclient-4.1.2.jar:4.1.2]
   	at org.apache.http.impl.nio.client.InternalHttpAsyncClient.execute(InternalHttpAsyncClient.java:123) ~[httpasyncclient-4.1.2.jar:4.1.2]
   	at org.elasticsearch.client.RestClient.performRequestAsync(RestClient.java:366) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestClient.performRequestAsyncNoCatch(RestClient.java:351) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:233) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:198) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.performRequest(RestHighLevelClient.java:522) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.performRequestAndParseEntity(RestHighLevelClient.java:508) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.search(RestHighLevelClient.java:404) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.apache.skywalking.oap.server.library.client.elasticsearch.ElasticSearchClient.search(ElasticSearchClient.java:220) ~[library-client-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.storage.plugin.elasticsearch.cache.NetworkAddressInventoryCacheEsDAO.loadLastUpdate(NetworkAddressInventoryCacheEsDAO.java:89) ~[storage-elasticsearch-plugin-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.core.cache.CacheUpdateTimer.updateNetAddressInventory(CacheUpdateTimer.java:83) ~[server-core-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.core.cache.CacheUpdateTimer.update(CacheUpdateTimer.java:57) ~[server-core-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.core.cache.CacheUpdateTimer.lambda$start$0(CacheUpdateTimer.java:48) ~[server-core-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.apm.util.RunnableWithExceptionProtection.run(RunnableWithExceptionProtection.java:36) [apm-util-6.3.0.jar:6.3.0]
   	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [?:1.8.0_65]
   	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [?:1.8.0_65]
   	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [?:1.8.0_65]
   	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [?:1.8.0_65]
   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_65]
   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_65]
   	at java.lang.Thread.run(Thread.java:745) [?:1.8.0_65]
   
   
   
   
   
   It may also appear
   
   java.io.IOException: listener timeout after waiting for [30000] ms
   	at org.elasticsearch.client.RestClient$SyncResponseListener.get(RestClient.java:684) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:235) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:198) ~[elasticsearch-rest-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.performRequest(RestHighLevelClient.java:522) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.performRequestAndParseEntity(RestHighLevelClient.java:508) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.elasticsearch.client.RestHighLevelClient.get(RestHighLevelClient.java:293) ~[elasticsearch-rest-high-level-client-6.3.2.jar:6.3.2]
   	at org.apache.skywalking.oap.server.library.client.elasticsearch.ElasticSearchClient.get(ElasticSearchClient.java:226) ~[library-client-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.storage.plugin.elasticsearch.cache.EndpointInventoryCacheEsDAO.getEndpointId(EndpointInventoryCacheEsDAO.java:49) ~[storage-elasticsearch-plugin-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.core.cache.EndpointInventoryCache.getEndpointId(EndpointInventoryCache.java:69) ~[server-core-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.core.register.service.EndpointInventoryRegister.getOrCreate(EndpointInventoryRegister.java:54) ~[server-core-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.oap.server.receiver.register.provider.handler.v6.grpc.RegisterServiceHandler.lambda$doEndpointRegister$2(RegisterServiceHandler.java:140) ~[skywalking-register-receiver-plugin-6.3.0.jar:6.3.0]
   	at java.util.ArrayList.forEach(ArrayList.java:1249) [?:1.8.0_65]
   	at java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1080) [?:1.8.0_65]
   	at org.apache.skywalking.oap.server.receiver.register.provider.handler.v6.grpc.RegisterServiceHandler.doEndpointRegister(RegisterServiceHandler.java:136) [skywalking-register-receiver-plugin-6.3.0.jar:6.3.0]
   	at org.apache.skywalking.apm.network.register.v2.RegisterGrpc$MethodHandlers.invoke(RegisterGrpc.java:513) [apm-network-6.3.0.jar:6.3.0]
   	at io.grpc.stub.ServerCalls$UnaryServerCallHandler$UnaryServerCallListener.onHalfClose(ServerCalls.java:171) [grpc-stub-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ServerCallImpl$ServerStreamListenerImpl.halfClosed(ServerCallImpl.java:283) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ServerImpl$JumpToApplicationThreadServerStreamListener$1HalfClosed.runInContext(ServerImpl.java:707) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.ContextRunnable.run(ContextRunnable.java:37) [grpc-core-1.15.1.jar:1.15.1]
   	at io.grpc.internal.SerializingExecutor.run(SerializingExecutor.java:123) [grpc-core-1.15.1.jar:1.15.1]
   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_65]
   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_65]
   	at java.lang.Thread.run(Thread.java:745) [?:1.8.0_65]
   
   
   After these abnormalities occur, the machine cpu is super high.I want to know how I can do this to ensure that the collector works properly. Es is without any problems,The above exceptions will occur in both the cluster and the single . I have restarted the collector not only 10 times, but the collector does not work after an hour.
   ___
   ### Bug
   - Which version of SkyWalking, OS and JRE?
   skywalking 6.3.0 version
   OS CentOs7.3
   JRE 1.9
   - Which company or project?
   
   - What happen?
   If possible, provide a way for reproducing the error. e.g. demo application, component version.
   
   ___
   ### Requirement or improvement
   - Please describe about your requirements or improvement suggestions.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services