You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "zhangsong (JIRA)" <ji...@apache.org> on 2016/05/29 14:39:12 UTC

[jira] [Commented] (KUDU-1472) kudu-tserver crash unexpected

    [ https://issues.apache.org/jira/browse/KUDU-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15305931#comment-15305931 ] 

zhangsong commented on KUDU-1472:
---------------------------------

linked with KUDU-1462

> kudu-tserver crash unexpected
> -----------------------------
>
>                 Key: KUDU-1472
>                 URL: https://issues.apache.org/jira/browse/KUDU-1472
>             Project: Kudu
>          Issue Type: Bug
>    Affects Versions: 0.8.0, 0.9.0
>            Reporter: zhangsong
>            Priority: Critical
>
> kudu-tserver will crash under some case, in jd.com 200-node env, it occurring frequently.
> some crash   info  from core file:
> (gdb) bt
> #0  0x0000000000a2489f in kudu::tablet::RowSetDataPB::SharedDtor (this=0x58fb5b180)
>    at /export/ldb/kudu-master/build/release/src/kudu/tablet/metadata.pb.cc:815
> #1  kudu::tablet::RowSetDataPB::~RowSetDataPB (this=0x58fb5b180, __in_chrg=<optimized out>)
>    at /export/ldb/kudu-master/build/release/src/kudu/tablet/metadata.pb.cc:809
> #2  kudu::tablet::RowSetDataPB::~RowSetDataPB (this=0x58fb5b180, __in_chrg=<optimized out>)
>    at /export/ldb/kudu-master/build/release/src/kudu/tablet/metadata.pb.cc:810
> #3  google::protobuf::internal::GenericTypeHandler<kudu::tablet::RowSetDataPB>::Delete (value=0x58fb5b180)
>    at /export/ldb/kudu-master/thirdparty/installed-deps/include/google/protobuf/repeated_field.h:363
> #4  google::protobuf::internal::RepeatedPtrFieldBase::Destroy<google::protobuf::RepeatedPtrField<kudu::tablet::RowSetDataPB>::TypeHandler> (
>    this=<optimized out>, this=<optimized out>) at /export/ldb/kudu-master/thirdparty/installed-deps/include/google/protobuf/repeated_field.h:869
> Backtrace stopped: Cannot access memory at address 0x7fc1f230fd08
> after crash , kudu-tserver will not be restarted successfully, due to some pb validation  check failed, for example:
>  check failed: _s.ok() Bad status: IO error: Could not init Tablet Manager: Failed to open tablet metadata for tablet: 260359a41a134c1f91631e9094847bcf: Failed to load tablet metadata for tablet id 260359a41a134c1f91631e9094847bcf: Could not load tablet metadata from /export/servers/kudu/tserver_data_7052/tablet-meta/260359a41a134c1f91631e9094847bcf: Unable to parse PB from path: /export/servers/kudu/tserver_data_7052/tablet-meta/260359a41a134c1f91631e9094847bcf
> kudu version is 0.9.0-snapshot, last commit id :  be10f8514c48950b64c7d59bbce848f3792ec52d 
> workload is: several write tasks  keeps inserting into kudu table, some task using java api, while others using impala.
> kudu-table will be scanned while whose tasks are running.
> almost everyday there will be a crash case. same phenomenon as described above. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)