You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Adar Dembo (JIRA)" <ji...@apache.org> on 2016/02/26 07:51:18 UTC

[jira] [Resolved] (KUDU-1325) TS crashes trying to remote bootstrap a non-running tablet

     [ https://issues.apache.org/jira/browse/KUDU-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Adar Dembo resolved KUDU-1325.
------------------------------
       Resolution: Fixed
    Fix Version/s: 0.8.0

> TS crashes trying to remote bootstrap a non-running tablet
> ----------------------------------------------------------
>
>                 Key: KUDU-1325
>                 URL: https://issues.apache.org/jira/browse/KUDU-1325
>             Project: Kudu
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Jean-Daniel Cryans
>            Assignee: Adar Dembo
>             Fix For: 0.8.0
>
>
> This happened on a TS while YCSB was running against the cluster:
> {noformat}
> E0205 08:59:11.157722 66927 consensus_queue.cc:348] T b98bed072a534c54a08794d61dbb2736 P 6f83d42f66c34337a12f93f39c31c5db [LEADER]: Error trying to read ahead of the log while preparing peer request: Incomplete: Op with index 35582 is ahead of the local log (next sequential op: 35582). Destination peer: Peer: fbd5dc753e264f819f1bc24fdc724982, Is new: false, Last received: 240.35582, Next index: 35583, Last known committed idx: 35580, Last exchange result: SUCCESS, Needs remote bootstrap: false
> *** Aborted at 1454703705 (unix time) try "date -d @1454703705" if you are using GNU date ***
> PC: @           0x978cd3 kudu::log::LogReader::GetSegmentsSnapshot()
> *** SIGSEGV (@0x48) received by PID 107515 (TID 0x7fe3e2420700) from PID 72; stack trace: ***
>     @       0x316de0f710 (unknown)
>     @           0x978cd3 kudu::log::LogReader::GetSegmentsSnapshot()
>     @           0x78f42c kudu::tserver::RemoteBootstrapSession::Init()
>     @           0x788b98 kudu::tserver::RemoteBootstrapServiceImpl::BeginRemoteBootstrapSession()
>     @           0x7f549a kudu::tserver::RemoteBootstrapServiceIf::Handle()
>     @           0x9d60a8 kudu::rpc::ServicePool::RunThread()
>     @          0x179da1a kudu::Thread::SuperviseThread()
>     @       0x316de079d1 (unknown)
>     @       0x316dae88fd (unknown)
>     @                0x0 (unknown)
> {noformat}
> This is where the error happens:
> {noformat}
> /usr/src/debug/kudu-0.7.0-kudu0.7.0-SNAPSHOT/src/kudu/gutil/atomicops-internals-x86.h:87
> /usr/src/debug/kudu-0.7.0-kudu0.7.0-SNAPSHOT/src/kudu/gutil/atomicops-internals-x86.h:144
> /usr/src/debug/kudu-0.7.0-kudu0.7.0-SNAPSHOT/src/kudu/gutil/spinlock.h:74
> /usr/src/debug/kudu-0.7.0-kudu0.7.0-SNAPSHOT/src/kudu/util/locks.h:45
> /opt/toolchain/boost-pic-1.55.0/include/boost/thread/lock_guard.hpp:38
> /usr/src/debug/kudu-0.7.0-kudu0.7.0-SNAPSHOT/src/kudu/consensus/log_reader.cc:397
> {noformat}
> [~tlipcon] thinks this is a NULL log_reader that's being accessed during a bad remote bootstrap. I'm saving the server logs and will do more digging.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)