You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Alexey Serbin (JIRA)" <ji...@apache.org> on 2018/03/10 20:57:00 UTC

[jira] [Commented] (KUDU-2295) nullptr dereference while scanning on already shutdown tablet replica

    [ https://issues.apache.org/jira/browse/KUDU-2295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16394329#comment-16394329 ] 

Alexey Serbin commented on KUDU-2295:
-------------------------------------

One more stack trace captured after with RYW changes were committed:
{noformat}
PC: @     0x7f563b5e31cb std::atomic_bool::load()                               
*** SIGSEGV (@0x1f8) received by PID 19893 (TID 0x7f561ce82700) from PID 504; stack trace: ***
    @     0x7f5638713330 (unknown) at ??:0                                      
    @     0x7f563b5e31cb std::atomic_bool::load() at ??:0                       
    @     0x7f563b609c31 kudu::tablet::MvccManager::is_open() at ??:0           
    @     0x7f563b6085f3 kudu::tablet::MvccManager::CheckOpen() at ??:0         
    @     0x7f563b607fc5 kudu::tablet::MvccManager::WaitUntil() at ??:0         
    @     0x7f563b608938 kudu::tablet::MvccManager::WaitForSnapshotWithAllCommitted() at ??:0
    @     0x7f563ca61b55 kudu::tserver::TabletServiceImpl::HandleScanAtSnapshot() at ??:0
    @     0x7f563ca5c0e2 kudu::tserver::TabletServiceImpl::HandleNewScanRequest() at ??:0
    @     0x7f563ca59793 kudu::tserver::TabletServiceImpl::Scan() at ??:0       
    @     0x7f5637324e4d kudu::tserver::TabletServerServiceIf::TabletServerServiceIf()::$_5::operator()() at ??:0
    @     0x7f5637324c92 std::_Function_handler<>::_M_invoke() at ??:0          
    @     0x7f563648992b std::function<>::operator()() at ??:0                  
    @     0x7f56364891ed kudu::rpc::GeneratedServiceIf::Handle() at ??:0        
    @     0x7f563648b5e6 kudu::rpc::ServicePool::RunThread() at ??:0            
    @     0x7f563648dc29 boost::_mfi::mf0<>::operator()() at ??:0               
    @     0x7f563648db90 boost::_bi::list1<>::operator()<>() at ??:0            
    @     0x7f563648db3a boost::_bi::bind_t<>::operator()() at ??:0             
    @     0x7f563648d91d boost::detail::function::void_function_obj_invoker0<>::invoke() at ??:0
    @     0x7f5636430078 boost::function0<>::operator()() at ??:0               
    @     0x7f563472c08d kudu::Thread::SuperviseThread() at ??:0                
    @     0x7f563870b184 start_thread at ??:0                                   
    @     0x7f5630a2affd clone at ??:0  
{noformat}

> nullptr dereference while scanning on already shutdown tablet replica
> ---------------------------------------------------------------------
>
>                 Key: KUDU-2295
>                 URL: https://issues.apache.org/jira/browse/KUDU-2295
>             Project: Kudu
>          Issue Type: Bug
>          Components: tserver
>    Affects Versions: 1.7.0
>            Reporter: Alexey Serbin
>            Assignee: Alexey Serbin
>            Priority: Major
>
> While running the \{{raft_consensus_stress-itest}}, I find one of tablet servers crashed with the following stack trace:
> {noformat}
>                                      
> *** Aborted at 1518480865 (unix time) try "date -d @1518480865" if you are using GNU date ***
> PC: @     0x7f1e02025790 scoped_refptr<>::operator->()                          
> *** SIGSEGV (@0x160) received by PID 8782 (TID 0x7f1de3c7e700) from PID 352; stack trace: ***
>     @     0x7f1dfdcfc330 (unknown) at ??:0                                      
>     @     0x7f1e02025790 scoped_refptr<>::operator->() at ??:0                  
>     @     0x7f1e00ae62e7 kudu::tablet::Tablet::GetTabletAncientHistoryMark() at ??:0
>     @     0x7f1e00ae627d kudu::tablet::Tablet::GetHistoryGcOpts() at ??:0       
>     @     0x7f1e02012c53 kudu::tserver::(anonymous namespace)::VerifyNotAncientHistory() at ??:0
>     @     0x7f1e0201223b kudu::tserver::TabletServiceImpl::HandleScanAtSnapshot() at ??:0
>     @     0x7f1e0200c6dd kudu::tserver::TabletServiceImpl::HandleNewScanRequest() at ??:0
>     @     0x7f1e02009d33 kudu::tserver::TabletServiceImpl::Scan() at ??:0       
>     @     0x7f1dfc90de4d kudu::tserver::TabletServerServiceIf::TabletServerServiceIf()::$_5::operator()() at ??:0
>     @     0x7f1dfc90dc92 std::_Function_handler<>::_M_invoke() at ??:0          
>     @     0x7f1dfba728ab std::function<>::operator()() at ??:0                  
>     @     0x7f1dfba7216d kudu::rpc::GeneratedServiceIf::Handle() at ??:0        
>     @     0x7f1dfba74526 kudu::rpc::ServicePool::RunThread() at ??:0            
>     @     0x7f1dfba76ad9 boost::_mfi::mf0<>::operator()() at ??:0               
>     @     0x7f1dfba76a40 boost::_bi::list1<>::operator()<>() at ??:0            
>     @     0x7f1dfba769ea boost::_bi::bind_t<>::operator()() at ??:0             
>     @     0x7f1dfba767cd boost::detail::function::void_function_obj_invoker0<>::invoke() at ??:0
>     @     0x7f1dfba190f8 boost::function0<>::operator()() at ??:0               
>     @     0x7f1df9d1788d kudu::Thread::SuperviseThread() at ??:0                
>     @     0x7f1dfdcf4184 start_thread at ??:0                                   
>     @     0x7f1df6023ffd clone at ??:0                                          
>     @                0x0 (unknown){noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)