You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Vinod Kone (JIRA)" <ji...@apache.org> on 2015/06/08 19:11:01 UTC

[jira] [Updated] (MESOS-2768) SIGPIPE in process::run_in_event_loop()

     [ https://issues.apache.org/jira/browse/MESOS-2768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kone updated MESOS-2768:
------------------------------
    Priority: Critical  (was: Major)

Saw this again. [~benjaminhindman] and [~jvanremoortere] can you take a look?

{code}
I0608 07:51:42.654503 33802 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974269766096158days
I0608 07:52:42.655881 33802 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974269123435694days
I0608 07:53:42.656980 33803 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974268530210648days
I0608 07:54:42.658324 33799 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974268283033542days
I0608 07:55:42.659076 33791 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974259582399502days
I0608 07:56:42.660492 33801 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974259088045301days
I0608 07:57:42.662521 33797 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974258692561933days
I0608 07:58:42.663774 33792 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974258099336887days
I0608 07:59:42.665227 33793 slave.cpp:3733] Current disk usage 4.65%. Max allowed age: 5.974257604982673days
W0608 07:59:42.665227 33796 logging.cpp:91] RAW: Received signal SIGPIPE; escalating to SIGABRT
*** Aborted at 1433750412 (unix time) try "date -d @1433750412" if you are using GNU date ***
PC: @     0x7f1e53cc1b6d raise
*** SIGABRT (@0x83fc) received by PID 33788 (TID 0x7f1e4b4d2940) from PID 33788; stack trace: ***
    @     0x7f1e53cc1ca0 (unknown)
    @     0x7f1e53cc1b6d raise
    @     0x7f1e546808e7 mesos::internal::logging::handler()
    @     0x7f1e53cc1ca0 (unknown)
    @     0x7f1e53cc0a2b __libc_write
    @     0x7f1e545bef7d evpipe_write.part.5
    @     0x7f1e54c998d0 process::run_in_event_loop<>()
    @     0x7f1e54c9790d process::io::poll()
    @     0x7f1e54c51c07 process::network::PollSocketImpl::send()
    @     0x7f1e54c5bf94 process::internal::send()
    @     0x7f1e54c6b05a process::SocketManager::send()
    @     0x7f1e54c6b6df process::SocketManager::send()
    @     0x7f1e54c6b986 process::HttpProxy::process()
    @     0x7f1e54c6c6a9 process::HttpProxy::waited()
    @     0x7f1e54c6409a process::ProcessManager::resume()
    @     0x7f1e54c6435c process::schedule()
    @     0x7f1e53cb983d start_thread
    @     0x7f1e534abfcd clone
/usr/local/bin/mesos-slave.sh: line 102: 33788 Aborted                 (core dumped) $debug /usr/local/sbin/mesos-slave "${MESOS_FLAGS[@]}"
Slave Exit Status: 134
{code}

> SIGPIPE in process::run_in_event_loop()
> ---------------------------------------
>
>                 Key: MESOS-2768
>                 URL: https://issues.apache.org/jira/browse/MESOS-2768
>             Project: Mesos
>          Issue Type: Bug
>    Affects Versions: 0.23.0
>            Reporter: Yan Xu
>            Priority: Critical
>
> Observed in production.
> {noformat:title=slave log}
> I0526 12:17:48.027257 51633 slave.cpp:4077] Received a new estimation of the oversubscribable resources 
> W0526 12:17:48.027257 51636 logging.cpp:91] RAW: Received signal SIGPIPE; escalating to SIGABRT
> *** Aborted at 1432642668 (unix time) try "date -d @1432642668" if you are using GNU date ***
> PC: @     0x7fa58c23eb6d raise
> *** SIGABRT (@0xc9a5) received by PID 51621 (TID 0x7fa58224c940) from PID 51621; stack trace: ***
>     @     0x7fa58c23eca0 (unknown)
>     @     0x7fa58c23eb6d raise
>     @     0x7fa58cc19ba7 mesos::internal::logging::handler()
>     @     0x7fa58c23eca0 (unknown)
>     @     0x7fa58c23da2b __libc_write
>     @     0x7fa58cb57b6f evpipe_write.part.5
>     @     0x7fa58d245070 process::run_in_event_loop<>()
>     @     0x7fa58d2441ba process::EventLoop::delay()
>     @     0x7fa58d1c3c9c process::clock::scheduleTick()
>     @     0x7fa58d1c65b1 process::Clock::timer()
>     @     0x7fa58d23915a process::delay<>()
>     @     0x7fa58d23a740 process::ReaperProcess::wait()
>     @     0x7fa58d21261a process::ProcessManager::resume()
>     @     0x7fa58d2128dc process::schedule()
>     @     0x7fa58c23683d start_thread
>     @     0x7fa58ba28fcd clone
> {noformat}
> {noformat:title=gdb}
> (gdb) bt
> #0  0x00007fa58c23eb6d in raise () from /lib64/libpthread.so.0
> #1  0x00007fa58cc19ba7 in mesos::internal::logging::handler (signal=Unhandled dwarf expression opcode 0xf3
> ) at logging/logging.cpp:92
> #2  <signal handler called>
> #3  0x00007fa58c23da2b in write () from /lib64/libpthread.so.0
> #4  0x00007fa58cb57b6f in evpipe_write (loop=0x7fa58e1e79c0, flag=Unhandled dwarf expression opcode 0xfa
> ) at ev.c:2172
> #5  0x00007fa58d245070 in process::run_in_event_loop<Nothing>(const std::function<process::Future<Nothing>()> &) (f=Unhandled dwarf expression opcode 0xf3
> ) at src/libev.hpp:80
> #6  0x00007fa58d2441ba in process::EventLoop::delay(const Duration &, const std::function<void()> &) (duration=Unhandled dwarf expression opcode 0xf3
> ) at src/libev.cpp:106
> #7  0x00007fa58d1c3c9c in process::clock::scheduleTick (timers=Unhandled dwarf expression opcode 0xf3
> ) at src/clock.cpp:119
> #8  0x00007fa58d1c65b1 in process::Clock::timer(const Duration &, const std::function<void()> &) (duration=Unhandled dwarf expression opcode 0xf3
> ) at src/clock.cpp:254
> #9  0x00007fa58d23915a in process::delay<process::ReaperProcess> (duration=..., pid=Unhandled dwarf expression opcode 0xf3
> ) at ./include/process/delay.hpp:25
> #10 0x00007fa58d23a740 in process::ReaperProcess::wait (this=0x2056920) at src/reap.cpp:93
> #11 0x00007fa58d21261a in process::ProcessManager::resume (this=0x1db8d20, process=0x2056958) at src/process.cpp:2172
> #12 0x00007fa58d2128dc in process::schedule (arg=Unhandled dwarf expression opcode 0xf3
> ) at src/process.cpp:602
> #13 0x00007fa58c23683d in start_thread () from /lib64/libpthread.so.0
> #14 0x00007fa58ba28fcd in clone () from /lib64/libc.so.6
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)