You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Jean-Baptiste (JIRA)" <ji...@apache.org> on 2017/10/02 09:29:00 UTC

[jira] [Commented] (MESOS-8011) Enabling Port mapping generate segfault

    [ https://issues.apache.org/jira/browse/MESOS-8011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16187798#comment-16187798 ] 

Jean-Baptiste commented on MESOS-8011:
--------------------------------------

Adding a *strace* output if this can help:
{code}
open("/proc/self/maps", O_RDONLY)                           = 10
read(10, "561e673f0000-561e6742e000 r-xp 0"..., 1024)       = 1024
read(10, "000 ---p 00000000 00:00 0 \n7f119"..., 1002)      = 1002
read(10, "ogger-1.3.1.so\n7f11ba91f000-7f11"..., 887)       = 887
read(10, "p 00000000 00:00 0 \n7f11bd928000"..., 995)       = 995
read(10, "0 0 \n7f11c2932000-7f11c2933000 -"..., 980)       = 980
read(10, "          /lib/x86_64-linux-gnu/"..., 961)        = 961
read(10, "4-linux-gnu/libkrb5support.so.0."..., 937)        = 937
read(10, "lib/x86_64-linux-gnu/libp11-kit."..., 946)        = 946
read(10, "u/libgmp.so.10.2.0\n7f11c431c000-"..., 927)       = 927
read(10, "o.5.0.0\n7f11c476e000-7f11c477600"..., 920)       = 920
read(10, "b/x86_64-linux-gnu/libcom_err.so"..., 948)        = 948
read(10, "0941                     /usr/li"..., 976)        = 976
read(10, "0                     /usr/lib/x"..., 973)        = 973
read(10, "0 ca:02 272539                  "..., 986)        = 986
read(10, "11.6.12\n7f11c5ec5000-7f11c5eef00"..., 915)       = 915
read(10, "         /lib/x86_64-linux-gnu/l"..., 960)        = 960
read(10, "    /usr/lib/x86_64-linux-gnu/li"..., 955)        = 955
read(10, "a:02 271003                     "..., 983)        = 983
read(10, "\n7f11c7087000-7f11c7088000 rw-p "..., 904)       = 904
read(10, "00005000 ca:02 295137           "..., 993)        = 993
read(10, "                         /lib/x8"..., 976)        = 976
read(10, "76                        /lib/x"..., 977)        = 977
read(10, "-gnutls.so.4.3.0\n7f11c8491000-7f"..., 918)       = 918
read(10, "a:02 294964                     "..., 983)        = 983
read(10, "143                     /usr/lib"..., 975)        = 975
read(10, "ibsasl2.so.2.0.25\n7f11c8f8b000-7"..., 924)       = 924
read(10, "         /usr/lib/x86_64-linux-g"..., 960)        = 960
read(10, "00000000 ca:02 295023           "..., 993)        = 993
read(10, "                 /lib/x86_64-lin"..., 968)        = 968
read(10, "0 ca:02 77                      "..., 986)        = 986
read(10, "                 /usr/lib/x86_64"..., 968)        = 968
read(10, "cd3d6000-7f11cd3f7000 rw-p 00000"..., 1020)       = 680
read(10, "", 340)                                           = 0
read(10, "", 1024)                                          = 0
close(10)                                                   = 0
write(2, "    @                0x0 (unknow"..., 35    @                0x0 (unknown)
)         = 35
gettimeofday({1506936412, 36558}, NULL)                     = 0
gettimeofday({1506936412, 36613}, NULL)                     = 0
gettimeofday({1506936412, 36666}, NULL)                     = 0
gettimeofday({1506936412, 36719}, NULL)                     = 0
rt_sigaction(SIGABRT, {SIG_DFL, [], SA_RESTORER, 0x7f11ca091890}, NULL, 8) = 0
kill(30960, SIGABRT)                                        = 0
+++ killed by SIGABRT +++
Aborted
{code}

> Enabling Port mapping generate segfault 
> ----------------------------------------
>
>                 Key: MESOS-8011
>                 URL: https://issues.apache.org/jira/browse/MESOS-8011
>             Project: Mesos
>          Issue Type: Bug
>          Components: agent, network
>    Affects Versions: 1.3.0, 1.3.1, 1.4.0
>            Reporter: Jean-Baptiste
>              Labels: core, isolation, reliability
>
> h2. Overview
> After a succesful build of Mesos in the different versions (1.3.0 / 1.3.1 / 1.4.0 / 1.5.0), I still get stuck with the following segfault when starting the Mesos agent:
> h2. Environment
> * *Debian* Linux 8.7 (Jessie)
> * *Kernel* 4.12 (also tried with 3.16 and 4.9)
> * *Mesos* 1.3.0 (also tried with 1.3.1, 1.4.0 and 1.5.0)
> * *Libnl* 3.2.27-2
> h2. Stack trace
> {code}
> Sep 25 12:41:46 ip-10-43-20-218 systemd[1]: Starting Mesos Slave...
> Sep 25 12:41:46 ip-10-43-20-218 systemd[1]: Started Mesos Slave.
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: WARNING: Logging before InitGoogleLogging() is written to STDERR
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: W0925 12:41:46.510066  2717 parse.hpp:97] Specifying an absolute filename to read a command line option out of without using 'file:// is deprecated and will be removed in a future release. Simply adding 'file://' to the beginning of the path should eliminate this warning.
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.510259  2717 main.cpp:322] Build: 2017-09-04 19:29:27 by pbuilder
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.510275  2717 main.cpp:323] Version: 1.3.1
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.511230  2717 logging.cpp:194] INFO level logging started!
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.517127  2717 systemd.cpp:238] systemd version `215` detected
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: W0925 12:41:46.517174  2717 systemd.cpp:246] Required functionality `Delegate` was introduced in Version `218`. Your system may not function properly; however since some distributions have patched systemd packages, your system may still be functional. This is why we keep running. See MESOS-3352 for more information
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.517293  2717 main.cpp:432] Inializing systemd state
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.520074  2717 systemd.cpp:326] Started systemd slice `mesos_executors.slice`
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: W0925 12:41:46.611994  2717 containerizer.cpp:189] 'posix/disk' has been renamed as 'disk/du', please update your --isolation flag to use 'disk/du'
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.612027  2717 containerizer.cpp:221] Using isolation: cgroups/cpu,posix/mem,posix/disk,network/port_mapping,filesystem/posix
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.615073  2717 linux_launcher.cpp:150] Using /sys/fs/cgroup/freezer as the freezer hierarchy for the Linux launcher
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: I0925 12:41:46.615413  2717 provisioner.cpp:249] Using default backend 'overlay'
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: mesos-slave: ../3rdparty/boost-1.53.0/boost/icl/concept/interval.hpp:586: typename boost::enable_if<boost::icl::is_asymmetric_interval<Type>, bool>::type boost::icl::non_empty::exclusive_less(const Type&, const Type&) [with Type = Interval<short unsigned int>; typename boost::enable_if<boost::icl::is_asymmetric_interval<Type>, bool>::type = bool]: Assertion `!(icl::is_empty(left) || icl::is_empty(right))' failed.
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: *** Aborted at 1506343306 (unix time) try "date -d @1506343306" if you are using GNU date ***
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: PC: @     0x7f27069d1067 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: *** SIGABRT (@0xa9d) received by PID 2717 (TID 0x7f270a0a2800) from PID 2717; stack trace: ***
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2706d56890 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27069d1067 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27069d2448 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27069ca266 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27069ca312 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2708d124c3 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2708d126a7 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2708d4d0dc (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2708d38dc2 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27089dbe2c (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27089cf201 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f2708944198 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x557ff33c4e7a (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x7f27069bdb45 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @     0x557ff33c62db (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 mesos-slave[2754]: @                0x0 (unknown)
> Sep 25 12:41:46 ip-10-43-20-218 systemd[1]: mesos-slave.service: main process exited, code=killed, status=6/ABRT
> Sep 25 12:41:46 ip-10-43-20-218 systemd[1]: Unit mesos-slave.service entered failed state.
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)