You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Alice Chen (JIRA)" <ji...@apache.org> on 2015/07/22 20:15:20 UTC

[jira] [Created] (TRAFODION-243) LP Bug: 1315194 - Core files generated with no useful stack

Alice Chen created TRAFODION-243:
------------------------------------

             Summary: LP Bug: 1315194 - Core files generated with no useful stack
                 Key: TRAFODION-243
                 URL: https://issues.apache.org/jira/browse/TRAFODION-243
             Project: Apache Trafodion
          Issue Type: Bug
          Components: sql-exe
            Reporter: Weishiun Tsai
            Assignee: Apache Trafodion


We occasionally see a core that does not have any useful stack information.  The following tdm_arkesp core generated on rhel-mapr1, a MapR machine, is one of such examples.  Discussion with Hans and Suresh indicated that the problem may be related to JIT.  But gdb does not show any useful stack with a mixture of Java and C++.  Presumably hs_err_pid<pid>.log files can help debug the problem.  But in this particular case, no hs_err_pid<pid>.log files were generated either.

This is such a problem that there is no certain way to reproduce and the core file does not provide enough information to debug the problem.  However, there were core files and they do show up every now and then when running QA or dev regression tests.  This case is currently filed as a Medium severity case just to track this problem until we figure out a better way to diagnose this problem.

This may or may not be related to another bug report https://bugs.launchpad.net/trafodion/+bug/1305066 "Intermittent core files during SQL dev regressions".  The two issues can probably be investigated together.

The build is the datalake v39613 build.  rhel-mapr1 is a cluster with 4 nodes.

[trafodion@rhel-mapr1 cores]$ pwd
/opt/cores
[trafodion@rhel-mapr1 cores]$ ls
tdm_arkesp.core.30577.rhel-mapr1  tdm_arkesp.core.30589.rhel-mapr1
tdm_arkesp.core.30583.rhel-mapr1  tdm_arkesp.core.30596.rhel-mapr1

gdb /home/trafodion/trafodion/export/bin64d/tdm_arkesp tdm_arkesp.core.30589.rhel-mapr1

(gdb) bt
#0  0x0000003c93032925 in raise () from /lib64/libc.so.6
#1  0x0000003c93034105 in abort () from /lib64/libc.so.6
#2  0x00007f0bd8b4d8c5 in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#3  0x00007f0bd8cbb78f in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#4  0x00007f0bd8b52a92 in JVM_handle_linux_signal ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#5  <signal handler called>
#6  0x0000000000000000 in ?? ()
#7  0x00007f0bc81e3350 in ?? ()
#8  0x00000007aaa80da0 in ?? ()
#9  0x00007fffa0bc6390 in ?? ()
#10 0x00000006f5a26ee3 in ?? ()
#11 0x00007fffa0bc63d8 in ?? ()
#12 0x00000006f5ab4eb0 in ?? ()
#13 0x0000000000000000 in ?? ()

(gdb) info thread
  15 Thread 0x7f0bccfe1700 (LWP 30594)  0x0000003c930e9163 in epoll_wait ()
   from /lib64/libc.so.6
  14 Thread 0x7f0bc701c700 (LWP 30612)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  13 Thread 0x7f0bc6c18700 (LWP 30620)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  12 Thread 0x7f0bc6f1b700 (LWP 30614)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  11 Thread 0x7f0bc6e1a700 (LWP 30616)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  10 Thread 0x7f0bc721e700 (LWP 30608)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  9 Thread 0x7f0bcda40700 (LWP 30593)  0x0000003c930accdd in nanosleep ()
   from /lib64/libc.so.6
  8 Thread 0x7f0bbbe0c700 (LWP 30639)  0x0000003c9340b98e in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  7 Thread 0x7f0bc731f700 (LWP 30607)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  6 Thread 0x7f0bc6d19700 (LWP 30618)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  5 Thread 0x7f0bd1a67700 (LWP 30590)  0x0000003c930e972d in accept ()
   from /lib64/libc.so.6
  4 Thread 0x7f0bd1066700 (LWP 30591)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  3 Thread 0x7f0bc711d700 (LWP 30610)  0x0000003c9340b5bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  2 Thread 0x7f0bce742700 (LWP 30592)  0x0000003c93033a1d in sigtimedwait ()
   from /lib64/libc.so.6
* 1 Thread 0x7f0bd1cc68a0 (LWP 30589)  0x0000003c93032925 in raise ()
   from /lib64/libc.so.6
(gdb)
Assigned to LaunchPad User James Capps



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)