You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@trafodion.apache.org by Radu Marias <ra...@gmail.com> on 2015/08/27 18:04:40 UTC

sqstart fails

Hi,

I have a cluster of 5 nodes, each as a virtual machine.
This is on them:
Centos 7
Ambari 2.1
HDP 2.2
jdk1.7.0_67, installed by ambari

I managed to run the installer with success (though some warning were
present, see bellow). When I try to run sqstart then as trafodion user it
fails.

Processing cluster.conf on local host node5
[SHELL] Shell/shell Version 1.0.1 Release 1.2.0 (Build release
[1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009)
^[[?1034h
[SHELL] %
! Start the monitor processes across the cluster
startup
[SHELL] %startup
[SHELL] Unable to communicate with monitor because monitor port file
/home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is missing.
[SHELL] Failed to start environment!

[SHELL] %
exit
[SHELL] %exit
Trying to connect to the SQ monitor ..........
There seems to be a problem connecting to the SQ monitor.
Aborting startup.
/logs/sqcheckmon.log: No such file or directory
Error while executing the startup script!!!

Please check the SQ shell log file :
/home/trafodion/trafodion-20150827_0830/logs/sqmon.log

SQ Startup (from /home/trafodion/trafodion-20150827_0830/sql/scripts) Failed

Checking if processes are up.
^MChecking attempt: 1; user specified max: 2. Execution time in seconds: 4.

The SQ environment is not up all, or partially up and not operational.
Check the logs.

Process         Configured      Actual      Down
-------         ----------      ------      ----
DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3 \$TM4
RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
\$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
MXOSRVR         2               0           2


The SQ environment is down.]
Zookeeper is listening on port 2181
Dcs Master is not started ...

Attached are some logs.

I also did *sqgen* but the same. Also tried *ckillall* and restarted hbase
server between multiple starts.
I have *log4cxx* installed.

-- 
And in the end, it's not the years in your life that count. It's the life
in your years.

Re: sqstart fails

Posted by Radu Marias <ra...@gmail.com>.
/home/trafodion/trafodion-20150827_0830/sql/scripts
$ ls -la | grep -i core
-rwxr-x--- 1 trafodion trafodion   7293 Aug 27 14:20 sqcore
-rwxr-x--- 1 trafodion trafodion   3345 Aug 27 14:20 sqcorefile

$ ulimit -a
core file size          (blocks, -c) unlimited
data seg size           (kbytes, -d) unlimited
scheduling priority             (-e) 0
file size               (blocks, -f) unlimited
pending signals                 (-i) 772842
max locked memory       (kbytes, -l) unlimited
max memory size         (kbytes, -m) unlimited
open files                      (-n) 32768
pipe size            (512 bytes, -p) 8
POSIX message queues     (bytes, -q) 819200
real-time priority              (-r) 0
stack size              (kbytes, -s) 10240
cpu time               (seconds, -t) unlimited
max user processes              (-u) 100000
virtual memory          (kbytes, -v) unlimited
file locks                      (-x) unlimited


On Thu, Aug 27, 2015 at 8:26 PM, Narendra Goyal <na...@esgyn.com>
wrote:

> Also, I believe you are running Centos 7 (that comes with Linux kernel
> 3.*).
> We have, so far, never tried on that kernel version.
>
> Thanks,
> -Narendra
>
> -----Original Message-----
> From: Narendra Goyal [mailto:narendra.goyal@esgyn.com]
> Sent: Thursday, August 27, 2015 9:50 AM
> To: 'dev@trafodion.incubator.apache.org'
> <de...@trafodion.incubator.apache.org>
> Subject: RE: sqstart fails
>
> Hi Radu,
>
> -  Are there any core files in the directory:
> /home/trafodion/trafodion-20150827_0830/sql/scripts
>
> If so, could you please do a 'file core*' on them
>
> - Could you run 'ulimit -a', say, on node5. Trafodion requires a certain
> amount of locked memory, open files etc.
>
> Thanks,
> -Narendra
>
> -----Original Message-----
> From: Radu Marias [mailto:radumarias@gmail.com]
> Sent: Thursday, August 27, 2015 9:15 AM
> To: dev <de...@trafodion.incubator.apache.org>
> Subject: Re: sqstart fails
>
> sudo sysctl -q kernel.pid_max
> kernel.pid_max = 65535
>
>
> On Thu, Aug 27, 2015 at 7:10 PM, Amanda Moran <am...@esgyn.com>
> wrote:
>
> > Can you try *sudo sysctl -q kernel.pid_max* on all nodes?
> >
> > FYI: sudo sysctl -w kernel.pid_max=65535 (is what it should be set to).
> >
> > Thanks!
> >
> >
> >
> > On Thu, Aug 27, 2015 at 9:05 AM, Radu Marias <ra...@gmail.com>
> wrote:
> >
> > > These lines are from the install process:
> > >
> > > *pdcp@node5: can't stat /home/trafodion/sqcert*
> > > *pdcp@node5: can't stat shell.env*
> > > *pdcp@node5: can't stat mon.env*
> > >
> > >
> > > On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com>
> > wrote:
> > >
> > > > Hi,
> > > >
> > > > I have a cluster of 5 nodes, each as a virtual machine.
> > > > This is on them:
> > > > Centos 7
> > > > Ambari 2.1
> > > > HDP 2.2
> > > > jdk1.7.0_67, installed by ambari
> > > >
> > > > I managed to run the installer with success (though some warning
> > > > were present, see bellow). When I try to run sqstart then as
> > > > trafodion user
> > it
> > > > fails.
> > > >
> > > > Processing cluster.conf on local host node5 [SHELL] Shell/shell
> > > > Version 1.0.1 Release 1.2.0 (Build release
> > > > [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009) ^[[?1034h
> > > > [SHELL] % ! Start the monitor processes across the cluster startup
> > > > [SHELL] %startup [SHELL] Unable to communicate with monitor
> > > > because monitor port file
> > > > /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is
> > > missing.
> > > > [SHELL] Failed to start environment!
> > > >
> > > > [SHELL] %
> > > > exit
> > > > [SHELL] %exit
> > > > Trying to connect to the SQ monitor ..........
> > > > There seems to be a problem connecting to the SQ monitor.
> > > > Aborting startup.
> > > > /logs/sqcheckmon.log: No such file or directory Error while
> > > > executing the startup script!!!
> > > >
> > > > Please check the SQ shell log file :
> > > > /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
> > > >
> > > > SQ Startup (from
> > > > /home/trafodion/trafodion-20150827_0830/sql/scripts)
> > > > Failed
> > > >
> > > > Checking if processes are up.
> > > > ^MChecking attempt: 1; user specified max: 2. Execution time in
> > seconds:
> > > 4.
> > > >
> > > > The SQ environment is not up all, or partially up and not
> operational.
> > > > Check the logs.
> > > >
> > > > Process         Configured      Actual      Down
> > > > -------         ----------      ------      ----
> > > > DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3
> > \$TM4
> > > > RMS             10              0           \$ZSC000 \$ZSC001
> \$ZSC002
> > > > \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> > > > MXOSRVR         2               0           2
> > > >
> > > >
> > > > The SQ environment is down.]
> > > > Zookeeper is listening on port 2181 Dcs Master is not started ...
> > > >
> > > > Attached are some logs.
> > > >
> > > > I also did *sqgen* but the same. Also tried *ckillall* and
> > > > restarted hbase server between multiple starts.
> > > > I have *log4cxx* installed.
> > > >
> > > > --
> > > > And in the end, it's not the years in your life that count. It's
> > > > the
> > life
> > > > in your years.
> > > >
> > >
> > >
> > >
> > > --
> > > And in the end, it's not the years in your life that count. It's the
> > > life in your years.
> > >
> >
> >
> >
> > --
> > Thanks,
> >
> > Amanda Moran
> >
>
>
>
> --
> And in the end, it's not the years in your life that count. It's the life
> in
> your years.
>



-- 
And in the end, it's not the years in your life that count. It's the life
in your years.

RE: sqstart fails

Posted by Narendra Goyal <na...@esgyn.com>.
Also, I believe you are running Centos 7 (that comes with Linux kernel 3.*).
We have, so far, never tried on that kernel version.

Thanks,
-Narendra

-----Original Message-----
From: Narendra Goyal [mailto:narendra.goyal@esgyn.com]
Sent: Thursday, August 27, 2015 9:50 AM
To: 'dev@trafodion.incubator.apache.org'
<de...@trafodion.incubator.apache.org>
Subject: RE: sqstart fails

Hi Radu,

-  Are there any core files in the directory:
/home/trafodion/trafodion-20150827_0830/sql/scripts

If so, could you please do a 'file core*' on them

- Could you run 'ulimit -a', say, on node5. Trafodion requires a certain
amount of locked memory, open files etc.

Thanks,
-Narendra

-----Original Message-----
From: Radu Marias [mailto:radumarias@gmail.com]
Sent: Thursday, August 27, 2015 9:15 AM
To: dev <de...@trafodion.incubator.apache.org>
Subject: Re: sqstart fails

sudo sysctl -q kernel.pid_max
kernel.pid_max = 65535


On Thu, Aug 27, 2015 at 7:10 PM, Amanda Moran <am...@esgyn.com>
wrote:

> Can you try *sudo sysctl -q kernel.pid_max* on all nodes?
>
> FYI: sudo sysctl -w kernel.pid_max=65535 (is what it should be set to).
>
> Thanks!
>
>
>
> On Thu, Aug 27, 2015 at 9:05 AM, Radu Marias <ra...@gmail.com> wrote:
>
> > These lines are from the install process:
> >
> > *pdcp@node5: can't stat /home/trafodion/sqcert*
> > *pdcp@node5: can't stat shell.env*
> > *pdcp@node5: can't stat mon.env*
> >
> >
> > On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com>
> wrote:
> >
> > > Hi,
> > >
> > > I have a cluster of 5 nodes, each as a virtual machine.
> > > This is on them:
> > > Centos 7
> > > Ambari 2.1
> > > HDP 2.2
> > > jdk1.7.0_67, installed by ambari
> > >
> > > I managed to run the installer with success (though some warning
> > > were present, see bellow). When I try to run sqstart then as
> > > trafodion user
> it
> > > fails.
> > >
> > > Processing cluster.conf on local host node5 [SHELL] Shell/shell
> > > Version 1.0.1 Release 1.2.0 (Build release
> > > [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009) ^[[?1034h
> > > [SHELL] % ! Start the monitor processes across the cluster startup
> > > [SHELL] %startup [SHELL] Unable to communicate with monitor
> > > because monitor port file
> > > /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is
> > missing.
> > > [SHELL] Failed to start environment!
> > >
> > > [SHELL] %
> > > exit
> > > [SHELL] %exit
> > > Trying to connect to the SQ monitor ..........
> > > There seems to be a problem connecting to the SQ monitor.
> > > Aborting startup.
> > > /logs/sqcheckmon.log: No such file or directory Error while
> > > executing the startup script!!!
> > >
> > > Please check the SQ shell log file :
> > > /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
> > >
> > > SQ Startup (from
> > > /home/trafodion/trafodion-20150827_0830/sql/scripts)
> > > Failed
> > >
> > > Checking if processes are up.
> > > ^MChecking attempt: 1; user specified max: 2. Execution time in
> seconds:
> > 4.
> > >
> > > The SQ environment is not up all, or partially up and not operational.
> > > Check the logs.
> > >
> > > Process         Configured      Actual      Down
> > > -------         ----------      ------      ----
> > > DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3
> \$TM4
> > > RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
> > > \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> > > MXOSRVR         2               0           2
> > >
> > >
> > > The SQ environment is down.]
> > > Zookeeper is listening on port 2181 Dcs Master is not started ...
> > >
> > > Attached are some logs.
> > >
> > > I also did *sqgen* but the same. Also tried *ckillall* and
> > > restarted hbase server between multiple starts.
> > > I have *log4cxx* installed.
> > >
> > > --
> > > And in the end, it's not the years in your life that count. It's
> > > the
> life
> > > in your years.
> > >
> >
> >
> >
> > --
> > And in the end, it's not the years in your life that count. It's the
> > life in your years.
> >
>
>
>
> --
> Thanks,
>
> Amanda Moran
>



--
And in the end, it's not the years in your life that count. It's the life in
your years.

RE: sqstart fails

Posted by Narendra Goyal <na...@esgyn.com>.
Hi Radu,

-  Are there any core files in the directory:
/home/trafodion/trafodion-20150827_0830/sql/scripts

If so, could you please do a 'file core*' on them

- Could you run 'ulimit -a', say, on node5. Trafodion requires a certain
amount of locked memory, open files etc.

Thanks,
-Narendra

-----Original Message-----
From: Radu Marias [mailto:radumarias@gmail.com]
Sent: Thursday, August 27, 2015 9:15 AM
To: dev <de...@trafodion.incubator.apache.org>
Subject: Re: sqstart fails

sudo sysctl -q kernel.pid_max
kernel.pid_max = 65535


On Thu, Aug 27, 2015 at 7:10 PM, Amanda Moran <am...@esgyn.com>
wrote:

> Can you try *sudo sysctl -q kernel.pid_max* on all nodes?
>
> FYI: sudo sysctl -w kernel.pid_max=65535 (is what it should be set to).
>
> Thanks!
>
>
>
> On Thu, Aug 27, 2015 at 9:05 AM, Radu Marias <ra...@gmail.com> wrote:
>
> > These lines are from the install process:
> >
> > *pdcp@node5: can't stat /home/trafodion/sqcert*
> > *pdcp@node5: can't stat shell.env*
> > *pdcp@node5: can't stat mon.env*
> >
> >
> > On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com>
> wrote:
> >
> > > Hi,
> > >
> > > I have a cluster of 5 nodes, each as a virtual machine.
> > > This is on them:
> > > Centos 7
> > > Ambari 2.1
> > > HDP 2.2
> > > jdk1.7.0_67, installed by ambari
> > >
> > > I managed to run the installer with success (though some warning
> > > were present, see bellow). When I try to run sqstart then as
> > > trafodion user
> it
> > > fails.
> > >
> > > Processing cluster.conf on local host node5 [SHELL] Shell/shell
> > > Version 1.0.1 Release 1.2.0 (Build release
> > > [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009) ^[[?1034h
> > > [SHELL] % ! Start the monitor processes across the cluster startup
> > > [SHELL] %startup [SHELL] Unable to communicate with monitor
> > > because monitor port file
> > > /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is
> > missing.
> > > [SHELL] Failed to start environment!
> > >
> > > [SHELL] %
> > > exit
> > > [SHELL] %exit
> > > Trying to connect to the SQ monitor ..........
> > > There seems to be a problem connecting to the SQ monitor.
> > > Aborting startup.
> > > /logs/sqcheckmon.log: No such file or directory Error while
> > > executing the startup script!!!
> > >
> > > Please check the SQ shell log file :
> > > /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
> > >
> > > SQ Startup (from
> > > /home/trafodion/trafodion-20150827_0830/sql/scripts)
> > > Failed
> > >
> > > Checking if processes are up.
> > > ^MChecking attempt: 1; user specified max: 2. Execution time in
> seconds:
> > 4.
> > >
> > > The SQ environment is not up all, or partially up and not operational.
> > > Check the logs.
> > >
> > > Process         Configured      Actual      Down
> > > -------         ----------      ------      ----
> > > DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3
> \$TM4
> > > RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
> > > \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> > > MXOSRVR         2               0           2
> > >
> > >
> > > The SQ environment is down.]
> > > Zookeeper is listening on port 2181 Dcs Master is not started ...
> > >
> > > Attached are some logs.
> > >
> > > I also did *sqgen* but the same. Also tried *ckillall* and
> > > restarted hbase server between multiple starts.
> > > I have *log4cxx* installed.
> > >
> > > --
> > > And in the end, it's not the years in your life that count. It's
> > > the
> life
> > > in your years.
> > >
> >
> >
> >
> > --
> > And in the end, it's not the years in your life that count. It's the
> > life in your years.
> >
>
>
>
> --
> Thanks,
>
> Amanda Moran
>



--
And in the end, it's not the years in your life that count. It's the life in
your years.

Re: sqstart fails

Posted by Radu Marias <ra...@gmail.com>.
sudo sysctl -q kernel.pid_max
kernel.pid_max = 65535


On Thu, Aug 27, 2015 at 7:10 PM, Amanda Moran <am...@esgyn.com>
wrote:

> Can you try *sudo sysctl -q kernel.pid_max* on all nodes?
>
> FYI: sudo sysctl -w kernel.pid_max=65535 (is what it should be set to).
>
> Thanks!
>
>
>
> On Thu, Aug 27, 2015 at 9:05 AM, Radu Marias <ra...@gmail.com> wrote:
>
> > These lines are from the install process:
> >
> > *pdcp@node5: can't stat /home/trafodion/sqcert*
> > *pdcp@node5: can't stat shell.env*
> > *pdcp@node5: can't stat mon.env*
> >
> >
> > On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com>
> wrote:
> >
> > > Hi,
> > >
> > > I have a cluster of 5 nodes, each as a virtual machine.
> > > This is on them:
> > > Centos 7
> > > Ambari 2.1
> > > HDP 2.2
> > > jdk1.7.0_67, installed by ambari
> > >
> > > I managed to run the installer with success (though some warning were
> > > present, see bellow). When I try to run sqstart then as trafodion user
> it
> > > fails.
> > >
> > > Processing cluster.conf on local host node5
> > > [SHELL] Shell/shell Version 1.0.1 Release 1.2.0 (Build release
> > > [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009)
> > > ^[[?1034h
> > > [SHELL] %
> > > ! Start the monitor processes across the cluster
> > > startup
> > > [SHELL] %startup
> > > [SHELL] Unable to communicate with monitor because monitor port file
> > > /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is
> > missing.
> > > [SHELL] Failed to start environment!
> > >
> > > [SHELL] %
> > > exit
> > > [SHELL] %exit
> > > Trying to connect to the SQ monitor ..........
> > > There seems to be a problem connecting to the SQ monitor.
> > > Aborting startup.
> > > /logs/sqcheckmon.log: No such file or directory
> > > Error while executing the startup script!!!
> > >
> > > Please check the SQ shell log file :
> > > /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
> > >
> > > SQ Startup (from /home/trafodion/trafodion-20150827_0830/sql/scripts)
> > > Failed
> > >
> > > Checking if processes are up.
> > > ^MChecking attempt: 1; user specified max: 2. Execution time in
> seconds:
> > 4.
> > >
> > > The SQ environment is not up all, or partially up and not operational.
> > > Check the logs.
> > >
> > > Process         Configured      Actual      Down
> > > -------         ----------      ------      ----
> > > DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3
> \$TM4
> > > RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
> > > \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> > > MXOSRVR         2               0           2
> > >
> > >
> > > The SQ environment is down.]
> > > Zookeeper is listening on port 2181
> > > Dcs Master is not started ...
> > >
> > > Attached are some logs.
> > >
> > > I also did *sqgen* but the same. Also tried *ckillall* and restarted
> > > hbase server between multiple starts.
> > > I have *log4cxx* installed.
> > >
> > > --
> > > And in the end, it's not the years in your life that count. It's the
> life
> > > in your years.
> > >
> >
> >
> >
> > --
> > And in the end, it's not the years in your life that count. It's the life
> > in your years.
> >
>
>
>
> --
> Thanks,
>
> Amanda Moran
>



-- 
And in the end, it's not the years in your life that count. It's the life
in your years.

Re: sqstart fails

Posted by Amanda Moran <am...@esgyn.com>.
Can you try *sudo sysctl -q kernel.pid_max* on all nodes?

FYI: sudo sysctl -w kernel.pid_max=65535 (is what it should be set to).

Thanks!



On Thu, Aug 27, 2015 at 9:05 AM, Radu Marias <ra...@gmail.com> wrote:

> These lines are from the install process:
>
> *pdcp@node5: can't stat /home/trafodion/sqcert*
> *pdcp@node5: can't stat shell.env*
> *pdcp@node5: can't stat mon.env*
>
>
> On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com> wrote:
>
> > Hi,
> >
> > I have a cluster of 5 nodes, each as a virtual machine.
> > This is on them:
> > Centos 7
> > Ambari 2.1
> > HDP 2.2
> > jdk1.7.0_67, installed by ambari
> >
> > I managed to run the installer with success (though some warning were
> > present, see bellow). When I try to run sqstart then as trafodion user it
> > fails.
> >
> > Processing cluster.conf on local host node5
> > [SHELL] Shell/shell Version 1.0.1 Release 1.2.0 (Build release
> > [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009)
> > ^[[?1034h
> > [SHELL] %
> > ! Start the monitor processes across the cluster
> > startup
> > [SHELL] %startup
> > [SHELL] Unable to communicate with monitor because monitor port file
> > /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is
> missing.
> > [SHELL] Failed to start environment!
> >
> > [SHELL] %
> > exit
> > [SHELL] %exit
> > Trying to connect to the SQ monitor ..........
> > There seems to be a problem connecting to the SQ monitor.
> > Aborting startup.
> > /logs/sqcheckmon.log: No such file or directory
> > Error while executing the startup script!!!
> >
> > Please check the SQ shell log file :
> > /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
> >
> > SQ Startup (from /home/trafodion/trafodion-20150827_0830/sql/scripts)
> > Failed
> >
> > Checking if processes are up.
> > ^MChecking attempt: 1; user specified max: 2. Execution time in seconds:
> 4.
> >
> > The SQ environment is not up all, or partially up and not operational.
> > Check the logs.
> >
> > Process         Configured      Actual      Down
> > -------         ----------      ------      ----
> > DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3 \$TM4
> > RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
> > \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> > MXOSRVR         2               0           2
> >
> >
> > The SQ environment is down.]
> > Zookeeper is listening on port 2181
> > Dcs Master is not started ...
> >
> > Attached are some logs.
> >
> > I also did *sqgen* but the same. Also tried *ckillall* and restarted
> > hbase server between multiple starts.
> > I have *log4cxx* installed.
> >
> > --
> > And in the end, it's not the years in your life that count. It's the life
> > in your years.
> >
>
>
>
> --
> And in the end, it's not the years in your life that count. It's the life
> in your years.
>



-- 
Thanks,

Amanda Moran

Re: sqstart fails

Posted by Radu Marias <ra...@gmail.com>.
These lines are from the install process:

*pdcp@node5: can't stat /home/trafodion/sqcert*
*pdcp@node5: can't stat shell.env*
*pdcp@node5: can't stat mon.env*


On Thu, Aug 27, 2015 at 7:04 PM, Radu Marias <ra...@gmail.com> wrote:

> Hi,
>
> I have a cluster of 5 nodes, each as a virtual machine.
> This is on them:
> Centos 7
> Ambari 2.1
> HDP 2.2
> jdk1.7.0_67, installed by ambari
>
> I managed to run the installer with success (though some warning were
> present, see bellow). When I try to run sqstart then as trafodion user it
> fails.
>
> Processing cluster.conf on local host node5
> [SHELL] Shell/shell Version 1.0.1 Release 1.2.0 (Build release
> [1.0.0_core-1121-g5928f31_Bld184], date 20150827_083009)
> ^[[?1034h
> [SHELL] %
> ! Start the monitor processes across the cluster
> startup
> [SHELL] %startup
> [SHELL] Unable to communicate with monitor because monitor port file
> /home/trafodion/trafodion-20150827_0830/tmp/monitor.port.node5 is missing.
> [SHELL] Failed to start environment!
>
> [SHELL] %
> exit
> [SHELL] %exit
> Trying to connect to the SQ monitor ..........
> There seems to be a problem connecting to the SQ monitor.
> Aborting startup.
> /logs/sqcheckmon.log: No such file or directory
> Error while executing the startup script!!!
>
> Please check the SQ shell log file :
> /home/trafodion/trafodion-20150827_0830/logs/sqmon.log
>
> SQ Startup (from /home/trafodion/trafodion-20150827_0830/sql/scripts)
> Failed
>
> Checking if processes are up.
> ^MChecking attempt: 1; user specified max: 2. Execution time in seconds: 4.
>
> The SQ environment is not up all, or partially up and not operational.
> Check the logs.
>
> Process         Configured      Actual      Down
> -------         ----------      ------      ----
> DTM             5               0           \$TM0 \$TM1 \$TM2 \$TM3 \$TM4
> RMS             10              0           \$ZSC000 \$ZSC001 \$ZSC002
> \$ZSC003 \$ZSC004 \$ZSM000 \$ZSM001 \$ZSM002 \$ZSM003 \$ZSM004
> MXOSRVR         2               0           2
>
>
> The SQ environment is down.]
> Zookeeper is listening on port 2181
> Dcs Master is not started ...
>
> Attached are some logs.
>
> I also did *sqgen* but the same. Also tried *ckillall* and restarted
> hbase server between multiple starts.
> I have *log4cxx* installed.
>
> --
> And in the end, it's not the years in your life that count. It's the life
> in your years.
>



-- 
And in the end, it's not the years in your life that count. It's the life
in your years.