You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@griffin.apache.org by jenny li <su...@gmail.com> on 2018/07/11 03:42:51 UTC

griffin job crashed due to OOM

Hi Experts,

our griffin job stopped running reporting OOM error as below:

```save source data count: 6672

write path: hdfs:///griffin/streaming/pri7406in/dump/source/new

#

   1. There is insufficient memory for the Java Runtime Environment to
   continue.


   1. Native memory allocation (mmap) failed to map 12288 bytes for
   committing reserved memory.


   1. An error report file with more information is saved as:

/home/relmgmt/griffin-job/pri7406in/hs_err_pid12123.log```

 attached the hs_err_pid12123.log


and we manually start a griffin job with:

*spark-submit --class org.apache.griffin.measure.Application --master yarn
--deploy-mode client --queue default --driver-memory 512m --executor-memory
512m --num-executors 3 --conf
"spark.driver.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
--conf
"spark.executor.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
--files "jaas.conf,keystore.jks,truststore.jks"
griffin-measure-rheos-test.jar env.json config.json local,local*

would you please help to check? many thanks

I created a jira ticket for it as well:
https://issues.apache.org/jira/browse/GRIFFIN-176

BR-
Juan

Re: griffin job crashed due to OOM

Posted by William Guo <gu...@apache.org>.
Hi Juan,

Thanks for your question.
I will follow this issue.

Thanks,
William

On Wed, Jul 11, 2018 at 11:42 AM, jenny li <su...@gmail.com> wrote:

> Hi Experts,
>
> our griffin job stopped running reporting OOM error as below:
>
> ```save source data count: 6672
>
> write path: hdfs:///griffin/streaming/pri7406in/dump/source/new
>
> #
>
>    1. There is insufficient memory for the Java Runtime Environment to
>    continue.
>
>
>    1. Native memory allocation (mmap) failed to map 12288 bytes for
>    committing reserved memory.
>
>
>    1. An error report file with more information is saved as:
>
> /home/relmgmt/griffin-job/pri7406in/hs_err_pid12123.log```
>
>  attached the hs_err_pid12123.log
>
>
> and we manually start a griffin job with:
>
> *spark-submit --class org.apache.griffin.measure.Application --master yarn
> --deploy-mode client --queue default --driver-memory 512m --executor-memory
> 512m --num-executors 3 --conf
> "spark.driver.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
> --conf
> "spark.executor.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
> --files "jaas.conf,keystore.jks,truststore.jks"
> griffin-measure-rheos-test.jar env.json config.json local,local*
>
> would you please help to check? many thanks
>
> I created a jira ticket for it as well: https://issues.apache.
> org/jira/browse/GRIFFIN-176
>
> BR-
> Juan
>
>