You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@griffin.apache.org by jenny li <su...@gmail.com> on 2018/07/11 03:42:51 UTC
griffin job crashed due to OOM
Hi Experts,
our griffin job stopped running reporting OOM error as below:
```save source data count: 6672
write path: hdfs:///griffin/streaming/pri7406in/dump/source/new
#
1. There is insufficient memory for the Java Runtime Environment to
continue.
1. Native memory allocation (mmap) failed to map 12288 bytes for
committing reserved memory.
1. An error report file with more information is saved as:
/home/relmgmt/griffin-job/pri7406in/hs_err_pid12123.log```
attached the hs_err_pid12123.log
and we manually start a griffin job with:
*spark-submit --class org.apache.griffin.measure.Application --master yarn
--deploy-mode client --queue default --driver-memory 512m --executor-memory
512m --num-executors 3 --conf
"spark.driver.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
--conf
"spark.executor.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
--files "jaas.conf,keystore.jks,truststore.jks"
griffin-measure-rheos-test.jar env.json config.json local,local*
would you please help to check? many thanks
I created a jira ticket for it as well:
https://issues.apache.org/jira/browse/GRIFFIN-176
BR-
Juan
Re: griffin job crashed due to OOM
Posted by William Guo <gu...@apache.org>.
Hi Juan,
Thanks for your question.
I will follow this issue.
Thanks,
William
On Wed, Jul 11, 2018 at 11:42 AM, jenny li <su...@gmail.com> wrote:
> Hi Experts,
>
> our griffin job stopped running reporting OOM error as below:
>
> ```save source data count: 6672
>
> write path: hdfs:///griffin/streaming/pri7406in/dump/source/new
>
> #
>
> 1. There is insufficient memory for the Java Runtime Environment to
> continue.
>
>
> 1. Native memory allocation (mmap) failed to map 12288 bytes for
> committing reserved memory.
>
>
> 1. An error report file with more information is saved as:
>
> /home/relmgmt/griffin-job/pri7406in/hs_err_pid12123.log```
>
> attached the hs_err_pid12123.log
>
>
> and we manually start a griffin job with:
>
> *spark-submit --class org.apache.griffin.measure.Application --master yarn
> --deploy-mode client --queue default --driver-memory 512m --executor-memory
> 512m --num-executors 3 --conf
> "spark.driver.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
> --conf
> "spark.executor.extraJavaOptions=-Djava.security.auth.login.config=jaas.conf"
> --files "jaas.conf,keystore.jks,truststore.jks"
> griffin-measure-rheos-test.jar env.json config.json local,local*
>
> would you please help to check? many thanks
>
> I created a jira ticket for it as well: https://issues.apache.
> org/jira/browse/GRIFFIN-176
>
> BR-
> Juan
>
>