You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Peter Somogyi (Jira)" <ji...@apache.org> on 2020/08/17 17:46:00 UTC

[jira] [Updated] (HBASE-19848) Zookeeper thread leaks in hbase-spark bulkLoad method

     [ https://issues.apache.org/jira/browse/HBASE-19848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Peter Somogyi updated HBASE-19848:
----------------------------------
    Fix Version/s: 3.0.0-alpha-1

> Zookeeper thread leaks in hbase-spark bulkLoad method
> -----------------------------------------------------
>
>                 Key: HBASE-19848
>                 URL: https://issues.apache.org/jira/browse/HBASE-19848
>             Project: HBase
>          Issue Type: Bug
>          Components: spark, Zookeeper
>    Affects Versions: 1.2.0
>         Environment: hbase-spark-1.2.0-cdh5.12.1 version
> spark 1.6
>            Reporter: Key Hutu
>            Assignee: Key Hutu
>            Priority: Major
>              Labels: performance
>             Fix For: 3.0.0-alpha-1
>
>         Attachments: HBASE-19848-V2.patch, HBASE-19848-V3.patch, HBaseContext.patch, HBaseContext.scala
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> In hbase-spark project, HBaseContext provides bulkload methond for loading spark rdd data to hbase easily.But when i using it frequently, the program will throw "cannot create native thread" exception.
> using pstack command in spark driver process , the thread num is increasing 
> using jstack, named "main-SendThread" and "main-EventThread"  thread so many
> It seems like that , connection created before bulkload ,but close method uninvoked at last



--
This message was sent by Atlassian Jira
(v8.3.4#803005)