You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Abhishek Shakya (Jira)" <ji...@apache.org> on 2021/09/21 17:58:00 UTC

[jira] [Created] (SPARK-36817) Does Apache Spark 3 support GPU usage for Spark RDDs?

Abhishek Shakya created SPARK-36817:
---------------------------------------

             Summary: Does Apache Spark 3 support GPU usage for Spark RDDs?
                 Key: SPARK-36817
                 URL: https://issues.apache.org/jira/browse/SPARK-36817
             Project: Spark
          Issue Type: Question
          Components: Spark Core
    Affects Versions: 3.1.2
            Reporter: Abhishek Shakya


I am currently trying to run genomic analyses pipelines using [Hail|https://hail.is/](library for genomics analyses written in python and Scala). Recently, Apache Spark 3 was released and it supported GPU usage.

I tried [spark-rapids|https://nvidia.github.io/spark-rapids/] library start an on-premise slurm cluster with gpu nodes. I was able to initialise the cluster. However, when I tried running hail tasks, the executors keep getting killed.

On querying in Hail forum, I got the response that
{quote}That’s a GPU code generator for Spark-SQL, and Hail doesn’t use any Spark-SQL interfaces, only the RDD interfaces.
{quote}
So, does Spark3 not support GPU usage for RDD interfaces?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org