You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Abhishek Shakya (Jira)" <ji...@apache.org> on 2021/09/21 17:58:00 UTC
[jira] [Created] (SPARK-36817) Does Apache Spark 3 support GPU
usage for Spark RDDs?
Abhishek Shakya created SPARK-36817:
---------------------------------------
Summary: Does Apache Spark 3 support GPU usage for Spark RDDs?
Key: SPARK-36817
URL: https://issues.apache.org/jira/browse/SPARK-36817
Project: Spark
Issue Type: Question
Components: Spark Core
Affects Versions: 3.1.2
Reporter: Abhishek Shakya
I am currently trying to run genomic analyses pipelines using [Hail|https://hail.is/](library for genomics analyses written in python and Scala). Recently, Apache Spark 3 was released and it supported GPU usage.
I tried [spark-rapids|https://nvidia.github.io/spark-rapids/] library start an on-premise slurm cluster with gpu nodes. I was able to initialise the cluster. However, when I tried running hail tasks, the executors keep getting killed.
On querying in Hail forum, I got the response that
{quote}That’s a GPU code generator for Spark-SQL, and Hail doesn’t use any Spark-SQL interfaces, only the RDD interfaces.
{quote}
So, does Spark3 not support GPU usage for RDD interfaces?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org