You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Jelther Oliveira Gonçalves (Jira)" <ji...@apache.org> on 2019/12/12 23:00:00 UTC

[jira] [Created] (SPARK-30242) Support reading Parquet files from Stream Buffer

Jelther Oliveira Gonçalves created SPARK-30242:
--------------------------------------------------

             Summary: Support reading Parquet files from Stream Buffer
                 Key: SPARK-30242
                 URL: https://issues.apache.org/jira/browse/SPARK-30242
             Project: Spark
          Issue Type: Wish
          Components: Spark Core
    Affects Versions: 2.4.4
            Reporter: Jelther Oliveira Gonçalves


Reading from a Python BufferIO a parquet is not possible using Pyspark.



Using:

 
{code:java}
from io import BytesIO

parquetbytes : Bytes = b'PAR...'

df = spark.read.format("parquet").load(BytesIO(parquetbytes))
{code}
Raises :
{code:java}
java.lang.ClassCastException: java.util.ArrayList cannot be cast to java.lang.String{code}
 

Is there any chance this will be available in the future?

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org