You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Tim Ludwinski (JIRA)" <ji...@apache.org> on 2019/05/14 21:01:00 UTC
[jira] [Created] (SPARK-27712) createDataFrame() reorders row
Tim Ludwinski created SPARK-27712:
-------------------------------------
Summary: createDataFrame() reorders row
Key: SPARK-27712
URL: https://issues.apache.org/jira/browse/SPARK-27712
Project: Spark
Issue Type: Bug
Components: PySpark
Affects Versions: 2.4.0
Environment: emr-5.20.0
PySpark 2.4.0
Python 2.7.15
Reporter: Tim Ludwinski
Executing the following:
{code:java}
my_schema = pyspark.sql.types.StructType([
pyspark.sql.types.StructField("B", pyspark.sql.types.StringType(), True),
pyspark.sql.types.StructField("A", pyspark.sql.types.StringType(), True)
])
spark.createDataFrame(spark.sparkContext.parallelize([pyspark.sql.Row(A="1", B="2")]), my_schema).collect()
{code}
should produce this:
{code:java}
[Row(A="1", B="2")]
{code}
or this:
{code:java}
[Row(B='2', A='1')]
{code}
but produces this instead:
{code:java}
[Row(B=u'1', A=u'2')]
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org