You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "ocean (Jira)" <ji...@apache.org> on 2021/11/16 02:01:00 UTC

[jira] [Created] (SPARK-37344) split function behave differently between spark 2.3 and spark 3.2

ocean created SPARK-37344:
-----------------------------

             Summary: split function behave differently between spark 2.3 and spark 3.2
                 Key: SPARK-37344
                 URL: https://issues.apache.org/jira/browse/SPARK-37344
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.2.0, 3.1.2, 3.1.1
            Reporter: ocean


while use split function in sql, it behave differently between 2.3 and 3.2, which cause incorrect problem.

we can use this sql to reproduce this problem:

 

create table split_test ( id int,name string)

insert into split_test values(1,"abc;def")

explain extended select split(name,'\\\\;') from split_test

 

spark3:

spark-sql> Explain extended select split(name,'\\\\;') from split_test;

== Parsed Logical Plan ==

'Project [unresolvedalias('split('name, \\;), None)]

+- 'UnresolvedRelation [split_test], [], false

 

spark2:

 

spark-sql> Explain extended select split(name,'\\\\;') from split_test;

== Parsed Logical Plan ==

'Project [unresolvedalias('split('name, \;), None)]

+- 'UnresolvedRelation split_test

 

It looks like the deal of escape is different



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org