You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Menelais Karavelas (Jira)" <ji...@apache.org> on 2021/10/18 17:56:00 UTC

[jira] [Created] (SPARK-37047) Add overloads for lpad and rpad for BINARY strings

Menelais Karavelas created SPARK-37047:
------------------------------------------

             Summary: Add overloads for lpad and rpad for BINARY strings
                 Key: SPARK-37047
                 URL: https://issues.apache.org/jira/browse/SPARK-37047
             Project: Spark
          Issue Type: New Feature
          Components: SQL
    Affects Versions: 3.3.0
            Reporter: Menelais Karavelas


Currently, `lpad` and `rpad` accept BINARY strings as input (both in terms of input string to be padded and padding pattern), and these strings get cast to UTF8 strings. The result of the operation is a UTF8 string which may be invalid as it can contain non-UTF8 characters.

What we would like to do is to overload `lpad` and `rpad` to accept BINARY strings as inputs (both for the string to be padded and the padding pattern) and produce a left or right padded BINARY string as output.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org