You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "xuejianbest (JIRA)" <ji...@apache.org> on 2018/08/14 02:15:00 UTC

[jira] [Created] (SPARK-25108) Dataset.show() generates incorrect padding for Unicode Character

xuejianbest created SPARK-25108:
-----------------------------------

             Summary: Dataset.show() generates incorrect padding for Unicode Character
                 Key: SPARK-25108
                 URL: https://issues.apache.org/jira/browse/SPARK-25108
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.3.1, 2.3.0
         Environment: spark-shell on Xshell
            Reporter: xuejianbest


The Dataset.show() method generates incorrect space padding since column name or column value has Unicode Character.
{code:java}
val df = spark.createDataset(Seq(
"γύρος",
"pears",
"linguiça",
"xoriço",
"hamburger",
"éclair",
"smørbrød",
"spätzle",
"包子",
"jamón serrano",
"pêches",
"シュークリーム",
"막걸리",
"寿司",
"おもち",
"crème brûlée",
"fideuà",
"pâté",
"お好み焼き")).toDF("value")

before:
+-------------+
| value|
+-------------+
| γύρος|
| pears|
| linguiça|
| xoriço|
| hamburger|
| éclair|
| smørbrød|
| spätzle|
| 包子|
|jamón serrano|
| pêches|
| シュークリーム|
| 막걸리|
| 寿司|
| おもち|
| crème brûlée|
| fideuà|
| pâté|
| お好み焼き|
+-------------+

after fix:
+--------------+
| value|
+--------------+
| γύρος|
| pears|
| linguiça|
| xoriço|
| hamburger|
| éclair|
| smørbrød|
| spätzle|
| 包子|
| jamón serrano|
| pêches|
|シュークリーム|
| 막걸리|
| 寿司|
| おもち|
| crème brûlée|
| fideuà|
| pâté|
| お好み焼き|
+--------------+

{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org