You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "xuejianbest (JIRA)" <ji...@apache.org> on 2018/08/14 02:15:00 UTC
[jira] [Created] (SPARK-25108) Dataset.show() generates incorrect
padding for Unicode Character
xuejianbest created SPARK-25108:
-----------------------------------
Summary: Dataset.show() generates incorrect padding for Unicode Character
Key: SPARK-25108
URL: https://issues.apache.org/jira/browse/SPARK-25108
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 2.3.1, 2.3.0
Environment: spark-shell on Xshell
Reporter: xuejianbest
The Dataset.show() method generates incorrect space padding since column name or column value has Unicode Character.
{code:java}
val df = spark.createDataset(Seq(
"γύρος",
"pears",
"linguiça",
"xoriço",
"hamburger",
"éclair",
"smørbrød",
"spätzle",
"包子",
"jamón serrano",
"pêches",
"シュークリーム",
"막걸리",
"寿司",
"おもち",
"crème brûlée",
"fideuà",
"pâté",
"お好み焼き")).toDF("value")
before:
+-------------+
| value|
+-------------+
| γύρος|
| pears|
| linguiça|
| xoriço|
| hamburger|
| éclair|
| smørbrød|
| spätzle|
| 包子|
|jamón serrano|
| pêches|
| シュークリーム|
| 막걸리|
| 寿司|
| おもち|
| crème brûlée|
| fideuà|
| pâté|
| お好み焼き|
+-------------+
after fix:
+--------------+
| value|
+--------------+
| γύρος|
| pears|
| linguiça|
| xoriço|
| hamburger|
| éclair|
| smørbrød|
| spätzle|
| 包子|
| jamón serrano|
| pêches|
|シュークリーム|
| 막걸리|
| 寿司|
| おもち|
| crème brûlée|
| fideuà|
| pâté|
| お好み焼き|
+--------------+
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org